BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 026959
(230 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 311
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 179/225 (79%), Positives = 202/225 (89%), Gaps = 2/225 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKSI SEVRTSSGMF++KAQDEIVA IEARIAAWTFLP ENGE+MQILHYEHG
Sbjct: 85 MVADNESGKSIESEVRTSSGMFIAKAQDEIVADIEARIAAWTFLPEENGESMQILHYEHG 144
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
QKYEPHFD+F DK NQ+LGGHR+ATVLMYLS+VEKGGETVFPN+E +SQ ++ +WS+CA
Sbjct: 145 QKYEPHFDYFHDKANQELGGHRVATVLMYLSNVEKGGETVFPNAEGKLSQPKEDSWSDCA 204
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
+ GYAVKP KGDALLFFSLHPDA+TDS SLHGSCPVIEGEKWSATKWIHVR+F+K K+
Sbjct: 205 KGGYAVKPEKGDALLFFSLHPDATTDSDSLHGSCPVIEGEKWSATKWIHVRSFEKSFKQL 264
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
DCVDE+ +C +WAKAGECKKNPLYM+GS + GYCRKSCKVC
Sbjct: 265 GKGDCVDENDHCPLWAKAGECKKNPLYMIGSGGANGYCRKSCKVC 309
>gi|50845214|gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 316
Score = 367 bits (941), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 177/226 (78%), Positives = 194/226 (85%), Gaps = 3/226 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKSI SEVRTSSGMFL KAQD++VA+IEARIAAWTFLP ENGEAMQILHYE G
Sbjct: 91 MVADNESGKSIPSEVRTSSGMFLQKAQDDVVAAIEARIAAWTFLPIENGEAMQILHYERG 150
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGN--WSECA 118
QKYEPHFD+F DK+NQQLGGHRIATVLMYLS+VE+GGETVFPN+E N S+CA
Sbjct: 151 QKYEPHFDYFHDKVNQQLGGHRIATVLMYLSNVEEGGETVFPNAEAKLQLANNESLSDCA 210
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK-E 177
+ GY+VKP KGDALLFFSLHPDASTDS SLHGSCPVIEGEKWSATKWIHVR+FD+ K +
Sbjct: 211 KGGYSVKPKKGDALLFFSLHPDASTDSLSLHGSCPVIEGEKWSATKWIHVRSFDRIRKDD 270
Query: 178 PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
P DCVD++ C WA AGECKKNPLYMVGSK +GYCRKSC VC
Sbjct: 271 PPSGDCVDDNALCAQWALAGECKKNPLYMVGSKDMKGYCRKSCNVC 316
>gi|224141325|ref|XP_002324024.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
gi|222867026|gb|EEF04157.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
Length = 308
Score = 366 bits (939), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 172/225 (76%), Positives = 194/225 (86%), Gaps = 2/225 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKSI SEVRTSSGMF+ K+QDEIV IEARIAAWTFLP ENGE++QILHYEHG
Sbjct: 82 MVADNESGKSIESEVRTSSGMFIGKSQDEIVDDIEARIAAWTFLPQENGESIQILHYEHG 141
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
QKYEPHFD+F DK NQ+LGGHR+ TVLMYLS+V KGGETVFPNSE Q +D +WS+CA
Sbjct: 142 QKYEPHFDYFHDKANQELGGHRVVTVLMYLSNVGKGGETVFPNSEGKTIQPKDDSWSDCA 201
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
+ GYAVKP KGDALLFFSLHPDA+TD+ SLHGSCPVIEGEKWSATKWIHVR+F+K K
Sbjct: 202 KNGYAVKPQKGDALLFFSLHPDATTDTNSLHGSCPVIEGEKWSATKWIHVRSFEKSLKHA 261
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
C+DE+ NC +WAKAGEC+KNP+YMVGS+ S G CRKSCKVC
Sbjct: 262 ASGGCIDENENCPLWAKAGECQKNPVYMVGSEGSYGSCRKSCKVC 306
>gi|225452614|ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera]
Length = 316
Score = 365 bits (937), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 173/225 (76%), Positives = 196/225 (87%), Gaps = 2/225 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKSI SEVRTSSGMFL KAQDEIVA IEARIAAWTFLP ENGE++QILHYE+G
Sbjct: 90 MVADNESGKSIMSEVRTSSGMFLLKAQDEIVADIEARIAAWTFLPVENGESIQILHYENG 149
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
+KYEPHFD+F DK+NQ LGGHRIATVLMYL+ VE+GGETVFPNSE SQ +D +WS+CA
Sbjct: 150 EKYEPHFDYFHDKVNQLLGGHRIATVLMYLATVEEGGETVFPNSEGRFSQPKDDSWSDCA 209
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
++GYAV P KGDALLFFSLHPDA+TD +SLHGSCPVI GEKWSATKWIHVR+FDKP K
Sbjct: 210 KKGYAVNPKKGDALLFFSLHPDATTDPSSLHGSCPVIAGEKWSATKWIHVRSFDKPSKRG 269
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+CVDED +C WA GEC+KNP+YMVGS++S G+CRKSC VC
Sbjct: 270 AQGECVDEDEHCPKWAAVGECEKNPVYMVGSENSDGFCRKSCGVC 314
>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 318
Score = 363 bits (932), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 173/225 (76%), Positives = 193/225 (85%), Gaps = 2/225 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKSI SEVRTSSGMFL+KAQDEIVA IEARIAAWTFLP ENGE+MQILHYE+G
Sbjct: 92 MVADNESGKSIMSEVRTSSGMFLNKAQDEIVAGIEARIAAWTFLPIENGESMQILHYENG 151
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPN--SEVSQSRDGNWSECA 118
QKYEPHFD+F DK NQ +GGHRIATVLMYLS VEKGGET+FPN +++ Q +D +WSECA
Sbjct: 152 QKYEPHFDYFHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAKAKLLQPKDESWSECA 211
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
+GYAVKP KGDALLFFSLH DASTD+ SLHGSCPVIEGEKWSATKWIHV +F KP K+
Sbjct: 212 HKGYAVKPRKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKWSATKWIHVSDFQKPIKQV 271
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ DCVDE+ NC WAK GEC+KNPLYMVG + +G C KSC VC
Sbjct: 272 DSGDCVDENENCPRWAKVGECEKNPLYMVGGEGVKGSCMKSCNVC 316
>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 319
Score = 363 bits (932), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 171/225 (76%), Positives = 194/225 (86%), Gaps = 2/225 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKSI S++RTSSGMFL+KAQDEIVA IEARIAAWTFLP ENGE+MQILHYE+G
Sbjct: 93 MVADNDSGKSIMSDIRTSSGMFLNKAQDEIVAGIEARIAAWTFLPVENGESMQILHYENG 152
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
QKYEPHFD+F DK NQ +GGHRIATVLMYLS VEKGGET+FPN+E + Q +D +WSECA
Sbjct: 153 QKYEPHFDYFHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAEAKLLQPKDESWSECA 212
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
+GYAVKP KGDALLFFSLH DASTD+ SLHGSCPVIEGEKWSATKWIHV +F+KP K+
Sbjct: 213 HKGYAVKPQKGDALLFFSLHLDASTDTKSLHGSCPVIEGEKWSATKWIHVSDFEKPFKQV 272
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
++ +CVDE+ NC WAK GEC KNPLYMVG + RG C KSC VC
Sbjct: 273 DNGECVDENENCPRWAKVGECDKNPLYMVGGEGVRGSCMKSCNVC 317
>gi|449522594|ref|XP_004168311.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Cucumis
sativus]
Length = 313
Score = 362 bits (929), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 168/225 (74%), Positives = 193/225 (85%), Gaps = 2/225 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKS++SEVRTSSGMFL KAQDE+VA +EARIAAWT LP ENGE++QILHYE+G
Sbjct: 89 MVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENG 148
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV--SQSRDGNWSECA 118
QKYEPHFDFF DK+NQ+LGGHRIATVLMYLS+VEKGGET+FPNSE SQ++D +WS+C+
Sbjct: 149 QKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAKDESWSDCS 208
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
R+GYAVK KGDALLFFSL+ DA+TD SLHGSCPVI GEKWSATKWIHVR+F+K
Sbjct: 209 RKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKITSRV 268
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
CVDE+ NC+ WAK GECKKNP YMVGS + GYCRKSCK C
Sbjct: 269 SRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC 313
>gi|449461905|ref|XP_004148682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 362 bits (929), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 169/228 (74%), Positives = 194/228 (85%), Gaps = 5/228 (2%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKS++SEVRTSSGMFL KAQDE+VA +EARIAAWT LP ENGE++QILHYE+G
Sbjct: 68 MVADNDSGKSVSSEVRTSSGMFLRKAQDEVVAGVEARIAAWTLLPAENGESIQILHYENG 127
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV-----SQSRDGNWS 115
QKYEPHFDFF DK+NQ+LGGHRIATVLMYLS+VEKGGET+FPNSEV SQ++D +WS
Sbjct: 128 QKYEPHFDFFHDKVNQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSESQAKDESWS 187
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
+C+R+GYAVK KGDALLFFSL+ DA+TD SLHGSCPVI GEKWSATKWIHVR+F+K
Sbjct: 188 DCSRKGYAVKAQKGDALLFFSLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFEKIT 247
Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
CVDE+ NC+ WAK GECKKNP YMVGS + GYCRKSCK C
Sbjct: 248 SRVSRQGCVDENENCLAWAKKGECKKNPTYMVGSGGALGYCRKSCKAC 295
>gi|255637501|gb|ACU19077.1| unknown [Glycine max]
Length = 318
Score = 360 bits (925), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 172/225 (76%), Positives = 192/225 (85%), Gaps = 2/225 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKSI SEVRTSSGMFL+KAQDEIVA IEARIAAWTFLP ENGE+MQILHYE+G
Sbjct: 92 MVADNESGKSIMSEVRTSSGMFLNKAQDEIVAGIEARIAAWTFLPIENGESMQILHYENG 151
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
QKYEPHFD+F DK NQ +GGHRIATVLMYLS VEKGGET+F N++ + Q +D +WSECA
Sbjct: 152 QKYEPHFDYFHDKANQVMGGHRIATVLMYLSDVEKGGETIFSNAKAKLLQPKDESWSECA 211
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
+GYAVKP KGDALLFFSLH DASTD+ SLHGSCPVIEGEKWSATKWIHV +F KP K+
Sbjct: 212 HKGYAVKPRKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKWSATKWIHVSDFQKPIKQV 271
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ DCVDE+ NC WAK GEC+KNPLYMVG + +G C KSC VC
Sbjct: 272 DSGDCVDENENCPRWAKVGECEKNPLYMVGGEGVKGSCMKSCNVC 316
>gi|215490183|dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 318
Score = 358 bits (920), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 168/225 (74%), Positives = 194/225 (86%), Gaps = 7/225 (3%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ SEVRTSSGMF KAQD++VA++EARIAAWTFLP ENGE++QILHYEHG
Sbjct: 98 MVADNESGKSVESEVRTSSGMFFRKAQDQVVANVEARIAAWTFLPEENGESIQILHYEHG 157
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV--SQSRDGNWSECA 118
QKYEPHFD+F DK+NQ+LGGHR+ATVLMYLS VEKGGETVFPNSE +Q++ +WS+CA
Sbjct: 158 QKYEPHFDYFHDKVNQELGGHRVATVLMYLSDVEKGGETVFPNSEAKKTQAKGDDWSDCA 217
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
++GYAVKP KGDALLFFSLHPDA+TD SLHGSCPVIEGEKWSATKWIHVR+F E
Sbjct: 218 KKGYAVKPRKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSF-----ET 272
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
C D++ NC WA AGEC+KNPLYM+GS+ S G+CRKSCKVC
Sbjct: 273 TSSVCKDQNPNCPQWATAGECEKNPLYMMGSEDSVGHCRKSCKVC 317
>gi|242039723|ref|XP_002467256.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
gi|241921110|gb|EER94254.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
Length = 303
Score = 356 bits (914), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 167/226 (73%), Positives = 193/226 (85%), Gaps = 3/226 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ SEVRTSSGMFL K QDE+V IE RIAAWTFLPPENGE++QILHY++G
Sbjct: 76 MVADNESGKSVQSEVRTSSGMFLEKKQDEVVRGIEERIAAWTFLPPENGESIQILHYQNG 135
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
+KYEPH+D+F DK NQ LGGHRIATVLMYLS+VEKGGET+FPN+E + Q +D WS+CA
Sbjct: 136 EKYEPHYDYFHDKNNQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCA 195
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
R GYAVKP+KGDALLFFSLHPDA+TDS SLHGSCPVIEG+KWSATKWIHVR+FD P K+P
Sbjct: 196 RNGYAVKPVKGDALLFFSLHPDATTDSESLHGSCPVIEGQKWSATKWIHVRSFDLPVKQP 255
Query: 179 -EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C D+++ C WA GEC KNP YMVG+K + G+CRKSCKVC
Sbjct: 256 GSSDGCEDDNVLCPQWAAVGECAKNPNYMVGTKEAPGFCRKSCKVC 301
>gi|293337056|ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays]
gi|224031897|gb|ACN35024.1| unknown [Zea mays]
gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays]
gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays]
Length = 299
Score = 356 bits (913), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 165/226 (73%), Positives = 193/226 (85%), Gaps = 3/226 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ SEVRTSSGMFL + QDE+V IE RI+AWTFLPPENGE++QILHY++G
Sbjct: 72 MVADNESGKSVQSEVRTSSGMFLERKQDEVVTRIEERISAWTFLPPENGESIQILHYQNG 131
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
+KYEPH+D+F DK NQ LGGHRIATVLMYLS+VEKGGET+FPN+E + Q +D WS+CA
Sbjct: 132 EKYEPHYDYFHDKKNQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCA 191
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
R GYAVKP+KGDALLFFSLHPDA+TDS SLHGSCPVIEG+KWSATKWIHVR+FD P K+P
Sbjct: 192 RNGYAVKPVKGDALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSFDLPVKQP 251
Query: 179 -EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C D+++ C WA GEC KNP YMVG+K + G+CRKSCKVC
Sbjct: 252 GSSDGCEDDNILCPQWAAVGECAKNPNYMVGTKEAPGFCRKSCKVC 297
>gi|357496283|ref|XP_003618430.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|217073992|gb|ACJ85356.1| unknown [Medicago truncatula]
gi|355493445|gb|AES74648.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|388494436|gb|AFK35284.1| unknown [Medicago truncatula]
Length = 313
Score = 352 bits (903), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 170/226 (75%), Positives = 193/226 (85%), Gaps = 4/226 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKSI SEVRTSSGMFL+K QDEIV+ IEARIAAWTFLP ENGE+MQ+LHY +G
Sbjct: 87 MVADNESGKSIQSEVRTSSGMFLNKQQDEIVSGIEARIAAWTFLPVENGESMQVLHYMNG 146
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
+KYEPHFDFF DK NQ+LGGHR+ATVLMYLS+VEKGGET+FP++E +SQ +D +WSECA
Sbjct: 147 EKYEPHFDFFHDKANQRLGGHRVATVLMYLSNVEKGGETIFPHAEGKLSQPKDESWSECA 206
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
+GYAVKP KGDALLFFSLH DA+TDS SLHGSCPVIEGEKWSATKWIHV +F+KP ++
Sbjct: 207 HKGYAVKPRKGDALLFFSLHLDATTDSKSLHGSCPVIEGEKWSATKWIHVADFEKPVRQA 266
Query: 179 -EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
ED C DE+ NC WAK GEC+KNPLYMVG K G C KSC VC
Sbjct: 267 LEDRVCADENENCARWAKVGECEKNPLYMVG-KGGNGKCMKSCNVC 311
>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 351 bits (901), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 164/226 (72%), Positives = 192/226 (84%), Gaps = 3/226 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKS+ SEVRTSSGMFL K QDE+V IE RI+AWTFLPPENGEA+QILHY++G
Sbjct: 71 MVADNKSGKSVQSEVRTSSGMFLEKKQDEVVTRIEERISAWTFLPPENGEAIQILHYQNG 130
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
+KYEPH+D+F DK NQ LGGHRIATVLMYLS+VEKGGET+FPN+E + Q +D WS+CA
Sbjct: 131 EKYEPHYDYFHDKNNQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCA 190
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
R GYAVKP+KGDALLFFSLHPD++TDS SLHGSCPVIEG+KWSATKWIHVR+FD K+P
Sbjct: 191 RNGYAVKPVKGDALLFFSLHPDSTTDSDSLHGSCPVIEGQKWSATKWIHVRSFDLTVKQP 250
Query: 179 -EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C D+++ C WA GEC KNP YMVG+K + G+CRKSCKVC
Sbjct: 251 GPSDGCEDDNVLCPQWAAVGECAKNPNYMVGTKEAPGFCRKSCKVC 296
>gi|294461211|gb|ADE76168.1| unknown [Picea sitchensis]
Length = 280
Score = 350 bits (897), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 168/225 (74%), Positives = 191/225 (84%), Gaps = 3/225 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ SE+RTSSGMFL+KAQDEIVAS+E RIAAWTFLP ENGEAMQ+LHYE G
Sbjct: 57 MVADNESGKSVMSEIRTSSGMFLNKAQDEIVASVEDRIAAWTFLPIENGEAMQVLHYELG 116
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV--SQSRDGNWSECA 118
QKYEPHFD+F DK+NQ +GGHRIATVLMYLS V KGGETVFPN+E SQ +D +WSECA
Sbjct: 117 QKYEPHFDYFHDKINQAMGGHRIATVLMYLSDVVKGGETVFPNAETKDSQPKDDSWSECA 176
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
+ GY+VKP KGDALLFFSL PDA+TD +SLHGSCPVIEGEKWSATKWIHVR+F+ ++
Sbjct: 177 KGGYSVKPNKGDALLFFSLRPDATTDQSSLHGSCPVIEGEKWSATKWIHVRSFEVSNRKI 236
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ CVDE+ +C WA GECKKNP YMVGS S G CRKSC+VC
Sbjct: 237 S-EGCVDENDSCTHWASIGECKKNPTYMVGSPDSPGACRKSCQVC 280
>gi|212720650|ref|NP_001132477.1| uncharacterized protein LOC100193935 precursor [Zea mays]
gi|194694488|gb|ACF81328.1| unknown [Zea mays]
gi|347978828|gb|AEP37756.1| prolyl 4-hydroxylase 7 [Zea mays]
gi|413934218|gb|AFW68769.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 349 bits (896), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 163/226 (72%), Positives = 191/226 (84%), Gaps = 3/226 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKS+ SEVRTSSGMFL K QDE+V IE RI+AWTFLPPENGEA+QILHY++G
Sbjct: 71 MVADNKSGKSVQSEVRTSSGMFLEKKQDEVVTRIEERISAWTFLPPENGEAIQILHYQNG 130
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
+KYEPH+D+F DK NQ LGGHRIATVLMYLS+VEKGGET+FPN+E + Q +D WS+CA
Sbjct: 131 EKYEPHYDYFHDKNNQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCA 190
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
R GYAVKP+KGDALLFFSLHPD++TDS SLHGSCP IEG+KWSATKWIHVR+FD K+P
Sbjct: 191 RNGYAVKPVKGDALLFFSLHPDSTTDSDSLHGSCPAIEGQKWSATKWIHVRSFDLTVKQP 250
Query: 179 -EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C D+++ C WA GEC KNP YMVG+K + G+CRKSCKVC
Sbjct: 251 GPSDGCEDDNVLCPQWAAVGECAKNPNYMVGTKEAPGFCRKSCKVC 296
>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 316
Score = 349 bits (895), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 160/225 (71%), Positives = 194/225 (86%), Gaps = 4/225 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SG+S+ SEVRTSSGMFLSK QD+IVA++EA++AAWTF+P ENGE+MQILHYE+G
Sbjct: 92 MVADNDSGESVESEVRTSSGMFLSKRQDDIVANVEAKLAAWTFIPEENGESMQILHYENG 151
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP--NSEVSQSRDGNWSECA 118
QKYEPHFD+F D+ N +LGGHRIATVLMYLS+VEKGGETVFP + +Q +D +W+ECA
Sbjct: 152 QKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKTTQLKDDSWTECA 211
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
++GYAVKP KGDALLFF+LHP+A+TDS SLHGSCPV+EGEKWSAT+WIHVR+FD+
Sbjct: 212 KQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVRSFDRA--FS 269
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ CVDE+++C WAKAGEC+KNP YMVGS GYCRKSC VC
Sbjct: 270 KQSGCVDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCNVC 314
>gi|357140446|ref|XP_003571778.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 298
Score = 348 bits (892), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 163/226 (72%), Positives = 190/226 (84%), Gaps = 3/226 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ SEVRTSSGMFL K QDE+VA IE RIAAWTFLP ENGE++QILHY++G
Sbjct: 71 MVADNESGKSVQSEVRTSSGMFLEKRQDEVVARIEERIAAWTFLPSENGESIQILHYKNG 130
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
+KYEPH+D+F DK NQ LGGHRIATVLMYLS+VEKGGET+FPN+E ++Q +D SECA
Sbjct: 131 EKYEPHYDYFHDKNNQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLTQHKDETASECA 190
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE- 177
+ GYAVKPMKGDALLFFSLHPDA+TD SLHGSCPVIEG+KWSATKWIHVR+F+ P K+
Sbjct: 191 KNGYAVKPMKGDALLFFSLHPDATTDPDSLHGSCPVIEGQKWSATKWIHVRSFENPGKQG 250
Query: 178 PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C DE++ C WA GEC KNP YMVG+K + G+CRKSC +C
Sbjct: 251 ASGDGCEDENVLCAQWAAVGECAKNPNYMVGTKEAPGFCRKSCNLC 296
>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 332
Score = 345 bits (886), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 157/225 (69%), Positives = 194/225 (86%), Gaps = 4/225 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SG+S+ SEVRTSSGMFLSK QD+IV+++EA++AAWTFLP ENGE+MQILHYE+G
Sbjct: 108 MVADNDSGESVESEVRTSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENG 167
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP--NSEVSQSRDGNWSECA 118
QKYEPHFD+F D+ N +LGGHRIATVLMYLS+VEKGGETVFP + +Q +D +W+ECA
Sbjct: 168 QKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECA 227
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
++GYAVKP KGDALLFF+LHP+A+TDS SLHGSCPV+EGEKWSAT+WIHV++F++
Sbjct: 228 KQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFERAFN-- 285
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ C+DE+++C WAKAGEC+KNP YMVGS GYCRKSCK C
Sbjct: 286 KQSGCMDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 330
>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 316
Score = 344 bits (883), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 157/225 (69%), Positives = 194/225 (86%), Gaps = 4/225 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SG+S+ SEVRTSSGMFLSK QD+IV+++EA++AAWTFLP ENGE+MQILHYE+G
Sbjct: 92 MVADNDSGESVESEVRTSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENG 151
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP--NSEVSQSRDGNWSECA 118
QKYEPHFD+F D+ N +LGGHRIATVLMYLS+VEKGGETVFP + +Q +D +W+ECA
Sbjct: 152 QKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECA 211
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
++GYAVKP KGDALLFF+LHP+A+TDS SLHGSCPV+EGEKWSAT+WIHV++F++
Sbjct: 212 KQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFERAFN-- 269
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ C+DE+++C WAKAGEC+KNP YMVGS GYCRKSCK C
Sbjct: 270 KQSGCMDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 314
>gi|110289076|gb|ABB47602.2| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
Length = 309
Score = 343 bits (881), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 162/227 (71%), Positives = 187/227 (82%), Gaps = 4/227 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ SEVRTSSGMFL K QDE+VA IE RIAAWTFLPP+NGE++QILHY++G
Sbjct: 81 MVADNESGKSVMSEVRTSSGMFLEKKQDEVVARIEERIAAWTFLPPDNGESIQILHYQNG 140
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS---QSRDGNWSEC 117
+KYEPH+D+F DK NQ LGGHRIATVLMYLS V KGGET+FP +EV Q +D WS+C
Sbjct: 141 EKYEPHYDYFHDKNNQALGGHRIATVLMYLSDVGKGGETIFPEAEVGKLLQPKDDTWSDC 200
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE 177
A+ GYAVKP+KGDALLFFSLHPDA+TDS SLHGSCPVIEG+KWSATKWIHVR+FD K+
Sbjct: 201 AKNGYAVKPVKGDALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSFDISVKQ 260
Query: 178 -PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C DE++ C WA GEC KNP YMVG+ + G+CRKSC VC
Sbjct: 261 GASTDGCEDENVLCPQWAAVGECAKNPNYMVGTNEAPGFCRKSCNVC 307
>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
Length = 316
Score = 343 bits (880), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 157/225 (69%), Positives = 193/225 (85%), Gaps = 4/225 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SG+S+ SEVRTSSGMFLSK QD+IV ++EA++AAWTFLP ENGE+MQILHYE+G
Sbjct: 92 MVADNDSGESVESEVRTSSGMFLSKRQDDIVNNVEAKLAAWTFLPEENGESMQILHYENG 151
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP--NSEVSQSRDGNWSECA 118
QKYEPHFD+F D+ N +LGGHRIATVLMYLS+VEKGGETVFP + +Q +D +W+ECA
Sbjct: 152 QKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECA 211
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
++GYAVKP KGDALLFF+LHP+A+TDS SLHGSCPV+EGEKWSAT+WIHV++F++
Sbjct: 212 KQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFERAFN-- 269
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ C+DE+++C WAKAGEC+KNP YMVGS GYCRKSCK C
Sbjct: 270 KQSGCMDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 314
>gi|218184507|gb|EEC66934.1| hypothetical protein OsI_33548 [Oryza sativa Indica Group]
Length = 308
Score = 342 bits (878), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 161/226 (71%), Positives = 187/226 (82%), Gaps = 3/226 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ SEVRTSSGMFL K QDE+VA IE RIAAWTFLPP+NGE++QILHY++G
Sbjct: 81 MVADNESGKSVMSEVRTSSGMFLEKKQDEVVARIEERIAAWTFLPPDNGESIQILHYQNG 140
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
+KYEPH+D+F DK NQ LGGHRIATVLMYLS V KGGET+FP +E + Q +D WS+CA
Sbjct: 141 EKYEPHYDYFHDKNNQALGGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCA 200
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE- 177
+ GYAVKP+KGDALLFFSLHPDA+TDS SLHGSCPVIEG+KWSATKWIHVR+FD K+
Sbjct: 201 KNGYAVKPVKGDALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSFDISVKQG 260
Query: 178 PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C DE++ C WA GEC KNP YMVG+ + G+CRKSC VC
Sbjct: 261 ASTDGCEDENVLCPQWAAVGECAKNPNYMVGTNEAPGFCRKSCNVC 306
>gi|115481998|ref|NP_001064592.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|110289075|gb|ABG66075.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113639201|dbj|BAF26506.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|215692577|dbj|BAG87997.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222612821|gb|EEE50953.1| hypothetical protein OsJ_31503 [Oryza sativa Japonica Group]
Length = 308
Score = 342 bits (878), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 161/226 (71%), Positives = 187/226 (82%), Gaps = 3/226 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ SEVRTSSGMFL K QDE+VA IE RIAAWTFLPP+NGE++QILHY++G
Sbjct: 81 MVADNESGKSVMSEVRTSSGMFLEKKQDEVVARIEERIAAWTFLPPDNGESIQILHYQNG 140
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
+KYEPH+D+F DK NQ LGGHRIATVLMYLS V KGGET+FP +E + Q +D WS+CA
Sbjct: 141 EKYEPHYDYFHDKNNQALGGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCA 200
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE- 177
+ GYAVKP+KGDALLFFSLHPDA+TDS SLHGSCPVIEG+KWSATKWIHVR+FD K+
Sbjct: 201 KNGYAVKPVKGDALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSFDISVKQG 260
Query: 178 PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C DE++ C WA GEC KNP YMVG+ + G+CRKSC VC
Sbjct: 261 ASTDGCEDENVLCPQWAAVGECAKNPNYMVGTNEAPGFCRKSCNVC 306
>gi|297727581|ref|NP_001176154.1| Os10g0415128 [Oryza sativa Japonica Group]
gi|255679404|dbj|BAH94882.1| Os10g0415128 [Oryza sativa Japonica Group]
Length = 241
Score = 341 bits (875), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 161/226 (71%), Positives = 187/226 (82%), Gaps = 3/226 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ SEVRTSSGMFL K QDE+VA IE RIAAWTFLPP+NGE++QILHY++G
Sbjct: 1 MVADNESGKSVMSEVRTSSGMFLEKKQDEVVARIEERIAAWTFLPPDNGESIQILHYQNG 60
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
+KYEPH+D+F DK NQ LGGHRIATVLMYLS V KGGET+FP +E + Q +D WS+CA
Sbjct: 61 EKYEPHYDYFHDKNNQALGGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCA 120
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE- 177
+ GYAVKP+KGDALLFFSLHPDA+TDS SLHGSCPVIEG+KWSATKWIHVR+FD K+
Sbjct: 121 KNGYAVKPVKGDALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSFDISVKQG 180
Query: 178 PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C DE++ C WA GEC KNP YMVG+ + G+CRKSC VC
Sbjct: 181 ASTDGCEDENVLCPQWAAVGECAKNPNYMVGTNEAPGFCRKSCNVC 226
>gi|148537204|dbj|BAF63493.1| prolyl 4-hydroxylase [Potamogeton distinctus]
Length = 246
Score = 335 bits (860), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 158/225 (70%), Positives = 184/225 (81%), Gaps = 4/225 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ SE+RTSSGMFL + QDE + IE RIAAWTFLP ENGE +QILHYE G
Sbjct: 24 MVADNESGKSVMSEIRTSSGMFLERRQDETITRIEKRIAAWTFLPEENGEPIQILHYEKG 83
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
QKY+ H+D+F DK NQ++GGHR+ATVLMYLS V+KGGETVFP++E + Q +D WS+CA
Sbjct: 84 QKYDAHYDYFHDKNNQRVGGHRMATVLMYLSDVKKGGETVFPDAEGKLLQVKDDTWSDCA 143
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
R GYAVKP KGDALLFFS HP+A+TD SLH SCPVIEGEKWSAT+WIHVR+F K KE
Sbjct: 144 RSGYAVKPRKGDALLFFSCHPNATTDPNSLHASCPVIEGEKWSATRWIHVRSFAK--KER 201
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D+CVDE+ NC WA GEC+KN LYMVG+ + GYCRKSCKVC
Sbjct: 202 NKDECVDEEDNCSFWASNGECEKNVLYMVGNNETLGYCRKSCKVC 246
>gi|297818458|ref|XP_002877112.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322950|gb|EFH53371.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 289
Score = 332 bits (850), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 154/225 (68%), Positives = 189/225 (84%), Gaps = 6/225 (2%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+VAD+ SG+SI SE RTSSG+FL+K QD+IVA++EA++A WTFLP ENGEA+QILHYE+G
Sbjct: 69 VVADDNSGESIDSEERTSSGVFLTKRQDDIVANVEAKLATWTFLPEENGEALQILHYENG 128
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP--NSEVSQSRDGNWSECA 118
QKY+PHFD++ DK +LGGHRIATVLMYLS+V KGGETVFP + Q +D WSECA
Sbjct: 129 QKYDPHFDYYYDKETLKLGGHRIATVLMYLSNVTKGGETVFPMWKGKTPQLKDDTWSECA 188
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
++GYAVKP KGDALLFF+LHP+A+TD TSLHGSCPVIEGEKWSAT+WIHVR+F K +
Sbjct: 189 KQGYAVKPRKGDALLFFNLHPNATTDPTSLHGSCPVIEGEKWSATRWIHVRSFGKKQS-- 246
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D CVD+ +C +WAKAGEC+KNP+YM+GS++ GYCRKSCK C
Sbjct: 247 --DGCVDDHESCEIWAKAGECEKNPMYMMGSETDLGYCRKSCKAC 289
>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
Length = 296
Score = 329 bits (843), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 156/223 (69%), Positives = 179/223 (80%), Gaps = 4/223 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ S +RTSSGMFLSK QDE++ IE RIAAWTFLP ENGEA+Q+L YE G
Sbjct: 78 MVADNESGKSVLSNIRTSSGMFLSKGQDEVINRIEERIAAWTFLPKENGEAIQVLRYEFG 137
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
+KYEPH+D+F DK NQ LGGHRIATVLMYLS V KGGETVFP+SE + +D +WS+CA++
Sbjct: 138 EKYEPHYDYFHDKYNQALGGHRIATVLMYLSDVVKGGETVFPSSEDTTVKDDSWSDCAKK 197
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
G AVKP KGDALLF+SLHPDA+ D +SLHG CPVIEGEKWSATKWIHV F KP+KE
Sbjct: 198 GIAVKPRKGDALLFYSLHPDATPDESSLHGGCPVIEGEKWSATKWIHVLPFGKPKKE--- 254
Query: 181 DDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
C DE+ C WA GEC KNP YMVG++ G CRKSCKVC
Sbjct: 255 -GCADENEKCGEWAAYGECDKNPSYMVGTQEWPGACRKSCKVC 296
>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
Length = 283
Score = 323 bits (829), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 155/224 (69%), Positives = 178/224 (79%), Gaps = 5/224 (2%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ S +RTSSGMFLSK QDE++ IE RIAAWTFLP ENGEA+Q+L YE G
Sbjct: 64 MVADNESGKSVLSNIRTSSGMFLSKGQDEVINRIEERIAAWTFLPKENGEAIQVLRYEFG 123
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQS-RDGNWSECAR 119
+KYEPH+D+F DK NQ LGGHRIATVLMYLS KGGETVFP+SE + +D +WS+CA+
Sbjct: 124 EKYEPHYDYFHDKYNQALGGHRIATVLMYLSDAVKGGETVFPSSEEDTTVKDDSWSDCAK 183
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
+G AVKP KGDALLF+SLHPDA+ D +SLHG CPVIEGEKWSATKWIHV F KP+KE
Sbjct: 184 KGIAVKPRKGDALLFYSLHPDATPDESSLHGGCPVIEGEKWSATKWIHVLPFGKPKKE-- 241
Query: 180 DDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
C DE+ C WA GEC KNP YMVG++ G CRKSCKVC
Sbjct: 242 --GCADENEKCGEWAAYGECDKNPSYMVGTQEWPGACRKSCKVC 283
>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
Length = 1062
Score = 323 bits (829), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 154/228 (67%), Positives = 178/228 (78%), Gaps = 2/228 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKSI S+VRTSSG FLSK +D+IV+ IE R+AAWTFLP EN E++QILHYE G
Sbjct: 73 MVADNDSGKSIMSQVRTSSGTFLSKHEDDIVSGIEKRVAAWTFLPEENAESIQILHYELG 132
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSECA 118
QKY+ HFD+F DK N + GGHR+ATVLMYL+ V+KGGETVFPN+ Q +D WS+CA
Sbjct: 133 QKYDAHFDYFHDKNNLKRGGHRVATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCA 192
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
R G AVKP KGDALLFFSLH +A+TD SLHGSCPVIEGEKWSATKWIHVR+FD P
Sbjct: 193 RSGLAVKPKKGDALLFFSLHVNATTDPASLHGSCPVIEGEKWSATKWIHVRSFDNPPDVS 252
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCKPS 226
D C DE+ C WA GEC +NP YMVG+K S G+CRKSC VC S
Sbjct: 253 LDLPCSDENERCTRWAAVGECYRNPKYMVGTKDSLGFCRKSCGVCSRS 300
>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 293
Score = 323 bits (828), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 150/225 (66%), Positives = 181/225 (80%), Gaps = 2/225 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKS+ S+VRTSSG FL+K +DEI++ IE R+AAWTFLP EN E++Q+LHYE G
Sbjct: 67 MVADNDSGKSVMSQVRTSSGTFLNKHEDEIISGIEKRVAAWTFLPEENAESIQVLHYEVG 126
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSECA 118
QKY+ HFD+F DK NQ+LGGHR+ATVLMYL+ V+KGGETVFPN+E Q +D WSECA
Sbjct: 127 QKYDAHFDYFHDKNNQKLGGHRVATVLMYLTDVKKGGETVFPNAEGRHLQHKDETWSECA 186
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
R G AVKP KGDALLFFSLH +A+TD +SLHGSCPVIEGEKWSATKWIHVR+FD P
Sbjct: 187 RSGLAVKPRKGDALLFFSLHINATTDPSSLHGSCPVIEGEKWSATKWIHVRSFDNPPIVR 246
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C D++ C WA GEC +NP YM+G+K + G+CRKSC +C
Sbjct: 247 MDVRCSDDNELCSKWAAVGECYRNPKYMIGTKDTLGFCRKSCGIC 291
>gi|30689216|ref|NP_189490.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
gi|332643931|gb|AEE77452.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
Length = 288
Score = 321 bits (823), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 153/225 (68%), Positives = 187/225 (83%), Gaps = 7/225 (3%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+VAD +SG+S SEVRTSSGMFL+K QD+IVA++EA++AAWTFLP ENGEA+QILHYE+G
Sbjct: 69 VVADVDSGESEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENG 128
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPN--SEVSQSRDGNWSECA 118
QKY+PHFD+F DK +LGGHRIATVLMYLS+V KGGETVFPN + Q +D +WS+CA
Sbjct: 129 QKYDPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCA 188
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
++GYAVKP KGDALLFF+LH + +TD SLHGSCPVIEGEKWSAT+WIHVR+F K +
Sbjct: 189 KQGYAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSFGKKKLV- 247
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
CVD+ +C WA AGEC+KNP+YMVGS++S G+CRKSCK C
Sbjct: 248 ----CVDDHESCQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 288
>gi|28393447|gb|AAO42145.1| putative prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 253
Score = 321 bits (823), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 153/225 (68%), Positives = 187/225 (83%), Gaps = 7/225 (3%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+VAD +SG+S SEVRTSSGMFL+K QD+IVA++EA++AAWTFLP ENGEA+QILHYE+G
Sbjct: 34 VVADVDSGESEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENG 93
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPN--SEVSQSRDGNWSECA 118
QKY+PHFD+F DK +LGGHRIATVLMYLS+V KGGETVFPN + Q +D +WS+CA
Sbjct: 94 QKYDPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCA 153
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
++GYAVKP KGDALLFF+LH + +TD SLHGSCPVIEGEKWSAT+WIHVR+F K +
Sbjct: 154 KQGYAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSFGKKKLV- 212
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
CVD+ +C WA AGEC+KNP+YMVGS++S G+CRKSCK C
Sbjct: 213 ----CVDDHESCQEWADAGECEKNPMYMVGSETSLGFCRKSCKAC 253
>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
expressed [Oryza sativa Japonica Group]
gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
Length = 299
Score = 320 bits (819), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 153/225 (68%), Positives = 177/225 (78%), Gaps = 2/225 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKSI S+VRTSSG FLSK +D+IV+ IE R+AAWTFLP EN E++QILHYE G
Sbjct: 73 MVADNDSGKSIMSQVRTSSGTFLSKHEDDIVSGIEKRVAAWTFLPEENAESIQILHYELG 132
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSECA 118
QKY+ HFD+F DK N + GGHR+ATVLMYL+ V+KGGETVFPN+ Q +D WS+CA
Sbjct: 133 QKYDAHFDYFHDKNNLKRGGHRVATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCA 192
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
R G AVKP KGDALLFFSLH +A+TD SLHGSCPVIEGEKWSATKWIHVR+FD P
Sbjct: 193 RSGLAVKPKKGDALLFFSLHVNATTDPASLHGSCPVIEGEKWSATKWIHVRSFDNPPDVS 252
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C DE+ C WA GEC +NP YMVG+K S G+CRKSC VC
Sbjct: 253 LDLPCSDENERCTRWAAVGECYRNPKYMVGTKDSLGFCRKSCGVC 297
>gi|242047772|ref|XP_002461632.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
gi|241925009|gb|EER98153.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
Length = 307
Score = 318 bits (816), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 153/226 (67%), Positives = 179/226 (79%), Gaps = 3/226 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKS+ SEVRTSSGMFL+K QD +V+ IE RIAAWTFLP EN E MQIL YEHG
Sbjct: 80 MVADNQSGKSVMSEVRTSSGMFLNKRQDPVVSRIEERIAAWTFLPQENAENMQILRYEHG 139
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
QKYEPHFD+F DK+NQ GGHR ATVLMYLS V+KGGETVFPN++ SQ +D +SECA
Sbjct: 140 QKYEPHFDYFHDKINQVRGGHRYATVLMYLSTVDKGGETVFPNAKGWESQPKDDTFSECA 199
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
+G AVKP+KGDA+LFFSLH D D SLHGSCPVI+GEKWSA KWIHVR+++ P P
Sbjct: 200 HQGLAVKPVKGDAVLFFSLHVDGVPDPLSLHGSCPVIQGEKWSAPKWIHVRSYENPPVVP 259
Query: 179 EDD-DCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+D C D+ +C WA AGEC KNP+YMVG++ + G CRKSC VC
Sbjct: 260 KDTRGCADKSEHCAEWAAAGECGKNPVYMVGAEGAPGQCRKSCNVC 305
>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
Length = 297
Score = 317 bits (811), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 146/225 (64%), Positives = 178/225 (79%), Gaps = 2/225 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKS+AS+ RTSSG FL+K +DEIV++IE R+AAWTFLP EN E++Q+L YE G
Sbjct: 71 MVADNDSGKSVASQARTSSGTFLAKREDEIVSAIEKRVAAWTFLPEENAESLQVLRYETG 130
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSECA 118
QKY+ HFD+F D+ N +LGG R+ATVLMYL+ V+KGGETVFPN+E S Q +D WSEC+
Sbjct: 131 QKYDAHFDYFHDRNNLKLGGQRVATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECS 190
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
R G AVKP KGDALLFF+LH +A+ D+ SLHGSCPVIEGEKWSATKWIHVR+FD P
Sbjct: 191 RSGLAVKPKKGDALLFFNLHVNATADTGSLHGSCPVIEGEKWSATKWIHVRSFDNPPDVR 250
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C D+ C WA GEC +NP YMVG+K + G+CRKSC +C
Sbjct: 251 TDAPCSDDKELCPRWAAIGECHRNPTYMVGTKDTLGFCRKSCGIC 295
>gi|55741040|gb|AAV64184.1| unknown [Zea mays]
Length = 394
Score = 316 bits (810), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 144/201 (71%), Positives = 170/201 (84%), Gaps = 3/201 (1%)
Query: 26 AQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIAT 85
QDE+V IE RI+AWTFLPPENGE++QILHY++G+KYEPH+D+F DK NQ LGGHRIAT
Sbjct: 192 TQDEVVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRIAT 251
Query: 86 VLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDAST 143
VLMYLS+VEKGGET+FPN+E + Q +D WS+CAR GYAVKP+KGDALLFFSLHPDA+T
Sbjct: 252 VLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPDATT 311
Query: 144 DSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP-EDDDCVDEDLNCVVWAKAGECKKN 202
DS SLHGSCPVIEG+KWSATKWIHVR+FD P K+P D C D+++ C WA GEC KN
Sbjct: 312 DSDSLHGSCPVIEGQKWSATKWIHVRSFDLPVKQPGSSDGCEDDNILCPQWAAVGECAKN 371
Query: 203 PLYMVGSKSSRGYCRKSCKVC 223
P YMVG+K + G+CRKSCKVC
Sbjct: 372 PNYMVGTKEAPGFCRKSCKVC 392
>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
Length = 297
Score = 316 bits (809), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 146/225 (64%), Positives = 177/225 (78%), Gaps = 2/225 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKS+AS+ RTSSG FL+K +DEIV++IE R+AAWTFLP EN E++Q+L YE G
Sbjct: 71 MVADNDSGKSVASQARTSSGTFLAKREDEIVSAIEKRVAAWTFLPEENAESLQVLRYETG 130
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSECA 118
QKY+ HFD+F D+ N +LGG R+ATVLMYL+ V KGGETVFPN+E S Q +D WSEC+
Sbjct: 131 QKYDAHFDYFHDRNNLKLGGQRVATVLMYLTDVNKGGETVFPNAEGSHLQYKDETWSECS 190
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
R G AVKP KGDALLFF+LH +A+ D+ SLHGSCPVIEGEKWSATKWIHVR+FD P
Sbjct: 191 RSGLAVKPKKGDALLFFNLHVNATADTGSLHGSCPVIEGEKWSATKWIHVRSFDNPPDVR 250
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C D+ C WA GEC +NP YMVG+K + G+CRKSC +C
Sbjct: 251 TDAPCSDDKELCPRWAAIGECHRNPTYMVGTKDTLGFCRKSCGIC 295
>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 263
Score = 315 bits (808), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 153/224 (68%), Positives = 176/224 (78%), Gaps = 3/224 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ SE+RTSSGMFL K QD+I++ IE RIAAWTFLP ENGEA+Q+L Y+ G
Sbjct: 42 MVADNESGKSVKSEIRTSSGMFLMKGQDDIISRIEDRIAAWTFLPKENGEAIQVLRYQDG 101
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE-VSQSRDGNWSECAR 119
+KYEPHFD+F DK NQ LGGHRIATVLMYLS V KGGETVFP+SE +D +WS C +
Sbjct: 102 EKYEPHFDYFHDKNNQALGGHRIATVLMYLSDVVKGGETVFPSSEDRGGPKDDSWSACGK 161
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
G AVKP KGDALLFFSLHP A D +SLH CPVIEGEKWSATKWIHV F+KP P+
Sbjct: 162 TGVAVKPRKGDALLFFSLHPSAVPDESSLHTGCPVIEGEKWSATKWIHVAAFEKP--RPK 219
Query: 180 DDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ CV+E +C WA GEC+KNP YMVG+K GYCRK+C VC
Sbjct: 220 NGACVNEVDSCEEWAAYGECQKNPAYMVGTKEWPGYCRKACHVC 263
>gi|115471029|ref|NP_001059113.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|113610649|dbj|BAF21027.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|215768445|dbj|BAH00674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 315 bits (808), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 150/225 (66%), Positives = 175/225 (77%), Gaps = 2/225 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKS+ SEVRTSSGMFL K QD +V+ IE RIAAWTFLP EN E +QIL YEHG
Sbjct: 93 MVADNKSGKSVMSEVRTSSGMFLDKRQDPVVSRIEKRIAAWTFLPEENAENIQILRYEHG 152
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
QKYEPHFD+F DK+NQ LGGHR ATVLMYLS VEKGGETVFPN+E +Q +D +SECA
Sbjct: 153 QKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSECA 212
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
++G AVKP+KGD +LFFSLH D D SLHGSCPVIEGEKWSA KWI +R+++ P
Sbjct: 213 QKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYEHPPVSK 272
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ C D C WA+AGEC+KNP+YMVG++ G CRKSC VC
Sbjct: 273 VTEGCSDNSARCAKWAEAGECEKNPVYMVGAEGLPGNCRKSCGVC 317
>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
Length = 297
Score = 315 bits (806), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 149/225 (66%), Positives = 178/225 (79%), Gaps = 2/225 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKS+ S+VRTSSG FL+K +DEIV++IE R+AAWTFLP EN E+MQ+L YE G
Sbjct: 71 MVADNDSGKSLMSQVRTSSGAFLAKHEDEIVSAIEKRVAAWTFLPEENAESMQVLRYEIG 130
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSECA 118
QKY+ HFD+F DK N + GG R ATVLMYL+ V+KGGETVFPN+E S Q +D WSEC+
Sbjct: 131 QKYDAHFDYFHDKNNVKHGGQRFATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECS 190
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
R G AVKP KGDALLFF LH +A+TD++SLHGSCPVIEGEKWSATKWIHVR+FD P
Sbjct: 191 RSGLAVKPKKGDALLFFGLHLNATTDTSSLHGSCPVIEGEKWSATKWIHVRSFDNPPNVR 250
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C D++ C WA GEC KNP YMVG+K + G+CRKSC +C
Sbjct: 251 MDAPCSDDNELCPKWAAIGECYKNPTYMVGTKDTNGFCRKSCGLC 295
>gi|34393269|dbj|BAC83179.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
gi|50509101|dbj|BAD30161.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
Length = 313
Score = 315 bits (806), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 150/225 (66%), Positives = 175/225 (77%), Gaps = 2/225 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKS+ SEVRTSSGMFL K QD +V+ IE RIAAWTFLP EN E +QIL YEHG
Sbjct: 87 MVADNKSGKSVMSEVRTSSGMFLDKRQDPVVSRIEKRIAAWTFLPEENAENIQILRYEHG 146
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
QKYEPHFD+F DK+NQ LGGHR ATVLMYLS VEKGGETVFPN+E +Q +D +SECA
Sbjct: 147 QKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSECA 206
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
++G AVKP+KGD +LFFSLH D D SLHGSCPVIEGEKWSA KWI +R+++ P
Sbjct: 207 QKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYEHPPVSK 266
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ C D C WA+AGEC+KNP+YMVG++ G CRKSC VC
Sbjct: 267 VTEGCSDNSARCAKWAEAGECEKNPVYMVGAEGLPGNCRKSCGVC 311
>gi|334185677|ref|NP_001189994.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|332643930|gb|AEE77451.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 324
Score = 311 bits (797), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 148/233 (63%), Positives = 187/233 (80%), Gaps = 12/233 (5%)
Query: 1 MVADNESGKSIASE----VRTSSGMFLSKAQ----DEIVASIEARIAAWTFLPPENGEAM 52
MVADN+SG+S+ SE V S F++ D+IV+++EA++AAWTFLP ENGE+M
Sbjct: 92 MVADNDSGESVESEDSVSVVRQSSSFIANMDSLEIDDIVSNVEAKLAAWTFLPEENGESM 151
Query: 53 QILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP--NSEVSQSR 110
QILHYE+GQKYEPHFD+F D+ N +LGGHRIATVLMYLS+VEKGGETVFP + +Q +
Sbjct: 152 QILHYENGQKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLK 211
Query: 111 DGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
D +W+ECA++GYAVKP KGDALLFF+LHP+A+TDS SLHGSCPV+EGEKWSAT+WIHV++
Sbjct: 212 DDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKS 271
Query: 171 FDKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
F++ + C+DE+++C WAKAGEC+KNP YMVGS GYCRKSCK C
Sbjct: 272 FERAFN--KQSGCMDENVSCEKWAKAGECQKNPTYMVGSDKDHGYCRKSCKAC 322
>gi|449459442|ref|XP_004147455.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515722|ref|XP_004164897.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 319
Score = 309 bits (791), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 146/222 (65%), Positives = 178/222 (80%), Gaps = 5/222 (2%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+G+S+ S+ RTS+GMFL KAQDEIVA IE+RIAAWTFLP +NGE +QIL YE+GQKYEPH
Sbjct: 101 TGQSVTSKERTSTGMFLHKAQDEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPH 160
Query: 67 FDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV--SQSRDGNWSECARRGYAV 124
FDFF+D N +GGHRIAT+LMYLS+VEKGGETVFPNS V S+ + SEC + GY V
Sbjct: 161 FDFFQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPVKLSEEEKADLSECGKVGYGV 220
Query: 125 KPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCV 184
+P GDALLFFS++P+ + D+TS HGSCPVIEGEKWSATKWIH+ D+ + P CV
Sbjct: 221 RPKLGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKWSATKWIHMLPIDEFWRNPA---CV 277
Query: 185 DEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCKPS 226
DE+ +C WAKAGEC+KNP+YM+GSK+ G+CR SCKVC PS
Sbjct: 278 DENDHCTAWAKAGECEKNPVYMMGSKNELGFCRFSCKVCSPS 319
>gi|326501992|dbj|BAK06488.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 306
Score = 308 bits (790), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 147/232 (63%), Positives = 175/232 (75%), Gaps = 4/232 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MV D ++GKS+ SEVRTSSG FL+K QD++VA+IEARIAAWT LP ENGE++Q+L YE+G
Sbjct: 75 MVVDRQTGKSVMSEVRTSSGTFLAKKQDQVVATIEARIAAWTLLPQENGESIQVLRYENG 134
Query: 61 QKYEPHFDFFRD--KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSE 116
QKYEPH DF R K + GGHR+ATVLMYLS V+ GGETVFPNS+ Q +D SE
Sbjct: 135 QKYEPHVDFIRHAAKGHHSRGGHRVATVLMYLSDVKMGGETVFPNSDAKTLQPKDDTQSE 194
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CARRGYAVKP+KGDA+LFFSLHP+ +TD SLHG CPVIEGEKWSATKWIHVR FD +
Sbjct: 195 CARRGYAVKPVKGDAVLFFSLHPNGTTDRDSLHGGCPVIEGEKWSATKWIHVRPFDNRRR 254
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCKPSSV 228
P C D+D C A GEC +NP YMVG+ S G+CRKSC C +++
Sbjct: 255 VPSTAGCGDDDELCPRLAANGECDRNPRYMVGTAGSPGFCRKSCNACNGTTL 306
>gi|413934216|gb|AFW68767.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
Length = 210
Score = 308 bits (790), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 142/200 (71%), Positives = 168/200 (84%), Gaps = 3/200 (1%)
Query: 27 QDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATV 86
QDE+V IE RI+AWTFLPPENGEA+QILHY++G+KYEPH+D+F DK NQ LGGHRIATV
Sbjct: 9 QDEVVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATV 68
Query: 87 LMYLSHVEKGGETVFPNSE--VSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTD 144
LMYLS+VEKGGET+FPN+E + Q +D WS+CAR GYAVKP+KGDALLFFSLHPD++TD
Sbjct: 69 LMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTD 128
Query: 145 STSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP-EDDDCVDEDLNCVVWAKAGECKKNP 203
S SLHGSCP IEG+KWSATKWIHVR+FD K+P D C D+++ C WA GEC KNP
Sbjct: 129 SDSLHGSCPAIEGQKWSATKWIHVRSFDLTVKQPGPSDGCEDDNVLCPQWAAVGECAKNP 188
Query: 204 LYMVGSKSSRGYCRKSCKVC 223
YMVG+K + G+CRKSCKVC
Sbjct: 189 NYMVGTKEAPGFCRKSCKVC 208
>gi|413934217|gb|AFW68768.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
Length = 204
Score = 307 bits (786), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 141/201 (70%), Positives = 168/201 (83%), Gaps = 3/201 (1%)
Query: 26 AQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIAT 85
+ DE+V IE RI+AWTFLPPENGEA+QILHY++G+KYEPH+D+F DK NQ LGGHRIAT
Sbjct: 2 SNDEVVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIAT 61
Query: 86 VLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDAST 143
VLMYLS+VEKGGET+FPN+E + Q +D WS+CAR GYAVKP+KGDALLFFSLHPD++T
Sbjct: 62 VLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTT 121
Query: 144 DSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP-EDDDCVDEDLNCVVWAKAGECKKN 202
DS SLHGSCP IEG+KWSATKWIHVR+FD K+P D C D+++ C WA GEC KN
Sbjct: 122 DSDSLHGSCPAIEGQKWSATKWIHVRSFDLTVKQPGPSDGCEDDNVLCPQWAAVGECAKN 181
Query: 203 PLYMVGSKSSRGYCRKSCKVC 223
P YMVG+K + G+CRKSCKVC
Sbjct: 182 PNYMVGTKEAPGFCRKSCKVC 202
>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 294
Score = 305 bits (781), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 148/227 (65%), Positives = 172/227 (75%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADNESG S SEVRTSSGMF+ KA+D IV+ IE +IA WTFLP ENGE +Q+L YE GQ
Sbjct: 70 VADNESGNSKTSEVRTSSGMFIPKAKDPIVSGIEEKIATWTFLPKENGEEIQVLRYEEGQ 129
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR-----DGNWSE 116
KYEPH+D+F DK+N GGHR+ATVLMYL++VEKGGETVFP +E S R D + SE
Sbjct: 130 KYEPHYDYFVDKVNIARGGHRLATVLMYLTNVEKGGETVFPKAEESPRRRSMIADDSLSE 189
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA++G VKP KGDALLF+SLHP+A+ D SLHG CPVI+GEKWSATKWIHV +FDK
Sbjct: 190 CAKKGIPVKPRKGDALLFYSLHPNATPDPLSLHGGCPVIQGEKWSATKWIHVDSFDKTVD 249
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ +C D D NC WA GEC KNP YM+GS GYCRKSCKVC
Sbjct: 250 --TEGNCSDRDENCERWAALGECTKNPEYMLGSAGLPGYCRKSCKVC 294
>gi|29150368|gb|AAO72377.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711617|gb|ABF99412.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|125546090|gb|EAY92229.1| hypothetical protein OsI_13949 [Oryza sativa Indica Group]
gi|125588294|gb|EAZ28958.1| hypothetical protein OsJ_13002 [Oryza sativa Japonica Group]
Length = 310
Score = 304 bits (779), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 141/225 (62%), Positives = 174/225 (77%), Gaps = 2/225 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ SEVRTSSGMFL K QD +V+ IE RIAAWT LP EN E +QIL YE+G
Sbjct: 84 MVADNESGKSVMSEVRTSSGMFLDKQQDPVVSGIEERIAAWTLLPQENAENIQILRYENG 143
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
QKY+PHFD+F+DK+NQ GGHR ATVL YLS VEKGGETVFPN+E SQ +D ++S+CA
Sbjct: 144 QKYDPHFDYFQDKVNQLQGGHRYATVLTYLSTVEKGGETVFPNAEGWESQPKDDSFSDCA 203
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
++G AVK +KGD++LFF+L PD + D SLHGSCPVIEGEKWSA KWIHVR++D
Sbjct: 204 KKGLAVKAVKGDSVLFFNLQPDGTPDPLSLHGSCPVIEGEKWSAPKWIHVRSYDNASSMK 263
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ ++C D NC WA +GEC N +YM+G++ + G C+KSC C
Sbjct: 264 QSEECSDLSENCAAWAASGECNNNAVYMIGTEDAPGQCQKSCNAC 308
>gi|114796723|gb|ABI79328.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 297
Score = 304 bits (779), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 150/228 (65%), Positives = 173/228 (75%), Gaps = 10/228 (4%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADNESGKS SEVRTSSG F+SKA+D IV IE ++A WTFLP ENGE +Q+L YE GQ
Sbjct: 74 VADNESGKSQVSEVRTSSGAFISKAKDAIVQRIEEKLATWTFLPIENGEDIQVLRYEEGQ 133
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR------DGNWS 115
KYE HFDFF DK+N GGHR ATVLMYLS+VEKGG+TVFPN+E+S+ + + + S
Sbjct: 134 KYENHFDFFSDKVNIARGGHRYATVLMYLSNVEKGGDTVFPNAELSERQKAAIAANDDLS 193
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
ECA+RG +VKP KGDALLFFSL P A+ D SLHG CPVIEGEKWSATKWIHV +FDK
Sbjct: 194 ECAKRGISVKPRKGDALLFFSLTPTATPDQLSLHGGCPVIEGEKWSATKWIHVDSFDK-- 251
Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+D C D + NC WA GEC KNP YMVG+ S GYCR+SCKVC
Sbjct: 252 --ILEDGCNDHNQNCERWAALGECTKNPEYMVGTSSLPGYCRRSCKVC 297
>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
Length = 487
Score = 300 bits (767), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 142/213 (66%), Positives = 166/213 (77%), Gaps = 2/213 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKS+ SEVRTSSGMFL K QD +V+ IE RIAAWTFLP EN E +QIL YEHG
Sbjct: 93 MVADNKSGKSVMSEVRTSSGMFLDKRQDPVVSRIEKRIAAWTFLPEENAENIQILRYEHG 152
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
QKYEPHFD+F DK+NQ LGGHR ATVLMYLS VEKGGETVFPN+E +Q +D +SECA
Sbjct: 153 QKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSECA 212
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
++G AVKP+KGDA+LFFSLH D D SLHGSCPVIEGEKWSA KWI +R+++ P
Sbjct: 213 QKGLAVKPVKGDAVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYEHPPVSK 272
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKS 211
+ C D C WA+AGEC+KNP+YM + S
Sbjct: 273 VTEGCSDNSARCAKWAEAGECEKNPVYMTVNSS 305
>gi|9294584|dbj|BAB02865.1| unnamed protein product [Arabidopsis thaliana]
Length = 328
Score = 298 bits (764), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 144/216 (66%), Positives = 175/216 (81%), Gaps = 7/216 (3%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+VAD +SG+S SEVRTSSGMFL+K QD+IVA++EA++AAWTFLP ENGEA+QILHYE+G
Sbjct: 2 VVADVDSGESEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENG 61
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPN--SEVSQSRDGNWSECA 118
QKY+PHFD+F DK +LGGHRIATVLMYLS+V KGGETVFPN + Q +D +WS+CA
Sbjct: 62 QKYDPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCA 121
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
++GYAVKP KGDALLFF+LH + +TD SLHGSCPVIEGEKWSAT+WIHVR+F K +
Sbjct: 122 KQGYAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSFGKKKLV- 180
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRG 214
CVD+ +C WA AGEC+KNP+YMVG G
Sbjct: 181 ----CVDDHESCQEWADAGECEKNPMYMVGVGKKTG 212
>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
Length = 300
Score = 298 bits (763), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 145/227 (63%), Positives = 173/227 (76%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADNESGKS SEVRTSSGMF++KA+D IVA IE +IA WTFLP ENGE +Q+L YEHGQ
Sbjct: 76 VADNESGKSKLSEVRTSSGMFITKAKDPIVAGIEDKIATWTFLPRENGEDIQVLRYEHGQ 135
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
KY+PH+D+F DK+N GGHR+ATVLMYL+ VEKGGETVFP++E R + SE
Sbjct: 136 KYDPHYDYFSDKVNIARGGHRVATVLMYLTDVEKGGETVFPSAEELPRRKASVSHEDLSE 195
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CAR+G AVKP +GDALLFFSL+P A D++S+H CPVIEGEKWSATKWIHV +FDK +
Sbjct: 196 CARKGIAVKPRRGDALLFFSLYPTAVPDTSSIHAGCPVIEGEKWSATKWIHVDSFDKNLE 255
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+C D++ +C WA GEC KN YMVGS GYCR+SCKVC
Sbjct: 256 --AGGNCTDQNESCGRWAALGECTKNVEYMVGSSGLPGYCRRSCKVC 300
>gi|222636605|gb|EEE66737.1| hypothetical protein OsJ_23428 [Oryza sativa Japonica Group]
Length = 487
Score = 298 bits (763), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 141/213 (66%), Positives = 165/213 (77%), Gaps = 2/213 (0%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKS+ SEVRTSSGMFL K QD +V+ IE RIAAWTFLP EN E +QIL YEHG
Sbjct: 93 MVADNKSGKSVMSEVRTSSGMFLDKRQDPVVSRIEKRIAAWTFLPEENAENIQILRYEHG 152
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
QKYEPHFD+F DK+NQ LGGHR ATVLMYLS VEKGGETVFPN+E +Q +D +SECA
Sbjct: 153 QKYEPHFDYFHDKVNQALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSECA 212
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
++G AVKP+KGD +LFFSLH D D SLHGSCPVIEGEKWSA KWI +R+++ P
Sbjct: 213 QKGLAVKPVKGDTVLFFSLHIDGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYEHPPVSK 272
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKS 211
+ C D C WA+AGEC+KNP+YM + S
Sbjct: 273 VTEGCSDNSARCAKWAEAGECEKNPVYMTVNSS 305
>gi|359477455|ref|XP_002278454.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Vitis
vinifera]
Length = 296
Score = 297 bits (760), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 150/225 (66%), Positives = 169/225 (75%), Gaps = 5/225 (2%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SGKS SEVRTSSGMF+ K +D IVA IE +IAAWTFLP +NGE MQ+L YE GQ
Sbjct: 74 VADNVSGKSRLSEVRTSSGMFIGKGKDPIVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQ 133
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQS---RDGNWSECA 118
KY+ H+D+F DK+N GGHRIATVLMYLS V KGGETVFP +EVS S + + SECA
Sbjct: 134 KYDAHYDYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEVSSSTLPTNDDLSECA 193
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
R+G AVKP KGDALLFFSLHP A D SLHG CPVIEGEKWSATKWIHV +FDK K
Sbjct: 194 RKGIAVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKWSATKWIHVDSFDKILK-- 251
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+C DE+ +C WA GEC KNP YM+GS G CR+SCKVC
Sbjct: 252 PGGNCTDENDSCERWAALGECTKNPEYMLGSSDLPGACRRSCKVC 296
>gi|218192156|gb|EEC74583.1| hypothetical protein OsI_10158 [Oryza sativa Indica Group]
Length = 299
Score = 296 bits (759), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 139/224 (62%), Positives = 171/224 (76%), Gaps = 3/224 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + +SG+S+ S+ RTSSGMFL + QDE+VA IE RIAAWT P ENGE+MQ+L Y G+
Sbjct: 75 VVNGKSGESVMSKTRTSSGMFLIRKQDEVVARIEERIAAWTMFPAENGESMQMLRYGQGE 134
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECAR 119
KYEPHFD+ R + GGHRIATVLMYLS+V+ GGETVFP++E +SQ +D WS+CA
Sbjct: 135 KYEPHFDYIRGRQASARGGHRIATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAE 194
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
+G+AVKP KG A+LFFSL+P+A+ D SLHGSCPVI+GEKWSATKWIHVR++D+ +
Sbjct: 195 QGFAVKPTKGSAVLFFSLYPNATFDPGSLHGSCPVIQGEKWSATKWIHVRSYDENGRR-S 253
Query: 180 DDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C DE C WA AGEC KNP YMVG+ S G+CRKSC VC
Sbjct: 254 SDKCEDEHALCSSWAAAGECAKNPGYMVGTSESPGFCRKSCNVC 297
>gi|357447555|ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 303
Score = 296 bits (758), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 143/229 (62%), Positives = 172/229 (75%), Gaps = 9/229 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SG+S SEVRTSSGMF+SK +D IV+ IE +I++WTFLP ENGE +Q+L YEHGQ
Sbjct: 77 VADNLSGESKLSEVRTSSGMFISKNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQ 136
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQS-------RDGNW 114
KY+PH+D+F DK+N GGHR+ATVLMYL++V KGGETVFPN+E+ +S D +
Sbjct: 137 KYDPHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAELQESPRHKLSETDEDL 196
Query: 115 SECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
SEC ++G AVKP +GDALLFFSLHP+A D+ SLH CPVIEGEKWSATKWIHV +FDK
Sbjct: 197 SECGKKGVAVKPRRGDALLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIHVDSFDKT 256
Query: 175 EKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
DC D+ +C WA GEC KNP YMVG+ GYCRKSCK C
Sbjct: 257 VG--AGGDCTDQHESCERWAALGECTKNPEYMVGTSGLPGYCRKSCKTC 303
>gi|108706361|gb|ABF94156.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|222624253|gb|EEE58385.1| hypothetical protein OsJ_09545 [Oryza sativa Japonica Group]
Length = 299
Score = 296 bits (757), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 171/224 (76%), Gaps = 3/224 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + +SG+S+ S+ RTSSGMFL + QDE+VA IE RIAAWT P ENGE+MQ+L Y G+
Sbjct: 75 VVNGKSGESVMSKTRTSSGMFLIRKQDEVVARIEERIAAWTMFPAENGESMQMLRYGQGE 134
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECAR 119
KYEPHFD+ R + GGHRIATVLMYLS+V+ GGETVFP++E +SQ +D WS+CA
Sbjct: 135 KYEPHFDYIRGRQASARGGHRIATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAE 194
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
+G+AVKP KG A+LFFSL+P+A+ D SLHGSCPVI+GEKWSATKWIHVR++D+ +
Sbjct: 195 QGFAVKPTKGSAVLFFSLYPNATFDPGSLHGSCPVIQGEKWSATKWIHVRSYDENGRR-S 253
Query: 180 DDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C D+ C WA AGEC KNP YMVG+ S G+CRKSC VC
Sbjct: 254 SDKCEDQHALCSSWAAAGECAKNPGYMVGTSESPGFCRKSCNVC 297
>gi|255551575|ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 297
Score = 295 bits (754), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 143/227 (62%), Positives = 169/227 (74%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADNESGKS SEVRTSSGMF++K +D I+A IE +I+ WTFLP ENGE +Q+L YEHGQ
Sbjct: 73 VADNESGKSKLSEVRTSSGMFIAKGKDPIIAGIEEKISTWTFLPKENGEDLQVLRYEHGQ 132
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
KY+PH+D+F DK+N GGHR+ATVLMYLS V KGGETVFPN+E R + SE
Sbjct: 133 KYDPHYDYFADKINIARGGHRMATVLMYLSDVVKGGETVFPNAEEPPRRKATESHEDLSE 192
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA++G +VKP +GDALLFFSLHP A D SLH CPVIEGEKWSATKWIHV +FDK +
Sbjct: 193 CAKKGISVKPRRGDALLFFSLHPTAIPDPNSLHAGCPVIEGEKWSATKWIHVDSFDKNIE 252
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+C D++ +C WA GEC NP YMVGS GYCR+SCKVC
Sbjct: 253 --AGGNCTDKNESCERWAALGECTNNPEYMVGSPELPGYCRRSCKVC 297
>gi|357447553|ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 301
Score = 294 bits (753), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 143/227 (62%), Positives = 170/227 (74%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SG+S SEVRTSSGMF+SK +D IV+ IE +I++WTFLP ENGE +Q+L YEHGQ
Sbjct: 77 VADNLSGESKLSEVRTSSGMFISKNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQ 136
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS-----QSRDGNWSE 116
KY+PH+D+F DK+N GGHR+ATVLMYL++V KGGETVFPN+E S D + SE
Sbjct: 137 KYDPHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAEESPRHKLSETDEDLSE 196
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
C ++G AVKP +GDALLFFSLHP+A D+ SLH CPVIEGEKWSATKWIHV +FDK
Sbjct: 197 CGKKGVAVKPRRGDALLFFSLHPNAIPDTLSLHAGCPVIEGEKWSATKWIHVDSFDKTVG 256
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
DC D+ +C WA GEC KNP YMVG+ GYCRKSCK C
Sbjct: 257 --AGGDCTDQHESCERWAALGECTKNPEYMVGTSGLPGYCRKSCKTC 301
>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 303
Score = 294 bits (752), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 145/227 (63%), Positives = 173/227 (76%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SGKS SEVRTSSG F+ KA+D IV+ IE +IAAWTFLP +NGE +Q+L YE+GQ
Sbjct: 78 VADNLSGKSKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQ 137
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
KY+ HFD+F DK+N GGHR+ATVLMYLS VEKGGETVFP++E SQ R + S+
Sbjct: 138 KYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSD 197
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA++G AVKP KGDALLFFSLHP+A D++SLHG CPVIEGEKWSATKWI V +FD +
Sbjct: 198 CAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVVR 257
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ +C DE+ +C WA+ GEC NP YMVGS GYCRKSCK C
Sbjct: 258 --DHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC 302
>gi|359806348|ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
gi|255645457|gb|ACU23224.1| unknown [Glycine max]
Length = 298
Score = 292 bits (747), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 141/227 (62%), Positives = 172/227 (75%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SG+S S+VRTSSGMF+SK +D I++ IE +I++WTFLP ENGE +Q+L YEHGQ
Sbjct: 74 VADNLSGESQLSDVRTSSGMFISKNKDPIISGIEDKISSWTFLPKENGEDIQVLRYEHGQ 133
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
KY+PH+D+F DK+N GGHRIATVLMYL++V KGGETVFP++E R G + SE
Sbjct: 134 KYDPHYDYFTDKVNIARGGHRIATVLMYLTNVTKGGETVFPSAEEPPRRRGTETSSDLSE 193
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA++G AVKP +GDALLFFSLH +A+ D++SLH CPVIEGEKWSATKWIHV +FDK
Sbjct: 194 CAKKGIAVKPHRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATKWIHVDSFDKTVG 253
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
DC D ++C WA GEC KNP YM+GS GYCRKSCK C
Sbjct: 254 --AGGDCSDHHVSCERWASLGECTKNPEYMIGSSDVPGYCRKSCKSC 298
>gi|359477453|ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
vinifera]
gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera]
Length = 298
Score = 292 bits (747), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 148/227 (65%), Positives = 167/227 (73%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SGKS SEVRTSSGMF+ K +D IVA IE +IAAWTFLP +NGE MQ+L YE GQ
Sbjct: 74 VADNVSGKSRLSEVRTSSGMFIGKGKDPIVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQ 133
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR-----DGNWSE 116
KY+ H+D+F DK+N GGHRIATVLMYLS V KGGETVFP +E R + + SE
Sbjct: 134 KYDAHYDYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEEPSRRKPLPTNDDLSE 193
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CAR+G AVKP KGDALLFFSLHP A D SLHG CPVIEGEKWSATKWIHV +FDK K
Sbjct: 194 CARKGIAVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKWSATKWIHVDSFDKILK 253
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+C DE+ +C WA GEC KNP YM+GS G CR+SCKVC
Sbjct: 254 --PGGNCTDENDSCERWAALGECTKNPEYMLGSSDLPGACRRSCKVC 298
>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
Length = 298
Score = 291 bits (744), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 140/227 (61%), Positives = 170/227 (74%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN+SG+S SEVRTSSG F+SK +D IV+ IE +I+ WTFLP ENGE +Q+L YEHGQ
Sbjct: 74 VADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQ 133
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR-----DGNWSE 116
KY+ HFD+F DK+N GGHR+AT+LMYLS+V KGGETVFP++E+ R + + S+
Sbjct: 134 KYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENEEDLSD 193
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA+RG AVKP KGDALLFF+LHPDA D SLHG CPVIEGEKWSATKWIHV +FD+
Sbjct: 194 CAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSFDRIVT 253
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+C D + +C WA GEC KNP YMVG+ GYCR+SCK C
Sbjct: 254 --PSGNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298
>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
gi|255641119|gb|ACU20838.1| unknown [Glycine max]
Length = 297
Score = 290 bits (743), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 142/227 (62%), Positives = 170/227 (74%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SG+S S+VRTSSGMF+SK +D IVA IE +I++WTFLP ENGE +Q+ YEHGQ
Sbjct: 73 VADNLSGESQLSDVRTSSGMFISKNKDPIVAGIEDKISSWTFLPKENGEDIQVSRYEHGQ 132
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
KY+PH+D+F DK+N GGHRIATVLMYL+ V KGGETVFP++E R G + SE
Sbjct: 133 KYDPHYDYFTDKVNIARGGHRIATVLMYLTDVAKGGETVFPSAEEPPRRRGAETSSDLSE 192
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA++G AVKP +GDALLFFSLH +A+ D++SLH CPVIEGEKWSATKWIHV +FDK
Sbjct: 193 CAKKGIAVKPRRGDALLFFSLHTNATPDTSSLHAGCPVIEGEKWSATKWIHVDSFDKTVG 252
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
DC D ++C WA GEC KNP YM+GS GYCRKSCK C
Sbjct: 253 --AGGDCSDNHVSCERWASLGECTKNPEYMIGSSDIPGYCRKSCKAC 297
>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 298
Score = 290 bits (742), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 140/227 (61%), Positives = 169/227 (74%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN+SG+S SEVRTSSG F+SK +D IV+ IE +I+ WTFLP ENGE +Q+L YEHGQ
Sbjct: 74 VADNDSGESKFSEVRTSSGTFISKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQ 133
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR-----DGNWSE 116
KY+ HFD+F DK+N GGHR+AT+LMYLS+V KGGETVFP++E+ R + S+
Sbjct: 134 KYDAHFDYFHDKVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSD 193
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA+RG AVKP KGDALLFF+LHPDA D SLHG CPVIEGEKWSATKWIHV +FD+
Sbjct: 194 CAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSFDRIVT 253
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+C D + +C WA GEC KNP YMVG+ GYCR+SCK C
Sbjct: 254 --PSGNCTDMNESCERWAVLGECTKNPEYMVGTTELPGYCRRSCKAC 298
>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
gi|194697650|gb|ACF82909.1| unknown [Zea mays]
gi|194708468|gb|ACF88318.1| unknown [Zea mays]
gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
Length = 308
Score = 290 bits (742), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 140/223 (62%), Positives = 169/223 (75%), Gaps = 3/223 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SGKS SEVRTSSG FL K QD IV IE +IAAWTFLP ENGE +Q+L Y+HG+
Sbjct: 88 VADNMSGKSTLSEVRTSSGTFLRKGQDPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGE 147
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS-EVSQSRDGNWSECARR 120
KYEPH+D+F D +N GGHR ATVL+YL+ V +GGETVFP + E ++D SECA++
Sbjct: 148 KYEPHYDYFTDNVNTVRGGHRYATVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQK 207
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
G AV+P KGDALLFF+L+PD +TDS SLHG CPVI+GEKWSATKWI V +FDK P+
Sbjct: 208 GIAVRPRKGDALLFFNLNPDGTTDSVSLHGGCPVIKGEKWSATKWIRVASFDKVH-HPQ- 265
Query: 181 DDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+C DE+ +C WA GEC KNP YMVG+ + GYCR+SC VC
Sbjct: 266 GNCTDENESCAKWAALGECIKNPEYMVGTTALPGYCRRSCNVC 308
>gi|356546462|ref|XP_003541645.1| PREDICTED: uncharacterized protein LOC100818794 [Glycine max]
Length = 839
Score = 290 bits (741), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 143/227 (62%), Positives = 168/227 (74%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SG+S SEVRTSSGMF+ K +D IVA IE +I++WTFLP ENGE +Q+L YEHGQ
Sbjct: 615 VADNLSGESKLSEVRTSSGMFIPKNKDLIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQ 674
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
KY+PH+D+F DK+N GGHR+ATVLMYL+ V KGGETVFP++E S G N SE
Sbjct: 675 KYDPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSE 734
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA++G AVKP +GDALLFFSL+P+A D+ SLH CPVIEGEKWSATKWIHV +FDK
Sbjct: 735 CAQKGIAVKPRRGDALLFFSLYPNAIPDTLSLHAGCPVIEGEKWSATKWIHVDSFDKVVG 794
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ DC D+ NC WA GEC NP YMVGS GYC KSCK C
Sbjct: 795 --DGGDCNDKHENCERWATLGECTSNPEYMVGSPGLPGYCMKSCKEC 839
>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Glycine max]
Length = 297
Score = 289 bits (739), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 140/223 (62%), Positives = 167/223 (74%), Gaps = 3/223 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SG+S SEVRTSSGMF+ K +D IVA +E +I++WT LP ENGE +Q+L YEHGQ
Sbjct: 77 VADNLSGESKLSEVRTSSGMFIPKNKDPIVAGVEDKISSWTLLPKENGEDIQVLRYEHGQ 136
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARR 120
KY+PH+D+F DK+N GGHR+ATVLMYL+ V KGGETVFPN+E+ S + SECA++
Sbjct: 137 KYDPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPNAELKSSETKEDLSECAQK 196
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
G AVKP +GDALLFFSL+P+A D+ SLH CPVIEGEKWSATKWIHV +FDK +
Sbjct: 197 GIAVKPRRGDALLFFSLYPNAIPDTMSLHAGCPVIEGEKWSATKWIHVDSFDK--MVADG 254
Query: 181 DDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
DC D+ NC WA GEC NP YMVGS GYC KSCK C
Sbjct: 255 GDCNDKQENCDRWATLGECTSNPNYMVGSPGLPGYCMKSCKAC 297
>gi|388496942|gb|AFK36537.1| unknown [Lotus japonicus]
Length = 302
Score = 288 bits (738), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 141/230 (61%), Positives = 170/230 (73%), Gaps = 13/230 (5%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SG S S+VRTSSGMF+SK +D IVA IE +I++WTFLP ENGE +Q+L YEHGQ
Sbjct: 78 VADNLSGDSKLSDVRTSSGMFISKNKDPIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQ 137
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV--------SQSRDGN 113
KY+PH+DFF DK+N GGHR+ATVLMYL++V +GGETVFPN+EV S++ D +
Sbjct: 138 KYDPHYDFFADKVNIARGGHRVATVLMYLTNVTRGGETVFPNAEVEEFPRHRGSETID-D 196
Query: 114 WSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
SECA++G AVKP +GDALLFFSL+P+A D+ SLH CPVIEGEKWSATKWIHV +FD+
Sbjct: 197 LSECAKKGIAVKPRRGDALLFFSLYPNAVPDTMSLHAGCPVIEGEKWSATKWIHVDSFDR 256
Query: 174 PEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
DC D +C WA GEC NP YMVGS GYC +SCK C
Sbjct: 257 ----KAGGDCTDHHESCASWAAVGECTNNPEYMVGSAGLPGYCMRSCKAC 302
>gi|449454448|ref|XP_004144967.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449474082|ref|XP_004154068.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515181|ref|XP_004164628.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 300
Score = 287 bits (735), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 141/227 (62%), Positives = 170/227 (74%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN+SGKS S VRTSSGMF+SK +D IV+ IE +I+AWTFLP ENGE +Q+L YEHGQ
Sbjct: 76 VADNDSGKSKLSTVRTSSGMFISKNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQ 135
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR-----DGNWSE 116
KYE H+D+F DK+N GGHR+ATVLMYLS+V +GGETVFP +E R D + SE
Sbjct: 136 KYESHYDYFVDKVNIAWGGHRLATVLMYLSNVTQGGETVFPLAEKPSHRRAYETDEDLSE 195
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA++G AVKP KGDALLFFSL P+A D+ SLHG CPV+EGEKWSATKWIHV +F K
Sbjct: 196 CAKKGVAVKPKKGDALLFFSLEPNAIPDTNSLHGGCPVLEGEKWSATKWIHVDSFSK--N 253
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ +C D + +C WA GEC KNP YMVGS GYCR+SC++C
Sbjct: 254 LGDIGNCTDLNESCERWAALGECTKNPEYMVGSPEMPGYCRRSCRIC 300
>gi|356555585|ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Glycine max]
Length = 301
Score = 287 bits (734), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 141/227 (62%), Positives = 167/227 (73%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SG+S SEVRTSSGMF+ K +D IVA +E +I++WT LP ENGE +Q+L YEHGQ
Sbjct: 77 VADNLSGESKLSEVRTSSGMFIPKNKDPIVAGVEDKISSWTLLPKENGEDIQVLRYEHGQ 136
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
KY+PH+D+F DK+N GGHR+ATVLMYL+ V KGGETVFPN+E S G + SE
Sbjct: 137 KYDPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPNAEESPRHRGSETKEDLSE 196
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA++G AVKP +GDALLFFSL+P+A D+ SLH CPVIEGEKWSATKWIHV +FDK
Sbjct: 197 CAQKGIAVKPRRGDALLFFSLYPNAIPDTMSLHAGCPVIEGEKWSATKWIHVDSFDK--M 254
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ DC D+ NC WA GEC NP YMVGS GYC KSCK C
Sbjct: 255 VADGGDCNDKQENCDRWATLGECTSNPNYMVGSPGLPGYCMKSCKAC 301
>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 298
Score = 287 bits (734), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 142/227 (62%), Positives = 166/227 (73%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN+SG+S SEVRTSSG F+ K +D IV+ IE +I+ WTFLP ENGE +Q+L YEHGQ
Sbjct: 74 VADNDSGESKFSEVRTSSGTFIPKGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQ 133
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR-----DGNWSE 116
KY+ HFD+F DK+N GGHRIATVLMYLS+V KGGETVFP++EV R + S+
Sbjct: 134 KYDAHFDYFHDKVNIVRGGHRIATVLMYLSNVTKGGETVFPDAEVPSCRVLSENKEDLSD 193
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA+RG AVKP KGDALLFF+LHPDA D SLHG CPVIEGEKWSATKWIHV +FDK
Sbjct: 194 CAKRGIAVKPRKGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKWSATKWIHVDSFDKIVT 253
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+C + +C WA GEC KNP YMVG+ GYCR SCK C
Sbjct: 254 --PSGNCTNMHESCERWAVLGECTKNPEYMVGTTELPGYCRHSCKAC 298
>gi|255641919|gb|ACU21228.1| unknown [Glycine max]
Length = 301
Score = 286 bits (732), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 142/227 (62%), Positives = 168/227 (74%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SG+S SEVRTSSGMF+ K +D IVA IE +I++WTFLP ENGE +Q+L YEHGQ
Sbjct: 77 VADNLSGESKLSEVRTSSGMFIPKNKDLIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQ 136
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
KY+PH+D+F DK+N GGHR+ATVLMYL+ V KGGETVFP++E S G N SE
Sbjct: 137 KYDPHYDYFADKVNIARGGHRVATVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSE 196
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA++G AVKP +GDALLFFSL+P+A D+ SLH CPVIEGEKWSAT+WIHV +FDK
Sbjct: 197 CAQKGIAVKPRRGDALLFFSLYPNAIPDTLSLHAGCPVIEGEKWSATEWIHVDSFDKVVG 256
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ DC D+ NC WA GEC NP YMVGS GYC KSCK C
Sbjct: 257 --DGGDCNDKHENCERWATLGECTSNPEYMVGSPGLPGYCMKSCKEC 301
>gi|242088305|ref|XP_002439985.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
gi|241945270|gb|EES18415.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
Length = 308
Score = 286 bits (732), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 138/223 (61%), Positives = 165/223 (73%), Gaps = 3/223 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SGKS S+VRTSSG FL K QD IV IE +IAAWTFLP ENGE +Q+L Y+HG+
Sbjct: 88 VADNMSGKSTLSDVRTSSGTFLRKGQDPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGE 147
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS-EVSQSRDGNWSECARR 120
KYEPH+D+F D +N GGHR ATVL+YL+ V +GGETVFP + EV ++D +SECA++
Sbjct: 148 KYEPHYDYFTDNVNTIRGGHRYATVLLYLTDVAEGGETVFPLAEEVDDAKDATFSECAQK 207
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
G AVKP KGDALLFF+L PD +TD SLHG C VI GEKWSATKWI V +FDK
Sbjct: 208 GIAVKPRKGDALLFFNLKPDGTTDPVSLHGGCAVIRGEKWSATKWIRVASFDKVHY--PQ 265
Query: 181 DDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+C DE+ +C WA GEC KNP YMVG+ + GYCR+SC VC
Sbjct: 266 GNCTDENESCSKWAALGECIKNPEYMVGTTALPGYCRRSCNVC 308
>gi|357478545|ref|XP_003609558.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355510613|gb|AES91755.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 299
Score = 284 bits (727), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 140/227 (61%), Positives = 166/227 (73%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SG S S+VRTSSGMF+SK +D IV+ IE RI+AWTFLP ENGE +Q+L YEHGQ
Sbjct: 74 VADNLSGDSQLSDVRTSSGMFISKNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQ 133
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
KY+PH+D+F DK+N GGHR+ATVLMYL++V KGGETVFP +E R G + SE
Sbjct: 134 KYDPHYDYFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSE 193
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA++G AVKP +GDALLFFSL +A D+ SLH CPV+EGEKWSATKWIHV +FDK
Sbjct: 194 CAKKGIAVKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWIHVDSFDKIVG 253
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
C D+ +C WA GEC NP+YMVGS GYCRKSCK C
Sbjct: 254 --AGGGCSDQHDSCERWASLGECTNNPVYMVGSSDLPGYCRKSCKAC 298
>gi|326526235|dbj|BAJ97134.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 308
Score = 283 bits (725), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 140/226 (61%), Positives = 167/226 (73%), Gaps = 9/226 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VAD SGKS SEVRTSSG F+SK +D IVA IE +IAAWTFLP ENGE MQ+L Y+ G+
Sbjct: 88 VADETSGKSQLSEVRTSSGTFISKGKDPIVAGIEDKIAAWTFLPKENGEDMQVLRYKRGE 147
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGN----WSEC 117
KYEPH+DFF D +N LGGHR+ATVL+YL+ V +GGETVFP +++ R G+ SEC
Sbjct: 148 KYEPHYDFFTDSVNTILGGHRVATVLLYLTDVAEGGETVFP---LAKGRKGSHHKGLSEC 204
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE 177
A++G AVKP KGDALLFF+L PDA+TD TSLHG C VI+GEKWSATKWI V +FDK
Sbjct: 205 AQKGIAVKPRKGDALLFFNLRPDAATDPTSLHGGCEVIKGEKWSATKWIRVASFDKVYHS 264
Query: 178 PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
P +C D +C WA GEC KNP YMVG+ G+CR+SC VC
Sbjct: 265 P--GNCTDNSNSCSQWAALGECTKNPAYMVGTAVLPGHCRRSCNVC 308
>gi|449495423|ref|XP_004159836.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 304
Score = 283 bits (725), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 143/228 (62%), Positives = 171/228 (75%), Gaps = 8/228 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SGKS SEVRTSSG F+ KA+D IV+ IE +IAAWTFLP +NGE +Q+L YE+GQ
Sbjct: 78 VADNLSGKSKVSEVRTSSGAFIHKAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQ 137
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVF----PNSEVSQSRDGN--WS 115
KY+ HFD+F DK+N GGHR+ATVLMYLS VEKGGETVF S+ Q+ + N S
Sbjct: 138 KYDAHFDYFADKVNIARGGHRMATVLMYLSDVEKGGETVFLLRRSESQRRQASETNEDLS 197
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
+CA++G AVKP KGDALLFFSLHP+A D++SLHG CPVIEGEKWSATKWI V +FD
Sbjct: 198 DCAKKGIAVKPRKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSFDMVV 257
Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ + +C DE+ +C WA+ GEC NP YMVGS GYCRKSCK C
Sbjct: 258 R--DHTNCGDENPSCERWAELGECTNNPEYMVGSPELPGYCRKSCKAC 303
>gi|388500582|gb|AFK38357.1| unknown [Medicago truncatula]
Length = 299
Score = 282 bits (722), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 139/227 (61%), Positives = 165/227 (72%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SG S S+VRTSSGM +SK +D IV+ IE RI+AWTFLP ENGE +Q+L YEHGQ
Sbjct: 74 VADNLSGDSQLSDVRTSSGMLISKNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQ 133
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
KY+PH+D+F DK+N GGHR+ATVLMYL++V KGGETVFP +E R G + SE
Sbjct: 134 KYDPHYDYFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSE 193
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA++G AVKP +GDALLFFSL +A D+ SLH CPV+EGEKWSATKWIHV +FDK
Sbjct: 194 CAKKGIAVKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWIHVDSFDKIVG 253
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
C D+ +C WA GEC NP+YMVGS GYCRKSCK C
Sbjct: 254 --AGGGCSDQHDSCERWASLGECTNNPVYMVGSSDLPGYCRKSCKAC 298
>gi|388492638|gb|AFK34385.1| unknown [Medicago truncatula]
Length = 299
Score = 282 bits (722), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 139/227 (61%), Positives = 165/227 (72%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SG S S+VRTSSGMF+SK +D IV+ IE RI+AWTFLP ENGE +Q+L YEHGQ
Sbjct: 74 VADNLSGDSQLSDVRTSSGMFISKNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQ 133
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
KY+PH+D+F DK+N GGHR+ATVLMYL++V KGGETVFP +E R G + SE
Sbjct: 134 KYDPHYDYFADKVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSE 193
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA++G AVKP +GDALLFFSL +A D+ SLH CPV+EGEKWSATKWIHV + DK
Sbjct: 194 CAKKGIAVKPRRGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKWSATKWIHVDSLDKIVG 253
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
C D+ +C WA GEC NP+YMVGS GYCRKSCK C
Sbjct: 254 --AGGGCSDQHDSCERWASLGECTNNPVYMVGSSDLPGYCRKSCKAC 298
>gi|115464581|ref|NP_001055890.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|50511363|gb|AAT77286.1| putative prolyl 4-hydroxylase alpha subunit [Oryza sativa Japonica
Group]
gi|113579441|dbj|BAF17804.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|125587281|gb|EAZ27945.1| hypothetical protein OsJ_11906 [Oryza sativa Japonica Group]
gi|215737307|dbj|BAG96236.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 281 bits (720), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 137/227 (60%), Positives = 166/227 (73%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SGKS S+ RTSSG F+ K+QD IVA IE +IAAWTFLP ENGE +Q+L Y+HG+
Sbjct: 95 VADNLSGKSELSDARTSSGTFIRKSQDPIVAGIEEKIAAWTFLPKENGEDIQVLRYKHGE 154
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV-----SQSRDGNWSE 116
KYE H+D+F D +N GGHRIATVLMYL+ V +GGETVFP +E + + D SE
Sbjct: 155 KYERHYDYFSDNVNTLRGGHRIATVLMYLTDVAEGGETVFPLAEEFTESGTNNEDSTLSE 214
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA++G AVKP KGDALLFF+L PDAS DS SLH CPVI+GEKWSATKWI V +FDK
Sbjct: 215 CAKKGVAVKPRKGDALLFFNLSPDASKDSLSLHAGCPVIKGEKWSATKWIRVASFDKVYH 274
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+C D++ +C WA GEC KNP YM+G+ + GYCRKSC +C
Sbjct: 275 --TQGNCTDDNESCEKWAALGECIKNPEYMIGTAALPGYCRKSCNIC 319
>gi|125552794|gb|EAY98503.1| hypothetical protein OsI_20415 [Oryza sativa Indica Group]
Length = 319
Score = 281 bits (719), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 137/227 (60%), Positives = 166/227 (73%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SGKS S+ RTSSG F+ K+QD IVA IE +IAAWTFLP ENGE +Q+L Y+HG+
Sbjct: 95 VADNLSGKSELSDARTSSGTFIRKSQDPIVAGIEEKIAAWTFLPKENGEDIQVLRYKHGE 154
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV-----SQSRDGNWSE 116
KYE H+D+F D +N GGHRIATVLMYL+ V +GGETVFP +E + + D SE
Sbjct: 155 KYERHYDYFSDNVNTLRGGHRIATVLMYLTDVAEGGETVFPLAEEFTESGTNNEDSTLSE 214
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA++G AVKP KGDALLFF+L PDAS DS SLH CPVI+GEKWSATKWI V +FDK
Sbjct: 215 CAKKGVAVKPRKGDALLFFNLSPDASKDSLSLHAGCPVIKGEKWSATKWIRVASFDKVYH 274
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+C D++ +C WA GEC KNP YM+G+ + GYCRKSC +C
Sbjct: 275 --TQGNCTDDNESCEKWAALGECIKNPEYMIGTAALPGYCRKSCNIC 319
>gi|388495016|gb|AFK35574.1| unknown [Lotus japonicus]
Length = 297
Score = 281 bits (718), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 140/227 (61%), Positives = 164/227 (72%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN G S SEVRTSSGMF+SK +D IVA IE +I+AWTFLP ENGE MQ+L YEHGQ
Sbjct: 73 VADNLPGDSKLSEVRTSSGMFISKKKDPIVAGIEDKISAWTFLPKENGEDMQVLRYEHGQ 132
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-----NWSE 116
KY+PH+D+F DK+N GGHR+ATVL+YL++V +GGETVFP +E R G + SE
Sbjct: 133 KYDPHYDYFTDKVNIVRGGHRMATVLLYLTNVTRGGETVFPVAEEPPRRRGLETNSDLSE 192
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA++G AVKP +GDALLFFSLH A D+ SLH CPVIEGEKWSATKWIHV +FDK
Sbjct: 193 CAKKGIAVKPRRGDALLFFSLHTTAIPDTDSLHAGCPVIEGEKWSATKWIHVDSFDKTVG 252
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
DC D+ +C WA GEC NP YMVGS G CR+SCK C
Sbjct: 253 --AGGDCSDQHESCQRWASLGECTNNPEYMVGSSDLPGSCRRSCKAC 297
>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
Length = 321
Score = 279 bits (714), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 141/242 (58%), Positives = 168/242 (69%), Gaps = 22/242 (9%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPE-------------- 47
V D ESG+S+ S+VRTSSGMFL K QDE+VA IE RIAAWT LP E
Sbjct: 80 VVDGESGESVTSKVRTSSGMFLDKKQDEVVARIEERIAAWTMLPTECIIFYCFANFAILK 139
Query: 48 ---NGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS 104
NGE+MQIL Y G+KYEPHFD+ + G R+ATVLMYLS+V+ GGET+FP+
Sbjct: 140 LSENGESMQILRYGQGEKYEPHFDYISGRQGSTREGDRVATVLMYLSNVKMGGETIFPDC 199
Query: 105 E--VSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSA 162
E +SQ +D WS+CA +G+AVKP KG A+LFFSLHP+A+ D+ SLHGSCPVIEGEKWSA
Sbjct: 200 EARLSQPKDETWSDCAEQGFAVKPAKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEKWSA 259
Query: 163 TKWIHVRNFDKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVG-SKSSRGYCRKSCK 221
TKWIHVR++ + C DE + C WA AGEC KNP YMVG S S G+CRKSC
Sbjct: 260 TKWIHVRSYSYRRRSA--GKCEDEHVLCSSWAAAGECAKNPGYMVGTSDSPPGFCRKSCN 317
Query: 222 VC 223
VC
Sbjct: 318 VC 319
>gi|110738390|dbj|BAF01121.1| hypothetical protein [Arabidopsis thaliana]
Length = 299
Score = 277 bits (709), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 135/228 (59%), Positives = 171/228 (75%), Gaps = 9/228 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN++G+S S+VRTSSG F+SK +D IV+ IE +++ WTFLP ENGE +Q+L YEHGQ
Sbjct: 75 VADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQ 134
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE------VSQSRDGNWS 115
KY+ HFD+F DK+N GGHRIATVL+YLS+V KGGETVFP+++ +S+++D + S
Sbjct: 135 KYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKD-DLS 193
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
+CA++G AVKP KG+ALLFF+L DA D SLHG CPVIEGEKWSATKWIHV +FDK
Sbjct: 194 DCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHVDSFDKIL 253
Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D +C D + +C WA GEC KNP YMVG+ G CR+SCK C
Sbjct: 254 T--HDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299
>gi|18397528|ref|NP_566279.1| P4H isoform 2 [Arabidopsis thaliana]
gi|332640849|gb|AEE74370.1| P4H isoform 2 [Arabidopsis thaliana]
Length = 299
Score = 277 bits (709), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 135/228 (59%), Positives = 171/228 (75%), Gaps = 9/228 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN++G+S S+VRTSSG F+SK +D IV+ IE +++ WTFLP ENGE +Q+L YEHGQ
Sbjct: 75 VADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQ 134
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE------VSQSRDGNWS 115
KY+ HFD+F DK+N GGHRIATVL+YLS+V KGGETVFP+++ +S+++D + S
Sbjct: 135 KYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKD-DLS 193
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
+CA++G AVKP KG+ALLFF+L DA D SLHG CPVIEGEKWSATKWIHV +FDK
Sbjct: 194 DCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHVDSFDKIL 253
Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D +C D + +C WA GEC KNP YMVG+ G CR+SCK C
Sbjct: 254 T--HDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 299
>gi|21618073|gb|AAM67123.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 297
Score = 277 bits (709), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 135/228 (59%), Positives = 171/228 (75%), Gaps = 9/228 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN++G+S S+VRTSSG F+SK +D IV+ IE +++ WTFLP ENGE +Q+L YEHGQ
Sbjct: 73 VADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQ 132
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE------VSQSRDGNWS 115
KY+ HFD+F DK+N GGHRIATVL+YLS+V KGGETVFP+++ +S+++D + S
Sbjct: 133 KYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEFSRRSLSENKD-DLS 191
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
+CA++G AVKP KG+ALLFF+L DA D SLHG CPVIEGEKWSATKWIHV +FDK
Sbjct: 192 DCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHVDSFDKIL 251
Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D +C D + +C WA GEC KNP YMVG+ G CR+SCK C
Sbjct: 252 T--HDGNCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 297
>gi|388520887|gb|AFK48505.1| unknown [Lotus japonicus]
Length = 187
Score = 275 bits (704), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 126/179 (70%), Positives = 150/179 (83%), Gaps = 2/179 (1%)
Query: 47 ENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE- 105
ENGE++QILHYE+G+KYEPH+D+F D+ NQ +GGHRIATVLMYLS V KGGET+FPN+E
Sbjct: 7 ENGESIQILHYENGRKYEPHYDYFHDRANQFMGGHRIATVLMYLSDVGKGGETIFPNAES 66
Query: 106 -VSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATK 164
+SQ +D +WSECA +GYAVKP KGDALLFFSLH +A+TDS SLHGSCPVIEGEKWSATK
Sbjct: 67 KLSQPKDESWSECAHKGYAVKPRKGDALLFFSLHLNATTDSNSLHGSCPVIEGEKWSATK 126
Query: 165 WIHVRNFDKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
WIHV +F+K K+ ++ DC DE+ NC WAK GEC KNPLYM+G K +GYC KSC VC
Sbjct: 127 WIHVSDFEKAIKQDDNGDCTDENENCSRWAKLGECVKNPLYMIGGKGVKGYCMKSCNVC 185
>gi|357128903|ref|XP_003566109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 313
Score = 275 bits (703), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 135/227 (59%), Positives = 162/227 (71%), Gaps = 7/227 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SGKS SEVRTS G F+SK +D IVA IE +IAAWTFLP ENGE MQ+L Y+ G+
Sbjct: 89 VADNTSGKSTLSEVRTSYGTFISKGKDPIVAGIEDKIAAWTFLPKENGEDMQVLRYKRGE 148
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP-----NSEVSQSRDGNWSE 116
K EP FDFF D +N GGHR+ATVL+YL+ V +GGETVFP +D SE
Sbjct: 149 KDEPQFDFFTDTVNTVRGGHRVATVLLYLTDVAEGGETVFPLAKDFTDTGLHDKDTTLSE 208
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
CA++G AVKP KGDALLFF+L PDA+TD SLHG C VI+GEKW+ATKWI V +FDK
Sbjct: 209 CAQKGIAVKPRKGDALLFFNLRPDAATDPLSLHGGCTVIKGEKWTATKWIRVASFDKVYH 268
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
P +C D + +CV WA GEC KNP YM+G+ + G+CR+SC VC
Sbjct: 269 MP--GNCSDNNDSCVRWAALGECIKNPPYMIGTAALPGHCRRSCNVC 313
>gi|226494249|ref|NP_001141909.1| uncharacterized protein LOC100274058 [Zea mays]
gi|194706408|gb|ACF87288.1| unknown [Zea mays]
gi|413932757|gb|AFW67308.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
gi|413932758|gb|AFW67309.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
Length = 217
Score = 275 bits (703), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 127/201 (63%), Positives = 155/201 (77%), Gaps = 2/201 (0%)
Query: 25 KAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIA 84
+ +DEIV++IE R+AAWTFLP EN E++Q+L YE GQKY+ HFD+F D+ N +LGG R+A
Sbjct: 15 QPKDEIVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRVA 74
Query: 85 TVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSECARRGYAVKPMKGDALLFFSLHPDAS 142
TVLMYL+ V KGGETVFPN+E S Q +D WSEC+R G AVKP KGDALLFF+LH +A+
Sbjct: 75 TVLMYLTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNAT 134
Query: 143 TDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCVDEDLNCVVWAKAGECKKN 202
D+ SLHGSCPVIEGEKWSATKWIHVR+FD P D C D+ C WA GEC +N
Sbjct: 135 ADTGSLHGSCPVIEGEKWSATKWIHVRSFDNPPDVRTDAPCSDDKELCPRWAAIGECHRN 194
Query: 203 PLYMVGSKSSRGYCRKSCKVC 223
P YMVG+K + G+CRKSC +C
Sbjct: 195 PTYMVGTKDTLGFCRKSCGIC 215
>gi|224141327|ref|XP_002324025.1| predicted protein [Populus trichocarpa]
gi|222867027|gb|EEF04158.1| predicted protein [Populus trichocarpa]
Length = 239
Score = 272 bits (695), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 129/178 (72%), Positives = 150/178 (84%), Gaps = 2/178 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVA++E+G+S+ S+ RTSSGMF+ K +DEIV IEARIAAWTFLP ENGE +QIL YEHG
Sbjct: 54 MVANDETGESMESQERTSSGMFIFKTEDEIVNGIEARIAAWTFLPEENGEPIQILRYEHG 113
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV--SQSRDGNWSECA 118
QKYE H D+F DK NQ+ GGHR ATVLMYLS V+KGGETVFP SE SQ++D +WS+CA
Sbjct: 114 QKYEAHIDYFVDKANQEEGGHRAATVLMYLSDVKKGGETVFPTSEAEGSQAKDDSWSDCA 173
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
++GYAVKP KGDALLFFSLHPDA+ D SLH SCPVIEGEKWSATKWIHVR+F +P K
Sbjct: 174 KKGYAVKPNKGDALLFFSLHPDATPDPGSLHASCPVIEGEKWSATKWIHVRSFSEPVK 231
>gi|297829156|ref|XP_002882460.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
gi|297328300|gb|EFH58719.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
Length = 299
Score = 272 bits (695), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 134/228 (58%), Positives = 169/228 (74%), Gaps = 9/228 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN++G+S S+VRTSSG F+SK +D IV+ IE +++ WTFLP ENGE +Q+L YE GQ
Sbjct: 75 VADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEPGQ 134
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE------VSQSRDGNWS 115
KY+ HFD+F DK+N GGHRIATVL+YLS+V KGGETVFP+++ +S+++D + S
Sbjct: 135 KYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQEYSRRSLSENKD-DLS 193
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
+CA++G AVKP KG+ALLFF+L DA D SLHG CPVIEGEKWSATKWIHV +FDK
Sbjct: 194 DCAKKGIAVKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHVDSFDKIL 253
Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D +C D + +C WA GEC KNP YMVG+ G CR SCK C
Sbjct: 254 T--HDGNCTDVNESCERWAVLGECGKNPEYMVGTPELPGNCRHSCKAC 299
>gi|55741082|gb|AAV64222.1| unknown [Zea mays]
Length = 369
Score = 271 bits (694), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 124/172 (72%), Positives = 147/172 (85%), Gaps = 3/172 (1%)
Query: 26 AQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIAT 85
QDE+V IE RI+AWTFLPPENGE++QILHY++G+KYEPH+D+F DK NQ LGGHRIAT
Sbjct: 192 TQDEVVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRIAT 251
Query: 86 VLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDAST 143
VLMYLS+VEKGGET+FPN+E + Q +D WS+CAR GYAVKP+KGDALLFFSLHPDA+T
Sbjct: 252 VLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPDATT 311
Query: 144 DSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP-EDDDCVDEDLNCVVWA 194
DS SLHGSCPVIEG+KWSATKWIHVR+FD P K+P D C D+++ C WA
Sbjct: 312 DSDSLHGSCPVIEGQKWSATKWIHVRSFDLPVKQPGSSDGCEDDNILCPQWA 363
>gi|116784858|gb|ABK23496.1| unknown [Picea sitchensis]
Length = 208
Score = 266 bits (681), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 127/210 (60%), Positives = 152/210 (72%), Gaps = 9/210 (4%)
Query: 21 MFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGG 80
MF+ K +D I++ IE +IAAWTFLP ENGE MQ+L YE G+KY+PHFDFF+DK+N GG
Sbjct: 1 MFIPKGKDAIISRIEDKIAAWTFLPKENGEDMQVLRYEPGEKYDPHFDFFQDKVNIVRGG 60
Query: 81 HRIATVLMYLSHVEKGGETVFPNSEVSQSR-------DGNWSECARRGYAVKPMKGDALL 133
HR+ATVLMYL+ V KGGETVFP++E R D S+CA+RG AVKP +GDALL
Sbjct: 61 HRVATVLMYLTDVSKGGETVFPSAEEDTHRRISSIIKDDTLSDCAKRGTAVKPKRGDALL 120
Query: 134 FFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCVDEDLNCVVW 193
FFSL A D+ SLH CPVIEGEKWS TKWIHV +FDKP + D+CVD++ C W
Sbjct: 121 FFSLTTQAKPDTRSLHAGCPVIEGEKWSVTKWIHVESFDKPRQ--SSDNCVDQNPRCGEW 178
Query: 194 AKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
A GEC NP+YM+GS G CRKSCKVC
Sbjct: 179 AAYGECNNNPIYMLGSPDLPGACRKSCKVC 208
>gi|224034451|gb|ACN36301.1| unknown [Zea mays]
gi|413945801|gb|AFW78450.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 295
Score = 261 bits (668), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 130/223 (58%), Positives = 159/223 (71%), Gaps = 16/223 (7%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SGKS SE D IV IE +IAAWTFLP ENGE +Q+L Y+HG+
Sbjct: 88 VADNMSGKSTLSE-------------DPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGE 134
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS-EVSQSRDGNWSECARR 120
KYEPH+D+F D +N GGHR ATVL+YL+ V +GGETVFP + E ++D SECA++
Sbjct: 135 KYEPHYDYFTDNVNTVRGGHRYATVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQK 194
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
G AV+P KGDALLFF+L+PD +TDS SLHG CPVI+GEKWSATKWI V +FDK P+
Sbjct: 195 GIAVRPRKGDALLFFNLNPDGTTDSVSLHGGCPVIKGEKWSATKWIRVASFDKVH-HPQ- 252
Query: 181 DDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+C DE+ +C WA GEC KNP YMVG+ + GYCR+SC VC
Sbjct: 253 GNCTDENESCAKWAALGECIKNPEYMVGTTALPGYCRRSCNVC 295
>gi|363543295|ref|NP_001241863.1| prolyl 4-hydroxylase 4 precursor [Zea mays]
gi|347978806|gb|AEP37745.1| prolyl 4-hydroxylase 4 [Zea mays]
gi|414591890|tpg|DAA42461.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 274
Score = 261 bits (667), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 128/188 (68%), Positives = 149/188 (79%), Gaps = 6/188 (3%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ SEVRTSSGMFL K QD +V+ IE RIAAWTFLP EN E MQ+L YE G
Sbjct: 81 MVADNESGKSVKSEVRTSSGMFLDKRQDPVVSRIEERIAAWTFLPQENAENMQVLRYEPG 140
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
QKYEPHFD+F D++NQ GGHR ATVLMYLS V +GGETVFPN++ SQ +D +SECA
Sbjct: 141 QKYEPHFDYFHDRVNQARGGHRYATVLMYLSTVREGGETVFPNAKGWESQPKDATFSECA 200
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF-DKPEKE 177
+G AVKP+KGDA+LFFSLH D + D SLHGSCPVI GEKWSA KWIHVR++ D+P+
Sbjct: 201 HKGLAVKPVKGDAVLFFSLHADGTPDPLSLHGSCPVIRGEKWSAPKWIHVRSYEDEPQAV 260
Query: 178 ---PEDDD 182
PE+ D
Sbjct: 261 LVLPEETD 268
>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
Length = 278
Score = 259 bits (663), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 128/222 (57%), Positives = 157/222 (70%), Gaps = 18/222 (8%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN++G+S S+VRTSSG F+SK +D IV+ IE +++ WTFLP ENGE +Q+L YEHGQ
Sbjct: 75 VADNDNGESQVSDVRTSSGTFISKGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQ 134
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
KY+ HFD+F DK+N GGHRIATVL+YLS+V KGGETVFP+++V
Sbjct: 135 KYDAHFDYFHDKVNIARGGHRIATVLLYLSNVTKGGETVFPDAQV--------------- 179
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD 181
+KP KG+ALLFF+L DA D SLHG CPVIEGEKWSATKWIHV +FDK D
Sbjct: 180 -CLKPKKGNALLFFNLQQDAIPDPFSLHGGCPVIEGEKWSATKWIHVDSFDKILT--HDG 236
Query: 182 DCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+C D + +C WA GEC KNP YMVG+ G CR+SCK C
Sbjct: 237 NCTDVNESCERWAVLGECGKNPEYMVGTPEIPGNCRRSCKAC 278
>gi|255085592|ref|XP_002505227.1| predicted protein [Micromonas sp. RCC299]
gi|226520496|gb|ACO66485.1| predicted protein [Micromonas sp. RCC299]
Length = 267
Score = 252 bits (644), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 126/227 (55%), Positives = 158/227 (69%), Gaps = 6/227 (2%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V DN++G+S+ S +RTS GMF + +D+I+ IE RIA WT +P ENGE +Q+L YE GQ
Sbjct: 42 VVDNKTGQSVPSNIRTSDGMFFDRHEDDIIEDIERRIAEWTNVPWENGEGIQVLRYEVGQ 101
Query: 62 KYEPHFDFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNS-EVSQSRDGNWSECA 118
KYEPH D F DK N + GG R+ATVLMYLS VE+GGETVFP S + D WSECA
Sbjct: 102 KYEPHLDAFSDKFNTEESKGGQRMATVLMYLSDVEEGGETVFPRSVDKPHKGDPKWSECA 161
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE--K 176
+RG AVK KGDALLF+SL D++ D SLHG CPVI+G KWSATKW+H+++FD K
Sbjct: 162 QRGVAVKARKGDALLFWSLDIDSNVDELSLHGGCPVIKGTKWSATKWMHLKSFDTANSFK 221
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
PE C D + C WA GEC+KNP YM+G+ + GYC ++C C
Sbjct: 222 FPE-GVCDDVNEQCEGWASTGECEKNPKYMIGNGKTDGYCVRACGKC 267
>gi|384246332|gb|EIE19822.1| hypothetical protein COCSUDRAFT_25518 [Coccomyxa subellipsoidea
C-169]
Length = 347
Score = 252 bits (644), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 129/232 (55%), Positives = 156/232 (67%), Gaps = 10/232 (4%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V DN++GKSI S VRTS+G F + +DE++ IE RI+ T LP NGE +QILHYE GQ
Sbjct: 120 VVDNDTGKSIDSTVRTSTGTFFGREEDEVIQGIERRISMITHLPEVNGEGLQILHYEDGQ 179
Query: 62 KYEPHFDFFRDKMNQ--QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
KYE H DFF DK N + GG RIATVLMYL+ E+GGETVFP + ++ WSECAR
Sbjct: 180 KYEAHHDFFHDKFNSRPENGGQRIATVLMYLTTAEEGGETVFPMA-ANKVTGPQWSECAR 238
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF-DKPEKEP 178
G AVK +GDALLF+SL P+ TD TSLHGSCP +GEKWSATKWIHV F E++
Sbjct: 239 GGAAVKSRRGDALLFYSLLPNGETDPTSLHGSCPTTKGEKWSATKWIHVGPFGGSSEQQR 298
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCKPSSVSS 230
+C+D D C WA GECKKNP YM+ S CR SC C P+S ++
Sbjct: 299 AKGECIDADERCSGWAADGECKKNPGYMMSS------CRLSCHTCTPASKTT 344
>gi|159478673|ref|XP_001697425.1| predicted protein [Chlamydomonas reinhardtii]
gi|158274304|gb|EDP00087.1| predicted protein [Chlamydomonas reinhardtii]
Length = 297
Score = 248 bits (634), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 123/228 (53%), Positives = 158/228 (69%), Gaps = 16/228 (7%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V DNESGKS+ SE+RTS+G + +K +D +++ IE R+A T +P EN E +Q+LHY GQ
Sbjct: 79 VVDNESGKSVDSEIRTSTGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQ 138
Query: 62 KYEPHFDFFRDKMNQ--QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
KYEPH+D+F D +N + GG R+ T+LMYL+ VE+GGETV PN+E + DG WSECA+
Sbjct: 139 KYEPHYDYFHDPVNAGPEHGGQRVVTMLMYLTTVEEGGETVLPNAEQKVTGDG-WSECAK 197
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK--- 176
RG AVKP+KGDAL+F+SL PD S D SLHGSCP ++G+KWSATKWIHV +K
Sbjct: 198 RGLAVKPIKGDALMFYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHVAPIGGKKKLNL 257
Query: 177 -EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
PE C DED C WA GEC+KNP +M C++SCK C
Sbjct: 258 GTPE---CHDEDERCQEWAFFGECEKNPGFM------DAQCKRSCKKC 296
>gi|302845234|ref|XP_002954156.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
gi|300260655|gb|EFJ44873.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
Length = 309
Score = 246 bits (629), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 123/226 (54%), Positives = 156/226 (69%), Gaps = 10/226 (4%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V DN SGKS+ SE+RTS+G +L+K +DEI++ IE R+A T +P EN E +Q+LHY GQ
Sbjct: 91 VVDNASGKSVDSEIRTSTGAWLAKGEDEIISRIEKRVAQVTMIPLENHEGLQVLHYHDGQ 150
Query: 62 KYEPHFDFFRDKMNQ--QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
KYEPH+D+F D +N + GG R+ TVLMYL+ VE+GGETV P+++ S +G WSECA+
Sbjct: 151 KYEPHYDYFHDPVNASPEHGGQRVVTVLMYLTTVEEGGETVLPHADQKVSGEG-WSECAK 209
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF-DKPEKEP 178
RG AVKP+KGDAL+F+SL PD S D SLHGSCP ++G+KWSATKWIHV K
Sbjct: 210 RGLAVKPVKGDALMFYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHVGPIGGKKAVSL 269
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
+C D C WA GEC+KNP YM R C +SCK CK
Sbjct: 270 GTPECHDSMEQCTEWAFFGECEKNPGYM------RENCARSCKTCK 309
>gi|307106819|gb|EFN55064.1| hypothetical protein CHLNCDRAFT_35843 [Chlorella variabilis]
Length = 287
Score = 237 bits (605), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 120/224 (53%), Positives = 152/224 (67%), Gaps = 13/224 (5%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V DN++GKS+ S VRTSSG FL++ +DE+V +IE RI+ T +P ENGEA+QIL Y GQ
Sbjct: 75 VVDNKTGKSMDSTVRTSSGTFLARGEDEVVRAIEKRISLVTMIPEENGEAIQILKYVDGQ 134
Query: 62 KYEPHFDFFRDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
KYEPH D+F DK N + GG R+AT+LMYLS E+GGETVFP +E +G WSECAR
Sbjct: 135 KYEPHTDYFHDKYNSRTENGGQRVATILMYLSTPEEGGETVFPYAEKKVEGEG-WSECAR 193
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
+G AVK +KG ALLF+SL P+ D S HGSCP + GEKWSAT+WIHV F +
Sbjct: 194 KGLAVKAVKGSALLFYSLKPNGEEDQASTHGSCPTLAGEKWSATRWIHVGAFQPGGAK-- 251
Query: 180 DDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
C DE+ C WA GEC+ NP +M + C+KSC++C
Sbjct: 252 --GCKDENEKCEEWAVMGECQNNPAFM------KSNCKKSCELC 287
>gi|145345764|ref|XP_001417370.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577597|gb|ABO95663.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 328
Score = 234 bits (596), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 114/233 (48%), Positives = 154/233 (66%), Gaps = 7/233 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +G S+ S++RTSSGMFL + +D++VASIE RIA+WT +P +GE Q+L YE GQ
Sbjct: 93 VVDASNGGSVPSDIRTSSGMFLLRGEDDVVASIERRIASWTHVPESHGEGFQVLRYEFGQ 152
Query: 62 KYEPHFDFFRDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG--NWSEC 117
+Y PHFD+F+D+ NQ+ GG R+ATVLMYL+ VE+GGET+FP++E + G + S C
Sbjct: 153 EYRPHFDYFQDEFNQKREKGGQRVATVLMYLTDVEEGGETIFPDAEAGANPGGGDDASSC 212
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE 177
A AVKP KGDAL F SLH + ++D+ S H CPV++G K+SATKW+HV +
Sbjct: 213 AAGKLAVKPRKGDALFFRSLHHNGTSDAMSSHAGCPVVKGVKFSATKWMHVAPIEDSATA 272
Query: 178 P---EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCKPSS 227
E C D + C WA +GEC KNP +MVG + G C +SC C P +
Sbjct: 273 SVRFEPGVCKDVNAACEGWASSGECTKNPSFMVGRGRANGNCMRSCGACPPGT 325
>gi|307102962|gb|EFN51227.1| hypothetical protein CHLNCDRAFT_28161 [Chlorella variabilis]
Length = 300
Score = 232 bits (591), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 108/226 (47%), Positives = 155/226 (68%), Gaps = 7/226 (3%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN G+S+ S++RTS GMF + +DE+V +E R++ W+ +PP +GE +Q+L YE+G++Y
Sbjct: 57 DNPGGESV-SDIRTSYGMFFDRGEDEVVREVERRLSEWSLIPPGHGEGIQVLRYENGEEY 115
Query: 64 EPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV--SQSRDGNWSECARRG 121
+PHFD+F D ++ Q GG+R+AT+LMYL+ E GGETVFPN + Q+ + +SECA +G
Sbjct: 116 KPHFDYFFDNLSVQNGGNRLATILMYLAEPEFGGETVFPNVKAPPEQTLEAGYSECATQG 175
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF----DKPEKE 177
AVKP KGDA+LFFSL + + D SLHGSCP ++G K++ATKW HV ++ ++
Sbjct: 176 LAVKPRKGDAVLFFSLRTEGTLDKGSLHGSCPTLKGFKFAATKWYHVAHYAMGGERAPVL 235
Query: 178 PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
P C DE CV WA+ GEC+ NP +MVG+K G C +C C
Sbjct: 236 PASAGCKDEKDACVGWAEGGECESNPGFMVGTKEQPGACLLACGRC 281
>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
Length = 344
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 113/234 (48%), Positives = 153/234 (65%), Gaps = 12/234 (5%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + +SGKS VRTS G FL++ D ++A IEARIA WT +P NGE +Q+L YEHGQ
Sbjct: 100 VVETDSGKSKIDNVRTSKGTFLNRGHDSVIADIEARIAKWTLMPAGNGEGLQVLKYEHGQ 159
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARR 120
+YE H+D+F K GG+R TVLMYL+ VE+GGET FPN +G +SECAR+
Sbjct: 160 EYEGHYDYFFHKAGTANGGNRYLTVLMYLNDVEEGGETCFPNIPSPNGDNGPEFSECARK 219
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF----DKP-- 174
A KP KG+A+LF S+ P + SLH +CPVI+G KWSA KW+HV ++ +KP
Sbjct: 220 VLAAKPKKGNAVLFHSIKPTGELERRSLHTACPVIKGVKWSAPKWVHVGHYAVGGEKPQH 279
Query: 175 -EKEPEDD----DCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
++ P+ D +C ++D C WA GEC+KNP++MVG+K G+C K+C C
Sbjct: 280 IQQIPQGDSTYPECKNKDAACDSWAGNGECEKNPVFMVGTKQRPGHCIKACGKC 333
>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 290
Score = 226 bits (576), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 106/173 (61%), Positives = 135/173 (78%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+++GKS S VRTSSGMFL + +D+IV +IE RIA +TF+P ENGE +QILHYE GQ
Sbjct: 116 VVDSKTGKSTESRVRTSSGMFLKRGKDKIVQNIEKRIADFTFIPEENGEGLQILHYEVGQ 175
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
KYEPH+D+F D+ N + GG RIATVLMYLS VE+GGETVFP + + S W S+CA
Sbjct: 176 KYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETVFPAANANFSSVPWWNDLSQCA 235
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
R+G +VKP GDALLF+S+ PDA+ D +SLHG CPVI+G KWS+TKW+H+R +
Sbjct: 236 RKGLSVKPKMGDALLFWSMRPDATLDPSSLHGGCPVIKGNKWSSTKWMHLREY 288
>gi|242047774|ref|XP_002461633.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
gi|241925010|gb|EER98154.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
Length = 275
Score = 226 bits (575), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 111/174 (63%), Positives = 131/174 (75%), Gaps = 4/174 (2%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVA N S S + RTSSGMFL K QD +V+ IE RIAAWT LP EN E MQI Y+HG
Sbjct: 81 MVAHNRS--SYYRQTRTSSGMFLRKRQDPVVSRIEERIAAWTLLPRENVEKMQIQRYQHG 138
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
QKY+PHFD+F DK++ GG R ATVLMYLS V+KGGETVFP ++ SQ +D +SECA
Sbjct: 139 QKYDPHFDYFDDKIHHTRGGPRYATVLMYLSTVDKGGETVFPKAKGWESQPKDDTFSECA 198
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
+G AVKP+KGDA+LFFSLH D D +LHGSCPVI+GEKWSA WIHVR+F+
Sbjct: 199 HKGLAVKPVKGDAVLFFSLHVDGGPDPLTLHGSCPVIQGEKWSAPNWIHVRSFE 252
>gi|308799217|ref|XP_003074389.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116000560|emb|CAL50240.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 294
Score = 224 bits (571), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 115/226 (50%), Positives = 145/226 (64%), Gaps = 4/226 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +G +SE+RTSSGMFL +A+D+++ +IEARIAAWT +P +GE Q+L YE Q
Sbjct: 63 VVDASTGGDASSEIRTSSGMFLGRAEDDVIEAIEARIAAWTHVPESHGEGFQVLRYEKHQ 122
Query: 62 KYEPHFDFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+Y H+D+F DK N ++ GG R+ TVLMYLS VE+GGETVFP E SECAR
Sbjct: 123 EYRAHYDYFHDKFNVKREKGGQRMGTVLMYLSDVEEGGETVFPKFEDGTPAGSEASECAR 182
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
AV+P KGDAL F SL D D+ S H CPVI G K+SATKW+HV +
Sbjct: 183 NKLAVRPRKGDALFFRSLRHDGVPDTFSEHAGCPVIRGVKFSATKWMHVSPIEDGSNGLL 242
Query: 180 DDDCVDEDLN--CVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
V +DL+ CV WAK+GEC+KN YMVG S+G C +SC C
Sbjct: 243 LPPGVCKDLHAACVAWAKSGECEKNKNYMVGRGRSKGNCMRSCGAC 288
>gi|303282201|ref|XP_003060392.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457863|gb|EEH55161.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 369
Score = 224 bits (571), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 118/229 (51%), Positives = 147/229 (64%), Gaps = 11/229 (4%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G+ + S +RTS GMF + +D++V ++E RI+AWT LP ENGE MQ+L Y GQ
Sbjct: 113 VVDTDTGEGVPSAIRTSDGMFFDRGEDDVVDAVERRISAWTRLPTENGEGMQVLRYAGGQ 172
Query: 62 KYEPHFDFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNSEVS-QSRDGNWSECA 118
KY+ H D F DK N GG R+ATVLMYL+ V+ GGETVFP + D +S CA
Sbjct: 173 KYDAHLDAFVDKFNADDAHGGQRVATVLMYLNDVDDGGETVFPETTAKPHVGDERYSACA 232
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPV-IEGEKWSATKWIHVRNFDKPEKE 177
RRG AVKP +GDALLF+S+ T + SLHG CPV G KWS TKWIH F + K
Sbjct: 233 RRGVAVKPRRGDALLFWSMD---ETFTRSLHGGCPVGAGGVKWSMTKWIHKGAFSRGHKM 289
Query: 178 --PEDDDCVDEDLNCVVWAKAGECKKNPLYMVG-SKSSRGYCRKSCKVC 223
PE C DED NC WAK+GEC+KNP YM G + + G+C SC C
Sbjct: 290 KFPE-GVCDDEDANCAGWAKSGECEKNPAYMTGDGRENDGHCAFSCGTC 337
>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
Length = 256
Score = 224 bits (571), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 106/173 (61%), Positives = 135/173 (78%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V DN++GKS S VRTSSG FL + QDEI++ IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 83 VVDNQTGKSKDSRVRTSSGTFLRRGQDEIISRIEERIAKFTFIPKEHGEGLQVLHYEVGQ 142
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
KY+ H D+F DK+N + GG R+ATVLMYLS VE+GGETVFP+++V+ S W SECA
Sbjct: 143 KYDAHHDYFHDKVNTKNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECA 202
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
++G +VKP KGDALLF+S+ PDA D SLHG CPVI+G KWSATKW+H+R +
Sbjct: 203 KKGVSVKPRKGDALLFWSMSPDAELDPFSLHGGCPVIKGNKWSATKWMHLREY 255
>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 315
Score = 223 bits (567), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 102/173 (58%), Positives = 134/173 (77%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+E+GKS S VRTSSGMFL + +D+++ +IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 141 VVDSETGKSKDSRVRTSSGMFLQRGRDKVIRAIERRIADYTFIPAEHGEGLQVLHYEVGQ 200
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPHFD+F D+ N + GG R+AT+LMYLS +E+GGET+FP++ V+ S SECA
Sbjct: 201 KYEPHFDYFLDEFNTKNGGQRMATILMYLSDIEEGGETIFPDANVNSSSLPWYNELSECA 260
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
R+G AVKP GDALLF+S+ PDA+ D SLHG CPVI+G KWS+TKW+HV +
Sbjct: 261 RKGLAVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWLHVGEY 313
>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
Length = 256
Score = 223 bits (567), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 105/173 (60%), Positives = 134/173 (77%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V DN++GKS S VRTSSG FL + QDEI++ IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 83 VVDNQTGKSKDSRVRTSSGTFLRRGQDEIISRIEERIAKFTFIPKEHGEGLQVLHYEVGQ 142
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
KY+ H D+F DK+N + GG R+ATVLMYLS VE+GGETVFP+++V+ S W SEC
Sbjct: 143 KYDAHHDYFHDKVNTKNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECG 202
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
++G +VKP KGDALLF+S+ PDA D SLHG CPVI+G KWSATKW+H+R +
Sbjct: 203 KKGVSVKPRKGDALLFWSMSPDAELDPFSLHGGCPVIKGNKWSATKWMHLREY 255
>gi|302834449|ref|XP_002948787.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
gi|300265978|gb|EFJ50167.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
Length = 329
Score = 222 bits (566), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 115/238 (48%), Positives = 151/238 (63%), Gaps = 17/238 (7%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V+D +G+ S++RTSSGMF ++ ++++V IE R+A WT LP ENGE +Q+L YE Q
Sbjct: 87 VSDATTGEGGVSDIRTSSGMFYTRGENDVVKRIETRLAMWTMLPVENGEGIQVLRYEKTQ 146
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV--SQSRDGNWSECAR 119
KY+PH D+F + GG+R+ATVLMYL+ E+GGETVFP V Q+R N+SEC
Sbjct: 147 KYDPHHDYFSFEGRDANGGNRMATVLMYLATPEEGGETVFPKIPVPAGQTR-ANFSECGM 205
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK-PEKEP 178
+G AVKP+KGDA+LF+S+ PD + SLHGSCPVI G KWSATKWIHV + EK
Sbjct: 206 KGLAVKPVKGDAVLFWSIRPDGRFEPGSLHGSCPVIRGVKWSATKWIHVGPYSMGAEKAV 265
Query: 179 E-------------DDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
E C++ C WA++GEC+ NP YMVG S G C +C C
Sbjct: 266 EVTRVIYAPPPPPAVPGCINTHKLCDHWAESGECESNPGYMVGQLGSPGACNLACNRC 323
>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
Length = 310
Score = 222 bits (566), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 103/173 (59%), Positives = 134/173 (77%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS S VRTSSGMFL + +D+++ +IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 136 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQ 195
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPHFD+F D+ N + GG R+AT+LMYLS VE+GGET+FP++ V+ S SECA
Sbjct: 196 KYEPHFDYFLDEYNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECA 255
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
R+G AVKP GDALLF+S+ PDA+ D SLHG CPVI+G KWS+TKW+HVR +
Sbjct: 256 RKGLAVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHVREY 308
>gi|159794881|pdb|2JIJ|A Chain A, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794882|pdb|2JIJ|B Chain B, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794883|pdb|2JIJ|C Chain C, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
Length = 233
Score = 221 bits (564), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 101/169 (59%), Positives = 131/169 (77%), Gaps = 3/169 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V DNESGKS+ SE+RTS+G + +K +D +++ IE R+A T +P EN E +Q+LHY GQ
Sbjct: 59 VVDNESGKSVDSEIRTSTGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQ 118
Query: 62 KYEPHFDFFRDKMNQ--QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
KYEPH+D+F D +N + GG R+ T+LMYL+ VE+GGETV PN+E + DG WSECA+
Sbjct: 119 KYEPHYDYFHDPVNAGPEHGGQRVVTMLMYLTTVEEGGETVLPNAEQKVTGDG-WSECAK 177
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
RG AVKP+KGDAL+F+SL PD S D SLHGSCP ++G+KWSATKWIHV
Sbjct: 178 RGLAVKPIKGDALMFYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 226
>gi|159794879|pdb|2JIG|A Chain A, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
gi|159794880|pdb|2JIG|B Chain B, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
Length = 224
Score = 221 bits (564), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 101/169 (59%), Positives = 131/169 (77%), Gaps = 3/169 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V DNESGKS+ SE+RTS+G + +K +D +++ IE R+A T +P EN E +Q+LHY GQ
Sbjct: 50 VVDNESGKSVDSEIRTSTGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQ 109
Query: 62 KYEPHFDFFRDKMNQ--QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
KYEPH+D+F D +N + GG R+ T+LMYL+ VE+GGETV PN+E + DG WSECA+
Sbjct: 110 KYEPHYDYFHDPVNAGPEHGGQRVVTMLMYLTTVEEGGETVLPNAEQKVTGDG-WSECAK 168
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
RG AVKP+KGDAL+F+SL PD S D SLHGSCP ++G+KWSATKWIHV
Sbjct: 169 RGLAVKPIKGDALMFYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 217
>gi|241913390|pdb|3GZE|A Chain A, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913391|pdb|3GZE|B Chain B, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913392|pdb|3GZE|C Chain C, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913393|pdb|3GZE|D Chain D, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
Length = 225
Score = 221 bits (564), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 101/169 (59%), Positives = 131/169 (77%), Gaps = 3/169 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V DNESGKS+ SE+RTS+G + +K +D +++ IE R+A T +P EN E +Q+LHY GQ
Sbjct: 51 VVDNESGKSVDSEIRTSTGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQ 110
Query: 62 KYEPHFDFFRDKMNQ--QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
KYEPH+D+F D +N + GG R+ T+LMYL+ VE+GGETV PN+E + DG WSECA+
Sbjct: 111 KYEPHYDYFHDPVNAGPEHGGQRVVTMLMYLTTVEEGGETVLPNAEQKVTGDG-WSECAK 169
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
RG AVKP+KGDAL+F+SL PD S D SLHGSCP ++G+KWSATKWIHV
Sbjct: 170 RGLAVKPIKGDALMFYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 218
>gi|145345836|ref|XP_001417405.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577632|gb|ABO95698.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 330
Score = 221 bits (564), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 113/245 (46%), Positives = 155/245 (63%), Gaps = 27/245 (11%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
E+G S+ S++RTS+GMFL K QD+IV +IE RIA + P +NGE MQIL Y+ GQKY+P
Sbjct: 82 EAGDSVPSDIRTSAGMFLRKGQDKIVKAIEERIARLSGTPVDNGEGMQILRYDVGQKYDP 141
Query: 66 HFDFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE------- 116
HFD+F DK+N + GG R+AT+L+YL +KGGET FPN+++ QS + + E
Sbjct: 142 HFDYFHDKVNPAPKRGGQRLATMLIYLVDTDKGGETTFPNAKLPQSFEADEPENPFASHI 201
Query: 117 ----CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
CA++G VK ++GDA+LFFS+ D D SLHG+CPVIEG+KW+A KWI V FD
Sbjct: 202 EHTDCAKKGIPVKSVRGDAILFFSMTQDGVLDRGSLHGACPVIEGQKWTAVKWIRVGKFD 261
Query: 173 ----------KPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSR----GYCRK 218
K + +++ CVD+ C WA G C+ NP +M + S+R C K
Sbjct: 262 GNYQEEIPMPKLSRRTDEEPCVDDWDECAKWASQGWCELNPEFMTTADSARDSQSAACAK 321
Query: 219 SCKVC 223
SC +C
Sbjct: 322 SCGLC 326
>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 318
Score = 221 bits (564), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 102/173 (58%), Positives = 133/173 (76%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS S VRTSSGMFL + +D+++ +IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 144 VVDSTTGKSKDSRVRTSSGMFLRRGRDKVIRAIERRIADYTFIPAEHGEGLQVLHYEVGQ 203
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPHFD+F D+ N + GG R+AT+LMYLS VE+GGET+FP++ V+ S SECA
Sbjct: 204 KYEPHFDYFLDEFNTKNGGQRMATILMYLSDVEEGGETIFPDANVNSSSLPWHNELSECA 263
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
R+G AVKP GDALLF+S++PDA+ D SLHG CPVI G KWS+TKW+HV +
Sbjct: 264 RKGLAVKPKMGDALLFWSMNPDATLDPLSLHGGCPVIRGNKWSSTKWMHVGEY 316
>gi|326495334|dbj|BAJ85763.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 300
Score = 220 bits (561), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 105/173 (60%), Positives = 131/173 (75%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +G S S VRTSSG FL + QD+IV +IE RI+ +TF+P ENGE +Q+LHYE GQ
Sbjct: 126 VVDSATGGSKDSRVRTSSGTFLRRGQDKIVRTIEKRISDFTFIPVENGEGLQVLHYEVGQ 185
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSECA 118
KYEPHFD+F D N + GG RIATVLMYLS VE+GGETVFP+++V+ S SECA
Sbjct: 186 KYEPHFDYFHDDFNTKNGGQRIATVLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECA 245
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+RG +VKP GDALLF+S+ PD + D TSLHG CPVI+G+KWS+TKWI V +
Sbjct: 246 KRGISVKPKMGDALLFWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHEY 298
>gi|168002780|ref|XP_001754091.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694645|gb|EDQ80992.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 214
Score = 220 bits (560), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 100/173 (57%), Positives = 134/173 (77%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+++GKS S +RTSSG FL + QD ++ IE RIA +TF+P E GE +Q+L Y+ +
Sbjct: 40 VVDSDTGKSKDSRLRTSSGTFLMRGQDPVIKRIEKRIADFTFIPAEQGEGLQVLQYKESE 99
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
KYEPH+D+F D N + GG RIATVLMYLS+VE+GGETVFP ++V+++ +W SECA
Sbjct: 100 KYEPHYDYFHDAYNTKNGGQRIATVLMYLSNVEEGGETVFPAAQVNKTEVPDWDKLSECA 159
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
++G +V+P GDALLF+S+ PDA+ DSTSLHG CPVI+G KWSATKW+HV N+
Sbjct: 160 QKGLSVRPRMGDALLFWSMKPDATLDSTSLHGGCPVIKGTKWSATKWLHVENY 212
>gi|168060785|ref|XP_001782374.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666166|gb|EDQ52828.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 211
Score = 219 bits (559), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 104/173 (60%), Positives = 128/173 (73%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS S VRTSSG FL + QD I+ IE RIA +TF+P E GE +Q+L Y +
Sbjct: 39 VIDSATGKSKDSRVRTSSGTFLVRGQDHIIKRIEKRIADFTFIPVEQGEGLQVLQYRESE 98
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
KYEPH+D+F D N + GG RIATVLMYLS VEKGGETVFP S+V+ S +W SECA
Sbjct: 99 KYEPHYDYFHDAFNTKNGGQRIATVLMYLSDVEKGGETVFPASKVNASEVPDWDQRSECA 158
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+RG +V+P GDALLF+S+ PDA D TSLHG+CPVI+G KWSATKW+HV +
Sbjct: 159 KRGLSVRPRMGDALLFWSMKPDAKLDPTSLHGACPVIQGTKWSATKWLHVEKY 211
>gi|388520325|gb|AFK48224.1| unknown [Lotus japonicus]
Length = 188
Score = 219 bits (558), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 98/173 (56%), Positives = 134/173 (77%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+++GKS+ S VRTSSGMFL + +D+++ +IE RIA + F+P ENGE +Q+LHYE GQ
Sbjct: 14 VVDSQTGKSVGSRVRTSSGMFLKRGKDKVIQTIEKRIADFAFIPVENGEGLQVLHYEVGQ 73
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPH+D+F D+ N + GG RIATVLMYLS VE+GGET+FP ++ + S + S CA
Sbjct: 74 KYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIFPAAKANFSSVPWYNDLSVCA 133
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
++G +VKP +GDALLF+S+ PDA+ D +SLHG CPVI G KWS+TKW+H+ +
Sbjct: 134 KKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGNKWSSTKWMHLEEY 186
>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 306
Score = 218 bits (556), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 102/173 (58%), Positives = 131/173 (75%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +G S S VRTSSG FL + QD+++ +IE RI+ +TF+P ENGE +Q+LHYE GQ
Sbjct: 132 VVDSATGGSKDSRVRTSSGTFLRRGQDKVIRTIEKRISDFTFIPAENGEGLQVLHYEVGQ 191
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSECA 118
KYEPHFD+F D N + GG RIAT+LMYLS VE+GGETVFP+++V+ S SECA
Sbjct: 192 KYEPHFDYFHDDFNTKNGGQRIATLLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECA 251
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+RG +VKP GDALLF+S+ PD + D TSLHG CPVI+G+KWS+TKWI V +
Sbjct: 252 KRGISVKPKMGDALLFWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHEY 304
>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 280
Score = 218 bits (555), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 100/173 (57%), Positives = 134/173 (77%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+++GKS S VRTSSGMFL + +D+I+ +IE RIA +TF+P ENGE +Q+LHY G+
Sbjct: 106 VVDSKTGKSTESRVRTSSGMFLKRGKDKIIQNIERRIADFTFIPVENGEGLQVLHYGVGE 165
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
KYEPH+D+F D+ N + GG R+ATVLMYLS VE+GGETVFP ++ + S W SECA
Sbjct: 166 KYEPHYDYFLDEFNTKNGGQRVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECA 225
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
R+G ++KP GDALLF+S+ PDA+ D++SLHG CPVI G KWS+TKW+H+ +
Sbjct: 226 RKGLSLKPKMGDALLFWSMRPDATLDASSLHGGCPVIVGNKWSSTKWMHLEEY 278
>gi|116788056|gb|ABK24739.1| unknown [Picea sitchensis]
Length = 303
Score = 218 bits (555), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 105/166 (63%), Positives = 128/166 (77%), Gaps = 3/166 (1%)
Query: 9 KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD 68
KS S VRTSSGMFL++ QD+ + SIE RIA +TF+P E+GE +Q+LHYE GQKYEPHFD
Sbjct: 136 KSNDSRVRTSSGMFLNRGQDKTIRSIEKRIADFTFIPAEHGEGLQVLHYEVGQKYEPHFD 195
Query: 69 FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECARRGYAVK 125
+F D+ N + GG RIATVLMYLS VEKGGETVFP S+V+ S W SECA+ G +V+
Sbjct: 196 YFLDEFNTKNGGQRIATVLMYLSDVEKGGETVFPASKVNSSSVPWWDELSECAKAGISVR 255
Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
P GDALLF+S+ PDA D +SLH CPVI+G+KWSATKWIHV +
Sbjct: 256 PRMGDALLFWSMRPDAELDPSSLHAGCPVIQGDKWSATKWIHVGEY 301
>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
Length = 313
Score = 218 bits (554), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 112/235 (47%), Positives = 147/235 (62%), Gaps = 16/235 (6%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +G S S++RTS GMFL + D+ VA+IE RIA WT LP NGE +Q+L+Y G+
Sbjct: 69 VVDTATGGSEISDIRTSKGMFLERGHDDTVAAIEERIARWTLLPVGNGEGLQVLNYHPGE 128
Query: 62 KYEPHFDFFRDKMN-QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECAR 119
KY+ D+F DK+N + GG+R ATVLMYL+ VE+GGETVFPN +G ++ECAR
Sbjct: 129 KYD---DYFFDKVNGESNGGNRYATVLMYLNTVEEGGETVFPNIPAPGGDNGPTFTECAR 185
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF-------- 171
R A KP KG A+LF S+ P + SLH +CPV++GEKWSA KWIHV ++
Sbjct: 186 RHLAAKPTKGSAVLFHSIKPSGDLERRSLHTACPVVKGEKWSAPKWIHVGHYAMGGEAAV 245
Query: 172 ---DKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
P+K C D D NC WA GEC+ N ++M+G++ G C KSC C
Sbjct: 246 PVPQHPQKVGNLLGCEDADENCEQWAANGECENNKVFMIGTRDRPGSCVKSCDAC 300
>gi|224033439|gb|ACN35795.1| unknown [Zea mays]
Length = 180
Score = 218 bits (554), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 99/173 (57%), Positives = 133/173 (76%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS S VRTSSGMFL + +D+++ +IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 6 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQ 65
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPHFD+F D+ N + GG RIAT+LMYLS VE+GGET+FP++ V+ S S+CA
Sbjct: 66 KYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCA 125
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+RG +VKP GDALLF+S+ PDA+ D SLHG CPVI+G KWS+TKW+H+ +
Sbjct: 126 KRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHEY 178
>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
Length = 307
Score = 216 bits (551), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 99/173 (57%), Positives = 132/173 (76%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS S VRTSSGMFL + +D+++ +IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 133 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPADHGEGLQVLHYEVGQ 192
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPHFD+F D+ N + GG R+AT+LMYLS VE+GGET+FP++ V+ S SECA
Sbjct: 193 KYEPHFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSECA 252
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+RG +VKP GDALLF+S+ PDA+ D SLHG CPVI G KWS+TKW+H+ +
Sbjct: 253 KRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEY 305
>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
Length = 307
Score = 216 bits (551), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 99/173 (57%), Positives = 133/173 (76%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS S VRTSSGMFL + +D+++ +IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 133 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQ 192
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPHFD+F D+ N + GG RIAT+LMYLS VE+GGET+FP++ V+ S S+CA
Sbjct: 193 KYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCA 252
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+RG +VKP GDALLF+S+ PDA+ D SLHG CPVI+G KWS+TKW+H+ +
Sbjct: 253 KRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHEY 305
>gi|238007346|gb|ACR34708.1| unknown [Zea mays]
Length = 180
Score = 216 bits (550), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 99/173 (57%), Positives = 131/173 (75%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS S VRTSSGMFL + +D+++ IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 6 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQ 65
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPHFD+F D+ N + GG R+AT+LMYLS VE+GGET+FP++ V+ S SECA
Sbjct: 66 KYEPHFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNELSECA 125
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+RG +VKP GDALLF+S+ PDA+ D SLHG CPVI G KWS+TKW+H+ +
Sbjct: 126 KRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEY 178
>gi|159795555|pdb|2V4A|A Chain A, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795556|pdb|2V4A|B Chain B, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795557|pdb|2V4A|C Chain C, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795558|pdb|2V4A|D Chain D, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii
Length = 233
Score = 216 bits (550), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 100/169 (59%), Positives = 128/169 (75%), Gaps = 3/169 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V DNESGKS+ SE+RTS+G + +K +D +++ IE R+A T +P EN E +Q+LHY GQ
Sbjct: 59 VVDNESGKSVDSEIRTSTGTWFAKGEDSVISKIEKRVAQVTXIPLENHEGLQVLHYHDGQ 118
Query: 62 KYEPHFDFFRDKMNQ--QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
KYEPH+D+F D +N + GG R+ T L YL+ VE+GGETV PN+E + DG WSECA+
Sbjct: 119 KYEPHYDYFHDPVNAGPEHGGQRVVTXLXYLTTVEEGGETVLPNAEQKVTGDG-WSECAK 177
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
RG AVKP+KGDAL F+SL PD S D SLHGSCP ++G+KWSATKWIHV
Sbjct: 178 RGLAVKPIKGDALXFYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 226
>gi|115482738|ref|NP_001064962.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|78708853|gb|ABB47828.1| prolyl 4-hydroxylase alpha subunit, putative, expressed [Oryza
sativa Japonica Group]
gi|113639571|dbj|BAF26876.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|215767852|dbj|BAH00081.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218184821|gb|EEC67248.1| hypothetical protein OsI_34188 [Oryza sativa Indica Group]
Length = 321
Score = 216 bits (549), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 103/173 (59%), Positives = 131/173 (75%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +G S S VRTSSGMFL + QD+I+ +IE RI+ +TF+P ENGE +Q+LHYE GQ
Sbjct: 147 VVDASTGGSKDSRVRTSSGMFLGRGQDKIIRTIEKRISDYTFIPVENGEGLQVLHYEVGQ 206
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSECA 118
KYEPHFD+F D+ N + GG RIAT+LMYLS VE+GGET+FP+S+ + S SECA
Sbjct: 207 KYEPHFDYFHDEFNTKNGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECA 266
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
++G AVKP GDALLF+S+ PD S D+TSLHG CPVI+G KWS+TKW+ V +
Sbjct: 267 KKGLAVKPKMGDALLFWSMRPDGSLDATSLHGGCPVIKGNKWSSTKWMRVHEY 319
>gi|303285562|ref|XP_003062071.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456482|gb|EEH53783.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 522
Score = 216 bits (549), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 114/244 (46%), Positives = 148/244 (60%), Gaps = 23/244 (9%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+VAD KS S +RTS+GMFL+K Q V +E R+AA LP ENGE MQIL YEHG
Sbjct: 266 VVADG-GKKSTKSGIRTSAGMFLTKGQTPTVRMVEERVAAAVGLPEENGEGMQILRYEHG 324
Query: 61 QKYEPHFDFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQS-----RDGN 113
QKY+PH+D+F DK+N GG R+AT+L+YL E+GGET+FPN++ + +DG
Sbjct: 325 QKYDPHYDYFHDKINPSPNRGGQRMATMLIYLKDTEEGGETIFPNAKKPEGFHDGEKDGA 384
Query: 114 WSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
+S+CA+RG VK +GDA+LF+SL D D SLHG+CPV+ GEKW+A KWI V FD
Sbjct: 385 FSDCAKRGLPVKSKRGDAVLFWSLTSDYKLDEGSLHGACPVLRGEKWTAVKWIRVAKFDG 444
Query: 174 ------PEKEPEDDD---------CVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRK 218
P D CVDE C WA+ G C++NP +M G +R
Sbjct: 445 RFTGELPMPSLTRGDRAAVDATARCVDEWDECAEWARKGWCERNPEFMTGVNGARDSKGP 504
Query: 219 SCKV 222
+C V
Sbjct: 505 ACAV 508
>gi|255072321|ref|XP_002499835.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
gi|226515097|gb|ACO61093.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
Length = 454
Score = 215 bits (547), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 114/244 (46%), Positives = 154/244 (63%), Gaps = 25/244 (10%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+V D SG S+ S++RTS+GMFL + QD V +IE RIAA + LP NGE +QIL YE+G
Sbjct: 207 VVGDKGSG-SMVSKIRTSAGMFLGRGQDPTVRAIEERIAAASGLPEPNGEGLQILRYENG 265
Query: 61 QKYEPHFDFFRDKMNQ--QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGN----- 113
QKY+PHFD+F D++N + GG R+AT+L+YL +GGET+FPN + D +
Sbjct: 266 QKYDPHFDYFHDQVNSSPRRGGQRMATMLIYLEDTTEGGETIFPNGVRPEDWDADEPGNH 325
Query: 114 --WSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
WS+CA++G VK +GDA+LF+SL D + D+ SLHG+CPVI GEKW+A KWI V F
Sbjct: 326 NSWSDCAKKGIPVKSHRGDAVLFWSLKEDYTLDNGSLHGACPVIAGEKWTAVKWIRVAKF 385
Query: 172 DKPEKEP-----------EDDDCVDEDLNCVVWAKAGECKKNPLYMV---GSKSSRG-YC 216
D +P +C+DE C WAK G C +NP +M G++ SRG C
Sbjct: 386 DGGFTDPLPMPALARSDRTKGECLDEWDECGEWAKKGWCDRNPSFMTGLEGARDSRGPAC 445
Query: 217 RKSC 220
+SC
Sbjct: 446 PQSC 449
>gi|222613083|gb|EEE51215.1| hypothetical protein OsJ_32038 [Oryza sativa Japonica Group]
Length = 222
Score = 215 bits (547), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 103/173 (59%), Positives = 131/173 (75%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +G S S VRTSSGMFL + QD+I+ +IE RI+ +TF+P ENGE +Q+LHYE GQ
Sbjct: 48 VVDASTGGSKDSRVRTSSGMFLGRGQDKIIRTIEKRISDYTFIPVENGEGLQVLHYEVGQ 107
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSECA 118
KYEPHFD+F D+ N + GG RIAT+LMYLS VE+GGET+FP+S+ + S SECA
Sbjct: 108 KYEPHFDYFHDEFNTKNGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECA 167
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
++G AVKP GDALLF+S+ PD S D+TSLHG CPVI+G KWS+TKW+ V +
Sbjct: 168 KKGLAVKPKMGDALLFWSMRPDGSLDATSLHGGCPVIKGNKWSSTKWMRVHEY 220
>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
Length = 307
Score = 214 bits (546), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 98/173 (56%), Positives = 133/173 (76%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS S VRTSSGMFL + +++++ +IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 133 VVDSTTGKSKDSRVRTSSGMFLQRGRNKVIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQ 192
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPHFD+F D+ N + GG RIAT+LMYLS VE+GGET+FP++ V+ S S+CA
Sbjct: 193 KYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCA 252
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+RG +VKP GDALLF+S+ PDA+ D SLHG CPVI+G KWS+TKW+H+ +
Sbjct: 253 KRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHEY 305
>gi|308802438|ref|XP_003078532.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
tauri]
gi|116056985|emb|CAL51412.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
tauri]
Length = 369
Score = 214 bits (546), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 112/248 (45%), Positives = 155/248 (62%), Gaps = 34/248 (13%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
+ G S+ASE+RTS+GMFL K+QD+ V IE RIA + +P +NGE MQIL Y+ GQKY+P
Sbjct: 123 DGGSSVASEIRTSAGMFLRKSQDDTVREIEERIARLSGVPVDNGEGMQILRYDKGQKYDP 182
Query: 66 HFDFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGN---------- 113
HFD+F DK+N + GG R+ATVL+YL E+GGET FPN + ++ + +
Sbjct: 183 HFDYFHDKVNPAPKRGGQRVATVLIYLVDTEEGGETTFPNGRLPENFEEDEPDNPFAAHI 242
Query: 114 -WSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
++CA+ G VK ++GDA+LFFS+ D D SLHG+CPVI G+KW+A KW+ V FD
Sbjct: 243 KHTDCAKNGIPVKSVRGDAILFFSMTKDGELDHGSLHGACPVIAGQKWTAVKWLRVAKFD 302
Query: 173 --------------KPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYM--VGSKSSRG-Y 215
+ E+EP CVDE +C WA+ G C++NP +M G++ S
Sbjct: 303 GGFKDELPMIPLTRRTEREP----CVDEWDDCASWARDGWCERNPEFMKFAGARDSHTPA 358
Query: 216 CRKSCKVC 223
C KSC +C
Sbjct: 359 CPKSCGLC 366
>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
Length = 308
Score = 214 bits (546), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 99/173 (57%), Positives = 131/173 (75%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS S VRTSSGMFL + +D+++ IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 134 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQ 193
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPHFD+F D+ N + GG R+AT+LMYLS VE+GGET+FP++ V+ S SECA
Sbjct: 194 KYEPHFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNELSECA 253
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+RG +VKP GDALLF+S+ PDA+ D SLHG CPVI G KWS+TKW+H+ +
Sbjct: 254 KRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEY 306
>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 287
Score = 214 bits (545), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 100/170 (58%), Positives = 130/170 (76%), Gaps = 3/170 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+E+GKS S VRTSSG FL + +D+I+ +IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 113 VVDSETGKSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQ 172
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPH+D+F D+ N + GG R+AT+LMYLS VE+GGETVFP + ++ S SEC
Sbjct: 173 KYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECG 232
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
++G +VKP GDALLF+S+ PDA+ D TSLHG CPVI G KWS+TKWIHV
Sbjct: 233 KKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWIHV 282
>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
Length = 307
Score = 214 bits (544), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 98/173 (56%), Positives = 132/173 (76%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS S VRTSSGMFL + +D+++ +IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 133 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQ 192
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPHFD+F D+ N + GG RIAT+LMYLS VE+GGET+FP++ V+ S S+CA
Sbjct: 193 KYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCA 252
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+RG +VKP GDALLF+S+ P A+ D SLHG CPVI+G KWS+TKW+H+ +
Sbjct: 253 KRGLSVKPKMGDALLFWSMKPGATLDPLSLHGGCPVIKGNKWSSTKWMHIHEY 305
>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
C-169]
Length = 222
Score = 214 bits (544), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 99/172 (57%), Positives = 127/172 (73%), Gaps = 1/172 (0%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V DNE+GKS S+VRTSSGMFL++ +D+++ IEARIA +T +P ENGE +QILHY+ +
Sbjct: 38 VVDNETGKSAPSKVRTSSGMFLNRGEDDVIERIEARIAKYTAIPKENGEGLQILHYQASE 97
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP-NSEVSQSRDGNWSECARR 120
+Y PHFD+F D N Q GG RIAT+LMYLS VE GGETVFP +S+ + +S+CA+
Sbjct: 98 EYRPHFDYFHDNFNTQNGGQRIATMLMYLSDVEDGGETVFPESSDKPNVGNTKFSQCAQA 157
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
G A KP KGDAL F+SL PD D SLH CPV++G+KWSATKW+ V F+
Sbjct: 158 GAAAKPKKGDALFFYSLTPDGRMDEKSLHAGCPVMKGDKWSATKWLRVDRFE 209
>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 213 bits (543), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 100/170 (58%), Positives = 130/170 (76%), Gaps = 3/170 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+E+GKS S VRTSSG FL + +D+I+ +IE RIA +TF+P ++GE +QILHYE GQ
Sbjct: 113 VVDSETGKSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQILHYEAGQ 172
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPH+D+F D+ N + GG R+AT+LMYLS VE+GGETVFP + ++ S SEC
Sbjct: 173 KYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECG 232
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
++G +VKP GDALLF+S+ PDA+ D TSLHG CPVI G KWS+TKW+HV
Sbjct: 233 KKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHV 282
>gi|449443243|ref|XP_004139389.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 213 bits (542), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 101/173 (58%), Positives = 133/173 (76%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V DNE+GK++ VRTSSGMFL++ QD+IV++IE RIA +TF+P E+GE +QILHYE GQ
Sbjct: 111 VVDNETGKNVEDSVRTSSGMFLNRGQDKIVSNIEKRIADFTFIPIEHGEGLQILHYEVGQ 170
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
KY+ H+D+F D+ N + GG R+AT+LMYLS VE+GGETVFP ++ + S W+E C
Sbjct: 171 KYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCG 230
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G +VKP GDALLF+S+ PDA+ D TSLHG+CPVI G KWS TKW+HV +
Sbjct: 231 KGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 283
>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
from Gallus gallus gi|212530 [Arabidopsis thaliana]
gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 287
Score = 213 bits (542), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 99/170 (58%), Positives = 130/170 (76%), Gaps = 3/170 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+E+GKS S VRTSSG FL + +D+I+ +IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 113 VVDSETGKSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQ 172
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPH+D+F D+ N + GG R+AT+LMYLS VE+GGETVFP + ++ S SEC
Sbjct: 173 KYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECG 232
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
++G +VKP GDALLF+S+ PDA+ D TSLHG CPVI G KWS+TKW+HV
Sbjct: 233 KKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHV 282
>gi|363543299|ref|NP_001241865.1| prolyl 4-hydroxylase 5-1 [Zea mays]
gi|347978814|gb|AEP37749.1| prolyl 4-hydroxylase 5-1 [Zea mays]
Length = 180
Score = 213 bits (541), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 97/173 (56%), Positives = 130/173 (75%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS S VRTSSGMFL + +D+++ IE RI +TF+P ++GE +Q+LHYE GQ
Sbjct: 6 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRITDYTFIPVDHGEGLQVLHYEVGQ 65
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPHFD+F D+ N + GG R+AT+LM+LS VE+GGET+FP++ V+ S SECA
Sbjct: 66 KYEPHFDYFLDEFNTKNGGQRMATLLMHLSDVEEGGETIFPDANVNDSSLPWYNELSECA 125
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+RG +VKP GDALLF+S+ PDA+ D SLHG CPVI G KWS+TKW+H+ +
Sbjct: 126 KRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEY 178
>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 213 bits (541), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 99/173 (57%), Positives = 132/173 (76%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+E+G+S S VRTSSGMFL + +D+I+ IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 114 VVDSETGRSKDSRVRTSSGMFLRRGRDKIIRDIEKRIADFTFIPVEHGEGLQVLHYEVGQ 173
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
KY+ H+D+F D+ N + GG RIAT+LMYLS VE+GGETVFP ++ + S W SEC
Sbjct: 174 KYDAHYDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETVFPATKANFSSVPWWNELSECG 233
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
++G +VKP GDALLF+S+ PDA+ D +SLHG CPVI+G KWS+TKW+HV +
Sbjct: 234 KKGLSVKPKMGDALLFWSMRPDATLDPSSLHGGCPVIKGNKWSSTKWMHVEEY 286
>gi|42567428|ref|NP_195306.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|332661174|gb|AEE86574.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 290
Score = 213 bits (541), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 102/174 (58%), Positives = 130/174 (74%), Gaps = 3/174 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++GKSI S VRTSSG FL++ DEIV IE RI+ +TF+PPENGE +Q+LHYE GQ
Sbjct: 117 VVDVKTGKSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQ 176
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
+YEPH D+F D+ N + GG RIATVLMYLS V++GGETVFP ++ + S W S+C
Sbjct: 177 RYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCG 236
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
+ G +V P K DALLF+S+ PDAS D +SLHG CPVI+G KWS+TKW HV ++
Sbjct: 237 KEGLSVLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHEYN 290
>gi|302842389|ref|XP_002952738.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300262082|gb|EFJ46291.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 281
Score = 212 bits (539), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 106/217 (48%), Positives = 136/217 (62%), Gaps = 6/217 (2%)
Query: 13 SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
S +RTS G+FL + +DEIV +E RIAAWT +P NGE +Q+L Y+ QKY+ H+D+F
Sbjct: 36 SNIRTSYGVFLDRGEDEIVKRVEERIAAWTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFH 95
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
K GG+R ATVLMYL E+GGETVFPN + +SECAR A KP KG A+
Sbjct: 96 KDGITNGGNRYATVLMYLVDTEEGGETVFPNVAAPGGENVGFSECARYHLAAKPKKGTAI 155
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH----VRNFDKPEKEPED--DDCVDE 186
LF S+ P + SLH +CPVI G KWSA KWIH + +P+ +P+D C D
Sbjct: 156 LFHSIKPTGELERKSLHTACPVIRGIKWSAAKWIHHAETIEQHPQPKVKPQDLPPGCEDS 215
Query: 187 DLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C WA AGEC++N +MVGS++ G C SCK C
Sbjct: 216 DEMCPEWADAGECERNASFMVGSRARPGKCVASCKRC 252
>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 289
Score = 212 bits (539), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 98/173 (56%), Positives = 134/173 (77%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+E+GKS S VRTSSG FL++ +D+IV +IE +IA +TF+P E+GE +Q+LHYE GQ
Sbjct: 115 VVDSETGKSKDSRVRTSSGTFLARGRDKIVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQ 174
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPH+D+F D+ N + GG RIATVLMYL+ VE+GGETVFP ++ + S S+C
Sbjct: 175 KYEPHYDYFLDEFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSNVPWYNELSDCG 234
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
++G ++KP +GDALLF+S+ PDA+ D++SLHG CPVI+G KWS+TKWI V +
Sbjct: 235 KKGLSIKPKRGDALLFWSMKPDATLDASSLHGGCPVIKGNKWSSTKWIRVNEY 287
>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
Length = 290
Score = 212 bits (539), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 99/173 (57%), Positives = 132/173 (76%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+E+GKS S VRTSSG FL++ +D+IV IE RIA ++F+P E+GE +Q+LHYE GQ
Sbjct: 116 VVDSETGKSKDSRVRTSSGTFLARGRDKIVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQ 175
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
KYEPH+D+F D N + GG RIATVLMYL+ VE+GGETVFP ++ + S W SEC
Sbjct: 176 KYEPHYDYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSSVPWWNELSECG 235
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
++G ++KP +GDALLF+S+ PDA+ D +SLHG CPVI+G KWS+TKW+ V +
Sbjct: 236 KKGLSIKPKRGDALLFWSMKPDATLDPSSLHGGCPVIKGNKWSSTKWMRVSEY 288
>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 289
Score = 212 bits (539), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 99/170 (58%), Positives = 131/170 (77%), Gaps = 3/170 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+++G+S S VRTSSGMFL + +D+I+ +IE RIA ++F+P E+GE +Q+LHYE GQ
Sbjct: 115 VVDSKTGRSKDSRVRTSSGMFLRRGRDKIIRNIEKRIADFSFIPIEHGEGLQVLHYEVGQ 174
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
KYE H+D+F D+ N + GG R AT+LMYLS VE+GGETVFP ++ + S +W SECA
Sbjct: 175 KYEAHYDYFLDEFNTKNGGQRTATLLMYLSDVEEGGETVFPAAKANISNVPSWNELSECA 234
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
R+G +VKP G+ALLF+S PDA+ D SLHGSCPVI G KWSATKW+H+
Sbjct: 235 RQGLSVKPKMGNALLFWSTRPDATLDPASLHGSCPVIRGNKWSATKWMHL 284
>gi|242039227|ref|XP_002467008.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
gi|241920862|gb|EER94006.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
Length = 307
Score = 211 bits (538), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 104/173 (60%), Positives = 128/173 (73%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +G S S VRTSSGMFL + QD+I+ +IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 133 VVDSATGASKDSRVRTSSGMFLRRGQDKIIQTIEKRIADFTFIPVEHGEGLQVLHYEVGQ 192
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSECA 118
KYEPHFD+F D N + GG RIAT+LMYLS VE GGETVFP+S + S SECA
Sbjct: 193 KYEPHFDYFHDDYNTKNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECA 252
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G +VKP GDALLF+S+ PD S DSTSLHG CPVI+G KWS+TKW+ V +
Sbjct: 253 KGGLSVKPKMGDALLFWSMKPDGSMDSTSLHGGCPVIKGNKWSSTKWMRVHEY 305
>gi|308801080|ref|XP_003075321.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116061875|emb|CAL52593.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 541
Score = 211 bits (536), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 112/237 (47%), Positives = 147/237 (62%), Gaps = 27/237 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G S SEVRTS+G F+S+ D+I+A +E RI W+ +P + EA QIL YE GQ
Sbjct: 295 VVDAQTGGSSLSEVRTSTGTFISRKYDDIIAGVEERIELWSQIPQSHHEAFQILRYEPGQ 354
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGN-WSECARR 120
+Y+ HFD+F K + +RIATVL+YLS VE+GGETVFPN++V SR+ + +SEC
Sbjct: 355 EYKAHFDYFFHKSGMR--NNRIATVLLYLSDVEEGGETVFPNTDVPTSRNRSMYSECGNG 412
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
G A+K KGDALLF+S+ P D+ S H CPVI+GEKW+ATKW+HV P P D
Sbjct: 413 GKALKARKGDALLFWSMKPGGELDAGSSHAGCPVIKGEKWTATKWMHV----NPLAGPND 468
Query: 181 D--------------DCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D C D C WA++GEC KNP +M R C+ SC+VC
Sbjct: 469 DAHNVFYDGGPRSTASCSDAQAECRGWAESGECDKNPGFM------RESCKMSCRVC 519
>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 211 bits (536), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 99/173 (57%), Positives = 132/173 (76%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+E+G+S S VRTSSG FL + +D+ V +IE R++ ++F+P E+GE +Q+LHYE GQ
Sbjct: 113 VVDSETGQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQ 172
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
KYEPHFD+F D+ N + GG RIATVLMYLS VE+GGETVFP ++ + S W+E C
Sbjct: 173 KYEPHFDYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCG 232
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
++G +VKP +GDALLF+S+ PDAS D +SLHG CPVI+G KWSATKW+ V +
Sbjct: 233 KKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWVRVEEY 285
>gi|449520146|ref|XP_004167095.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 249
Score = 210 bits (535), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 101/173 (58%), Positives = 130/173 (75%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V DNE+GK++ VRTSSGMFL++ QD+IV++IE RIA +TF+P E+GE +QILHYE GQ
Sbjct: 76 VVDNETGKNVEDSVRTSSGMFLNRGQDKIVSNIEKRIADFTFIPIEHGEGLQILHYEVGQ 135
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
KY+ H+DFF D+ N + G R+AT+LMYLS VE+GGETVFP ++ + S W+E C
Sbjct: 136 KYDAHYDFFDDEFNLKEIGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCG 195
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G +VKP GDALLF+S+ PD + D TSLHG+CPVI G KWS TKWIHV
Sbjct: 196 KGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWIHVNQL 248
>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
gi|255647110|gb|ACU24023.1| unknown [Glycine max]
Length = 289
Score = 210 bits (535), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 98/174 (56%), Positives = 134/174 (77%), Gaps = 3/174 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+E+GKS S VRTSSG FL++ +D+IV +IE +I+ +TF+P E+GE +Q+LHYE GQ
Sbjct: 115 VVDSETGKSKDSRVRTSSGTFLARGRDKIVRNIEKKISDFTFIPVEHGEGLQVLHYEVGQ 174
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS---ECA 118
KYEPH+D+F D N + GG RIATVLMYL+ VE+GGETVFP ++ + S W+ EC
Sbjct: 175 KYEPHYDYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSFVPWWNELFECG 234
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
++G ++KP +GDALLF+S+ PDAS D +SLHG CPVI+G KWS+TKW+ V ++
Sbjct: 235 KKGLSIKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWMRVSEYN 288
>gi|224133600|ref|XP_002327635.1| predicted protein [Populus trichocarpa]
gi|222836720|gb|EEE75113.1| predicted protein [Populus trichocarpa]
Length = 291
Score = 210 bits (535), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 102/173 (58%), Positives = 130/173 (75%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS S+VRTSSG FL + +D+IV IE RIA ++F+P E+GE +QILHYE GQ
Sbjct: 117 VVDSSTGKSKDSKVRTSSGTFLPRGRDKIVRDIEKRIADFSFIPVEHGEGLQILHYEVGQ 176
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
+YEPHFD+F D+ N + GG RIATVLMYLS VE+GGETVFP++E + S W SEC
Sbjct: 177 RYEPHFDYFMDEYNTKNGGQRIATVLMYLSDVEEGGETVFPSAEGNISAVPWWNELSECG 236
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G +VKP GDALLF+S++PD S D +SLHG CPVI G KWS+TKW+ V +
Sbjct: 237 KGGLSVKPKMGDALLFWSMNPDGSPDPSSLHGGCPVIRGNKWSSTKWMRVNEY 289
>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
Length = 288
Score = 210 bits (534), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 101/173 (58%), Positives = 129/173 (74%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS S VRTSSG FL++ QD+I+ IE R++ +TFLP E+GE +QILHYE GQ
Sbjct: 114 VVDSSTGKSKDSRVRTSSGTFLTRGQDKIIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQ 173
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
KYEPH+D+F D N + GG R+ATVLMYLS VE+GGETVFP ++ + S W+E C
Sbjct: 174 KYEPHYDYFLDDYNTKNGGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSXCG 233
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G +VKP GDALLF+S+ PDAS D +SLHG CPVI+G KWS+TKWI V +
Sbjct: 234 KEGLSVKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 210 bits (534), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 101/173 (58%), Positives = 129/173 (74%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS S VRTSSG FL++ QD+I+ IE R++ +TFLP E+GE +QILHYE GQ
Sbjct: 114 VVDSSTGKSKDSRVRTSSGTFLTRGQDKIIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQ 173
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
KYEPH+D+F D N + GG R+ATVLMYLS VE+GGETVFP ++ + S W+E C
Sbjct: 174 KYEPHYDYFLDDYNTKNGGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCG 233
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G +VKP GDALLF+S+ PDAS D +SLHG CPVI+G KWS+TKWI V +
Sbjct: 234 KEGLSVKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 210 bits (534), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 100/173 (57%), Positives = 133/173 (76%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+++G+S+ S VRTSSGMFL++ QD+I+ +IE RIA +TF+P E+GE +QILHYE GQ
Sbjct: 111 VVDSKTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQ 170
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
KY+ H+D+F D+ N + GG R+AT+LMYLS VE+GGETVFP ++ + S W SEC
Sbjct: 171 KYDAHYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECG 230
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G +VKP GDALLF+S+ PDA+ D TSLHG+CPVI G KWS TKW+HV +
Sbjct: 231 KGGLSVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 283
>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 210 bits (534), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 99/173 (57%), Positives = 132/173 (76%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+E+G+S S VRTSSG FL + +D+ V +IE R++ ++F+P E+GE +Q+LHYE GQ
Sbjct: 113 VVDSETGQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQ 172
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
KYEPHFD+F D+ N + GG RIATVLMYLS VE+GGETVFP ++ + S W+E C
Sbjct: 173 KYEPHFDYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCG 232
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
++G +VKP +GDALLF+S+ PDAS D +SLHG CPVI+G KWSATKW+ V +
Sbjct: 233 KKGLSVKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEY 285
>gi|414870899|tpg|DAA49456.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 364
Score = 209 bits (533), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 103/173 (59%), Positives = 126/173 (72%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +G S S VRTSSGMFL + QD+I+ +IE RIA +TF+P E GE +Q+LHYE GQ
Sbjct: 190 VVDSATGGSKDSRVRTSSGMFLRRGQDKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQ 249
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSECA 118
KYEPHFD+F D N + GG RIAT+LMYLS VE GGETVFP+S + S SECA
Sbjct: 250 KYEPHFDYFHDDYNTKNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECA 309
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G +VKP GDALLF+S+ PD S D TSLHG CPVI+G KWS+TKW+ V +
Sbjct: 310 KGGLSVKPKMGDALLFWSMKPDGSLDPTSLHGGCPVIKGNKWSSTKWMRVHEY 362
>gi|159476104|ref|XP_001696154.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Chlamydomonas reinhardtii]
gi|158275325|gb|EDP01103.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Chlamydomonas reinhardtii]
Length = 343
Score = 209 bits (533), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 111/238 (46%), Positives = 143/238 (60%), Gaps = 17/238 (7%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V+D +G S++RTSSGMF + + E+V IE R+A WT LP ENGE +Q+L YE Q
Sbjct: 104 VSDATTGAGAVSDIRTSSGMFYERGETELVKRIENRLAMWTMLPVENGEGIQVLRYEKTQ 163
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPN--SEVSQSRDGNWSECAR 119
KY+PH D+F GG+R+ATVLMYL+ E+GGETVFP V Q + C R
Sbjct: 164 KYDPHHDYFSFDGADDNGGNRMATVLMYLATPEEGGETVFPKVVGWVVQLTTTASAPC-R 222
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
+G AVKP KGDA+LF+S+ PD D SLHGSCPVI+G KWSATKWIHV ++ + E
Sbjct: 223 QGLAVKPAKGDAVLFWSIRPDGRFDPGSLHGSCPVIKGVKWSATKWIHVGHYAMSGERSE 282
Query: 180 D--------------DDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
C ++ C WA++GEC+ NP YM+G K G C +C C
Sbjct: 283 TVKRVQYVPPPPPAVPGCENQHKLCSHWAESGECESNPGYMIGKKGMPGACILACNRC 340
>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 287
Score = 209 bits (533), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 99/173 (57%), Positives = 130/173 (75%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+E+G+S S VRTSSG FLS+ +D+ + IE RIA ++F+P E+GE +Q+LHYE GQ
Sbjct: 113 VVDSETGRSKDSRVRTSSGTFLSRGRDKKIRDIEKRIADFSFIPVEHGEGLQVLHYEVGQ 172
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
KYEPHFD+F D+ N + GG R+AT+LMYLS VE+GGETVFP ++ + S W SEC
Sbjct: 173 KYEPHFDYFNDEFNTKNGGQRVATLLMYLSDVEEGGETVFPAAKGNFSAVPWWNELSECG 232
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
++G +VKP GDALLF+S+ PDA+ D +SLHG CPVI G KWSATKW+ V +
Sbjct: 233 KKGLSVKPNMGDALLFWSMKPDATLDPSSLHGGCPVINGNKWSATKWMRVNEY 285
>gi|212720775|ref|NP_001131953.1| uncharacterized protein LOC100193348 [Zea mays]
gi|194693016|gb|ACF80592.1| unknown [Zea mays]
gi|347978798|gb|AEP37741.1| prolyl 4-hydroxylase 1 [Zea mays]
gi|414870898|tpg|DAA49455.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 307
Score = 209 bits (533), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 103/173 (59%), Positives = 126/173 (72%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +G S S VRTSSGMFL + QD+I+ +IE RIA +TF+P E GE +Q+LHYE GQ
Sbjct: 133 VVDSATGGSKDSRVRTSSGMFLRRGQDKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQ 192
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSECA 118
KYEPHFD+F D N + GG RIAT+LMYLS VE GGETVFP+S + S SECA
Sbjct: 193 KYEPHFDYFHDDYNTKNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECA 252
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G +VKP GDALLF+S+ PD S D TSLHG CPVI+G KWS+TKW+ V +
Sbjct: 253 KGGLSVKPKMGDALLFWSMKPDGSLDPTSLHGGCPVIKGNKWSSTKWMRVHEY 305
>gi|297802350|ref|XP_002869059.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297314895|gb|EFH45318.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 290
Score = 209 bits (532), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 102/174 (58%), Positives = 128/174 (73%), Gaps = 3/174 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++GKSI S VRTSSG FL + DEIV IE RI+ +TF+P ENGE +Q+LHYE GQ
Sbjct: 117 VVDVKTGKSIDSRVRTSSGTFLKRGHDEIVEEIENRISDFTFIPIENGEGLQVLHYEVGQ 176
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
KYEPH D+F D+ N + GG RIATVLMYLS V++GGETVFP ++ + S W S+C
Sbjct: 177 KYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNISDVPWWDELSQCG 236
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
+ G +V P K DALLF+S+ PDAS D +SLHG CPVI+G KWS+TKW HV ++
Sbjct: 237 KEGLSVLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHEYN 290
>gi|412993142|emb|CCO16675.1| predicted protein [Bathycoccus prasinos]
Length = 564
Score = 207 bits (527), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 114/242 (47%), Positives = 150/242 (61%), Gaps = 24/242 (9%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G S+ S +RTS+GMFL KA D+ + +IE RIAA + P NGE MQIL Y+ GQKY+PHF
Sbjct: 321 GTSVPSTIRTSAGMFLRKAADKTLENIEYRIAAASGTPEPNGEGMQILRYDVGQKYDPHF 380
Query: 68 DFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD----GN---WSECA 118
D+F D +N + GG R+AT+L+YL + ++GGET+FP +++ D GN WSEC
Sbjct: 381 DYFHDAVNPSPKRGGQRMATMLIYLENTKEGGETIFPRGTRAETFDLTEEGNPHEWSECT 440
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
+ G VK +KGDALLF+SL D D SLHG+CPV++G+KW+A KWI V FD P
Sbjct: 441 KHGLPVKSVKGDALLFWSLTDDYKLDMGSLHGACPVVKGQKWTAVKWIRVAKFDGMFTSP 500
Query: 179 -----------EDDDCVDEDLNCVVWAKAGECKKNPLYMV---GSKSSRG-YCRKSCKVC 223
+ CVDE C WAK G C+KN +MV G++ S+G C SC V
Sbjct: 501 LPMPALSRRTEQHGKCVDEWDECAKWAKDGWCEKNKDFMVSNGGARDSKGPACPVSCNVP 560
Query: 224 KP 225
P
Sbjct: 561 CP 562
>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
Length = 288
Score = 207 bits (527), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 97/170 (57%), Positives = 129/170 (75%), Gaps = 3/170 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+++G+S S VRTSSGMFL + +D ++ IE RIA ++F+P E+GE +Q+LHYE GQ
Sbjct: 114 VVDSKTGRSKDSRVRTSSGMFLRRGRDRVIREIEKRIADFSFIPVEHGEGLQVLHYEVGQ 173
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
KYE HFD+F D+ N + GG R AT+LMYLS VE+GGETVFP + ++ S W SECA
Sbjct: 174 KYEAHFDYFLDEFNTKNGGQRTATLLMYLSDVEEGGETVFPAANMNISAVPWWNELSECA 233
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
++G ++KP G+ALLF+S PDA+ D +SLHGSCPVI G KWSATKW+H+
Sbjct: 234 KQGLSLKPKMGNALLFWSTRPDATLDPSSLHGSCPVIRGNKWSATKWMHL 283
>gi|145343778|ref|XP_001416487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576712|gb|ABO94780.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 255
Score = 207 bits (527), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 108/233 (46%), Positives = 148/233 (63%), Gaps = 19/233 (8%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G S S++RTS+G F+S+A D + +IE RI W+ +P ++GEA+Q+L YE+GQ
Sbjct: 31 VVDAKTGGSTTSDIRTSTGTFISRAHDPTITAIEERIELWSQIPVDHGEALQVLRYENGQ 90
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD-GNWSECARR 120
+Y+ HFD+F K ++ +RIATVL+YLS VE+GGETVFPN++V RD +SEC
Sbjct: 91 EYKAHFDYFFHKGGKR--NNRIATVLLYLSDVEEGGETVFPNTDVPTDRDRSQYSECGNG 148
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP------ 174
G +VK KGDALLF+S+ P D S H CPVI+G KW+ATKW+HV K
Sbjct: 149 GKSVKARKGDALLFWSMKPGGELDPGSSHAGCPVIKGVKWTATKWMHVNAIGKHGDDVHK 208
Query: 175 ---EKEPE-DDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
E P+ + C D D C WA++GEC KNP +M+ S C SC+ C
Sbjct: 209 IFYEGGPQATESCKDTDDACRGWAESGECDKNPGFMLKS------CAMSCRAC 255
>gi|302815629|ref|XP_002989495.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
gi|300142673|gb|EFJ09371.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
Length = 213
Score = 207 bits (526), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 97/173 (56%), Positives = 128/173 (73%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+++G S S VRTSSGMFL++ QD +++ IE +IA TF+P ++GE +Q+LHYE GQ
Sbjct: 39 VVDSQTGGSRDSRVRTSSGMFLNRGQDRVISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQ 98
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KY+ H DFF D +N + GG RIAT+LMYL+ VE+GGETVFP S + S SEC
Sbjct: 99 KYDAHHDFFYDTVNTRNGGQRIATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECG 158
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
RRG +V+P +GDALLF+S+ PDA D +SLHG CPVI+G+KWSATKW+ V +
Sbjct: 159 RRGVSVRPKRGDALLFWSMSPDAQLDHSSLHGGCPVIKGDKWSATKWMRVSEY 211
>gi|302762452|ref|XP_002964648.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
gi|300168377|gb|EFJ34981.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
Length = 225
Score = 207 bits (526), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 97/173 (56%), Positives = 128/173 (73%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+++G S S VRTSSGMFL++ QD +++ IE +IA TF+P ++GE +Q+LHYE GQ
Sbjct: 51 VVDSQTGGSRDSRVRTSSGMFLNRGQDRVISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQ 110
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KY+ H DFF D +N + GG RIAT+LMYL+ VE+GGETVFP S + S SEC
Sbjct: 111 KYDAHHDFFYDTVNTRNGGQRIATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECG 170
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
RRG +V+P +GDALLF+S+ PDA D +SLHG CPVI+G+KWSATKW+ V +
Sbjct: 171 RRGVSVRPKRGDALLFWSMSPDAQLDHSSLHGGCPVIKGDKWSATKWMRVSEY 223
>gi|357135727|ref|XP_003569460.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 2
[Brachypodium distachyon]
Length = 314
Score = 205 bits (522), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 103/204 (50%), Positives = 135/204 (66%), Gaps = 10/204 (4%)
Query: 23 LSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHR 82
L+ ++D +V+ IE RI+ W+F+P E+GE+MQIL Y Q D +D GG+R
Sbjct: 118 LADSKDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-----DHNKDGTQSSSGGNR 172
Query: 83 IATVLMYLSHVEKGGETVFPNSEV--SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPD 140
+ T+LMYLS V++GGETVFP SE+ +Q+++G SECA GYAVKP+KGDA+L F+L PD
Sbjct: 173 LVTILMYLSDVKQGGETVFPRSELKDTQAKEGALSECA--GYAVKPVKGDAILLFNLRPD 230
Query: 141 ASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE-PEDDDCVDEDLNCVVWAKAGEC 199
TDS S + C V+EGEKW A K +H+ DK P +D C DED CV WA AGEC
Sbjct: 231 GVTDSDSHYEDCSVLEGEKWLAIKHLHISKIDKSRSSLPSEDLCTDEDDKCVSWAAAGEC 290
Query: 200 KKNPLYMVGSKSSRGYCRKSCKVC 223
NP++M+GS G CRKSC C
Sbjct: 291 YSNPVFMIGSPDYYGTCRKSCHAC 314
>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
Length = 244
Score = 205 bits (522), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 106/223 (47%), Positives = 140/223 (62%), Gaps = 8/223 (3%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+VA N G S S++RTS G+FL + +D +V +E RI+A T +P NGE +Q+L Y+
Sbjct: 30 VVATN--GGSEESQIRTSFGVFLERGEDPVVKGVEERISALTLMPVGNGEGLQVLRYQKE 87
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
QKY+ H+D+F K GG+R ATVLMYL E+GGETVFPN + +SECAR
Sbjct: 88 QKYDAHWDYFFHKDGIANGGNRYATVLMYLVDTEEGGETVFPNIAAPGGENVGFSECARY 147
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
A KP KG A+LF S+ P + SLH +CPVI+G KWSA KWIHV KP+ P
Sbjct: 148 HLAAKPKKGTAILFHSIKPTGELERKSLHTACPVIKGIKWSAAKWIHV----KPQNLP-- 201
Query: 181 DDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
C D D C WA+AGEC++N +M+G+++ G C SCK C
Sbjct: 202 PGCEDSDEMCPDWAEAGECERNASFMIGTRARPGKCVASCKRC 244
>gi|357517881|ref|XP_003629229.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523251|gb|AET03705.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 278
Score = 205 bits (522), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 98/173 (56%), Positives = 126/173 (72%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V DNE+GKS S VRTSSG FL + DEIV +IE RIA +TF+P ENGE+ +L YE GQ
Sbjct: 104 VVDNETGKSKDSSVRTSSGTFLDRGGDEIVRNIEKRIADFTFIPVENGESFNVLRYEVGQ 163
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
KY+PH D+F D N GG RIAT+LMYLS VE+GGETVFP ++ + S W+E C
Sbjct: 164 KYDPHLDYFADDYNTVNGGQRIATMLMYLSDVEEGGETVFPAAKGNISSVPWWNELSDCG 223
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
++G ++KP GDALLF+S+ PD + D +SLHG+CPVI+G+KWS TKW+ + F
Sbjct: 224 KKGLSIKPKMGDALLFWSMKPDGTLDPSSLHGACPVIKGDKWSCTKWMRINEF 276
>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 291
Score = 204 bits (520), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 97/173 (56%), Positives = 124/173 (71%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G S S VRTSSG FL + DE+V IE RI+ +TF+P ENGE +Q+LHY+ GQ
Sbjct: 117 VVDEKTGGSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQ 176
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
KYEPH+D+F D+ N + GG RIATVLMYLS V+ GGETVFP + + S W+E C
Sbjct: 177 KYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCG 236
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G +V P K DALLF+++ PDAS D +SLHG CPV++G KWS+TKW HV F
Sbjct: 237 KEGLSVLPKKRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEF 289
>gi|240256489|ref|NP_201407.4| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010770|gb|AED98153.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 289
Score = 204 bits (520), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 97/173 (56%), Positives = 128/173 (73%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++GKS S VRTSSG FL++ +D+ + IE RI+ +TF+P E+GE +Q+LHYE GQ
Sbjct: 115 VVDEKTGKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQ 174
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
KYEPH+D+F D+ N + GG RIATVLMYLS VE+GGETVFP ++ + S W SEC
Sbjct: 175 KYEPHYDYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECG 234
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G +VKP GDALLF+S+ PDA+ D +SLHG C VI+G KWS+TKW+ V +
Sbjct: 235 KGGLSVKPKMGDALLFWSMTPDATLDPSSLHGGCAVIKGNKWSSTKWLRVHEY 287
>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
Arabidopsis thaliana
gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
thaliana]
gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 291
Score = 204 bits (520), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 97/173 (56%), Positives = 124/173 (71%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G S S VRTSSG FL + DE+V IE RI+ +TF+P ENGE +Q+LHY+ GQ
Sbjct: 117 VVDEKTGGSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQ 176
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
KYEPH+D+F D+ N + GG RIATVLMYLS V+ GGETVFP + + S W+E C
Sbjct: 177 KYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCG 236
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G +V P K DALLF+++ PDAS D +SLHG CPV++G KWS+TKW HV F
Sbjct: 237 KEGLSVLPKKRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEF 289
>gi|90704797|dbj|BAE92293.1| putative prolyl 4-hydroxylase, alpha subunit [Cryptomeria japonica]
Length = 302
Score = 204 bits (518), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 98/174 (56%), Positives = 129/174 (74%), Gaps = 3/174 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MV D+++G S+ S VRTSSG FL++ QD+I+ IE RIA ++ +P E+GE + +LHYE
Sbjct: 127 MVVDSKTGGSMDSNVRTSSGWFLNRGQDKIIRRIEKRIADFSHIPVEHGEGLHVLHYEVE 186
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SEC 117
QKY+ H+D+F D +N + GG R AT+LMYLS VEKGGETVFP S+V+ S W SEC
Sbjct: 187 QKYDAHYDYFSDTINVKNGGQRGATMLMYLSDVEKGGETVFPQSKVNSSSVPWWDELSEC 246
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
R G +V+P GDALLF+S+ PDAS D +SLHGSCPVI+G KWSATKW+ + +
Sbjct: 247 GRSGLSVRPKMGDALLFWSVKPDASLDPSSLHGSCPVIQGNKWSATKWMRLNKY 300
>gi|108706360|gb|ABF94155.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative [Oryza
sativa Japonica Group]
gi|125585047|gb|EAZ25711.1| hypothetical protein OsJ_09544 [Oryza sativa Japonica Group]
Length = 277
Score = 203 bits (517), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 100/175 (57%), Positives = 123/175 (70%), Gaps = 18/175 (10%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPE-------------- 47
V D ESG+S+ S+VRTSSGMFL K QDE+VA IE RIAAWT LP E
Sbjct: 80 VVDGESGESVTSKVRTSSGMFLDKKQDEVVARIEERIAAWTMLPTECIIFYCFANFAILK 139
Query: 48 ---NGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS 104
NGE+MQIL Y G+KYEPHFD+ + G R+ATVLMYLS+V K G+++ P +
Sbjct: 140 LSENGESMQILRYGQGEKYEPHFDYISGRQGSTREGDRVATVLMYLSNV-KMGDSLLPQA 198
Query: 105 EVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEK 159
+SQ +D WS+CA +G+AVKP KG A+LFFSLHP+A+ D+ SLHGSCPVIEGEK
Sbjct: 199 RLSQPKDETWSDCAEQGFAVKPAKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEK 253
>gi|357135725|ref|XP_003569459.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 1
[Brachypodium distachyon]
Length = 303
Score = 202 bits (515), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 102/199 (51%), Positives = 131/199 (65%), Gaps = 10/199 (5%)
Query: 28 DEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVL 87
D +V+ IE RI+ W+F+P E+GE+MQIL Y Q D +D GG+R+ T+L
Sbjct: 112 DIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-----DHNKDGTQSSSGGNRLVTIL 166
Query: 88 MYLSHVEKGGETVFPNSEV--SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDS 145
MYLS V++GGETVFP SE+ +Q+++G SECA GYAVKP+KGDA+L F+L PD TDS
Sbjct: 167 MYLSDVKQGGETVFPRSELKDTQAKEGALSECA--GYAVKPVKGDAILLFNLRPDGVTDS 224
Query: 146 TSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE-PEDDDCVDEDLNCVVWAKAGECKKNPL 204
S + C V+EGEKW A K +H+ DK P +D C DED CV WA AGEC NP+
Sbjct: 225 DSHYEDCSVLEGEKWLAIKHLHISKIDKSRSSLPSEDLCTDEDDKCVSWAAAGECYSNPV 284
Query: 205 YMVGSKSSRGYCRKSCKVC 223
+M+GS G CRKSC C
Sbjct: 285 FMIGSPDYYGTCRKSCHAC 303
>gi|307110744|gb|EFN58979.1| hypothetical protein CHLNCDRAFT_137600 [Chlorella variabilis]
Length = 327
Score = 202 bits (515), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 110/230 (47%), Positives = 140/230 (60%), Gaps = 14/230 (6%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G S+ E+RTSSGMF+ K D +++ +E R+AA T LP + E +Q+L YE GQKY H+
Sbjct: 84 GGSMLDEIRTSSGMFILKGHDAVISGLEERVAALTHLPVSHQEDLQVLRYELGQKYSAHW 143
Query: 68 DFFRDKMNQQ-------LGGHRIATVLMYLSHVEKGGETVFPNS----EVSQSRDGNWSE 116
D Q LGG R AT+LMYLS VE+GGET FP+ E Q+ ++E
Sbjct: 144 DINDSPERAQQMRAKGVLGGLRTATLLMYLSDVEEGGETAFPHGRWLDEGVQAAP-PYTE 202
Query: 117 CARRGYAVKPMKGDALLFFSLHPDAS-TDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
CA +G VKP KGDA+LFFSL + D SLH CPV+ G K+SATKW+HV F
Sbjct: 203 CASKGVVVKPRKGDAILFFSLKLNGQKKDVYSLHAGCPVVRGVKYSATKWVHVEPFGHTT 262
Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCKP 225
+ + C D + C WA AGEC NP+YM GS+ S G CR SCKVC+P
Sbjct: 263 VQ-QPSRCEDARVECPQWAAAGECDSNPVYMKGSEVSVGSCRLSCKVCRP 311
>gi|21593091|gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 291
Score = 202 bits (514), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 96/173 (55%), Positives = 123/173 (71%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G S S VRTSSG FL + DE+V IE RI+ +TF+P ENGE +Q+LHY+ GQ
Sbjct: 117 VVDEKTGGSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQ 176
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
KYEPH+D+F D+ N + GG RIATVLMYLS V+ GGETVFP + + S W+E C
Sbjct: 177 KYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCG 236
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G +V P DALLF+++ PDAS D +SLHG CPV++G KWS+TKW HV F
Sbjct: 237 KEGLSVLPKXRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEF 289
>gi|356502610|ref|XP_003520111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 201 bits (512), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 93/173 (53%), Positives = 128/173 (73%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN+SG+S+ +VR S+G FL + QDEIV +IE RIA TF+P ENGE + ++HYE GQ
Sbjct: 111 VADNQSGQSVVHDVRKSTGAFLDRGQDEIVRNIEKRIADVTFIPIENGEPIYVIHYEVGQ 170
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
Y+PH+D+F D N + GG RIAT+LMYLS+VE+GGET+FP ++ + S W+E C
Sbjct: 171 YYDPHYDYFIDDFNIENGGQRIATMLMYLSNVEEGGETMFPRAKANFSSVPWWNELSNCG 230
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G ++KP GDALLF+S+ P+A+ D+ +LH +CPVI+G KWS TKW+H F
Sbjct: 231 KMGLSIKPKMGDALLFWSMKPNATLDALTLHSACPVIKGNKWSCTKWMHPTEF 283
>gi|363543293|ref|NP_001241862.1| prolyl 4-hydroxylase 2-1 precursor [Zea mays]
gi|347978802|gb|AEP37743.1| prolyl 4-hydroxylase 2-1 [Zea mays]
Length = 204
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 92/125 (73%), Positives = 109/125 (87%), Gaps = 2/125 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ SEVRTSSGMFL + QDE+V IE RI+AWTFLPPENGE++QILHY++G
Sbjct: 72 MVADNESGKSVQSEVRTSSGMFLERKQDEVVTRIEERISAWTFLPPENGESIQILHYQNG 131
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
+KYEPH+D+F DK NQ LGGHRIATVLMYLS+VEKGGET+FPN+E + Q +D WS+CA
Sbjct: 132 EKYEPHYDYFHDKKNQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCA 191
Query: 119 RRGYA 123
R GYA
Sbjct: 192 RNGYA 196
>gi|224117220|ref|XP_002331751.1| predicted protein [Populus trichocarpa]
gi|222874448|gb|EEF11579.1| predicted protein [Populus trichocarpa]
Length = 266
Score = 201 bits (511), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 104/174 (59%), Positives = 129/174 (74%), Gaps = 3/174 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MV D+ SGKS S VRTSSG FL + +D+I+ IE RIA ++F+P E+GE +QILHYE G
Sbjct: 91 MVVDSSSGKSKDSRVRTSSGTFLPRGRDKIIRDIEKRIADFSFIPSEHGEGLQILHYEVG 150
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SEC 117
QKYEPHFD+F D N + GG RIATVLMYLS VE+GGETVFP+++ + S W SEC
Sbjct: 151 QKYEPHFDYFMDDYNTENGGQRIATVLMYLSDVEEGGETVFPSAKGNISSVPWWNELSEC 210
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G +VKP GDALLF+S+ PDAS D +SLHG CPVI G KWS+TKW+ V +
Sbjct: 211 GKGGLSVKPKMGDALLFWSMKPDASLDPSSLHGGCPVIRGNKWSSTKWMRVNEY 264
>gi|215697788|dbj|BAG91981.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 225
Score = 201 bits (511), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 97/200 (48%), Positives = 137/200 (68%), Gaps = 10/200 (5%)
Query: 27 QDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATV 86
+D +V+ IE RI+ W+FLP ENGE++Q+L Y + +++ G HR+AT+
Sbjct: 33 EDIVVSKIEDRISLWSFLPKENGESIQVLKYGVNRSGS-----IKEEPKSSSGAHRLATI 87
Query: 87 LMYLSHVEKGGETVFPNSEV--SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTD 144
LMYLS V++GGETVFP SE+ +Q+++G S+C+ GYAV+P KG+A+L F+L PD TD
Sbjct: 88 LMYLSDVKQGGETVFPRSEMKDAQAKEGAPSQCS--GYAVRPAKGNAILLFNLRPDGETD 145
Query: 145 STSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE-PEDDDCVDEDLNCVVWAKAGECKKNP 203
S + CPV+EGEKW A K I++R FD P+ +D+C DED CV WA +GEC +NP
Sbjct: 146 KDSQYEECPVLEGEKWLAIKHINLRKFDYPKSSLASEDECTDEDDRCVSWAASGECDRNP 205
Query: 204 LYMVGSKSSRGYCRKSCKVC 223
++M+GS G CRKSC+VC
Sbjct: 206 VFMIGSSDYYGSCRKSCRVC 225
>gi|115434812|ref|NP_001042164.1| Os01g0174500 [Oryza sativa Japonica Group]
gi|55296794|dbj|BAD68120.1| prolyl 4-hydroxylase -like [Oryza sativa Japonica Group]
gi|113531695|dbj|BAF04078.1| Os01g0174500 [Oryza sativa Japonica Group]
gi|222617830|gb|EEE53962.1| hypothetical protein OsJ_00571 [Oryza sativa Japonica Group]
Length = 303
Score = 201 bits (510), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 97/200 (48%), Positives = 137/200 (68%), Gaps = 10/200 (5%)
Query: 27 QDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATV 86
+D +V+ IE RI+ W+FLP ENGE++Q+L Y + +++ G HR+AT+
Sbjct: 111 EDIVVSKIEDRISLWSFLPKENGESIQVLKYGVNRSGS-----IKEEPKSSSGAHRLATI 165
Query: 87 LMYLSHVEKGGETVFPNSEV--SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTD 144
LMYLS V++GGETVFP SE+ +Q+++G S+C+ GYAV+P KG+A+L F+L PD TD
Sbjct: 166 LMYLSDVKQGGETVFPRSEMKDAQAKEGAPSQCS--GYAVRPAKGNAILLFNLRPDGETD 223
Query: 145 STSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE-PEDDDCVDEDLNCVVWAKAGECKKNP 203
S + CPV+EGEKW A K I++R FD P+ +D+C DED CV WA +GEC +NP
Sbjct: 224 KDSQYEECPVLEGEKWLAIKHINLRKFDYPKSSLASEDECTDEDDRCVSWAASGECDRNP 283
Query: 204 LYMVGSKSSRGYCRKSCKVC 223
++M+GS G CRKSC+VC
Sbjct: 284 VFMIGSSDYYGSCRKSCRVC 303
>gi|356502598|ref|XP_003520105.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 296
Score = 200 bits (508), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 94/170 (55%), Positives = 129/170 (75%), Gaps = 3/170 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++E+G SI S VRTSSG FL++ +D+IV +IE RIA +TF+P +NGE +Q+LHY+ G+
Sbjct: 122 VIESETGMSIESRVRTSSGTFLARGRDKIVRNIENRIADFTFIPVDNGEELQVLHYQVGE 181
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
KY PH D+F D +N GG RIAT+LMYLS VE+GGETVFP+++ + S W+E C
Sbjct: 182 KYVPHHDYFMDDINTANGGDRIATMLMYLSDVEEGGETVFPDAKGNFSSMPGWNELSVCG 241
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
++G ++KP +ALLF+S+ PDA+ D SLHGSCPVI+G KWS+TKWI +
Sbjct: 242 KKGLSIKPKMRNALLFWSIKPDATYDPLSLHGSCPVIKGNKWSSTKWIRI 291
>gi|357517897|ref|XP_003629237.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523259|gb|AET03713.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|388513409|gb|AFK44766.1| unknown [Medicago truncatula]
gi|388516345|gb|AFK46234.1| unknown [Medicago truncatula]
Length = 275
Score = 199 bits (507), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 93/173 (53%), Positives = 129/173 (74%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++GKS+ S +RTSSG FL + DEIV++IE RIA +TF+P E+GE+ +LHYE GQ
Sbjct: 103 VIDEKTGKSLNSSIRTSSGTFLDREGDEIVSNIEKRIADFTFIPVEHGESFNVLHYEVGQ 162
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
KYEPH+D+F D + + G RIAT+LMYLS VE+GGETVFPN++ + S W+E C
Sbjct: 163 KYEPHYDYFLDTFSTRHAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCG 222
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G ++KP G+A+LF+S+ PDA+ D +SLHG+CPVI+G+KWS KW+H +
Sbjct: 223 KGGLSIKPKMGNAILFWSMKPDATLDPSSLHGACPVIKGDKWSCAKWMHADEY 275
>gi|159487763|ref|XP_001701892.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281111|gb|EDP06867.1| predicted protein [Chlamydomonas reinhardtii]
Length = 259
Score = 199 bits (505), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 105/222 (47%), Positives = 140/222 (63%), Gaps = 11/222 (4%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
GKS+ RTS G FL + QDEIV IE R+AAWT +P + E QIL Y GQ+Y+ H
Sbjct: 43 GKSVEDNYRTSYGTFLKRYQDEIVERIENRVAAWTQIPVAHQEDTQILRYGLGQQYKVHA 102
Query: 68 DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE-----VSQSRDGNWSECARRGY 122
D RD + G R+ATVL+YL+ + GGET FP+SE ++++ N+S+CA+
Sbjct: 103 DTLRD----EEAGVRVATVLIYLNEPDGGGETAFPSSEWVNPQLAKTLGANFSDCAKNHV 158
Query: 123 AVKPMKGDALLFFSLHPDASTDST-SLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD 181
A P +GDALLF+S++PD +T+ T + H CPV+ G KW+ATKWIH R F +P + +
Sbjct: 159 AFAPKRGDALLFWSINPDGNTEDTHASHTGCPVLSGVKWTATKWIHARPF-RPNEMADPG 217
Query: 182 DCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
C DE NC WA G+C+KN YMV + S G CRKSC C
Sbjct: 218 VCYDESPNCPEWAARGDCEKNSDYMVVNAVSPGVCRKSCGAC 259
>gi|218187602|gb|EEC70029.1| hypothetical protein OsI_00603 [Oryza sativa Indica Group]
Length = 549
Score = 196 bits (499), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 96/200 (48%), Positives = 135/200 (67%), Gaps = 10/200 (5%)
Query: 27 QDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATV 86
+D +V+ IE RI+ W+FLP ENGE +Q+L Y ++ +++ GGH +AT+
Sbjct: 357 EDIVVSKIEDRISLWSFLPKENGENIQVLKYGVNRRGS-----IKEEPKSSTGGHWLATI 411
Query: 87 LMYLSHVEKGGETVFPNSEV--SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTD 144
L+YLS V++GGETVFP SE+ +Q+++G S+C+ GYAV+P KG+ALL F+L PD D
Sbjct: 412 LIYLSDVKQGGETVFPRSEMKDAQAKEGAPSQCS--GYAVRPAKGNALLLFNLRPDGEID 469
Query: 145 STSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE-PEDDDCVDEDLNCVVWAKAGECKKNP 203
S + CPV+EGEKW A K IH+R D P+ +D+C DED CV WA +GEC +NP
Sbjct: 470 KDSQYEECPVLEGEKWLAIKHIHLRKLDSPKSSLASEDECTDEDDRCVSWAASGECDRNP 529
Query: 204 LYMVGSKSSRGYCRKSCKVC 223
++M+GS G CRKSC+VC
Sbjct: 530 VFMIGSSDYYGSCRKSCRVC 549
>gi|159487419|ref|XP_001701720.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280939|gb|EDP06695.1| predicted protein [Chlamydomonas reinhardtii]
Length = 274
Score = 195 bits (495), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 99/233 (42%), Positives = 137/233 (58%), Gaps = 17/233 (7%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G+ + +RTS GMF+ + QD +VA IE RI+ WT LP E+ E +Q+L Y HGQ Y H+
Sbjct: 43 GEGVVDNIRTSYGMFIRRLQDPVVARIEKRISLWTHLPVEHQEDIQVLRYAHGQTYGAHY 102
Query: 68 DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV------SQSRDGNWSECARRG 121
D DK N+ R+AT LMYLS VE+GGET FP++ V + +S+CA+
Sbjct: 103 D-SGDKSNEPGPKWRLATFLMYLSDVEEGGETAFPHNSVWADPSIPEKVGDKFSDCAKGN 161
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK-------- 173
A KP GDA+LF+S +P+ + D ++H CPVI+G KW+A W+H F
Sbjct: 162 VAAKPKAGDAVLFYSFYPNMTMDPAAMHTGCPVIKGVKWAAPVWMHDIPFRPSEISGMVQ 221
Query: 174 --PEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
P+ EP+ C D CV WA AGEC+ N +M+G + G CRK+CK C+
Sbjct: 222 RIPDNEPDAGTCTDLHPRCVEWAAAGECEHNKGFMMGGPDNLGTCRKTCKACE 274
>gi|357467087|ref|XP_003603828.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492876|gb|AES74079.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 156
Score = 194 bits (494), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 88/154 (57%), Positives = 119/154 (77%), Gaps = 3/154 (1%)
Query: 21 MFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGG 80
MFL + +D+I+ +IE RIA +TF+P ENGE +Q+LHY G+KYEPH+D+F D+ N + GG
Sbjct: 1 MFLKRGKDKIIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNGG 60
Query: 81 HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECARRGYAVKPMKGDALLFFSL 137
R+ATVLMYLS VE+GGETVFP ++ + S W SECAR+G ++KP GDALLF+S+
Sbjct: 61 QRVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSM 120
Query: 138 HPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
PDA+ D++SLHG CPVI G KWS+TKW+H+ +
Sbjct: 121 RPDATLDASSLHGGCPVIVGNKWSSTKWMHLEEY 154
>gi|302143843|emb|CBI22704.3| unnamed protein product [Vitis vinifera]
Length = 317
Score = 194 bits (493), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 91/200 (45%), Positives = 133/200 (66%), Gaps = 5/200 (2%)
Query: 28 DEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVL 87
DE+ A IE RI+AWTFLP EN E ++++ Y+ + + +++F +K + G +ATVL
Sbjct: 119 DEVAARIEKRISAWTFLPKENSEPLEVVQYQF-ENAKQKYNYFSNKSTSKFGEPLMATVL 177
Query: 88 MYLSHVEKGGETVFPNSEV--SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDS 145
++LS+V +GGE FP SE+ SQS+ G S+C ++P+KG+A+LFF++HP+AS D
Sbjct: 178 LHLSNVTRGGELFFPESELKNSQSKSGILSDCTESSSGLRPVKGNAILFFNVHPNASPDK 237
Query: 146 TSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD--DCVDEDLNCVVWAKAGECKKNP 203
+S + CPV+EGE W ATK+ H+R + + D +C DED NC WA GEC++NP
Sbjct: 238 SSSYARCPVLEGEMWCATKFFHLRAIGRENVSFKLDGGECTDEDENCPKWASIGECQRNP 297
Query: 204 LYMVGSKSSRGYCRKSCKVC 223
+YM+GS G CRKSC VC
Sbjct: 298 IYMIGSPDYYGTCRKSCNVC 317
>gi|357517895|ref|XP_003629236.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523258|gb|AET03712.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 326
Score = 194 bits (493), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 92/173 (53%), Positives = 122/173 (70%), Gaps = 3/173 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+G + S RTSSG FL + D IV +IE RIA +TF+P E+GE +LHYE GQ
Sbjct: 152 VIDEETGNGVDSRERTSSGAFLKRGSDRIVKNIERRIADFTFIPVEHGENFNVLHYEVGQ 211
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
KYEPH+D+F D + G RIAT+LMYLS VE+GGETVFPN++ + S W+E C
Sbjct: 212 KYEPHYDYFMDTFSTTYAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCG 271
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G ++KP G+A+LF+S+ PDA+ D +SLHG+CPVI+G+KW KW+HV F
Sbjct: 272 KGGLSIKPKMGNAILFWSMKPDATLDPSSLHGACPVIKGDKWLCAKWMHVGEF 324
>gi|302844247|ref|XP_002953664.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
gi|300261073|gb|EFJ45288.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
Length = 364
Score = 194 bits (493), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 104/234 (44%), Positives = 138/234 (58%), Gaps = 20/234 (8%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G+ + +RTS GMF+ + D I+A IE RI+ WT LP E+ E +Q+L Y HGQ Y H+
Sbjct: 90 GEGVVDNIRTSFGMFIRRLSDPIIARIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHY 149
Query: 68 DFFRDKMNQQLGGH-RIATVLMYLSHVEKGGETVFPNSEV-----SQSRDGNWSECARRG 121
D + +G R+AT LMYLS VE+GGET FP + V R G SECA+
Sbjct: 150 D--SGASSDHVGPKWRLATFLMYLSDVEEGGETAFPQNSVWYDPTIPERIGPVSECAKGH 207
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE------ 175
A KP GDA+LF+S P+ + D ++H CPVI+G KW+A W+H F +PE
Sbjct: 208 VAAKPKAGDAVLFYSFLPNNTMDPAAMHTGCPVIKGIKWAAPVWMHDIPF-RPEEVQGGK 266
Query: 176 -----KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
++PE CVD C WA AGEC+KNP+YM G +S G CRKSC+ C+
Sbjct: 267 QLIMDRDPEAGLCVDGHPRCGEWAAAGECEKNPMYMAGGPNSLGTCRKSCRTCE 320
>gi|18071415|gb|AAL58274.1|AC068923_16 putative prolyl 4-hydroxylase, alpha subunit [Oryza sativa Japonica
Group]
Length = 343
Score = 191 bits (485), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 92/157 (58%), Positives = 118/157 (75%), Gaps = 3/157 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +G S S VRTSSGMFL + QD+I+ +IE RI+ +TF+P ENGE +Q+LHYE GQ
Sbjct: 147 VVDASTGGSKDSRVRTSSGMFLGRGQDKIIRTIEKRISDYTFIPVENGEGLQVLHYEVGQ 206
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSECA 118
KYEPHFD+F D+ N + GG RIAT+LMYLS VE+GGET+FP+S+ + S SECA
Sbjct: 207 KYEPHFDYFHDEFNTKNGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECA 266
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVI 155
++G AVKP GDALLF+S+ PD S D+TSLHG P++
Sbjct: 267 KKGLAVKPKMGDALLFWSMRPDGSLDATSLHGEIPIL 303
>gi|357517885|ref|XP_003629231.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523253|gb|AET03707.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 279
Score = 191 bits (484), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 89/170 (52%), Positives = 127/170 (74%), Gaps = 3/170 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS+ S RTSSG F+ + D+I++ IE RIA +TF+P E+GE + ILHYE GQ
Sbjct: 107 VVDDTTGKSVNSSARTSSGTFIDRGYDKILSDIEKRIADFTFIPVEHGEDVNILHYEVGQ 166
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
KY+ H D+F D++N + GG RIAT+LMYLS VE+GGETVFP+++ + S W+E C
Sbjct: 167 KYDFHTDYFEDEVNTKHGGERIATMLMYLSDVEEGGETVFPSAKGNFSSVPWWNELSDCG 226
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
++G ++KP G+A+LF+ + PDA+ D S+HG+CPVI+G+KWS TKW+ V
Sbjct: 227 KKGLSIKPKMGNAILFWGMKPDATVDPLSVHGACPVIKGDKWSCTKWMRV 276
>gi|307102975|gb|EFN51240.1| hypothetical protein CHLNCDRAFT_28187 [Chlorella variabilis]
Length = 322
Score = 191 bits (484), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 110/244 (45%), Positives = 145/244 (59%), Gaps = 17/244 (6%)
Query: 1 MVADNESGKSIASEVR---TSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHY 57
+V+ + SGK + R +SSG FL+K QD +VA +E RI T LP + E +Q+L Y
Sbjct: 52 VVSRDGSGKLDSVRTRQGLSSSGTFLTKRQDSVVAGVEDRIELATHLPFSHSEQLQVLKY 111
Query: 58 EHGQKYEPHFDFFRDKMNQQL-------GGHRIATVLMYLSHVEKGGETVFPNS----EV 106
E GQKY H+D QL GG R AT+LMYLS VE+GGET FP+ E
Sbjct: 112 ELGQKYSAHYDVHGSNEQAQLAIRRGEQGGSRYATMLMYLSDVEEGGETSFPHGRWIDEG 171
Query: 107 SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDA-STDSTSLHGSCPVIEGEKWSATKW 165
+Q++ +SEC RG AVKP KGDA+LF+SL D S D SLH CPV +G K+SAT W
Sbjct: 172 AQAQP-PYSECGSRGVAVKPRKGDAILFYSLKSDGQSKDFFSLHAGCPVAKGVKYSATAW 230
Query: 166 IHVRNFDKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCKP 225
IHV + C D + C WA GEC++N ++M G+ + RG+CR SCKVC+P
Sbjct: 231 IHVEPYSN-TGPLHPGFCRDNNAKCPEWAALGECERNVVFMRGNGTYRGHCRLSCKVCQP 289
Query: 226 SSVS 229
+ +
Sbjct: 290 CAAN 293
>gi|359490628|ref|XP_002271805.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Vitis
vinifera]
Length = 312
Score = 189 bits (481), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 89/198 (44%), Positives = 130/198 (65%), Gaps = 6/198 (3%)
Query: 28 DEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVL 87
DE+ A IE RI+AWTFLP EN E ++++ Y+ + + +++F +K + G +ATVL
Sbjct: 119 DEVAARIEKRISAWTFLPKENSEPLEVVQYQF-ENAKQKYNYFSNKSTSKFGEPLMATVL 177
Query: 88 MYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTS 147
++LS+V +GGE FP SE S+ G S+C ++P+KG+A+LFF++HP+AS D +S
Sbjct: 178 LHLSNVTRGGELFFPESE---SKSGILSDCTESSSGLRPVKGNAILFFNVHPNASPDKSS 234
Query: 148 LHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD--DCVDEDLNCVVWAKAGECKKNPLY 205
+ CPV+EGE W ATK+ H+R + + D +C DED NC WA GEC++NP+Y
Sbjct: 235 SYARCPVLEGEMWCATKFFHLRAIGRENVSFKLDGGECTDEDENCPKWASIGECQRNPIY 294
Query: 206 MVGSKSSRGYCRKSCKVC 223
M+GS G CRKSC VC
Sbjct: 295 MIGSPDYYGTCRKSCNVC 312
>gi|255545252|ref|XP_002513687.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223547595|gb|EEF49090.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 309
Score = 189 bits (481), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 96/221 (43%), Positives = 139/221 (62%), Gaps = 13/221 (5%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G + ++ +S S D+++A IE RI+AWTF+P EN + +Q++HY + E HF
Sbjct: 97 GDGSRNNIQLASSESRSHIYDDLLARIEERISAWTFIPKENSKPLQVMHYGIEEARE-HF 155
Query: 68 DFFRDKM---NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAV 124
D+F +K N L +AT+++YLS+V +GGE +FP SE+ +D WS+C + +
Sbjct: 156 DYFDNKTLISNVSL----MATLVLYLSNVTRGGEILFPKSEL---KDKVWSDCTKDSSIL 208
Query: 125 KPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD--D 182
+P+KG+A+L F+ H +AS DS S HG CPV+EGE W ATK VR ++ + P+ D D
Sbjct: 209 RPVKGNAVLIFNAHLNASADSRSTHGRCPVLEGEMWCATKQFLVRATNEEKSLPDSDGSD 268
Query: 183 CVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
C DED NC WA GEC++NP++M GS G CRKSC C
Sbjct: 269 CTDEDDNCPKWAALGECQRNPIFMTGSPDYYGTCRKSCNAC 309
>gi|412992163|emb|CCO19876.1| predicted protein [Bathycoccus prasinos]
Length = 350
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 100/232 (43%), Positives = 137/232 (59%), Gaps = 17/232 (7%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +G+ +RTS FL++ + +V +E R++ +T LP NGE MQIL Y G+
Sbjct: 117 VVDSITGEIKTDPIRTSKQTFLARGKYPVVTRVEERLSRFTMLPWYNGEDMQILSYGVGE 176
Query: 62 KYEPHFDFFRD--KMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSE----VSQSRDG 112
KY H D K QQL GG R+ATVL+YL E+GGET FP+SE S+
Sbjct: 177 KYSAHHDVGEKNTKSGQQLSADGGQRVATVLLYLQDTEEGGETAFPDSEWIEPESEYAQQ 236
Query: 113 NWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF- 171
+SECA+ G A KP +GD LLFFS+ P+ D S+H CPV++G KW+ATKWIH R F
Sbjct: 237 KFSECAKNGVAFKPKRGDGLLFFSITPEGDIDQKSMHAGCPVVKGTKWTATKWIHARPFH 296
Query: 172 -DKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKV 222
P +P + C + D C WA AGEC++NP +M + C+ +C+V
Sbjct: 297 YKLPNPKPPKEGCENTDERCKGWANAGECERNPGFMTKN------CKWACRV 342
>gi|222623961|gb|EEE58093.1| hypothetical protein OsJ_08962 [Oryza sativa Japonica Group]
Length = 387
Score = 186 bits (472), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 89/157 (56%), Positives = 117/157 (74%), Gaps = 3/157 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS S VRTSSGMFL + +D+++ +IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 136 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQ 195
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPHFD+F D+ N + GG R+AT+LMYLS VE+GGET+FP++ V+ S SECA
Sbjct: 196 KYEPHFDYFLDEYNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECA 255
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVI 155
R+G AVKP GDALLF+S+ PDA+ D SLH + V
Sbjct: 256 RKGLAVKPKMGDALLFWSMKPDATLDPLSLHDTLRVF 292
>gi|242051901|ref|XP_002455096.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
gi|241927071|gb|EES00216.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
Length = 303
Score = 186 bits (471), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 95/201 (47%), Positives = 131/201 (65%), Gaps = 12/201 (5%)
Query: 27 QDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGH-RIAT 85
+D IV++IE RI+ W+FLP + GE+MQIL KYE + + + +Q GH R+ T
Sbjct: 111 EDTIVSTIEDRISVWSFLPKDFGESMQIL------KYEVNKSDYNNYESQSSSGHDRLVT 164
Query: 86 VLMYLSHVEKGGETVFPNSEVSQSRD--GNWSECARRGYAVKPMKGDALLFFSLHPDAST 143
VLMYLS V++GGET FP SE+ ++ SECA GYAV+P++G+A+L F+L PD
Sbjct: 165 VLMYLSDVKRGGETAFPRSELKGTKVELAAPSECA--GYAVQPVRGNAILLFNLKPDGVI 222
Query: 144 DSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE-PEDDDCVDEDLNCVVWAKAGECKKN 202
D S + C V+EGE+W A K IH+R D P+ +D+C DED CV WA GEC +N
Sbjct: 223 DKDSQYEMCSVLEGEEWLAIKHIHLRKIDTPKSSLVSEDECTDEDDRCVSWAAGGECDRN 282
Query: 203 PLYMVGSKSSRGYCRKSCKVC 223
P++M+G+ G CRKSC+VC
Sbjct: 283 PIFMIGTPDYYGSCRKSCRVC 303
>gi|218191856|gb|EEC74283.1| hypothetical protein OsI_09531 [Oryza sativa Indica Group]
Length = 376
Score = 185 bits (470), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 88/151 (58%), Positives = 115/151 (76%), Gaps = 3/151 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS S VRTSSGMFL + +D+++ +IE RIA +TF+P E+GE +Q+LHYE GQ
Sbjct: 136 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQ 195
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR---DGNWSECA 118
KYEPHFD+F D+ N + GG R+AT+LMYLS VE+GGET+FP++ V+ S SECA
Sbjct: 196 KYEPHFDYFLDEYNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECA 255
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLH 149
R+G AVKP GDALLF+S+ PDA+ D SLH
Sbjct: 256 RKGLAVKPKMGDALLFWSMKPDATLDPLSLH 286
>gi|224122338|ref|XP_002318810.1| predicted protein [Populus trichocarpa]
gi|222859483|gb|EEE97030.1| predicted protein [Populus trichocarpa]
Length = 310
Score = 184 bits (468), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 95/222 (42%), Positives = 138/222 (62%), Gaps = 10/222 (4%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHY--EHGQ 61
D++SG+ + + SS L+ D I++ IE R++AWT LP EN + +Q++HY E +
Sbjct: 97 DDDSGRIERNRLFASSTSLLN-MDDNILSRIEERVSAWTLLPKENSKPLQVMHYGIEDAK 155
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
Y FD+F +K +AT++ YLS+V +GGE FP SEV ++ WS+C +
Sbjct: 156 NY---FDYFGNKSAIISSEPLMATLVFYLSNVTQGGEIFFPKSEV---KNKIWSDCTKIS 209
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD 181
+++P+KG+A+LFF++HP+ S D S H CPV+EGE W ATK ++R K + E
Sbjct: 210 DSLRPIKGNAILFFTVHPNTSPDMGSSHSRCPVLEGEMWYATKKFYLRAI-KVFSDSEGS 268
Query: 182 DCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+C DED NC WA GEC+KNP+YM+GS G CRKSC C
Sbjct: 269 ECTDEDENCPSWAALGECEKNPVYMIGSPDYFGTCRKSCNAC 310
>gi|159487421|ref|XP_001701721.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280940|gb|EDP06696.1| predicted protein [Chlamydomonas reinhardtii]
Length = 336
Score = 184 bits (466), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 98/229 (42%), Positives = 134/229 (58%), Gaps = 21/229 (9%)
Query: 14 EVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK 73
++RTS GMF+ + D +V IE RI+ WT LP E+ E +QIL Y HGQ Y H+D
Sbjct: 67 DIRTSYGMFIRRLSDPVVTRIEKRISLWTHLPVEHQEDIQILRYAHGQTYGAHYD--SGA 124
Query: 74 MNQQLGGH-RIATVLMYLSHVEKGGETVFPNSEV------SQSRDGNWSECARRGYAVKP 126
+ +G R+AT LMYLS VE+GGET FP++ V + +S+CA+ A KP
Sbjct: 125 SSDHVGPKWRLATFLMYLSDVEEGGETAFPHNSVWADPSIPEQVGDKFSDCAKGHVAAKP 184
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE----------- 175
GDA+LF+S +P+ + D S+H CPVI+G KW+A W+H F +PE
Sbjct: 185 KAGDAVLFYSFYPNNTMDPASMHTGCPVIKGVKWAAPVWMHDIPF-RPEEISGMTQHNMD 243
Query: 176 KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
++P+ C D C WA AGEC+ N YM G ++ G CRKSCKVC+
Sbjct: 244 RDPDAGTCTDLHARCTEWAAAGECENNKAYMCGGSNNLGACRKSCKVCE 292
>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 683
Score = 182 bits (461), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 88/169 (52%), Positives = 120/169 (71%), Gaps = 3/169 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+V D +G+ S RTSSGMFL + +D+IV +IE RIA T +P ENGE + ++HY G
Sbjct: 148 LVVDGVTGEVKESSSRTSSGMFLDRGKDKIVQNIERRIADITSVPIENGEGLHVIHYGVG 207
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
QK EPH+D+ D + + GG R+ATVLMYLS VE+GGETVFP+ +Q + S+C+
Sbjct: 208 QKCEPHYDYTSDGVVTKNGGPRVATVLMYLSDVEEGGETVFPD---AQPNFTSVSKCSGD 264
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G +VKP GDALLF+S+ PD + D++SLHG PVI G KW++TKW+H+R
Sbjct: 265 GLSVKPKMGDALLFWSMKPDGTLDTSSLHGGSPVIRGNKWASTKWLHLR 313
Score = 136 bits (342), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 75/171 (43%), Positives = 102/171 (59%), Gaps = 18/171 (10%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+V D +GK S RTSSG FL + +D+IV +IE RIA T +P + M
Sbjct: 393 LVVDGLTGKGRESSARTSSGRFLERGKDKIVQNIEQRIADITSIPRMARDFML------- 445
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
F + + GG R+ATVLMYLS VE+GGETVFPN++ + + + E +
Sbjct: 446 --------FTAGGVVTKNGGPRVATVLMYLSDVEEGGETVFPNAKPNINSVSKYPE---K 494
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G +VKP GDALLF S+ PD + D++SLHG PVI G KW++TKW+H+ F
Sbjct: 495 GLSVKPKMGDALLFRSMKPDGTLDTSSLHGGSPVIRGNKWASTKWLHLTEF 545
>gi|307109700|gb|EFN57937.1| hypothetical protein CHLNCDRAFT_142031 [Chlorella variabilis]
Length = 325
Score = 181 bits (460), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 102/234 (43%), Positives = 135/234 (57%), Gaps = 22/234 (9%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G+S+ RTS GMF+ + DE+V+++E R+A WT + E +Q+L Y Q+Y+ HF
Sbjct: 74 GESVVDNYRTSYGMFIRRHHDEVVSTLEKRVATWTKYNVTHQEDIQVLRYGTTQEYKAHF 133
Query: 68 DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE----VSQSRDGNWSECARRGYA 123
D D R ATVL+YLS VE GGET FPNSE G +SECA+ A
Sbjct: 134 DSLDDD------SPRTATVLIYLSDVESGGETTFPNSEWIDPALPKALGPFSECAQGHVA 187
Query: 124 VKPMKGDALLFFSLHPDA-STDSTSLHGSCPVIEGEKWSATKWIHVRNFD--------KP 174
+KP +GDA++F SL+PD S D +LH +CPVI G K+ A WIH + F P
Sbjct: 188 MKPKRGDAIVFHSLNPDGRSHDQHALHTACPVIVGVKYVAIFWIHTKPFRPEQLKGPLAP 247
Query: 175 EKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCK---VCKP 225
E +DCVD D C WA +GEC +NP +M G+ ++ G CR SC VCKP
Sbjct: 248 EPPMVPEDCVDADPGCPGWAASGECDRNPGFMRGAATTLGTCRASCGDCVVCKP 301
>gi|363543297|ref|NP_001241864.1| prolyl 4-hydroxylase 4-2 precursor [Zea mays]
gi|194704960|gb|ACF86564.1| unknown [Zea mays]
gi|347978810|gb|AEP37747.1| prolyl 4-hydroxylase 4-2 [Zea mays]
Length = 207
Score = 180 bits (457), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 87/125 (69%), Positives = 100/125 (80%), Gaps = 2/125 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ SEVRTSSGMFL K QD +V+ IE RIAAWTFLP EN E MQ+L YE G
Sbjct: 81 MVADNESGKSVKSEVRTSSGMFLDKRQDPVVSRIEERIAAWTFLPQENAENMQVLRYEPG 140
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
QKYEPHFD+F D++NQ GGHR ATVLMYLS V +GGETVFPN++ SQ +D +SECA
Sbjct: 141 QKYEPHFDYFHDRVNQARGGHRYATVLMYLSTVREGGETVFPNAKGWESQPKDATFSECA 200
Query: 119 RRGYA 123
+G A
Sbjct: 201 HKGLA 205
>gi|255083627|ref|XP_002508388.1| predicted protein [Micromonas sp. RCC299]
gi|226523665|gb|ACO69646.1| predicted protein [Micromonas sp. RCC299]
Length = 253
Score = 180 bits (457), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 102/219 (46%), Positives = 133/219 (60%), Gaps = 13/219 (5%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +G+S +RTS FL++ +IV +E R+A T LP +GE MQIL Y GQ
Sbjct: 35 VIDSVTGQSKVDPIRTSEQTFLNRGTWDIVTKVEERLAVVTQLPAYHGEDMQILKYGLGQ 94
Query: 62 KYEPHFDF--FRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSE-----VSQSRD 111
KY+ H D +QL GGHR+ATVL+YLS VE+GGET FP+SE + + +
Sbjct: 95 KYDAHHDVGELTSASGKQLAAEGGHRVATVLLYLSDVEEGGETAFPDSEWMTPELRKWAE 154
Query: 112 GN-WSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
G WS+CA AVKP KGD LLF+S++ + + D S+H CPVI GEKW+ATKWIH R
Sbjct: 155 GQKWSDCAEGNVAVKPRKGDGLLFWSVNNENAIDPHSMHAGCPVIRGEKWTATKWIHARP 214
Query: 171 F--DKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMV 207
F P C ++ C WA AGECKKNP +M+
Sbjct: 215 FRWTAPPPPKAPPGCDNKHELCKAWANAGECKKNPGFML 253
>gi|414591891|tpg|DAA42462.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 207
Score = 180 bits (457), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 87/125 (69%), Positives = 100/125 (80%), Gaps = 2/125 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADNESGKS+ SEVRTSSGMFL K QD +V+ IE RIAAWTFLP EN E MQ+L YE G
Sbjct: 81 MVADNESGKSVKSEVRTSSGMFLDKRQDPVVSRIEERIAAWTFLPQENAENMQVLRYEPG 140
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE--VSQSRDGNWSECA 118
QKYEPHFD+F D++NQ GGHR ATVLMYLS V +GGETVFPN++ SQ +D +SECA
Sbjct: 141 QKYEPHFDYFHDRVNQARGGHRYATVLMYLSTVREGGETVFPNAKGWESQPKDATFSECA 200
Query: 119 RRGYA 123
+G A
Sbjct: 201 HKGLA 205
>gi|412985583|emb|CCO19029.1| predicted protein [Bathycoccus prasinos]
Length = 458
Score = 179 bits (455), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 95/237 (40%), Positives = 134/237 (56%), Gaps = 23/237 (9%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+G + S++RTS+G F+ ++++ +E R+A ++ LP ++ EA Q+L YE Q
Sbjct: 215 VVDAETGGTAKSDIRTSTGSFVGIGANDLMKKLEKRVATFSMLPVKHQEATQVLRYEVKQ 274
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR-----DGNWSE 116
+Y H+D+F K + +RI T+LMYL E GGETVFPN+EV R N+SE
Sbjct: 275 EYRAHYDYFFHKGG--MANNRIVTILMYLHEPEFGGETVFPNTEVPLERAEKGWGKNFSE 332
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
C RG A KGDAL+F+S+ P D S H CPV+ GEKW+ATKWIHV ++ +
Sbjct: 333 CGNRGRAAVVRKGDALIFWSMKPGGELDPGSSHAGCPVVRGEKWTATKWIHVNPTNQWNQ 392
Query: 177 E----------PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ C D + C WA+ GEC NP +MV S C+ SC+ C
Sbjct: 393 NNHKVHYAGGPANSETCKDTNAACPGWAEGGECTANPGFMVNS------CKVSCRQC 443
>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
Length = 433
Score = 179 bits (453), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 98/236 (41%), Positives = 139/236 (58%), Gaps = 22/236 (9%)
Query: 2 VADNESGKSIASEVRTSSGMFLSK----AQDEIVASIEARIAAWTFLPPENGEAMQILHY 57
V D +G S S +RTS+G F+ +++V IE RIAAWT +P +GE +Q+L Y
Sbjct: 196 VVDASNGGSSFSNIRTSTGSFVPTVFPLGMNDVVRRIERRIAAWTQIPAAHGEPIQVLRY 255
Query: 58 EHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR-DGNWSE 116
+ GQ+Y+ HFD+F + + +RIATVLMYLS V+ GGETVFP++E Q + +
Sbjct: 256 QIGQEYQSHFDYFFHEGGMK--NNRIATVLMYLSDVKDGGETVFPSAESLQVKPEPIHHA 313
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN---FDK 173
CA+ G V P KGDA+LF+++ D S H CPV+ GEKW+ATKW+HV + FD
Sbjct: 314 CAKNGITVIPKKGDAILFWNMKVGGDLDGGSTHAGCPVVLGEKWTATKWLHVSSSTEFDA 373
Query: 174 PE------KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ +E C + ++ C VWA+ EC++NP YM R C SC +C
Sbjct: 374 RQRVLREGRETNFGGCRNANIQCQVWAEQNECERNPQYM------RDTCHLSCGMC 423
>gi|297797785|ref|XP_002866777.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312612|gb|EFH43036.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 266
Score = 177 bits (450), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 86/152 (56%), Positives = 113/152 (74%), Gaps = 3/152 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++GKS S VRTSSG FL++ +D+ + IE RI+ +TF+P E+GE +Q+LHYE GQ
Sbjct: 114 VVDEKTGKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQ 173
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
KYEPH+D+F D+ N + GG RIATVLMYLS VE+GGETVFP ++ + S W SEC
Sbjct: 174 KYEPHYDYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECG 233
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHG 150
+ G +VKP GDALLF+S+ PDA+ D +SLHG
Sbjct: 234 KGGLSVKPKMGDALLFWSMTPDATLDPSSLHG 265
>gi|10177121|dbj|BAB10411.1| prolyl 4-hydroxylase, alpha subunit-like protein [Arabidopsis
thaliana]
Length = 267
Score = 177 bits (450), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 86/152 (56%), Positives = 113/152 (74%), Gaps = 3/152 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++GKS S VRTSSG FL++ +D+ + IE RI+ +TF+P E+GE +Q+LHYE GQ
Sbjct: 115 VVDEKTGKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQ 174
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
KYEPH+D+F D+ N + GG RIATVLMYLS VE+GGETVFP ++ + S W SEC
Sbjct: 175 KYEPHYDYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECG 234
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHG 150
+ G +VKP GDALLF+S+ PDA+ D +SLHG
Sbjct: 235 KGGLSVKPKMGDALLFWSMTPDATLDPSSLHG 266
>gi|357467077|ref|XP_003603823.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492871|gb|AES74074.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 291
Score = 177 bits (449), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 87/175 (49%), Positives = 119/175 (68%), Gaps = 10/175 (5%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+V D +G+ I + VRTSSG FL + +D+IV ++E RIA T +P ENGE +QI+HYE G
Sbjct: 121 LVVDGVTGQGILNSVRTSSGTFLERGKDKIVQNVERRIADITSIPIENGEGLQIIHYEVG 180
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR- 119
QK+EPH+D+ + GG R+ATVLMYLS VE+GGETVFPN++ N++ ++
Sbjct: 181 QKFEPHYDYNFNWRITNNGGPRVATVLMYLSDVEEGGETVFPNAK------PNFNSVSKY 234
Query: 120 ---RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+G VKP GDALLF+S+ PD S D+ SLHG PVI G KW++ K +H+ F
Sbjct: 235 HPGKGLVVKPKMGDALLFWSVKPDGSLDTASLHGGSPVIRGSKWASNKLLHLTEF 289
>gi|242085722|ref|XP_002443286.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
gi|241943979|gb|EES17124.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
Length = 147
Score = 176 bits (447), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 85/152 (55%), Positives = 107/152 (70%), Gaps = 5/152 (3%)
Query: 21 MFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGG 80
MFL + QD IV +IE RIA +T +P ENGE +Q+LHY GQK+EPHFD+ ++GG
Sbjct: 1 MFLKRGQDTIVRTIEQRIADYTSVPIENGEPLQVLHYAVGQKFEPHFDYTDGTSVTKIGG 60
Query: 81 HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPD 140
R AT LMYLS VE+GGETVFPN+ S + A+ G +VKP GDALLF+S+ PD
Sbjct: 61 PRKATFLMYLSDVEEGGETVFPNATAKGS-----APSAKSGISVKPKMGDALLFWSMKPD 115
Query: 141 ASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
S D SLHG+ PVI+G+KWSATKWIHV ++
Sbjct: 116 GSLDPKSLHGASPVIKGDKWSATKWIHVNKYN 147
>gi|159469311|ref|XP_001692811.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278064|gb|EDP03830.1| predicted protein [Chlamydomonas reinhardtii]
Length = 273
Score = 176 bits (446), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 82/170 (48%), Positives = 111/170 (65%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D SG S+ S++RTS GMF + +D I+ ++E R+A WT P GE++Q+L Y Q
Sbjct: 73 VVDTGSGGSVVSDIRTSDGMFFERGEDAIIEAVEQRLADWTMTPIWGGESLQVLRYRKDQ 132
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
KY+ H+D+F K GG+R ATVL+YL+ E+GGETVFP + +SECA+
Sbjct: 133 KYDSHWDYFFHKDGSSNGGNRWATVLLYLTETEEGGETVFPKIPAPNGINVGFSECAKYN 192
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
AVKP KGDALLF S+ P + S+HG+CPVI GEK+S TKWIH ++
Sbjct: 193 LAVKPHKGDALLFHSMKPTGELEERSMHGACPVIRGEKFSMTKWIHAGHY 242
>gi|357517893|ref|XP_003629235.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523257|gb|AET03711.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 196
Score = 176 bits (446), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 86/177 (48%), Positives = 121/177 (68%), Gaps = 16/177 (9%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
D+E+GKS+ + RTSSG F+++ D+I+ +IE RIA +TF+P ENGE++ ILHYE GQ
Sbjct: 33 TVDDETGKSVDNSARTSSGTFINRGHDKILRNIEQRIADFTFIPVENGESVNILHYEVGQ 92
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE---CA 118
KYEPH DFF D++N + GG E+GGETVFP +E + S W+E C
Sbjct: 93 KYEPHPDFFTDEINTKNGG-------------EQGGETVFPFAEGNFSSVPWWNELSDCG 139
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
++G ++KP GDALLF+S+ PD + D S+HG+CPVI+G+KWS TKW+ V + P+
Sbjct: 140 KKGLSIKPKMGDALLFWSMKPDGTLDPLSMHGACPVIKGDKWSCTKWMRVGKWSIPK 196
>gi|255083957|ref|XP_002508553.1| predicted protein [Micromonas sp. RCC299]
gi|226523830|gb|ACO69811.1| predicted protein [Micromonas sp. RCC299]
Length = 262
Score = 174 bits (441), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 101/231 (43%), Positives = 136/231 (58%), Gaps = 20/231 (8%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + + +VRTS G FL K D+++ IE R+ ++ + EN E +Q+L Y GQ
Sbjct: 38 VVGGKDDTGVLDDVRTSFGTFLPKKYDDVLYGIERRVEDFSQISYENQEQLQLLKYHDGQ 97
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE----VSQSRDG---NW 114
+Y+ H +D + GG RIATVLM+L EKGGET FP + V+Q G
Sbjct: 98 EYKDH----QDGLTSPNGGRRIATVLMFLHEPEKGGETSFPQGKPLPAVAQRLRGMRDEL 153
Query: 115 SECA---RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
S+CA RG AVKP +GDA+LFFS + +D S H SCP + G KW+ATKWIH + F
Sbjct: 154 SDCAWRDGRGLAVKPRRGDAVLFFSFKKNGGSDIASTHASCPTVGGVKWTATKWIHEKRF 213
Query: 172 DKPE-KEPEDDDCVDED-LNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSC 220
D +EP+ CVDE+ NC WAK+GEC NP YM+G ++ G C +SC
Sbjct: 214 DTGVWREPK---CVDEEPANCPGWAKSGECANNPAYMLGGETP-GKCLRSC 260
>gi|302850293|ref|XP_002956674.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
nagariensis]
gi|300258035|gb|EFJ42276.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
nagariensis]
Length = 325
Score = 174 bits (440), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 93/223 (41%), Positives = 126/223 (56%), Gaps = 18/223 (8%)
Query: 9 KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD 68
+ + ++RTS G FL +AQD ++ +IE R+A W+ +PP + E MQ+L Y KY PH D
Sbjct: 84 EGVVDDIRTSYGTFLRRAQDPVIMAIEERLALWSHMPPSHQEDMQVLRYGRTNKYGPHID 143
Query: 69 FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS--EVSQSRDGNWSECARRGYAVKP 126
G R+ATVLMYL E G + P S E + N S CA+ A KP
Sbjct: 144 ----------GLERVATVLMYLVG-ESPGPDLAPVSACECMYAEQSNPSACAKGHVAYKP 192
Query: 127 MKGDALLFFSLHPD-ASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKE----PEDD 181
+GDAL+FF + PD +TD S+H CPV+ G KW+A KWIH F + + P+
Sbjct: 193 KRGDALMFFDVKPDYTTTDGHSMHTGCPVVAGVKWNAVKWIHGTPFRRMRRNKPPLPDPG 252
Query: 182 DCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
C D C WA+AGEC+ NP YM+GS + G CR +CK C+
Sbjct: 253 VCTDLHEMCDTWARAGECQNNPGYMLGSNTGIGNCRLACKDCE 295
>gi|302841711|ref|XP_002952400.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
gi|300262336|gb|EFJ46543.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
Length = 269
Score = 174 bits (440), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 82/171 (47%), Positives = 111/171 (64%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D SG S+ S++RTS GMF + +D I+ ++E R+A WT P GEA+Q+L Y Q
Sbjct: 73 VVDTASGSSVVSDIRTSDGMFFERGEDAILEAVEQRLADWTMTPIWAGEALQVLRYRKDQ 132
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
KY+ H ++F K GG+R ATVL YL+ E+GGETVFP + +SECA+
Sbjct: 133 KYDSHVNYFFHKEGSANGGNRWATVLTYLTDTEEGGETVFPKIPAPGGVNVGFSECAKYN 192
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
AVKP KGDA+LF S+ + + SLHG+CPVI+GEK+S TKWIH ++D
Sbjct: 193 LAVKPRKGDAILFHSMKTNGQLEERSLHGACPVIKGEKFSMTKWIHAGHYD 243
>gi|24417248|gb|AAN60234.1| unknown [Arabidopsis thaliana]
Length = 190
Score = 173 bits (439), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 78/99 (78%), Positives = 93/99 (93%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SG+S+ SEVRTSSGMFLSK QD+IV+++EA++AAWTFLP ENGE+MQILHYE+G
Sbjct: 92 MVADNDSGESVESEVRTSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENG 151
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGET 99
QKYEPHFD+F D+ N +LGGHRIATVLMYLS+VEKGGET
Sbjct: 152 QKYEPHFDYFHDQANLELGGHRIATVLMYLSNVEKGGET 190
>gi|302831512|ref|XP_002947321.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
nagariensis]
gi|300267185|gb|EFJ51369.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
nagariensis]
Length = 797
Score = 173 bits (438), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 96/232 (41%), Positives = 137/232 (59%), Gaps = 12/232 (5%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+V D+++G+S ++RTS G + +D ++A IE RIA WT LPPE+GE MQIL Y G
Sbjct: 529 LVVDSQTGQSKLDDIRTSYGAAFGRGEDPVIAEIEERIAEWTHLPPEHGEPMQILRYVDG 588
Query: 61 QKYEPHFDFFRDKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNS---EVSQSRDGNW 114
QKY+ H+D+F D ++ + + G+R ATVL+YLS VE GGET P + ++S N
Sbjct: 589 QKYDAHWDWFDDPVHHRSYLVDGNRYATVLLYLSEVEAGGETNLPLADPIDMSVQAIENP 648
Query: 115 SEC-ARRGYAVKPMKGDALLFFSLHPDAST-DSTSLHGSCPVIEGEKWSATKWIHVRNF- 171
S C A+ G +++P KGDALLF+ + + D +LH SCP ++G KW+ATKWIH + +
Sbjct: 649 SPCAAKMGLSIRPRKGDALLFYDMDIEGQKGDRKALHASCPTLKGMKWTATKWIHSKPYM 708
Query: 172 DKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
+ + C D +C G C + MVG G CRKSC C
Sbjct: 709 GRFDPLRTAGVCRDTAQDCAALVAEGRCTSDLDTMVGPA---GKCRKSCGDC 757
>gi|302844281|ref|XP_002953681.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
gi|300261090|gb|EFJ45305.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
Length = 304
Score = 172 bits (437), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 97/232 (41%), Positives = 133/232 (57%), Gaps = 19/232 (8%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G+S+ RT + + QD++V IE R+AAWT + + E MQIL Y GQ+Y+ H
Sbjct: 43 GQSVEDSYRTLYTAGVRRYQDDVVERIENRVAAWTQISVLHQEDMQILRYGIGQQYKVHA 102
Query: 68 DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE-----VSQSRDGNWSECARRGY 122
D RD G R+ATVL+YL+ E GGET FP+S+ ++++ N+S CA+
Sbjct: 103 DTLRDDE----AGVRVATVLIYLNEPEAGGETAFPDSQWVNPKLAETIGANFSACAKNHV 158
Query: 123 AVKPMKGDALLFFSLHPDAST-DSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE------ 175
A P +GDALLF+S+ PD +T D + H CPV+ G KW+ATKWIH + F E
Sbjct: 159 AFAPKRGDALLFWSIGPDGTTEDYHASHTGCPVLSGVKWTATKWIHAKPFRPQEMAAGRP 218
Query: 176 KEPEDDD---CVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
+P D C DE C WA G+C+KN YM+ + S G CRK+C CK
Sbjct: 219 HQPYVRDPGVCYDESPRCAEWAARGDCEKNRDYMIVNAVSPGVCRKACGACK 270
>gi|363543309|ref|NP_001241870.1| prolyl 4-hydroxylase 6-3 precursor [Zea mays]
gi|347978824|gb|AEP37754.1| prolyl 4-hydroxylase 6-3 [Zea mays]
Length = 208
Score = 172 bits (437), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 79/125 (63%), Positives = 102/125 (81%), Gaps = 2/125 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKS+AS+ RTSSG FL+K +DEIV++IE R+AAWTFLP EN E++Q+L YE G
Sbjct: 71 MVADNDSGKSVASQARTSSGTFLAKREDEIVSAIEKRVAAWTFLPEENAESLQVLRYETG 130
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS--QSRDGNWSECA 118
QKY+ HFD+F D+ N +LGG R+ATVLMYL+ V+KGGE VFP++E S Q +D WS+C+
Sbjct: 131 QKYDAHFDYFHDRNNLKLGGQRVATVLMYLTDVKKGGEAVFPDAEGSHLQYKDETWSDCS 190
Query: 119 RRGYA 123
R G A
Sbjct: 191 RSGLA 195
>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
Length = 269
Score = 172 bits (437), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 91/178 (51%), Positives = 117/178 (65%), Gaps = 14/178 (7%)
Query: 2 VADNESGKS---IASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILH 56
V D +GK+ I S+VRTS+GMFLS ++ +IE RIA ++ +P ENGE +Q+L
Sbjct: 97 VVDTSTGKARHGIESKVRTSTGMFLSNYDRRYPMIQAIERRIAVYSMIPVENGELLQVLR 156
Query: 57 YEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
YE Q Y+PH D+F D+ N + GG R+ATVLMYLS VE+GGET+FP+ DG E
Sbjct: 157 YEPNQYYKPHHDYFSDQFNLKRGGQRVATVLMYLSDVEEGGETIFPSVG-----DGE-CE 210
Query: 117 CA---RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
C R+G VKP KGDA+LF+S D + DS SLHG C V+ GEKWSATKW+ F
Sbjct: 211 CGGELRKGLCVKPRKGDAILFWSAALDGNVDSNSLHGGCSVLRGEKWSATKWLRQSRF 268
>gi|363807814|ref|NP_001242181.1| uncharacterized protein LOC100782154 [Glycine max]
gi|255644463|gb|ACU22735.1| unknown [Glycine max]
Length = 285
Score = 172 bits (437), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 84/167 (50%), Positives = 115/167 (68%), Gaps = 3/167 (1%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+V DNESG+ I + RTS+ + + +D+IV +IE RIA TF+P E+GE + ++ Y G
Sbjct: 119 LVIDNESGEGIETSYRTSTEYVVERGKDKIVRNIEKRIADVTFIPIEHGEPLHVIRYAVG 178
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SEC 117
Q YEPH D+F ++ + GG RIAT+LMYLS+VE GGETVFP + + S W SEC
Sbjct: 179 QYYEPHVDYFEEEFSLVNGGQRIATMLMYLSNVEGGGETVFPIANANFSSVPWWNELSEC 238
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATK 164
+ G ++KP GDALLF+S+ PDA+ D +LH +CPVI+G KWS TK
Sbjct: 239 GQTGLSIKPKMGDALLFWSMKPDATLDPLTLHRACPVIKGNKWSCTK 285
>gi|297803562|ref|XP_002869665.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297315501|gb|EFH45924.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 290
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 88/196 (44%), Positives = 123/196 (62%), Gaps = 9/196 (4%)
Query: 28 DEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVL 87
D +VA IE +I+AWTFLP ENG ++++ Y +K D+F ++ + L +ATV+
Sbjct: 104 DPVVAGIEEKISAWTFLPRENGGSIKVRSY-TSEKSGKKLDYFGEEPSSVLRESLLATVV 162
Query: 88 MYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTS 147
+YLS+ +GGE +FPNSEV + C+ G ++P+KG+A+LFFS +AS D TS
Sbjct: 163 LYLSNTTQGGELLFPNSEVKPKKS-----CSEDGNILRPVKGNAVLFFSRLLNASLDETS 217
Query: 148 LHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMV 207
H CPV++GE ATK I+ + K + E+ +C DED NC WA GECKKNP+YM+
Sbjct: 218 THLICPVVKGELLVATKLIYAK---KQARNEENGECSDEDENCERWANLGECKKNPVYMI 274
Query: 208 GSKSSRGYCRKSCKVC 223
GS G CRKSC C
Sbjct: 275 GSPDYYGTCRKSCNAC 290
>gi|159489450|ref|XP_001702710.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280732|gb|EDP06489.1| predicted protein [Chlamydomonas reinhardtii]
Length = 252
Score = 172 bits (435), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 101/227 (44%), Positives = 127/227 (55%), Gaps = 16/227 (7%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+VADN G S+ + RTS G F+++ +VA +E R+A T +P E MQ+L Y +G
Sbjct: 38 VVADN--GSSVLDDYRTSYGTFINRYATPVVARVEDRVAVLTRVPVHYQEDMQVLRYGNG 95
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---GNWSEC 117
Q Y H D + R+ATVL+YLS E GGET FP + G +SEC
Sbjct: 96 QYYHRHTDSLEND------SPRLATVLLYLSDPELGGETAFPLAWAHPDMPKVFGPFSEC 149
Query: 118 ARRGYAVKPMKGDALLFFSLHPDAST-DSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
+ A KP KGDALLF+S+ PD T D S H CPVI G KW+AT W+H + F +PE
Sbjct: 150 VKNNVAFKPRKGDALLFWSVKPDGKTEDPLSEHEGCPVIRGVKWTATVWVHTKPF-RPE- 207
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
E DDC D C W AGEC+KN YM G + G CR SC VC
Sbjct: 208 --EWDDCTDRHKECPKWKAAGECEKNHGYMQGDANQVGSCRLSCGVC 252
>gi|302764100|ref|XP_002965471.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
gi|300166285|gb|EFJ32891.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
Length = 264
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 90/174 (51%), Positives = 116/174 (66%), Gaps = 14/174 (8%)
Query: 2 VADNESGKS---IASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILH 56
V D +GK+ I S+VRTS+GMFLS ++ +IE RIA ++ +P ENGE +Q+L
Sbjct: 96 VVDTSTGKARHGIESKVRTSTGMFLSNYDRRYPMIEAIERRIAVYSMIPVENGELLQVLR 155
Query: 57 YEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
YE Q Y+PH D+F D+ N + GG R+ATVLMYLS VE+GGET+FP+ DG E
Sbjct: 156 YEPNQYYKPHHDYFSDQFNLKRGGQRVATVLMYLSDVEEGGETIFPSVG-----DGE-CE 209
Query: 117 CA---RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
C R+G VKP KGDA+LF+S D + DS SLHG C V+ GEKWSATKW+
Sbjct: 210 CGGELRKGLCVKPRKGDAILFWSAALDGNVDSNSLHGGCSVLRGEKWSATKWLR 263
>gi|159486447|ref|XP_001701251.1| hypothetical protein CHLREDRAFT_122372 [Chlamydomonas reinhardtii]
gi|158271833|gb|EDO97644.1| predicted protein [Chlamydomonas reinhardtii]
Length = 251
Score = 171 bits (433), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 96/225 (42%), Positives = 125/225 (55%), Gaps = 24/225 (10%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+G S+ +RTS G F+ + D +V + R+AAWT PPEN E +Q+L Y GQKY H
Sbjct: 43 NGSSVLDTIRTSYGTFIRRRHDPVVERVLRRVAAWTKAPPENQEDLQVLRYGPGQKYGAH 102
Query: 67 FDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS------EVSQSRDGNWSECARR 120
D D R+ATVL+YL E GGET FP+S ++QS G +SECA+
Sbjct: 103 MDSLIDD------SPRMATVLLYLHDTEYGGETAFPDSGHWLDPSLAQSM-GPFSECAQG 155
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKPEKEP 178
A +P KGDAL+F+S+ PD + D SLH CPV+ G KW+AT W+H N+D K
Sbjct: 156 HVAFRPKKGDALMFWSIKPDGTHDPLSLHTGCPVVTGVKWTATSWVHSMPYNYDDYFKP- 214
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
C D C W + GECKKNP YM +C +SC C
Sbjct: 215 --GACTDLHDQCKHWERMGECKKNPAYM------ESHCGRSCGAC 251
>gi|145354086|ref|XP_001421326.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581563|gb|ABO99619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 309
Score = 171 bits (432), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 87/179 (48%), Positives = 111/179 (62%), Gaps = 12/179 (6%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + G S S+ RTSSG ++S E++A+IE R+AAWT LP GE Q++ YE GQ
Sbjct: 115 VVNEADGTSKTSDERTSSGGWVSGEDSEVMANIERRVAAWTMLPRNRGETTQVMRYEAGQ 174
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS--------EVSQSRDGN 113
+Y H D+F D++N + GG R ATVLMYLS VE+GGETVFP E S GN
Sbjct: 175 EYAAHDDYFHDEVNVKNGGQRAATVLMYLSDVEEGGETVFPRGTPLGGAAPEKSGVTQGN 234
Query: 114 WSECARRG----YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
E A RG AVKP +GDALLFF++H + D + H CPV+ G KW+AT+W HV
Sbjct: 235 ACERALRGDPNVLAVKPRRGDALLFFNVHLNGEVDERARHAGCPVVRGTKWTATRWQHV 293
>gi|302765413|ref|XP_002966127.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
gi|300165547|gb|EFJ32154.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
Length = 201
Score = 170 bits (430), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 79/168 (47%), Positives = 111/168 (66%), Gaps = 2/168 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G S RTS G FL + D IV+ IE RI++ TF+P E GE++Q++ Y+ GQ
Sbjct: 28 VIDEKTGLGKDSRNRTSWGAFLRRDHDNIVSGIEDRISSITFIPKEYGESLQVVRYKTGQ 87
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD--GNWSECAR 119
K+EPH D+++ N GGHRI T+L+YL++VE GGETVFP + + D N SEC +
Sbjct: 88 KFEPHQDYYKLTENNNNGGHRIGTLLLYLTNVENGGETVFPRALANVINDYSTNTSECTK 147
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
+G ++P +GD LLF+ P D S HG CPV++GEKW ATK++H
Sbjct: 148 KGIVIRPRRGDGLLFWITRPSGEIDPFSFHGGCPVVKGEKWLATKFLH 195
>gi|302838815|ref|XP_002950965.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
nagariensis]
gi|300263660|gb|EFJ47859.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
nagariensis]
Length = 298
Score = 170 bits (430), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 94/239 (39%), Positives = 128/239 (53%), Gaps = 32/239 (13%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQ------------ 53
++G S+ +RTS G F+ + D ++ I R+AAWT PPEN E +Q
Sbjct: 42 QNGSSVTDNIRTSYGTFIRRRHDPVIERILRRVAAWTKAPPENQEDLQAGRGEGGREKER 101
Query: 54 ILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV-----SQ 108
+L Y GQKY H D D R+ATVL+YL E+GGET FP+S
Sbjct: 102 VLRYGIGQKYGAHMDSLIDD------SPRMATVLLYLHDTEEGGETAFPDSSSWLTPDLA 155
Query: 109 SRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
+R G +SECA+ A +P KGDAL+F+S+ PD + D S+H CPV++G KW+AT W+H
Sbjct: 156 TRMGPFSECAQGHVAFRPKKGDALMFWSIKPDGTHDPLSMHTGCPVVKGVKWTATSWVHS 215
Query: 169 RNFDKPEKEPEDDD---CVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
+ D + C D C VWA AGEC +NP+YM +C SCK C+
Sbjct: 216 MPYAYDRYISHDGEPGACTDLHDMCTVWAAAGECDRNPVYM------STHCGPSCKTCE 268
>gi|412988743|emb|CCO15334.1| predicted protein [Bathycoccus prasinos]
Length = 352
Score = 169 bits (429), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 96/231 (41%), Positives = 133/231 (57%), Gaps = 23/231 (9%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++G+ S++RTS G F+ K DE++ IE R A ++ +P + E MQ+L Y GQ
Sbjct: 105 VVGGQTGR--VSDIRTSFGTFIPKKYDEVLEKIEDRCAVFSGIPVAHQEQMQLLRYRDGQ 162
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP--------NSEVSQSRDGN 113
KY H D + + GG RIAT+LM+L +GGET F + +++D
Sbjct: 163 KYSDH----TDGLISENGGKRIATILMFLHEPTEGGETSFVLGNPLGKVKERIERTKD-Q 217
Query: 114 WSECARR---GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
+S+C R G+AVKP GDA+LFFS TD+ S+H SCP + G KW+AT WIH R
Sbjct: 218 FSDCGYRSGKGFAVKPKVGDAILFFSFSEAGITDNNSMHASCPTLGGTKWTATMWIHERP 277
Query: 171 FDKPE-KEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSC 220
FD ++P DC D C WA GECKKNP+YM+G++ G C +SC
Sbjct: 278 FDTATWRKP---DCKDLHQECANWANRGECKKNPIYMLGNEVV-GTCSRSC 324
>gi|115457822|ref|NP_001052511.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|38346023|emb|CAE03962.2| OSJNBb0085H11.11 [Oryza sativa Japonica Group]
gi|113564082|dbj|BAF14425.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|125547818|gb|EAY93640.1| hypothetical protein OsI_15426 [Oryza sativa Indica Group]
gi|125589953|gb|EAZ30303.1| hypothetical protein OsJ_14349 [Oryza sativa Japonica Group]
gi|215693934|dbj|BAG89133.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 267
Score = 169 bits (428), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 87/174 (50%), Positives = 116/174 (66%), Gaps = 9/174 (5%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D +GK + S VRTSSGMF+S + + ++ SIE RI+ ++ +P ENGE +Q+L YE
Sbjct: 98 VVDVATGKGVKSNVRTSSGMFVSSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEP 157
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
Q Y PH D+F D N + GG R+AT+LMYL+ +GGET FP Q+ DG S +
Sbjct: 158 SQYYRPHHDYFSDTFNIKRGGQRVATMLMYLTDGVEGGETHFP-----QAGDGECSCGGK 212
Query: 120 --RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+G VKP KGDA+LF+S+ D TDS S+HG CPV+EGEKWSATKW+ + F
Sbjct: 213 MVKGLCVKPNKGDAVLFWSMGLDGETDSNSIHGGCPVLEGEKWSATKWMRQKEF 266
>gi|303287328|ref|XP_003062953.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455589|gb|EEH52892.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 259
Score = 169 bits (427), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 98/223 (43%), Positives = 127/223 (56%), Gaps = 18/223 (8%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQ-----ILH 56
V D+ +G+S +RTS FL++ IV+ IE R+ +T LP NGE +Q +L
Sbjct: 38 VVDSTTGESKVDPIRTSEQCFLNRGHFPIVSVIEKRLERYTMLPWYNGEDLQARPSRVLK 97
Query: 57 YEHGQKYEPHFDF--FRDKMNQQL---GGHRIATVLMYLSHVEK--GGETVFPNSE---V 106
Y +GQKY+ H D +QL GGHR+ATVL+YLS V+ GGET FP+SE
Sbjct: 98 YSNGQKYDAHHDVGELDTASGKQLAAEGGHRVATVLLYLSDVDDDGGGETAFPDSEWIDP 157
Query: 107 SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ R WSECA AVKP KGD LLF+S+ P+ D S+H CPV+ G+ W+ATKWI
Sbjct: 158 TADRGSGWSECAEDHVAVKPKKGDGLLFWSITPEGVIDQQSMHAGCPVL-GKSWTATKWI 216
Query: 167 HVRNFDKP--EKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMV 207
H R F C D C WA +GECKKNP +M+
Sbjct: 217 HARPFRHQFPPPPAAPPGCADTVAMCKSWANSGECKKNPGFML 259
>gi|116309432|emb|CAH66506.1| OSIGBa0111I14.1 [Oryza sativa Indica Group]
Length = 267
Score = 169 bits (427), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 87/174 (50%), Positives = 116/174 (66%), Gaps = 9/174 (5%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D +GK + S VRTSSGMF+S + + ++ SIE RI+ ++ +P ENGE +Q+L YE
Sbjct: 98 VVDVATGKGVKSNVRTSSGMFVSSEERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEP 157
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
Q Y PH D+F D N + GG R+AT+LMYL+ +GGET FP Q+ DG S +
Sbjct: 158 SQYYRPHHDYFSDTFNIKRGGQRVATMLMYLTDGVEGGETHFP-----QAGDGECSCGGK 212
Query: 120 --RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+G VKP KGDA+LF+S+ D TDS S+HG CPV+EGEKWSATKW+ + F
Sbjct: 213 MVKGLCVKPNKGDAVLFWSMGLDGETDSNSIHGGCPVLEGEKWSATKWMRQKEF 266
>gi|308799555|ref|XP_003074558.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
gi|116000729|emb|CAL50409.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
Length = 274
Score = 168 bits (426), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 94/231 (40%), Positives = 135/231 (58%), Gaps = 17/231 (7%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ESGKS+ + +RTS FLS+ D +V + R+++ T LP + E +Q+L Y G+
Sbjct: 46 VIDSESGKSVVNPIRTSKQTFLSR-NDPVVRKVLERMSSVTHLPWYHCEDLQVLEYSAGE 104
Query: 62 KYEPHFDFFRD--KMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSE---VSQSRDGN 113
KY+ H D + K QL GG R+AT+L+YL E+GGET FP+SE +++
Sbjct: 105 KYDAHEDVGEEGTKSGDQLSKNGGKRVATILLYLEEPEEGGETAFPDSEWIDPERAKTET 164
Query: 114 WSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNF 171
WS+CA R A+KP +GD L+F+S+ PD + D +LH CP G KW+AT W+H N+
Sbjct: 165 WSKCAHRRVAMKPTRGDGLMFWSVRPDGTIDHRALHVGCPPTRGTKWTATIWVHADPYNW 224
Query: 172 DKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKV 222
KP C D+ C WA GEC KNP +M+ + C+ SC+V
Sbjct: 225 IKPPDPVPTIGCEDKSDRCRGWANIGECDKNPSFMLEN------CKWSCRV 269
>gi|307110383|gb|EFN58619.1| hypothetical protein CHLNCDRAFT_19485 [Chlorella variabilis]
Length = 328
Score = 168 bits (426), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 94/226 (41%), Positives = 123/226 (54%), Gaps = 30/226 (13%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G S+ ++RTS G FL + QD IV ++E R+A WT L + E MQIL Y GQKY H+
Sbjct: 74 GASVEDQIRTSYGTFLKRLQDPIVTAVEQRLATWTKLNVSHQEDMQILRYGIGQKYGAHY 133
Query: 68 DFFRDKMNQQLGGHRIATVLMYLSHV--EKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
D + R+ TVL+YLS V + GGET FP R A+
Sbjct: 134 DSLDND------SPRVCTVLLYLSDVPADGGGETAFPGV---------------RRQALY 172
Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF-------DKPEKEP 178
P KGDALLF+SL PD ++D+ SLH CP+I G KW+ATKWIH F ++ E
Sbjct: 173 PKKGDALLFYSLKPDGTSDAYSLHTGCPIISGVKWTATKWIHTLPFRPHLLGKEQAEAIV 232
Query: 179 EDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
++C D +C WA AGEC+ N +M G + G CR SC C+
Sbjct: 233 YPEECKDAQADCKAWADAGECENNEQFMRGDAFTLGNCRASCGDCE 278
>gi|3805847|emb|CAA21467.1| putative protein [Arabidopsis thaliana]
gi|7270533|emb|CAB81490.1| putative protein [Arabidopsis thaliana]
Length = 307
Score = 168 bits (426), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 82/139 (58%), Positives = 104/139 (74%), Gaps = 3/139 (2%)
Query: 15 VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM 74
VRTSSG FL++ DEIV IE RI+ +TF+PPENGE +Q+LHYE GQ+YEPH D+F D+
Sbjct: 161 VRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEF 220
Query: 75 NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECARRGYAVKPMKGDA 131
N + GG RIATVLMYLS V++GGETVFP ++ + S W S+C + G +V P K DA
Sbjct: 221 NVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLSVLPKKRDA 280
Query: 132 LLFFSLHPDASTDSTSLHG 150
LLF+S+ PDAS D +SLHG
Sbjct: 281 LLFWSMKPDASLDPSSLHG 299
>gi|303279839|ref|XP_003059212.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459048|gb|EEH56344.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 409
Score = 167 bits (423), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 104/243 (42%), Positives = 132/243 (54%), Gaps = 43/243 (17%)
Query: 13 SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQ---ILHYEHGQKYEPHFDF 69
S+ RTS+G FL K D++V +E R+ A++ LP EN E +Q +L YE GQ+Y H D
Sbjct: 135 SDYRTSTGAFLPKLYDDVVTRVERRVEAFSRLPFENQEQLQARSLLRYELGQEYRDHVDG 194
Query: 70 FRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD---------GNWSECA-- 118
F + GG R+ATVLM+L+ E+GGET FPN E S++ G S+CA
Sbjct: 195 F----ATENGGKRVATVLMFLAEPEEGGETAFPNGEPSEAVAARVAAQRARGELSDCAWR 250
Query: 119 -------------RRGYAVKPMKGDALLFFSLHPDASTDS-------TSLHGSCPVIEGE 158
RG+AVKP GDA+LFFS D S H SCP G
Sbjct: 251 GGGGGTAGGGRGNLRGFAVKPRLGDAVLFFSYDADDDGGYDGAEVSHASTHASCPTTRGV 310
Query: 159 KWSATKWIHVRNFDKPEKE-PEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCR 217
KW+ATKWIH R F E PE CVD D C WA+ GEC KNP +M+G +++ G C
Sbjct: 311 KWTATKWIHERAFATGTWETPE---CVDRDDGCAGWARGGECAKNPGFMLG-EATPGSCL 366
Query: 218 KSC 220
KSC
Sbjct: 367 KSC 369
>gi|30686940|ref|NP_194290.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
gi|26451153|dbj|BAC42680.1| unknown protein [Arabidopsis thaliana]
gi|29893542|gb|AAP06823.1| unknown protein [Arabidopsis thaliana]
gi|332659681|gb|AEE85081.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
Length = 291
Score = 166 bits (420), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 85/196 (43%), Positives = 121/196 (61%), Gaps = 9/196 (4%)
Query: 28 DEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVL 87
D +VA IE +++AWTFLP ENG ++++ Y +K D+F ++ + L +ATV+
Sbjct: 105 DPVVAGIEEKVSAWTFLPGENGGSIKVRSYTS-EKSGKKLDYFGEEPSSVLHESLLATVV 163
Query: 88 MYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTS 147
+YLS+ +GGE +FPNSE+ + C G ++P+KG+A+LFF+ +AS D S
Sbjct: 164 LYLSNTTQGGELLFPNSEMKPK-----NSCLEGGNILRPVKGNAILFFTRLLNASLDGKS 218
Query: 148 LHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMV 207
H CPV++GE ATK I+ + K + E +C DED NC WAK GECKKNP+YM+
Sbjct: 219 THLRCPVVKGELLVATKLIYAK---KQARIEESGECSDEDENCGRWAKLGECKKNPVYMI 275
Query: 208 GSKSSRGYCRKSCKVC 223
GS G CRKSC C
Sbjct: 276 GSPDYYGTCRKSCNAC 291
>gi|356559784|ref|XP_003548177.1| PREDICTED: uncharacterized protein LOC100795761 [Glycine max]
Length = 264
Score = 166 bits (419), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 91/226 (40%), Positives = 134/226 (59%), Gaps = 13/226 (5%)
Query: 2 VADNESGKSIASE-VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
V + SG SE V TS M +D+I+A IE R++ W FLP E + +Q++HY
Sbjct: 48 VKEKSSGNGGLSEGVETSLDM-----EDDILARIEERLSVWAFLPKEYSKPLQVMHYGPE 102
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSH-VEKGGETVFPNSEVSQSRDGNWSECAR 119
Q + D+F +K +L G +AT+++YLS+ V +GG+ +FP S S S C+
Sbjct: 103 QNGR-NLDYFTNKTQLELSGPLMATIILYLSNDVTQGGQILFPESVPGSSSW---SSCSN 158
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
++P+KG+A+LFFSLHP AS D +S H CPV+EG+ WSA K+ + + + +
Sbjct: 159 SSNILQPVKGNAILFFSLHPSASPDKSSFHARCPVLEGDMWSAIKYFYAKPISRGKVSAT 218
Query: 180 DD--DCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
D +C DED +C WA GEC++NP++M+GS G CRKSC C
Sbjct: 219 LDGGECTDEDDSCPAWAAVGECQRNPVFMIGSPDYYGTCRKSCNAC 264
>gi|302845026|ref|XP_002954052.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
nagariensis]
gi|300260551|gb|EFJ44769.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
nagariensis]
Length = 311
Score = 165 bits (418), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 96/236 (40%), Positives = 127/236 (53%), Gaps = 21/236 (8%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+VADN G S+ + RTS G F+++ Q ++A++E R+A T P E MQ+L Y G
Sbjct: 38 VVADN--GSSVLDDYRTSYGTFINRYQTPVIAAVEDRVALLTRTPVVYQEDMQVLRYGLG 95
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE-----VSQSRDGNWS 115
Q Y H D + R+ATVL+YLS E GGET FP + G +S
Sbjct: 96 QYYHRHTDSLEND------SPRMATVLLYLSEPELGGETAFPQAASWAHPAMAQLFGPFS 149
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDAST-DSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
+C + A KP +GDALLF+S+ PD T D S H CPVI G KW+AT W+H + F +P
Sbjct: 150 DCVKGNVAFKPRRGDALLFWSVKPDGRTEDPYSEHEGCPVIRGVKWTATVWVHTQPF-RP 208
Query: 175 EKEPEDDD------CVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVCK 224
E P C D C WA+AGEC N YM G + G CR++C VC+
Sbjct: 209 EDFPPQPRSRLSGLCTDRHAECPRWARAGECDNNSNYMKGDANQVGSCRRTCGVCE 264
>gi|125546091|gb|EAY92230.1| hypothetical protein OsI_13950 [Oryza sativa Indica Group]
Length = 178
Score = 165 bits (417), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 76/106 (71%), Positives = 91/106 (85%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MVADN+SGKSI S+VRTSSG FLSK +D+IV+ IE R+AAWTFLP EN E++QILHYE G
Sbjct: 73 MVADNDSGKSIMSQVRTSSGTFLSKHEDDIVSGIEKRVAAWTFLPEENAESIQILHYELG 132
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV 106
QKY+ HFD+F DK N + GGHR+ATVLMYL+ V+KGGETVFPN+ V
Sbjct: 133 QKYDAHFDYFHDKNNLKRGGHRVATVLMYLTDVKKGGETVFPNAAV 178
>gi|356530852|ref|XP_003533993.1| PREDICTED: uncharacterized protein LOC100775928 [Glycine max]
Length = 302
Score = 165 bits (417), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 85/203 (41%), Positives = 123/203 (60%), Gaps = 13/203 (6%)
Query: 27 QDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH---FDFFRDKMNQQLGGHRI 83
+D+I+A IE R++ W FLP E + +Q++HY EP+ D+F +K +L G +
Sbjct: 107 EDDILARIEERLSLWAFLPKEYSKPLQVMHYGP----EPNGRNLDYFTNKTQLELSGPLM 162
Query: 84 ATVLMYLSHV-EKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDAS 142
AT+++YLS+ +GG+ +FP S R +WS C+ ++P+KG+A+LFFSLHP AS
Sbjct: 163 ATIVLYLSNAATQGGQILFPES---VPRSSSWSSCSNSSNILQPVKGNAILFFSLHPSAS 219
Query: 143 TDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD--DCVDEDLNCVVWAKAGECK 200
D S H CPV+EG WSA K+ + + E D +C DED NC WA GEC+
Sbjct: 220 PDKNSFHARCPVLEGNMWSAIKYFYAKPISSGEVSAISDGGECTDEDDNCPAWAAMGECQ 279
Query: 201 KNPLYMVGSKSSRGYCRKSCKVC 223
+NP++M+GS G CRKSC C
Sbjct: 280 RNPVFMIGSPDYYGTCRKSCNAC 302
>gi|168001068|ref|XP_001753237.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695523|gb|EDQ81866.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 284
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 84/178 (47%), Positives = 116/178 (65%), Gaps = 15/178 (8%)
Query: 2 VADNESGKSIASEVRTSSGMFLS--KAQDEIVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D +GK I S+VRTS+GMFL+ + + +IE RIAA++ +P +NGE +Q+L YE
Sbjct: 115 VVDATTGKGIESKVRTSTGMFLNGNDRRHHTIQAIETRIAAYSMVPVQNGELLQVLRYES 174
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA- 118
Q Y+ H D+F D+ N + GG R+AT+LMYL+ +GGET+FP Q+ D EC+
Sbjct: 175 DQYYKAHHDYFSDEFNLKRGGQRVATMLMYLTEGVEGGETIFP-----QAGD---KECSC 226
Query: 119 ----RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
+ G VKP +GDA+LF+S+ D D TSLHG C V+ GEKWS+TKW+ R FD
Sbjct: 227 GGEMKIGVCVKPKRGDAVLFWSIKLDGQVDPTSLHGGCKVLSGEKWSSTKWMRQRAFD 284
>gi|356576923|ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 287
Score = 163 bits (412), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 83/167 (49%), Positives = 115/167 (68%), Gaps = 5/167 (2%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D ++GK I S+VRTSSGMFL+ + + +V +IE RI+ ++ +P ENGE MQ+L YE
Sbjct: 118 VVDTKTGKGIKSDVRTSSGMFLNPQERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEK 177
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
Q Y+PH D+F D N + GG RIAT+LMYLS +GGET FP ++ S + +
Sbjct: 178 NQYYKPHHDYFSDTFNLKRGGQRIATMLMYLSDNIEGGETYFP---LAGSGECSCGGKLV 234
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+G +VKP+KG+A+LF+S+ D +D S+HG C VI GEKWSATKW+
Sbjct: 235 KGLSVKPIKGNAVLFWSMGLDGQSDPNSVHGGCEVISGEKWSATKWM 281
>gi|159462456|ref|XP_001689458.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283446|gb|EDP09196.1| predicted protein [Chlamydomonas reinhardtii]
Length = 221
Score = 162 bits (409), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 115/214 (53%), Gaps = 29/214 (13%)
Query: 11 IASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF 70
+ ++RTS G FL + D ++A+IE R+A W+ LP + E MQ+L Y KY PH D
Sbjct: 36 VVDDIRTSYGTFLRRVPDPVIAAIEHRLALWSHLPASHQEDMQVLRYGPTNKYGPHID-- 93
Query: 71 RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
G R+ATVL+YL E+ N S+CAR A KP +GD
Sbjct: 94 --------GLERVATVLIYLGQAER----------------ANLSQCARGRVAYKPKRGD 129
Query: 131 ALLFFSLHPD-ASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCVDEDLN 189
AL+FF PD TD S+H CPV+EG KW+A KW+H + +P +P C +
Sbjct: 130 ALMFFDTMPDYKQTDVHSMHTGCPVVEGVKWNAVKWLHGTPYGRPLPDP--GICANLHEM 187
Query: 190 CVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
C WA GECK NP +M+G+ +S G CR +C C
Sbjct: 188 CETWALQGECKNNPGFMIGAGASMGSCRLACNDC 221
>gi|357445147|ref|XP_003592851.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355481899|gb|AES63102.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 281
Score = 161 bits (408), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 82/167 (49%), Positives = 111/167 (66%), Gaps = 5/167 (2%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D +GK I S+VRTSSGMFLS + + ++ +IE RI+ ++ +P ENGE MQ+L YE
Sbjct: 112 VVDANTGKGIKSDVRTSSGMFLSHEERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEK 171
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
Q Y PH D+F D N + GG RIAT+LMYL +GGET FP++ + G
Sbjct: 172 NQYYRPHHDYFSDTFNLKRGGQRIATMLMYLGDNVEGGETHFPSAGSDECSCGG---KLT 228
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+G VKP+KG+A+LF+S+ D +D S+HG CPV+ GEKWSATKW+
Sbjct: 229 KGLCVKPVKGNAVLFWSMGLDGQSDPDSVHGGCPVLAGEKWSATKWM 275
>gi|357162904|ref|XP_003579560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 266
Score = 161 bits (407), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 83/169 (49%), Positives = 114/169 (67%), Gaps = 9/169 (5%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D +GK + S+VRTSSGMF++ + + ++ +IE RI+ ++ +P ENGE +Q+L YE
Sbjct: 97 VVDVATGKGVKSDVRTSSGMFVNSEERKFPVIQAIEKRISVFSQIPVENGELIQVLRYEP 156
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
Q Y PH D+F D N + GG R+AT+LMYL+ +GGET FP Q+ DG S R
Sbjct: 157 SQYYRPHHDYFSDTFNLKRGGQRVATMLMYLTDGVEGGETHFP-----QAGDGECSCGGR 211
Query: 120 --RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
RG VKP KGDA+LF+S+ D +TDS S+H C V++GEKWSATKW+
Sbjct: 212 IVRGLCVKPNKGDAVLFWSMGLDGNTDSNSIHSGCAVLKGEKWSATKWM 260
>gi|297824279|ref|XP_002880022.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
gi|297325861|gb|EFH56281.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
Length = 283
Score = 160 bits (405), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 80/168 (47%), Positives = 112/168 (66%), Gaps = 5/168 (2%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQ--DEIVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D ++GK + S+VRTSSGMFL+ + + I+ +IE RIA ++ +P ENGE +Q+L YE
Sbjct: 114 VVDVKTGKGVKSDVRTSSGMFLTHVERSNPIIQAIEKRIAVFSQVPAENGELIQVLRYEP 173
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
Q Y+PH D+F D N + GG R+AT+LMYL+ +GGET FP ++ D
Sbjct: 174 KQFYKPHHDYFADTFNLKRGGQRVATMLMYLTDDVEGGETYFP---LAGDGDCTCGGKIM 230
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
+G +VKP KGDA+LF+S+ D +D S+HG C V+ GEKWSATKW+
Sbjct: 231 KGISVKPTKGDAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWMR 278
>gi|326503458|dbj|BAJ86235.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516134|dbj|BAJ88090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 266
Score = 160 bits (405), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 83/169 (49%), Positives = 113/169 (66%), Gaps = 9/169 (5%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D +GK + S+VRTSSGMF++ + + ++ +IE RI+ ++ +P ENGE +Q+L YE
Sbjct: 97 VVDVATGKGVKSDVRTSSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEP 156
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
Q Y PH D+F D N + GG R+AT+LMYL+ +GGET FP Q+ DG R
Sbjct: 157 NQYYRPHHDYFSDTFNLKRGGQRVATMLMYLTDGVEGGETHFP-----QAGDGECICGGR 211
Query: 120 --RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
RG VKP KGDA+LF+S+ D +TDS SLH C V++GEKWSATKW+
Sbjct: 212 LVRGLCVKPNKGDAVLFWSMGLDGNTDSNSLHSGCAVVKGEKWSATKWM 260
>gi|225433714|ref|XP_002268409.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296089634|emb|CBI39453.3| unnamed protein product [Vitis vinifera]
Length = 287
Score = 160 bits (405), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 83/171 (48%), Positives = 114/171 (66%), Gaps = 5/171 (2%)
Query: 2 VADNESGKSIASEVRTSSGMFLS--KAQDEIVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D ++GK I S+VRTSSGMFLS + IV +IE RI+ ++ +P ENGE +Q+L Y+
Sbjct: 118 VVDAQTGKGIQSDVRTSSGMFLSPDDSTYPIVRAIEKRISVYSQVPVENGELIQVLRYKK 177
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
Q Y+PH D+F D N + GG R+AT+L+YLS +GGET FP + R G S
Sbjct: 178 SQFYKPHHDYFSDSFNLKRGGQRVATMLIYLSDNVEGGETYFPMAGSGFCRCGGKSV--- 234
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
RG +V P+KG+A+LF+S+ D +D S+HG C V+ GEKWSATKW+ R+
Sbjct: 235 RGLSVAPVKGNAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWMRQRS 285
>gi|224069056|ref|XP_002302889.1| predicted protein [Populus trichocarpa]
gi|222844615|gb|EEE82162.1| predicted protein [Populus trichocarpa]
Length = 287
Score = 160 bits (404), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 81/170 (47%), Positives = 113/170 (66%), Gaps = 5/170 (2%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQD--EIVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D ++GK I S+VRTSSGMFLS + ++V +IE RI+ ++ +P ENGE +Q+L YE
Sbjct: 118 VVDVKTGKGIESKVRTSSGMFLSSEEKTYQVVQAIEKRISVYSQVPIENGELIQVLRYEK 177
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
Q Y+PH D+F D N + GG R+AT+LMYLS +GGET FP + + G
Sbjct: 178 NQYYKPHHDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGSGKCSCGG---KVV 234
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G +VKP+KG+A+LF+S+ D +D +S+HG C V+ G KWSATKW+ R
Sbjct: 235 DGLSVKPIKGNAVLFWSMGLDGQSDPSSIHGGCEVLSGVKWSATKWMRQR 284
>gi|15224220|ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana]
gi|3763917|gb|AAC64297.1| hypothetical protein [Arabidopsis thaliana]
gi|20197628|gb|AAM15158.1| hypothetical protein [Arabidopsis thaliana]
gi|26450452|dbj|BAC42340.1| unknown protein [Arabidopsis thaliana]
gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis thaliana]
gi|330255112|gb|AEC10206.1| P4H isoform 1 [Arabidopsis thaliana]
Length = 283
Score = 159 bits (403), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 80/168 (47%), Positives = 111/168 (66%), Gaps = 5/168 (2%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D ++GK + S+VRTSSGMFL+ + I+ +IE RIA ++ +P ENGE +Q+L YE
Sbjct: 114 VVDVKTGKGVKSDVRTSSGMFLTHVERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEP 173
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
Q Y+PH D+F D N + GG R+AT+LMYL+ +GGET FP ++ D
Sbjct: 174 QQFYKPHHDYFADTFNLKRGGQRVATMLMYLTDDVEGGETYFP---LAGDGDCTCGGKIM 230
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
+G +VKP KGDA+LF+S+ D +D S+HG C V+ GEKWSATKW+
Sbjct: 231 KGISVKPTKGDAVLFWSMGLDGQSDPRSIHGGCEVLSGEKWSATKWMR 278
>gi|414587756|tpg|DAA38327.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 263
Score = 159 bits (403), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 83/173 (47%), Positives = 114/173 (65%), Gaps = 15/173 (8%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D +GK + S+VRTSSGMF++ + + +V +IE RI+ ++ +P ENGE +Q+L YE
Sbjct: 94 VVDVATGKGVKSDVRTSSGMFVNSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEA 153
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA- 118
Q Y PH D+F D N + GG R+AT+LMYL+ GGET FP Q+ DG EC+
Sbjct: 154 SQYYRPHHDYFSDTFNLKRGGQRVATMLMYLTDGVVGGETHFP-----QAGDG---ECSC 205
Query: 119 ----RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
+G VKP KGDA+LF+S+ D +TD S+H CPV++GEKWSATKW+
Sbjct: 206 GGNVVKGLCVKPNKGDAVLFWSMGLDGNTDPNSIHSGCPVLKGEKWSATKWMR 258
>gi|297600382|ref|NP_001049073.2| Os03g0166200 [Oryza sativa Japonica Group]
gi|255674232|dbj|BAF10987.2| Os03g0166200, partial [Oryza sativa Japonica Group]
Length = 135
Score = 159 bits (402), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 73/122 (59%), Positives = 91/122 (74%), Gaps = 1/122 (0%)
Query: 102 PNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWS 161
P + +SQ +D WS+CA +G+AVKP KG A+LFFSL+P+A+ D SLHGSCPVI+GEKWS
Sbjct: 13 PQARLSQPKDETWSDCAEQGFAVKPTKGSAVLFFSLYPNATFDPGSLHGSCPVIQGEKWS 72
Query: 162 ATKWIHVRNFDKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCK 221
ATKWIHVR++D+ + D C D+ C WA AGEC KNP YMVG+ S G+CRKSC
Sbjct: 73 ATKWIHVRSYDENGRR-SSDKCEDQHALCSSWAAAGECAKNPGYMVGTSESPGFCRKSCN 131
Query: 222 VC 223
VC
Sbjct: 132 VC 133
>gi|449468746|ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
Length = 290
Score = 158 bits (400), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 82/175 (46%), Positives = 113/175 (64%), Gaps = 5/175 (2%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D ++GK + S+ RTSSGMFLS + +V +IE RI+ ++ +P ENGE +Q+L YE
Sbjct: 119 VVDTKTGKGVKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEK 178
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
Q Y+PH D+F D N + GG RIAT+LMYLS +GGET FP + + G +
Sbjct: 179 NQFYKPHHDYFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKTVP-- 236
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
G +VKP KGDA+LF+S+ D +D S+HG C V+ GEKWSATKW+ ++ P
Sbjct: 237 -GLSVKPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP 290
>gi|449520144|ref|XP_004167094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 323
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 84/176 (47%), Positives = 114/176 (64%), Gaps = 6/176 (3%)
Query: 3 ADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQK 62
A N + + + S RTSSG FL+K Q+++V IE RIA +TF+P ENGE + ILHYE GQK
Sbjct: 106 AQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVGQK 165
Query: 63 YEPHFDFFR-DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW----SEC 117
+EPH D+ D + + G R AT++MYLS V++GG TVFP ++ S W E
Sbjct: 166 FEPHHDYTHPDSFSFKSLGQRNATLVMYLSGVKEGGATVFPEAKKCASSARRWWKKLPEY 225
Query: 118 AR-RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
+ G +VKP GDALLF+S+ PD + D TSLH S PV++G+KW K +HV+ D
Sbjct: 226 GKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGDKWVGVKLMHVKAKD 281
Score = 52.4 bits (124), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 38/63 (60%), Gaps = 3/63 (4%)
Query: 90 LSHVEKGGETVFPNSEVSQSRDGNWSEC---ARRGYAVKPMKGDALLFFSLHPDASTDST 146
+ ++E+GGETVFP + S W + + G ++KP GDAL F+S+ PD + D T
Sbjct: 9 ILNIEEGGETVFPAANKCVSSVPWWKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLDYT 68
Query: 147 SLH 149
SLH
Sbjct: 69 SLH 71
>gi|449443245|ref|XP_004139390.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 158 bits (399), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 84/176 (47%), Positives = 114/176 (64%), Gaps = 6/176 (3%)
Query: 3 ADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQK 62
A N + + + S RTSSG FL+K Q+++V IE RIA +TF+P ENGE + ILHYE GQK
Sbjct: 115 AQNTNWEGVVSSRRTSSGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVGQK 174
Query: 63 YEPHFDFFR-DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW----SEC 117
+EPH D+ D + + G R AT++MYLS V++GG TVFP ++ S W E
Sbjct: 175 FEPHHDYTHPDSFSFKSLGQRNATLVMYLSGVKEGGATVFPEAKKCASSARRWWKKLPEY 234
Query: 118 AR-RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
+ G +VKP GDALLF+S+ PD + D TSLH S PV++G+KW K +HV+ D
Sbjct: 235 GKDNGLSVKPKMGDALLFWSVKPDGTLDPTSLHASSPVVKGDKWVGVKLMHVKAKD 290
Score = 68.6 bits (166), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 33/74 (44%), Positives = 47/74 (63%), Gaps = 3/74 (4%)
Query: 90 LSHVEKGGETVFPNSEVSQSRDGNWSEC---ARRGYAVKPMKGDALLFFSLHPDASTDST 146
+ ++E+GGETVFP + S W + + G ++KP GDAL F+S+ PD + D T
Sbjct: 9 ILNIEEGGETVFPAANQCVSSVPWWKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLDYT 68
Query: 147 SLHGSCPVIEGEKW 160
SLHGS PVI G++W
Sbjct: 69 SLHGSYPVIRGDEW 82
>gi|356541677|ref|XP_003539300.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 297
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 81/154 (52%), Positives = 106/154 (68%), Gaps = 5/154 (3%)
Query: 15 VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG+F+S ++DE I+ +IE +IA T +P +GEA IL YE GQKY H+D F +
Sbjct: 140 IRTSSGVFMSASEDETGILDAIEEKIAKATKIPRTHGEAFNILRYEVGQKYNSHYDAFDE 199
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
L R+A+ L+YL+ V +GGET+FP E +RDGN +C G V+P KGDAL
Sbjct: 200 AEYGPLQSQRVASFLLYLTDVPEGGETMFP-YENGFNRDGNVEDCI--GLRVRPRKGDAL 256
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
LF+SL P+ + D TS HGSCPVI+GEKW ATKWI
Sbjct: 257 LFYSLLPNGTIDQTSAHGSCPVIKGEKWVATKWI 290
>gi|384250156|gb|EIE23636.1| hypothetical protein COCSUDRAFT_53414 [Coccomyxa subellipsoidea
C-169]
Length = 285
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 77/175 (44%), Positives = 111/175 (63%), Gaps = 1/175 (0%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++ K + +++R + ++ + D+++ IE RIA +TFLP +GE I+ Y GQ
Sbjct: 90 VLDAKTKKQVPNKLRNNKEAYIDGSADDVIDQIERRIARYTFLPAAHGEPFHIMQYLPGQ 149
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS-QSRDGNWSECARR 120
Y PH D+ D + +LG RIAT+++YLS V +GGETVFPNS + D +S+CA++
Sbjct: 150 GYAPHTDWLDDWWHPRLGNERIATMIIYLSDVVEGGETVFPNSTMQPHVGDAAYSKCAQQ 209
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
G AVKP+KGDALL ++L + D SLH CPVI G KW+ATK I V P+
Sbjct: 210 GIAVKPVKGDALLLYNLLENGRNDGESLHQGCPVIRGVKWTATKRILVNQLPSPD 264
>gi|255637879|gb|ACU19258.1| unknown [Glycine max]
Length = 287
Score = 157 bits (397), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 81/168 (48%), Positives = 113/168 (67%), Gaps = 5/168 (2%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D ++GK I S+VRTSSGMFL+ + + +V +IE RI+ ++ +P ENGE MQ+L YE
Sbjct: 118 VVDTKTGKGIKSDVRTSSGMFLNSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEK 177
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
Q Y+P D+F D N + GG IAT+LMYLS +GGET FP ++ S + +
Sbjct: 178 NQYYKPRHDYFFDTFNLKRGGQGIATMLMYLSDNIEGGETYFP---LAGSGECSCGGKLV 234
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
+G +VKP+KG+A+LF+S+ D +D S+HG C VI GEKWSATKW+
Sbjct: 235 KGLSVKPIKGNAVLFWSMGLDGQSDPNSVHGGCEVISGEKWSATKWLR 282
>gi|168043388|ref|XP_001774167.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674574|gb|EDQ61081.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 284
Score = 157 bits (396), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 80/168 (47%), Positives = 109/168 (64%), Gaps = 7/168 (4%)
Query: 9 KSIASEVRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
++ E+RTSSG FL ++D+ +A +E ++A T +P +NGEA +L Y GQKY+ H
Sbjct: 120 EATTKEIRTSSGTFLRASEDKTQSLAEVEEKMARATMIPRQNGEAFNVLRYNPGQKYDCH 179
Query: 67 FDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVK 125
+D F R+A+ L+YLS VE+GGET+FP G N+ +C G VK
Sbjct: 180 YDVFDPAEYGPQPSQRMASFLLYLSDVEEGGETMFPFENFQNMNTGYNYKDCI--GLKVK 237
Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
P +GDALLF+S+HP+ + D T+LHGSCPVI+GEKW ATKWI RN DK
Sbjct: 238 PRQGDALLFYSMHPNGTFDKTALHGSCPVIKGEKWVATKWI--RNTDK 283
>gi|242075290|ref|XP_002447581.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
gi|241938764|gb|EES11909.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
Length = 263
Score = 156 bits (394), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 81/172 (47%), Positives = 114/172 (66%), Gaps = 15/172 (8%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D +GK + S+VRTSSGMF++ + + ++ +IE RI+ ++ +P ENGE +Q+L YE
Sbjct: 94 VVDVATGKGVKSDVRTSSGMFVNSEERKSPVIQAIEKRISVFSQIPKENGELIQVLRYEA 153
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA- 118
Q Y PH D+F D N + GG R+AT+LMYL+ +GGET F Q+ DG EC+
Sbjct: 154 SQYYRPHHDYFSDTFNLKRGGQRVATMLMYLTDGVEGGETHF-----LQAGDG---ECSC 205
Query: 119 ----RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+G VKP KGDA+LF+S+ D +TD S+H CPV++GEKWSATKW+
Sbjct: 206 GGNVVKGLCVKPNKGDAVLFWSMGLDGNTDPNSIHSGCPVLKGEKWSATKWM 257
>gi|388505024|gb|AFK40578.1| unknown [Medicago truncatula]
Length = 297
Score = 156 bits (394), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 84/170 (49%), Positives = 110/170 (64%), Gaps = 10/170 (5%)
Query: 15 VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG+FLS ++D+ + +IE +IA T +P +GEA IL YE GQ+Y H+D F
Sbjct: 136 IRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYYSHYDAFNP 195
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
R+A+ L+YL+ VE+GGET+FP E + DG + R G VKP +GD L
Sbjct: 196 DEYGPQKSQRVASFLLYLTDVEEGGETMFP-FENGLNMDGTYGYEDRVGLRVKPRQGDGL 254
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDD 182
LF+SL P+ + D TSLHGSCPVI+GEKW ATKWI RN D+ EDDD
Sbjct: 255 LFYSLLPNGTIDQTSLHGSCPVIKGEKWVATKWI--RNLDQ-----EDDD 297
>gi|7269410|emb|CAB81370.1| hypothetical protein [Arabidopsis thaliana]
Length = 315
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 86/216 (39%), Positives = 121/216 (56%), Gaps = 29/216 (13%)
Query: 28 DEIVASIEARIAAWTFLP--------------------PENGEAMQILHYEHGQKYEPHF 67
D +VA IE +++AWTFLP ENG ++++ Y +K
Sbjct: 109 DPVVAGIEEKVSAWTFLPGGLFSCGQTAGLCFSLDAHFSENGGSIKVRSYTS-EKSGKKL 167
Query: 68 DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
D+F ++ + L +ATV++YLS+ +GGE +FPNSEV + C G ++P+
Sbjct: 168 DYFGEEPSSVLHESLLATVVLYLSNTTQGGELLFPNSEVKPK-----NSCLEGGNILRPV 222
Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCVDED 187
KG+A+LFF+ +AS D S H CPV++GE ATK I+ + K + E +C DED
Sbjct: 223 KGNAILFFTRLLNASLDGKSTHLRCPVVKGELLVATKLIYAK---KQARIEESGECSDED 279
Query: 188 LNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
NC WAK GECKKNP+YM+GS G CRKSC C
Sbjct: 280 ENCGRWAKLGECKKNPVYMIGSPDYYGTCRKSCNAC 315
>gi|255577610|ref|XP_002529682.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223530830|gb|EEF32693.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 165
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 77/163 (47%), Positives = 106/163 (65%), Gaps = 3/163 (1%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+ + + S VRTSSGMFLS + + +IE RI+ ++ +P ENGE +Q+L YE Q Y PH
Sbjct: 3 TNQGMKSNVRTSSGMFLSSEERKSPMAIEKRISVYSQVPIENGELVQVLRYEKSQFYRPH 62
Query: 67 FDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
D+F D N + GG R+AT+LMYLS +GGET FP + + G +G +VKP
Sbjct: 63 HDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGSGECSCGG---KIVKGLSVKP 119
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
+KGDA+LF+S+ D +D S+HG C V+ GEKWSATKW+ R
Sbjct: 120 IKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMRQR 162
>gi|255584898|ref|XP_002533164.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223527036|gb|EEF29223.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 290
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 81/158 (51%), Positives = 106/158 (67%), Gaps = 9/158 (5%)
Query: 15 VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSGMFLS ++D+ ++ +IE +IA T LP NGEA IL YE GQKY H+D F
Sbjct: 131 IRTSSGMFLSASEDKTGVLDAIEEKIARATMLPRANGEAFNILRYEIGQKYNSHYDAFNP 190
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFP---NSEVSQSRDGNWSECARRGYAVKPMKG 129
R+A+ L+YLS VE+GGET+FP + +V +S D + +C G V+P +G
Sbjct: 191 AEYGPQKSQRVASFLLYLSDVEEGGETMFPFENDLDVDESYD--FEKCI--GLQVRPRRG 246
Query: 130 DALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
D LLF+SL P+ + D TSLHGSCPVI+GEKW ATKWI
Sbjct: 247 DGLLFYSLFPNNTIDPTSLHGSCPVIKGEKWVATKWIR 284
>gi|308812133|ref|XP_003083374.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
gi|116055254|emb|CAL57650.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
Length = 311
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 81/179 (45%), Positives = 105/179 (58%), Gaps = 12/179 (6%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D++SG++ + R+S G ++S DE++ +IE R + W LP GE MQ+L YE GQ
Sbjct: 105 VTDDDSGEARPDDARSSIGGWVSGDDDEVIRNIELRASTWAMLPMNRGETMQVLRYEKGQ 164
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD--------GN 113
KY+ H DFF D+ N + GG R+AT+LMYLS VE+GGETVFP RD N
Sbjct: 165 KYDAHDDFFHDEHNVKNGGQRVATILMYLSDVEEGGETVFPLGTPLGGRDPEKSGVTGDN 224
Query: 114 WSECAR----RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
E A R AVKP +GDALLFF+ H D + H CPV G KW+ T+W V
Sbjct: 225 ACELASQNDPRVLAVKPRRGDALLFFNAHLSGEMDEKANHAGCPVNRGTKWTMTRWHRV 283
>gi|2980790|emb|CAA18166.1| hypothetical protein [Arabidopsis thaliana]
Length = 316
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 85/216 (39%), Positives = 121/216 (56%), Gaps = 29/216 (13%)
Query: 28 DEIVASIEARIAAWTFLP--------------------PENGEAMQILHYEHGQKYEPHF 67
D +VA IE +++AWTFLP ENG ++++ Y +K
Sbjct: 110 DPVVAGIEEKVSAWTFLPGGLFSCGQTAGLCFSLDAHFSENGGSIKVRSYTS-EKSGKKL 168
Query: 68 DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
D+F ++ + L +ATV++YLS+ +GGE +FPNSE+ + C G ++P+
Sbjct: 169 DYFGEEPSSVLHESLLATVVLYLSNTTQGGELLFPNSEMKPK-----NSCLEGGNILRPV 223
Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCVDED 187
KG+A+LFF+ +AS D S H CPV++GE ATK I+ + K + E +C DED
Sbjct: 224 KGNAILFFTRLLNASLDGKSTHLRCPVVKGELLVATKLIYAK---KQARIEESGECSDED 280
Query: 188 LNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
NC WAK GECKKNP+YM+GS G CRKSC C
Sbjct: 281 ENCGRWAKLGECKKNPVYMIGSPDYYGTCRKSCNAC 316
>gi|145341735|ref|XP_001415959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576182|gb|ABO94251.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 254
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 84/216 (38%), Positives = 120/216 (55%), Gaps = 11/216 (5%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +G+S +RTS FL++ +E+V I ++A T LP + E MQ+L Y G+
Sbjct: 40 VVDSVTGESKVDPIRTSKQTFLNR-DEEVVREIYDALSAVTMLPWTHNEDMQVLEYRVGE 98
Query: 62 KYEPHFDF-----FRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSE---VSQSRDGN 113
KY+ H D + + GG R+ATVL+YL E GGET FP+SE + +
Sbjct: 99 KYDAHEDVGAEDSLSGRELSKDGGKRVATVLLYLEEPEAGGETAFPDSEWIDPKMAEGTS 158
Query: 114 WSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF-- 171
WS+CA A+KP +GD L+F+S+ P+ D +LH CPV+ G KW+AT W+H +
Sbjct: 159 WSKCAEHRVAMKPRRGDGLIFWSVDPNGKIDHRALHVGCPVVAGVKWTATVWVHAEPYRW 218
Query: 172 DKPEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMV 207
KP + C D C WA GEC KNP +M+
Sbjct: 219 QKPPEASATPGCEDAHDQCRGWANTGECDKNPGFML 254
>gi|449469338|ref|XP_004152378.1| PREDICTED: uncharacterized protein LOC101218968 [Cucumis sativus]
Length = 311
Score = 154 bits (389), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 85/219 (38%), Positives = 129/219 (58%), Gaps = 7/219 (3%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
SG ++++E+ SSG+ L+ D+IVA IE R+A WT LP ++ QI+ Y + +
Sbjct: 98 SGITVSTELLNSSGVILN-TTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKY 156
Query: 67 FDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
F R M +ATV++YLS GGE +FP S+V + WS ++ ++P
Sbjct: 157 FYGNRSAMLPS-SEPLMATVVLYLSDSASGGEILFPESKV---KSKFWSGRRKKNNFLRP 212
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKPEKEPEDDDCV 184
+KG+A+LFFS+H +AS D +S H P+ +GE W ATK++++ +K + + D C
Sbjct: 213 VKGNAILFFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIQSDVDGCF 272
Query: 185 DEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
DED +C WA GEC++N ++MVGS G CRKSC C
Sbjct: 273 DEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC 311
>gi|357476355|ref|XP_003608463.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355509518|gb|AES90660.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 297
Score = 152 bits (385), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 83/170 (48%), Positives = 109/170 (64%), Gaps = 10/170 (5%)
Query: 15 VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG+FLS ++D+ + +IE +IA T +P +GEA IL YE GQ+Y H+D F
Sbjct: 136 IRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYNSHYDAFNP 195
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
R+A+ L+YL+ VE+GGET+FP E + DG + G VKP +GD L
Sbjct: 196 DEYGPQKSQRVASFLLYLTDVEEGGETMFP-FENGLNMDGTYGYEDCVGLRVKPRQGDGL 254
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDD 182
LF+SL P+ + D TSLHGSCPVI+GEKW ATKWI RN D+ EDDD
Sbjct: 255 LFYSLLPNGTIDQTSLHGSCPVIKGEKWVATKWI--RNLDQ-----EDDD 297
>gi|145347188|ref|XP_001418057.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578285|gb|ABO96350.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 317
Score = 152 bits (384), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 92/231 (39%), Positives = 129/231 (55%), Gaps = 21/231 (9%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+V +ESG S RTS G F+++ E + +E R+A ++ +P E+ E +Q+L Y G
Sbjct: 73 VVNSDESGA--VSTARTSFGTFVTRRLTETLQRVEDRVAKYSGIPWEHQEQLQLLRYRDG 130
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS----EVSQSRDGN--- 113
Q+Y H D + + GG RIATVLM+L GGET FP E + N
Sbjct: 131 QEYVAH----HDGIISENGGKRIATVLMFLREPTSGGETSFPQGTPLPETKAAFLANKDK 186
Query: 114 WSECA---RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
SEC G++V P KG+A+LFFS H + + D + H SCP + G K++ATKWIH
Sbjct: 187 LSECGWNDGNGFSVIPKKGEAVLFFSFHINGTNDPFANHASCPTLGGTKYTATKWIHENP 246
Query: 171 FDK-PEKEPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSC 220
F+ K P C DE C VWA+ EC++NP++M+G +S G C KSC
Sbjct: 247 FETGTAKTP---TCTDETELCPVWAQGHECERNPVFMMGEESV-GACSKSC 293
>gi|255085784|ref|XP_002505323.1| predicted protein [Micromonas sp. RCC299]
gi|226520592|gb|ACO66581.1| predicted protein [Micromonas sp. RCC299]
Length = 215
Score = 152 bits (383), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 83/178 (46%), Positives = 106/178 (59%), Gaps = 20/178 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VAD +G + SG FL + D IV IE RI+A+ +P ++GE M+IL Y G+
Sbjct: 42 VADARTGGTFPG-----SGAFLLRNHDPIVTRIEERISAFAMIPADHGEGMRILRYGRGE 96
Query: 62 KYEPHFDFFRD-KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV------------SQ 108
KY+PH D+F D N + G R+ATVLMYLS VE GGETVFP S
Sbjct: 97 KYDPHHDYFDDGDKNLRFYGQRVATVLMYLSDVESGGETVFPKHGAWIEPDEMDVRGRSS 156
Query: 109 SRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
S+D S+CA+ VKP +GDALLF + H + D TSLH CPV+ GEKW+ATKW+
Sbjct: 157 SKDS--SKCAKGALHVKPRRGDALLFHNCHLNGREDPTSLHAGCPVLRGEKWTATKWM 212
>gi|356496957|ref|XP_003517331.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 299
Score = 151 bits (381), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 80/160 (50%), Positives = 106/160 (66%), Gaps = 3/160 (1%)
Query: 15 VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTS G+F+S ++DE I+ SIE +IA T +P +GEA IL YE GQKY PH+D F +
Sbjct: 140 IRTSYGVFMSASEDETGILDSIEEKIAKATKIPRTHGEAFNILRYEVGQKYSPHYDAFDE 199
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
L R A+ L+YL+ V +GGET+FP E +RDG++ G V+P KGD L
Sbjct: 200 AEFGPLQSQRAASFLLYLTDVPEGGETLFP-YENGFNRDGSYDFEDCIGLRVRPRKGDGL 258
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
LF+SL P+ + D TS+HGSCPVI+GEKW ATKWI + D
Sbjct: 259 LFYSLLPNGTIDQTSVHGSCPVIKGEKWVATKWIRDQVLD 298
>gi|449488641|ref|XP_004158125.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101218968
[Cucumis sativus]
Length = 311
Score = 150 bits (380), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 84/219 (38%), Positives = 128/219 (58%), Gaps = 7/219 (3%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
SG ++++E+ SSG+ L+ D+IVA IE R+A WT LP ++ QI+ Y + +
Sbjct: 98 SGITVSTELLNSSGVILN-TTDDIVARIENRLAIWTLLPKDHSMPFQIMQYRGEEAKHKY 156
Query: 67 FDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
F R M +ATV++YLS GGE +FP S+V + WS ++ ++P
Sbjct: 157 FYGNRSAMLPS-SEPLMATVVLYLSDSASGGEILFPESKV---KSKFWSGRRKKNNFLRP 212
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKPEKEPEDDDCV 184
+KG+A+L FS+H +AS D +S H P+ +GE W ATK++++ +K + + D C
Sbjct: 213 VKGNAILXFSVHLNASPDKSSYHIRSPIRDGELWVATKFLYLGPPAGNKHTIQSDVDGCF 272
Query: 185 DEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
DED +C WA GEC++N ++MVGS G CRKSC C
Sbjct: 273 DEDKSCPQWAAIGECERNAVFMVGSPDYYGTCRKSCNAC 311
>gi|413945803|gb|AFW78452.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 239
Score = 150 bits (379), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 71/106 (66%), Positives = 84/106 (79%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VADN SGKS SEVRTSSG FL K QD IV IE +IAAWTFLP ENGE +Q+L Y+HG+
Sbjct: 88 VADNMSGKSTLSEVRTSSGTFLRKGQDPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGE 147
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS 107
KYEPH+D+F D +N GGHR ATVL+YL+ V +GGETVFP +EV+
Sbjct: 148 KYEPHYDYFTDNVNTVRGGHRYATVLLYLTDVPEGGETVFPLAEVN 193
>gi|449520827|ref|XP_004167434.1| PREDICTED: putative prolyl 4-hydroxylase-like, partial [Cucumis
sativus]
Length = 164
Score = 150 bits (378), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 78/166 (46%), Positives = 107/166 (64%), Gaps = 5/166 (3%)
Query: 11 IASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD 68
+ S+ RTSSGMFLS + +V +IE RI+ ++ +P ENGE +Q+L YE Q Y+PH D
Sbjct: 2 VKSDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHD 61
Query: 69 FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMK 128
+F D N + GG RIAT+LMYLS +GGET FP + + G + G +VKP K
Sbjct: 62 YFSDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECSCGGKT---VPGLSVKPAK 118
Query: 129 GDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
GDA+LF+S+ D +D S+HG C V+ GEKWSATKW+ ++ P
Sbjct: 119 GDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMRQKSTLVP 164
>gi|40809925|dbj|BAD07294.1| prolyl 4-hydroxylase [Nicotiana tabacum]
Length = 286
Score = 150 bits (378), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 80/161 (49%), Positives = 104/161 (64%), Gaps = 5/161 (3%)
Query: 15 VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG F+S ++D+ I+ IE +IA T +P +GEA +L YE GQ+Y+ H+D F
Sbjct: 127 IRTSSGTFISASEDKTGILDLIEEKIAKATMIPKTHGEAFNVLRYEIGQRYQSHYDAFDP 186
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
R A+ L+YLS VE+GGETVFP E Q+ D ++ G VKP +GD L
Sbjct: 187 AQYGPQKSQRAASFLLYLSDVEEGGETVFPY-ENGQNMDASYDFSKCIGLKVKPRRGDGL 245
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
LF+SL P+ + D TSLHGSCPVI GEKW ATKWI RN D+
Sbjct: 246 LFYSLFPNGTIDLTSLHGSCPVIRGEKWVATKWI--RNQDQ 284
>gi|449448264|ref|XP_004141886.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 294
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 101/155 (65%), Gaps = 3/155 (1%)
Query: 15 VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
VRTSSG+F S ++DE + IE +IA T +P +GEA IL YE GQKY H+D F+
Sbjct: 132 VRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKYNSHYDAFKP 191
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
R+A+ L+YL+ VE+GGET+FP E + DG ++ G VKP +GD L
Sbjct: 192 SEYGPQKSQRVASFLLYLTDVEEGGETMFP-FENGLNMDGTYNFQTCIGLKVKPRQGDGL 250
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LF+S+ P+ + D TSLHGSCPVI+G+KW ATKWI
Sbjct: 251 LFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIR 285
>gi|255573113|ref|XP_002527486.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223533126|gb|EEF34884.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 286
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 80/155 (51%), Positives = 101/155 (65%), Gaps = 5/155 (3%)
Query: 16 RTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK 73
RTSSG FLS ++D + IE +IA T +P +GEA IL YE GQKY+ H+D F
Sbjct: 128 RTSSGTFLSASEDGTGTLDFIEHKIARATMIPRSHGEAFNILRYEIGQKYDSHYDSFNPA 187
Query: 74 MNQQLGGHRIATVLMYLSHVEKGGETVFP-NSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
R+A+ L+YLS VEKGGET+FP + V S ++ +CA G VKP +GD +
Sbjct: 188 EYGPQMSQRVASFLLYLSDVEKGGETMFPFENGVKISSVYDYKKCA--GLKVKPRQGDGI 245
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LF+SL P+ + D TSLHGSCPVIEGEKW ATKWI
Sbjct: 246 LFYSLLPNGTIDQTSLHGSCPVIEGEKWVATKWIR 280
>gi|225438938|ref|XP_002279411.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087348|emb|CBI33722.3| unnamed protein product [Vitis vinifera]
Length = 285
Score = 148 bits (373), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 75/156 (48%), Positives = 103/156 (66%), Gaps = 5/156 (3%)
Query: 15 VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG+F+S ++D+ + IE +IA +P +GEA +L YE GQ+Y H+D F
Sbjct: 126 IRTSSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEAFNVLRYEIGQRYNSHYDAFDP 185
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFP-NSEVSQSRDGNWSECARRGYAVKPMKGDA 131
HRIAT L+YLS VE+GGET+FP + ++ +D ++ C G VKP +GD
Sbjct: 186 AEYGPQKSHRIATFLVYLSDVEEGGETMFPFENGLNMDKDYDFQRCI--GLKVKPHQGDG 243
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LLF+S+ P+ + D TSLHGSCPVI+GEKW ATKWI
Sbjct: 244 LLFYSMFPNGTIDPTSLHGSCPVIKGEKWVATKWIR 279
>gi|356536125|ref|XP_003536590.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 147 bits (372), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 77/159 (48%), Positives = 103/159 (64%), Gaps = 5/159 (3%)
Query: 14 EVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+ RTSSG F+S ++D+ I+ +E +IA T +P +GE IL YE GQKY+ H+D F
Sbjct: 126 DTRTSSGTFISASEDKSGILDLVERKIAKVTMIPRTHGEIFNILKYEVGQKYDSHYDAFN 185
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFP-NSEVSQSRDGNWSECARRGYAVKPMKGD 130
+ RIA+ L+YLS+VE GGET+FP ++ R ++ +C G VKP +GD
Sbjct: 186 PDEYGSVESQRIASFLLYLSNVEAGGETMFPYEGGLNIDRGYDYQKCI--GLKVKPRQGD 243
Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
LLF+SL P+ D TSLHGSCPVI+GEKW ATKWI R
Sbjct: 244 GLLFYSLLPNGKIDKTSLHGSCPVIKGEKWVATKWIDDR 282
>gi|168006299|ref|XP_001755847.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693166|gb|EDQ79520.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 299
Score = 147 bits (371), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 77/163 (47%), Positives = 104/163 (63%), Gaps = 7/163 (4%)
Query: 14 EVRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
++RTSSG FL +D + +E ++A T +P ENGEA +L Y GQKY+ H+D F
Sbjct: 140 DIRTSSGTFLRADEDTTRSLEQVEEKMAKATMIPRENGEAFNVLKYNVGQKYDCHYDVFD 199
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGD 130
R+A+ L+YLS VE+GGET+FP G ++ +C G VKP +GD
Sbjct: 200 PAEYGPQPSQRMASFLLYLSDVEEGGETMFPFENFQNMNIGFDYKKCI--GMKVKPRQGD 257
Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
ALLF+S+HP+ + D ++LHGSCPVI+GEKW ATKWI RN DK
Sbjct: 258 ALLFYSMHPNGTFDKSALHGSCPVIKGEKWVATKWI--RNTDK 298
>gi|308804269|ref|XP_003079447.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116057902|emb|CAL54105.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 363
Score = 147 bits (371), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 87/218 (39%), Positives = 125/218 (57%), Gaps = 30/218 (13%)
Query: 15 VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM 74
RTS G F+++ ++++E R+A ++ +P + E +Q+L YE GQ+Y
Sbjct: 141 ARTSFGTFITRRLTPTLSAVEDRVAEYSGIPWRHQEQLQLLRYEKGQEYGN--------- 191
Query: 75 NQQLGGHRIATVLMYLSHVEKGGETVFPN--------SEVSQSR----DGNWSECARRGY 122
G RIATVLM+L E GGET FP+ SE SR D W+E RG+
Sbjct: 192 ----GEKRIATVLMFLREPEFGGETHFPDATPLPATRSEFLGSRAKLSDCGWNEG--RGF 245
Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDD 182
+V P KGDA+LFFS H + ++D + H SCP + G K++ATKWIH + FD E
Sbjct: 246 SVIPRKGDAILFFSHHINGTSDDAASHASCPTLRGIKYTATKWIHEKEFDTTTFE--TPM 303
Query: 183 CVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSC 220
C D++ C WA +GEC+KNP++M+G ++ G C KSC
Sbjct: 304 CEDKEDMCDQWANSGECEKNPVFMMGIETV-GSCSKSC 340
>gi|224103711|ref|XP_002313164.1| predicted protein [Populus trichocarpa]
gi|222849572|gb|EEE87119.1| predicted protein [Populus trichocarpa]
Length = 294
Score = 147 bits (371), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 78/157 (49%), Positives = 101/157 (64%), Gaps = 5/157 (3%)
Query: 14 EVRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+ RTSSG F+S ++DE + IE +IA T +P +GEA IL YE GQKY+ H+D F
Sbjct: 132 DTRTSSGSFVSGSEDETGTLDFIEKKIAKATMIPQSHGEAFNILRYEIGQKYDSHYDAFN 191
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGD 130
Q R A+ L+YLS+VE+GGET+FP S G ++ +C G VKP +GD
Sbjct: 192 PDEYGQQSSQRTASFLLYLSNVEEGGETMFPFENGSAVIPGFDYKQCV--GLKVKPRQGD 249
Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LLF+SL P+ + D TSLHGSCPVI+G KW ATKWI
Sbjct: 250 GLLFYSLFPNGTIDPTSLHGSCPVIKGVKWVATKWIR 286
>gi|297798522|ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312981|gb|EFH43404.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 288
Score = 146 bits (368), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 79/164 (48%), Positives = 103/164 (62%), Gaps = 7/164 (4%)
Query: 15 VRTSSGMFLSKAQDEIVAS--IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
RTSSG F+S ++D A +E +IA T +P +GE+ IL YE GQKY+ H+D F
Sbjct: 129 TRTSSGTFISASEDSTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNP 188
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
RIA+ L+YLS VE+GGET+FP S G ++ +C G VKP KGD
Sbjct: 189 TEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGTGYDYKQCI--GLKVKPRKGDG 246
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
LLF+S+ P+ + D TSLHGSCPV +GEKW ATKWI R+ D+ E
Sbjct: 247 LLFYSVFPNGTIDQTSLHGSCPVTKGEKWVATKWI--RDQDQEE 288
>gi|449511009|ref|XP_004163837.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1-like [Cucumis sativus]
Length = 294
Score = 146 bits (368), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 75/155 (48%), Positives = 100/155 (64%), Gaps = 3/155 (1%)
Query: 15 VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
VRTSSG+F S ++DE + IE + A T +P +GEA IL YE GQKY H+D F+
Sbjct: 132 VRTSSGVFFSASEDESGTLGVIEEKXARATMIPRTHGEAYNILRYEIGQKYNSHYDAFKP 191
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
R+A+ L+YL+ VE+GGET+FP E + DG ++ G VKP +GD L
Sbjct: 192 SEYGPQKSQRVASFLLYLTDVEEGGETMFP-FENGLNMDGTYNFQTCIGLKVKPRQGDGL 250
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LF+S+ P+ + D TSLHGSCPVI+G+KW ATKWI
Sbjct: 251 LFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIR 285
>gi|297802348|ref|XP_002869058.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297314894|gb|EFH45317.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 245
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 77/149 (51%), Positives = 98/149 (65%), Gaps = 19/149 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + +G S RTSSG FL K D+IV IE RI+ +TF+P ENGEA+Q++HYE GQ
Sbjct: 100 VRNAITGLGEESSSRTSSGTFLRKGHDKIVKEIEKRISEFTFIPEENGEALQVIHYEVGQ 159
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
K+EPHFD G RIATVLMYLS V+KGGETVFP ++ +S ++G
Sbjct: 160 KFEPHFD----------GFQRIATVLMYLSDVDKGGETVFPEAKGIKS---------KKG 200
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHG 150
+V+P KGDALLF+S+ PD S D +S HG
Sbjct: 201 VSVRPKKGDALLFWSMRPDGSQDPSSKHG 229
>gi|302764866|ref|XP_002965854.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
gi|300166668|gb|EFJ33274.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
Length = 231
Score = 145 bits (366), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 75/156 (48%), Positives = 100/156 (64%), Gaps = 5/156 (3%)
Query: 14 EVRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+VRTS G FLS QD+ +A +E ++A T +P +GEA +L YE GQKY H+D F
Sbjct: 71 DVRTSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYDVFN 130
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGD 130
R+A+ L+YLS VE+GGET+FP + ++ EC G VKP +GD
Sbjct: 131 PAEYGPQKSQRMASFLLYLSDVEEGGETMFPFENYEHMNENYDYKECI--GLKVKPKQGD 188
Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
ALLF+S+ P+ + D T+LHGSCPVI+GEKW ATKWI
Sbjct: 189 ALLFYSMFPNGTFDKTALHGSCPVIKGEKWVATKWI 224
>gi|302802700|ref|XP_002983104.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
gi|300149257|gb|EFJ15913.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
Length = 292
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 75/156 (48%), Positives = 100/156 (64%), Gaps = 5/156 (3%)
Query: 14 EVRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+VRTS G FLS QD+ +A +E ++A T +P +GEA +L YE GQKY H+D F
Sbjct: 132 DVRTSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYDVFN 191
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGD 130
R+A+ L+YLS VE+GGET+FP + ++ EC G VKP +GD
Sbjct: 192 PAEYGPQKSQRMASFLLYLSDVEEGGETMFPFENYEHMNENYDYKECI--GLKVKPKQGD 249
Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
ALLF+S+ P+ + D T+LHGSCPVI+GEKW ATKWI
Sbjct: 250 ALLFYSMFPNGTFDKTALHGSCPVIKGEKWVATKWI 285
>gi|255647903|gb|ACU24410.1| unknown [Glycine max]
Length = 293
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 75/160 (46%), Positives = 101/160 (63%), Gaps = 3/160 (1%)
Query: 15 VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG+F+S ++D+ + IE +IA T +P +GEA IL YE Q+Y H+D F
Sbjct: 134 IRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYNSHYDAFNP 193
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
R+A+ L+YL+ VE+GGET+FP E + DGN+ G VKP +GD L
Sbjct: 194 AEYGPQKSQRMASFLLYLTDVEEGGETMFP-FENGLNMDGNYGYEGCIGLKVKPRQGDGL 252
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
LF+SL + + D TSLHGSCPVI+GEKW ATKWI + D
Sbjct: 253 LFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQELD 292
>gi|385137888|gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana]
Length = 288
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 75/156 (48%), Positives = 98/156 (62%), Gaps = 5/156 (3%)
Query: 15 VRTSSGMFLSKAQDEIVAS--IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
RTSSG F+S +++ A +E +IA T +P +GE+ IL YE GQKY+ H+D F
Sbjct: 129 TRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNP 188
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
RIA+ L+YLS VE+GGET+FP S G ++ +C G VKP KGD
Sbjct: 189 TEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCI--GLKVKPRKGDG 246
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LLF+S+ P+ + D TSLHGSCPV +GEKW ATKWI
Sbjct: 247 LLFYSVFPNGTIDQTSLHGSCPVTKGEKWVATKWIR 282
>gi|18418321|ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|17381226|gb|AAL36425.1| unknown protein [Arabidopsis thaliana]
gi|20465827|gb|AAM20018.1| unknown protein [Arabidopsis thaliana]
gi|21592377|gb|AAM64328.1| putative dioxygenase [Arabidopsis thaliana]
gi|332660892|gb|AEE86292.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 288
Score = 143 bits (361), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 75/156 (48%), Positives = 98/156 (62%), Gaps = 5/156 (3%)
Query: 15 VRTSSGMFLSKAQDEIVAS--IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
RTSSG F+S +++ A +E +IA T +P +GE+ IL YE GQKY+ H+D F
Sbjct: 129 TRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNP 188
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
RIA+ L+YLS VE+GGET+FP S G ++ +C G VKP KGD
Sbjct: 189 TEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCI--GLKVKPRKGDG 246
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LLF+S+ P+ + D TSLHGSCPV +GEKW ATKWI
Sbjct: 247 LLFYSVFPNGTIDQTSLHGSCPVTKGEKWVATKWIR 282
>gi|356574299|ref|XP_003555286.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 290
Score = 143 bits (360), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 75/158 (47%), Positives = 97/158 (61%), Gaps = 2/158 (1%)
Query: 14 EVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+ RTSSG F+S ++D+ I+ +E +IA T +P +GE IL YE QKY+ H+D F
Sbjct: 125 DTRTSSGTFISASEDKSGILDFVERKIAKVTMIPRTHGEKFNILKYEVAQKYDSHYDAFN 184
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ RIA+ L+YLS+VE GGET+FP G + G VKP +GD
Sbjct: 185 PDEYGTVESQRIASFLLYLSNVEAGGETMFPYEGGLNIDKGYYDYKKCIGLKVKPRQGDG 244
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
LLF+SL P+ D TSLHGSCPVI+GEKW ATKWI R
Sbjct: 245 LLFYSLLPNGKIDKTSLHGSCPVIKGEKWVATKWIDDR 282
>gi|226499492|ref|NP_001150030.1| LOC100283657 [Zea mays]
gi|195636206|gb|ACG37571.1| prolyl 4-hydroxylase [Zea mays]
gi|347978804|gb|AEP37744.1| prolyl 4-hydroxylase 3 [Zea mays]
Length = 294
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 97/155 (62%), Gaps = 3/155 (1%)
Query: 15 VRTSSGMFLSKAQD--EIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG FLS +D E +A IE +IA T LP +GE +L Y GQ+Y H+D F
Sbjct: 137 IRTSSGTFLSANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDP 196
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
R+A+ L+YL+ VE+GGET+FP E S++ D + G VKP KGD L
Sbjct: 197 AQYGPQKNQRVASFLLYLTDVEEGGETMFP-YENSENMDIGYDYEKCIGLKVKPRKGDGL 255
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LF+SL + + D TSLHGSCPVI+GEKW ATKWI
Sbjct: 256 LFYSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIR 290
>gi|223945827|gb|ACN26997.1| unknown [Zea mays]
gi|414872966|tpg|DAA51523.1| TPA: prolyl 4-hydroxylase [Zea mays]
Length = 294
Score = 143 bits (360), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 76/155 (49%), Positives = 97/155 (62%), Gaps = 3/155 (1%)
Query: 15 VRTSSGMFLSKAQD--EIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG FLS +D E +A IE +IA T LP +GE +L Y GQ+Y H+D F
Sbjct: 137 IRTSSGTFLSANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDP 196
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
R+A+ L+YL+ VE+GGET+FP E S++ D + G VKP KGD L
Sbjct: 197 AQYGPQKNQRVASFLLYLTDVEEGGETMFP-YENSENMDIGYDYEKCIGLKVKPRKGDGL 255
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LF+SL + + D TSLHGSCPVI+GEKW ATKWI
Sbjct: 256 LFYSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIR 290
>gi|356563543|ref|XP_003550021.1| PREDICTED: putative prolyl 4-hydroxylase-like [Glycine max]
Length = 293
Score = 142 bits (359), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 75/160 (46%), Positives = 101/160 (63%), Gaps = 3/160 (1%)
Query: 15 VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG+F+S ++D+ + IE +IA T +P +GEA IL YE Q+Y H+D F
Sbjct: 134 IRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYNSHYDAFNP 193
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
R+A+ L+YL+ VE+GGET+FP E + DGN+ G VKP +GD L
Sbjct: 194 AEYGPQKSQRMASFLLYLTDVEEGGETMFP-FENGLNMDGNYGYEDCIGLKVKPRQGDGL 252
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
LF+SL + + D TSLHGSCPVI+GEKW ATKWI + D
Sbjct: 253 LFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQELD 292
>gi|225428938|ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083079|emb|CBI22483.3| unnamed protein product [Vitis vinifera]
Length = 284
Score = 142 bits (357), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 100/156 (64%), Gaps = 5/156 (3%)
Query: 15 VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
RTSSG F+S ++D+ I+ +E +IA T +P +GEA IL YE GQ+Y H+D F
Sbjct: 125 TRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSHYDAFNP 184
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFP-NSEVSQSRDGNWSECARRGYAVKPMKGDA 131
R+A+ L+YLS VE+GGET+FP +++ ++ +C G VKP +GD
Sbjct: 185 AEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCI--GLKVKPQRGDG 242
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LLF+S+ P+ + D TSLHGSCPVI GEKW ATKWI
Sbjct: 243 LLFYSVFPNGTIDRTSLHGSCPVIAGEKWVATKWIR 278
>gi|363807682|ref|NP_001242420.1| uncharacterized protein LOC100775302 [Glycine max]
gi|255641811|gb|ACU21174.1| unknown [Glycine max]
Length = 293
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/155 (48%), Positives = 99/155 (63%), Gaps = 3/155 (1%)
Query: 15 VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG+F+S + D+ +A IE +IA T +P +GEA IL YE Q+Y H+D F
Sbjct: 134 IRTSSGVFVSASGDKTGTLAVIEEKIARATMIPRSHGEAFNILRYEVDQRYNSHYDAFNP 193
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
R+A+ L+YL+ VE+GGET+FP E + DGN+ G VKP +GD L
Sbjct: 194 AEYGPQKSQRMASFLLYLTDVEEGGETMFP-FENGLNMDGNYGYEDCIGLKVKPRQGDGL 252
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LF+SL + + D TSLHGSCPVI+GEKW ATKWI
Sbjct: 253 LFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIR 287
>gi|326492085|dbj|BAJ98267.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 347
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 74/156 (47%), Positives = 95/156 (60%), Gaps = 5/156 (3%)
Query: 15 VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG FLS +D +A IE +IA T +P +GE +L YE GQKY H+D F
Sbjct: 190 IRTSSGTFLSAEEDPTGALAEIETKIAKATMMPRSHGEPFNVLRYEIGQKYASHYDAFDP 249
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
R+A+ L+YL+ VE+GGET+FP G ++ +C G VKP KGD
Sbjct: 250 AQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGDNMNIGYDYEQCI--GLKVKPRKGDG 307
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LLF+SL + + D TSLHGSCPV+ GEKW ATKWI
Sbjct: 308 LLFYSLMVNGTIDPTSLHGSCPVVRGEKWVATKWIR 343
>gi|147823227|emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera]
Length = 276
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 100/156 (64%), Gaps = 5/156 (3%)
Query: 15 VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
RTSSG F+S ++D+ I+ +E +IA T +P +GEA IL YE GQ+Y H+D F
Sbjct: 117 TRTSSGTFISASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSHYDAFNP 176
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFP-NSEVSQSRDGNWSECARRGYAVKPMKGDA 131
R+A+ L+YLS VE+GGET+FP +++ ++ +C G VKP +GD
Sbjct: 177 AEYGPQTSQRVASFLLYLSDVEEGGETMFPFEHDLNIGTGYDYKKCI--GLKVKPQRGDG 234
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LLF+S+ P+ + D TSLHGSCPVI GEKW ATKWI
Sbjct: 235 LLFYSVFPNGTIDRTSLHGSCPVIAGEKWVATKWIR 270
>gi|302845120|ref|XP_002954099.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
nagariensis]
gi|300260598|gb|EFJ44816.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
nagariensis]
Length = 231
Score = 141 bits (356), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 75/163 (46%), Positives = 93/163 (57%), Gaps = 4/163 (2%)
Query: 12 ASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF 69
+ + RTS+G FL+ A D ++ +E RIAA T LP ENGEA +LHYE Q Y+ H+D
Sbjct: 65 SQQTRTSTGTFLAAAMDPEGVLGWVEQRIAAATLLPAENGEAFNVLHYEKEQHYDSHYDT 124
Query: 70 FRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD--GNWSECARRGYAVKPM 127
F K RIATVL+YLS V +GGETVF V G+W C + P
Sbjct: 125 FDPKEFGPQPSQRIATVLLYLSEVLEGGETVFKREGVDGENRVIGDWRNCDDGSFKYMPR 184
Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
GDA+LF+ P+ D +LHG CPV GEKW ATKWI R
Sbjct: 185 MGDAVLFWGTKPNGDIDPHALHGGCPVKRGEKWVATKWIRSRG 227
>gi|357114580|ref|XP_003559078.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 295
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 74/156 (47%), Positives = 96/156 (61%), Gaps = 5/156 (3%)
Query: 15 VRTSSGMFLSKAQD--EIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG FLS +D +A +E +IA T +P +GE +L YE GQKY H+D F
Sbjct: 138 IRTSSGTFLSADEDPTRTLAEVEKKIAKATMIPRSHGEPFNVLRYEIGQKYASHYDAFDP 197
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
R+A+ L+YL+ VE+GGET+FP G ++ +C G VKP KGD
Sbjct: 198 AQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEQCI--GLKVKPRKGDG 255
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LLF+SL + + D TSLHGSCPVI+GEKW ATKWI
Sbjct: 256 LLFYSLMVNGTIDLTSLHGSCPVIKGEKWVATKWIR 291
>gi|225428943|ref|XP_002263094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083076|emb|CBI22480.3| unnamed protein product [Vitis vinifera]
Length = 282
Score = 140 bits (354), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 74/155 (47%), Positives = 96/155 (61%), Gaps = 4/155 (2%)
Query: 15 VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG F+S ++D+ I+ IE +IA T +P +GE IL YE GQ+Y H+D
Sbjct: 124 IRTSSGTFISASEDKTGILDFIERKIAKATMIPRNHGEVFNILRYEIGQRYNSHYDAISP 183
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
RIA+ L+YLS VE+GGET+FP N +C G VKP +GD L
Sbjct: 184 AEYGLQTSQRIASFLLYLSDVEEGGETMFPFEHDLNINTFNSRKCI--GLKVKPRRGDGL 241
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LF+S+ P+ + D TS+HGSCPVIEGEKW ATKWI
Sbjct: 242 LFYSVFPNGTIDWTSMHGSCPVIEGEKWVATKWIR 276
>gi|334188665|ref|NP_001190630.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010771|gb|AED98154.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 243
Score = 140 bits (354), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 68/125 (54%), Positives = 91/125 (72%), Gaps = 3/125 (2%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++GKS S VRTSSG FL++ +D+ + IE RI+ +TF+P E+GE +Q+LHYE GQ
Sbjct: 113 VVDEKTGKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQ 172
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW---SECA 118
KYEPH+D+F D+ N + GG RIATVLMYLS VE+GGETVFP ++ + S W SEC
Sbjct: 173 KYEPHYDYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECG 232
Query: 119 RRGYA 123
+ G+
Sbjct: 233 KGGWV 237
>gi|388519941|gb|AFK48032.1| unknown [Lotus japonicus]
Length = 151
Score = 140 bits (353), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 72/148 (48%), Positives = 97/148 (65%), Gaps = 5/148 (3%)
Query: 21 MFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQL 78
MFL+ + + +V +IE RI+ ++ +P ENGE MQ+L YE Q Y+PH D+F D N +
Sbjct: 1 MFLTPEERKYPMVHAIEKRISVYSQVPIENGELMQVLRYEKNQYYKPHHDYFADTFNLKR 60
Query: 79 GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLH 138
GG RIAT+LMYLS +GGET FPN Q G + G +VKP KG+A+LF+S+
Sbjct: 61 GGQRIATMLMYLSDNVEGGETYFPNIGSGQCSCGGKT---VEGLSVKPTKGNAVLFWSMG 117
Query: 139 PDASTDSTSLHGSCPVIEGEKWSATKWI 166
D +D S+HG C V+ GEKWSATKW+
Sbjct: 118 LDGQSDPLSVHGGCEVLAGEKWSATKWM 145
>gi|115455509|ref|NP_001051355.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|14488368|gb|AAK63935.1|AC084282_16 putative dioxygenase [Oryza sativa Japonica Group]
gi|17027263|gb|AAL34117.1|AC090713_4 putative hydroxylase subunit [Oryza sativa Japonica Group]
gi|108711218|gb|ABF99013.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113549826|dbj|BAF13269.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|125545807|gb|EAY91946.1| hypothetical protein OsI_13633 [Oryza sativa Indica Group]
Length = 310
Score = 140 bits (352), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 74/156 (47%), Positives = 96/156 (61%), Gaps = 5/156 (3%)
Query: 15 VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG FLS +D +A +E +IA T +P +GE IL YE GQ+Y H+D F
Sbjct: 151 IRTSSGTFLSSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDP 210
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
R+A+ L+YL+ VE+GGET+FP G ++ +C G VKP KGD
Sbjct: 211 AQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEKCI--GLKVKPRKGDG 268
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LLF+SL + + D TSLHGSCPVI+GEKW ATKWI
Sbjct: 269 LLFYSLMVNGTIDPTSLHGSCPVIKGEKWVATKWIR 304
>gi|357453665|ref|XP_003597113.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|357482683|ref|XP_003611628.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355486161|gb|AES67364.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355512963|gb|AES94586.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 294
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 72/156 (46%), Positives = 99/156 (63%), Gaps = 5/156 (3%)
Query: 15 VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSGMF+S ++D+ ++ I+ +IA +P +G A IL Y+ GQKY H+D F
Sbjct: 134 IRTSSGMFISASEDKTGLLEVIDEKIARAAKIPKTHGGAYNILRYKVGQKYNSHYDAFNP 193
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
R+A+ L+YL+ V +GGET+FP S N+ +C G +KP+KGD
Sbjct: 194 AEYGPQESQRVASFLLYLTDVPEGGETMFPFENGSNMDSSYNFEDCI--GLKIKPLKGDG 251
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LLF+SL P+ + D TSLHGSCPVI+GEKW ATKWI
Sbjct: 252 LLFYSLFPNGTIDPTSLHGSCPVIKGEKWVATKWIR 287
>gi|159489502|ref|XP_001702736.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280758|gb|EDP06515.1| predicted protein [Chlamydomonas reinhardtii]
Length = 231
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 75/163 (46%), Positives = 93/163 (57%), Gaps = 4/163 (2%)
Query: 12 ASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF 69
+ + RTS+G FLS D ++ +E RIAA T LP +NGEA +LHYEH Q Y+ H D
Sbjct: 65 SQQTRTSTGTFLSSGMDTEGVLGWVEQRIAAATLLPADNGEAFNVLHYEHMQHYDSHMDS 124
Query: 70 FRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD--GNWSECARRGYAVKPM 127
F K RIATVL+YLS V +GGETVF V + +W C + P
Sbjct: 125 FDPKDFGPQPSQRIATVLLYLSEVLEGGETVFKKEGVDGADRPIQDWRNCDDGSFKYAPR 184
Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
GDA+LF+ P+ D SLHG CPV +GEKW ATKWI R
Sbjct: 185 MGDAVLFWGTRPNGEIDPHSLHGGCPVKKGEKWVATKWIRSRG 227
>gi|125588006|gb|EAZ28670.1| hypothetical protein OsJ_12681 [Oryza sativa Japonica Group]
Length = 280
Score = 140 bits (352), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 74/156 (47%), Positives = 96/156 (61%), Gaps = 5/156 (3%)
Query: 15 VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG FLS +D +A +E +IA T +P +GE IL YE GQ+Y H+D F
Sbjct: 121 IRTSSGTFLSSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDP 180
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
R+A+ L+YL+ VE+GGET+FP G ++ +C G VKP KGD
Sbjct: 181 AQYGPQKSQRVASFLLYLTDVEEGGETMFPYENGENMDIGYDYEKCI--GLKVKPRKGDG 238
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LLF+SL + + D TSLHGSCPVI+GEKW ATKWI
Sbjct: 239 LLFYSLMVNGTIDPTSLHGSCPVIKGEKWVATKWIR 274
>gi|242038031|ref|XP_002466410.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
gi|241920264|gb|EER93408.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
Length = 294
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 96/156 (61%), Gaps = 5/156 (3%)
Query: 15 VRTSSGMFLSKAQD--EIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG FLS +D +A IE +IA T +P +GE +L Y GQ+Y H+D F
Sbjct: 137 IRTSSGTFLSANEDPTRTLAEIEKKIARATMIPRNHGEPFNVLRYNIGQRYASHYDAFDP 196
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
R+A+ L+YL++VE+GGET+FP G ++ +C G VKP KGD
Sbjct: 197 VQYGPQKSQRVASFLLYLTNVEEGGETMFPYENGENMDIGYDYEKCI--GLKVKPRKGDG 254
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
LLF+SL + + D TSLHGSCPVI+GEKW ATKWI
Sbjct: 255 LLFYSLMVNGTIDRTSLHGSCPVIKGEKWVATKWIR 290
>gi|15233345|ref|NP_195307.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|3805848|emb|CAA21468.1| putative protein [Arabidopsis thaliana]
gi|7270534|emb|CAB81491.1| putative protein [Arabidopsis thaliana]
gi|332661175|gb|AEE86575.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 272
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 73/149 (48%), Positives = 96/149 (64%), Gaps = 19/149 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + +G S RTSSG F+ D+IV IE RI+ +TF+P ENGE +Q+++YE GQ
Sbjct: 133 VRNALTGLGEESSSRTSSGTFIRSGHDKIVKEIEKRISEFTFIPQENGETLQVINYEVGQ 192
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
K+EPHFD G RIATVLMYLS V+KGGETVFP ++ +S ++G
Sbjct: 193 KFEPHFD----------GFQRIATVLMYLSDVDKGGETVFPEAKGIKS---------KKG 233
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHG 150
+V+P KGDALLF+S+ PD S D +S HG
Sbjct: 234 VSVRPKKGDALLFWSMRPDGSRDPSSKHG 262
>gi|388523073|gb|AFK49598.1| unknown [Lotus japonicus]
Length = 318
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 77/172 (44%), Positives = 105/172 (61%), Gaps = 10/172 (5%)
Query: 15 VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG+F+S +D+ ++ IE +IA T +P +GEA +L Y+ GQKY H+D
Sbjct: 143 IRTSSGVFISAFEDKTGVLDVIEEKIARATKIPRTHGEAFNVLRYKVGQKYSSHYDALHP 202
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
+ R+A+ L+YLS V +GGET+FP E + DG++ G VKP KGD L
Sbjct: 203 DIYGPQKSQRMASFLLYLSDVPEGGETMFP-FENGLNMDGSYYYEKCIGLKVKPRKGDGL 261
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCV 184
LF+SL P+ + D SLHGSCPVI+GEKW ATKWI + D D+D V
Sbjct: 262 LFYSLFPNGTIDPMSLHGSCPVIKGEKWVATKWIRDQVLD-------DEDTV 306
>gi|384250599|gb|EIE24078.1| hypothetical protein COCSUDRAFT_47131 [Coccomyxa subellipsoidea
C-169]
Length = 327
Score = 139 bits (349), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 70/174 (40%), Positives = 100/174 (57%), Gaps = 8/174 (4%)
Query: 8 GKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
G VRTS G F+S+ D ++A +E + A T LP +GE +L Y+ GQ Y+
Sbjct: 153 GPQETENVRTSQGTFMSRKDDPAGVIAWVEEKAAQVTGLPVSHGEPFNVLRYQDGQHYDS 212
Query: 66 HFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFP-----NSEVSQSRDGNWSECARR 120
H+D F + R+AT+L YL+ VE+GGET+FP ++ + N+ C
Sbjct: 213 HYDIFEPESYGPQPSQRMATILFYLTDVEEGGETIFPLEGRYGPDLLKMTGFNYKSCTT- 271
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
G+ KP GDAL+F+S+HP+ + D +LHG CPV+ GEKW ATKWI + F P
Sbjct: 272 GFKYKPRMGDALMFYSMHPNGTFDKHALHGGCPVMAGEKWVATKWIRDKCFTPP 325
>gi|224071291|ref|XP_002303388.1| predicted protein [Populus trichocarpa]
gi|222840820|gb|EEE78367.1| predicted protein [Populus trichocarpa]
Length = 297
Score = 138 bits (347), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 73/156 (46%), Positives = 99/156 (63%), Gaps = 8/156 (5%)
Query: 15 VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSGMF+ ++D+ ++ IE +IA T +P +GEA +L YE GQKY+ H+D F
Sbjct: 140 IRTSSGMFVFSSEDQAGVLQVIEEKIARATMIPSTHGEAFNVLRYEIGQKYDAHYDAFNP 199
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFP--NSEVSQSRDGNWSECARRGYAVKPMKGD 130
R+AT L+YLS+ E+GGET FP N E + D +C G VKP +GD
Sbjct: 200 AEYGPQTSQRVATFLLYLSNFEEGGETTFPIENDENFEGYDAQ--KC--NGLRVKPHQGD 255
Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
A+LF+S+ P+ + D SLH SC VI+GEKW ATKWI
Sbjct: 256 AILFYSIFPNNTIDPASLHASCHVIKGEKWVATKWI 291
>gi|224056224|ref|XP_002298763.1| predicted protein [Populus trichocarpa]
gi|222846021|gb|EEE83568.1| predicted protein [Populus trichocarpa]
Length = 175
Score = 137 bits (346), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 72/154 (46%), Positives = 95/154 (61%), Gaps = 5/154 (3%)
Query: 17 TSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM 74
T+ F+ ++D+ + IE +IA T +P +GEA IL YE GQKY+ H+D F
Sbjct: 18 TTESTFIGGSEDKTGTLDFIERKIAKATMIPQSHGEAFNILRYEIGQKYDSHYDAFNPDE 77
Query: 75 NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDALL 133
R+A+ L+YLS VE+GGET+FP S G + +C G VKP +GD LL
Sbjct: 78 YGPQPSQRVASFLLYLSSVEEGGETMFPFENGSAVSSGFEYKQCV--GLKVKPRQGDGLL 135
Query: 134 FFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
F+SL P+ + D TSLHGSCPVI+GEKW ATKWI
Sbjct: 136 FYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 169
>gi|412994121|emb|CCO14632.1| predicted protein [Bathycoccus prasinos]
Length = 341
Score = 137 bits (344), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 70/158 (44%), Positives = 100/158 (63%), Gaps = 6/158 (3%)
Query: 15 VRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+RTSSG FL+ ++ + +E ++A T +P +GEA IL YE GQKY+ H+D F
Sbjct: 179 IRTSSGTFLTSKMEQSGALKRVEEKMARATHIPATHGEAYNILRYEIGQKYDSHYDMFDP 238
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFP---NSEVSQSRDGNWSECARRGYAVKPMKG 129
R+A+ L+YL+ ++GGETVFP + + + R +++ C G VKP KG
Sbjct: 239 SQYGPQRSQRVASFLLYLTTPDEGGETVFPLEGQNGLYRLRGIDYTSC-EAGLKVKPRKG 297
Query: 130 DALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
DALLF+S+HP+ + D +SLHG CPVI G K+ ATKWIH
Sbjct: 298 DALLFWSVHPNNTFDRSSLHGGCPVISGTKFVATKWIH 335
>gi|299532490|ref|ZP_07045880.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
gi|298719437|gb|EFI60404.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
Length = 299
Score = 135 bits (341), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 68/176 (38%), Positives = 99/176 (56%), Gaps = 23/176 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+ DN+SG ++ RTS+GMF + ++E+++ +E RIA P ENGE MQ+LHY G
Sbjct: 141 LTVDNQSGGEAVNDDRTSNGMFFQRGENELISLVEQRIARLLNWPLENGEGMQVLHYRPG 200
Query: 61 QKYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
+Y+PH+D+F + GG R+ T++MYL+ +GG T FP+
Sbjct: 201 AEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTFPDV----------- 249
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G V P +G+A+ F PD +T +LHG PV+EGEKW ATKW+ R F
Sbjct: 250 -----GLQVVPRRGNAVFFSYNRPDPATK--TLHGGAPVLEGEKWIATKWLREREF 298
>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
Length = 287
Score = 135 bits (340), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 72/174 (41%), Positives = 101/174 (58%), Gaps = 23/174 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN++G S +E RTS GMF + + E+++ IEARIAA P ENGE +Q+LHY G +Y
Sbjct: 132 DNDTGGSEVNEARTSQGMFFMRGEGELISRIEARIAALLDWPLENGEGVQVLHYRPGAEY 191
Query: 64 EPHFDFFRDKMN-----QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
+PH+D+F + GG R+ T++MYL+ E+GG T FP+ +
Sbjct: 192 KPHYDYFDPAQPGTPTILKRGGQRVGTLVMYLNTPERGGGTTFPDVNLE----------- 240
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
V P+KG+A +FFS + A + SLHG PV+ GEKW ATKW+ FD
Sbjct: 241 -----VAPIKGNA-VFFS-YERAHPSTRSLHGGAPVLAGEKWVATKWLRQARFD 287
>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
Length = 297
Score = 134 bits (338), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/175 (41%), Positives = 100/175 (57%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G + RTS G A+ ++A IEARIAA T +P E+GE +QIL+Y+ G
Sbjct: 135 VVNPDTGDENLIDARTSMGAMFQVAEHALIARIEARIAAVTGVPAEHGEGLQILNYKPGG 194
Query: 62 KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFD+F R +QL GG RIAT+++YL+ E GG T FP
Sbjct: 195 EYQPHFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP-------------- 240
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
R G V P+KG+A+ F L PD + D +LH PV GEKW ATKW+ R +
Sbjct: 241 --RVGLEVAPVKGNAVYFSYLLPDGTLDERTLHAGLPVASGEKWIATKWLRERPY 293
>gi|221068712|ref|ZP_03544817.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
gi|220713735|gb|EED69103.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
Length = 299
Score = 134 bits (337), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 67/176 (38%), Positives = 98/176 (55%), Gaps = 23/176 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+ DN+SG ++ RTS+GMF + +++++ +E RIA P ENGE MQ+LHY G
Sbjct: 141 LTVDNQSGGEAVNDDRTSNGMFFQRGENDLICRVEQRIARLLNWPLENGEGMQVLHYRPG 200
Query: 61 QKYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
+Y+PH+D+F + GG R+ T++MYL+ +GG T FP+
Sbjct: 201 AEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTFPDV----------- 249
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G V P +G+A+ F PD +T +LHG PV+EGEKW ATKW+ R F
Sbjct: 250 -----GLQVVPRRGNAVFFSYNRPDPATK--TLHGGAPVLEGEKWIATKWLREREF 298
>gi|264677094|ref|YP_003277000.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
gi|262207606|gb|ACY31704.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
Length = 306
Score = 133 bits (335), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 66/176 (37%), Positives = 99/176 (56%), Gaps = 23/176 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+ DN+SG ++ RTS+GMF + ++++++ +E RIA P ENGE MQ+LHY G
Sbjct: 148 LTVDNQSGGEAVNDDRTSNGMFFQRGENDLISLVEQRIARLLNWPLENGEGMQVLHYRPG 207
Query: 61 QKYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
+Y+PH+D+F + GG R+ T++MYL+ +GG T FP+
Sbjct: 208 AEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTFPDV----------- 256
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G + P +G+A+ F PD +T +LHG PV+EGEKW ATKW+ R F
Sbjct: 257 -----GLQIVPRRGNAVFFSYNRPDPATK--TLHGGAPVLEGEKWIATKWLREREF 305
>gi|30681957|ref|NP_850038.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|330252315|gb|AEC07409.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 274
Score = 133 bits (335), Expect = 5e-29, Method: Compositional matrix adjust.
Identities = 65/143 (45%), Positives = 92/143 (64%), Gaps = 1/143 (0%)
Query: 30 IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMY 89
++A+IE +IA T P + E+ IL Y+ GQKY+ H+D F L R+ T L++
Sbjct: 133 VLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYDAFHSAEYGPLISQRVVTFLLF 192
Query: 90 LSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLH 149
LS VE+GGET+FP E ++ +G + G VKP +GDA+ F++L P+ + D TSLH
Sbjct: 193 LSSVEEGGETMFP-FENGRNMNGRYDYEKCVGLKVKPRQGDAIFFYNLFPNGTIDQTSLH 251
Query: 150 GSCPVIEGEKWSATKWIHVRNFD 172
GSCPVI+GEKW ATKWI + +D
Sbjct: 252 GSCPVIKGEKWVATKWIRDQTYD 274
>gi|302844249|ref|XP_002953665.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300261074|gb|EFJ45289.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 245
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 68/151 (45%), Positives = 92/151 (60%), Gaps = 6/151 (3%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G+ + E+RTS GMF+ + D ++ IE RI+ WT LP E+ E +Q+L Y HGQ Y H+
Sbjct: 96 GEGVVDEIRTSYGMFIRRLADPVITRIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHY 155
Query: 68 DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV-----SQSRDGNWSECARRGY 122
D DK N+ R+AT LMYLS VE+GGET FP + V R G SECA+
Sbjct: 156 D-SGDKSNEPGPKWRLATFLMYLSDVEEGGETAFPQNSVWYDPTIPERIGPVSECAKGHV 214
Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCP 153
A KP GDA+LF+S +P+ + D ++H CP
Sbjct: 215 AAKPKAGDAVLFYSFYPNLTMDPAAMHTGCP 245
>gi|418530659|ref|ZP_13096582.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
gi|371452378|gb|EHN65407.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
Length = 299
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 66/176 (37%), Positives = 99/176 (56%), Gaps = 23/176 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+ DN+SG ++ RTS+GMF + ++++++ +E RIA P ENGE MQ+LHY G
Sbjct: 141 LTVDNQSGGEAVNDDRTSNGMFFQRGENDLISRVEQRIARLLNWPLENGEGMQVLHYRPG 200
Query: 61 QKYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
+Y+PH+D+F + GG R+ T++MYL+ +GG T FP+
Sbjct: 201 AEYKPHYDYFAPNEPGTPTILKRGGQRVGTLVMYLNEPARGGATTFPDV----------- 249
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G V P +G+A+ F P+ +T +LHG PV+EGEKW ATKW+ R F
Sbjct: 250 -----GLQVVPRRGNAVFFSYNRPEPATK--TLHGGAPVLEGEKWIATKWLREREF 298
>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
19424]
gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
taiwanensis LMG 19424]
Length = 296
Score = 133 bits (334), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 72/175 (41%), Positives = 100/175 (57%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G + RTS G A+ ++A IEARIAA T +P ++GE +QIL+Y+ G
Sbjct: 134 VVNPDTGDENLIDARTSMGAMFQVAEHALIARIEARIAAVTGVPADHGEGLQILNYKPGG 193
Query: 62 KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFD+F R +QL GG RIAT+++YL+ E GG T FP
Sbjct: 194 EYQPHFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP-------------- 239
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
R G V P+KG+A+ F L PD + D +LH PV GEKW ATKW+ R +
Sbjct: 240 --RVGLEVAPVKGNAVYFSYLLPDGTLDDRTLHAGLPVAAGEKWIATKWLRERPY 292
>gi|428170517|gb|EKX39441.1| hypothetical protein GUITHDRAFT_114401 [Guillardia theta CCMP2712]
Length = 322
Score = 133 bits (334), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 80/237 (33%), Positives = 119/237 (50%), Gaps = 38/237 (16%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
++ + K + S RT+ +L QD++V +E +IA T PE GE +Q+LHY
Sbjct: 112 LITPYGTNKLVESTTRTNKQAWLDFQQDDVVKRVEDKIAKLTKTTPEQGENLQVLHYAKS 171
Query: 61 QKYEPHFDFFRDKM----NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
Q++ H D+F N + GG+R+ TV++YL E+GGET F + +
Sbjct: 172 QQFTEHHDYFDPATDPPENYEKGGNRLITVIVYLQAAEEGGETHFGAANLK--------- 222
Query: 117 CARRGYAVKPMKGDALLFFSLH------PDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
+ KGDA++F++L D +LH P I+GEKW ATKWIH R
Sbjct: 223 -------LTAAKGDAVMFYNLKHGCDGIDPTCVDKQTLHAGLPPIKGEKWVATKWIHERG 275
Query: 171 FDKPEKEPEDDDCVDEDLNCVVWA--KAGECKKNPLYMVGSKSSRGYCRKSCKVCKP 225
+ + C D+ C WA ECK NP++M SK+ CR+SCK+C+P
Sbjct: 276 Y----QSETSGGCFDKHPKCTYWAGKTPTECKLNPVWM--SKN----CRRSCKICQP 322
>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Collimonas fungivorans Ter331]
gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
[Collimonas fungivorans Ter331]
Length = 289
Score = 132 bits (332), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 67/170 (39%), Positives = 96/170 (56%), Gaps = 21/170 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+++G + E RTSSG F + +A I+ R+AA +P +GE +QIL+Y+ G
Sbjct: 130 VVDHQTGNTKLHEHRTSSGTFFHRGTTPFIAMIDKRLAALMQVPESHGEGLQILNYQMGG 189
Query: 62 KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y PH+D+FR + GG R AT+++YL+ V+ GGET+FP
Sbjct: 190 EYRPHYDYFRPDAPGSAKHLARGGQRTATLIIYLNDVDGGGETIFP-------------- 235
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
R G ++ P KG A+ F + + DS S HG PVIEGEKW ATKW+
Sbjct: 236 --RNGLSIVPAKGSAIYFSYTNAENQLDSLSFHGGSPVIEGEKWIATKWV 283
>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
Length = 311
Score = 131 bits (330), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 71/177 (40%), Positives = 97/177 (54%), Gaps = 21/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ESG S S VR S G + ++E+V IEAR++A LP GE +QILHY G
Sbjct: 151 VVDRESGGSYESSVRKSEGSHFERGENELVRRIEARLSALVDLPVNRGEPLQILHYGPGG 210
Query: 62 KYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+ H DFF K + ++GG RI TV+MYL+ V +GGET FP+
Sbjct: 211 EYKAHQDFFEPKDPGSAVLTRVGGQRIGTVVMYLNDVPEGGETAFPDI------------ 258
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
G++ KP+KG A+ F + D D LH PVI G+KW TKW+ R +++
Sbjct: 259 ----GFSAKPIKGSAVYFEYQNADGQLDYRCLHAGMPVIRGDKWIMTKWLRERPYEQ 311
>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
Length = 297
Score = 131 bits (329), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 72/175 (41%), Positives = 99/175 (56%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G + RTS G A+ ++ IEARIAA T +P E+GE +QIL+Y+ G
Sbjct: 135 VVNPDTGDENLIDARTSMGAMFQVAEHPLITRIEARIAAVTGVPAEHGEGLQILNYKPGG 194
Query: 62 KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFD+F R +QL GG RIAT+++YL+ E GG T FP
Sbjct: 195 EYQPHFDYFNPQRPGEARQLSVGGQRIATLVIYLNTPEAGGATAFP-------------- 240
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
R G V P+KG+A+ F L PD + D +LH PV GEKW ATKW+ R +
Sbjct: 241 --RVGLEVAPVKGNAVYFSYLLPDGALDERTLHAGLPVAFGEKWIATKWLRERPY 293
>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
bacterium R229]
Length = 289
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 69/177 (38%), Positives = 98/177 (55%), Gaps = 21/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + E+G+ RTS G + ++A IEARIA T +P E+GE Q+LHY+ G
Sbjct: 127 VVNPETGEENLISARTSQGAMFQVGEHPLIARIEARIAQATGVPVEHGEGFQVLHYQPGG 186
Query: 62 KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFD+F R +QL GG R+AT+++YL+ V+ GG T FP
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP-------------- 232
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
+ G V P+KG+A+ F PD + D +LH PV GEKW ATKW+ R + +
Sbjct: 233 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 287
>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
PSI07]
Length = 289
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 69/177 (38%), Positives = 98/177 (55%), Gaps = 21/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + E+G+ RTS G + ++A IEARIA T +P E+GE Q+LHY+ G
Sbjct: 127 VVNPETGEENLISARTSQGAMFQVGEHPLIARIEARIAQATGVPVEHGEGFQVLHYQPGG 186
Query: 62 KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFD+F R +QL GG R+AT+++YL+ V+ GG T FP
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP-------------- 232
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
+ G V P+KG+A+ F PD + D +LH PV GEKW ATKW+ R + +
Sbjct: 233 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 287
>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 289
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 69/177 (38%), Positives = 98/177 (55%), Gaps = 21/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + E+G+ RTS G + ++A IEARIA T +P E+GE Q+LHY+ G
Sbjct: 127 VVNPETGEENLISARTSQGAMFQVGEHPLIARIEARIAQATGVPVEHGEGFQVLHYQPGG 186
Query: 62 KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFD+F R +QL GG R+AT+++YL+ V+ GG T FP
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP-------------- 232
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
+ G V P+KG+A+ F PD + D +LH PV GEKW ATKW+ R + +
Sbjct: 233 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 287
>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Ralstonia solanacearum GMI1000]
Length = 289
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 21/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + E+G+ RTS G + +VA IEARIA T +P E+GE Q+LHY+ G
Sbjct: 127 VVNPETGEENLISARTSQGAMFQVGEHPLVARIEARIAQATGVPVEHGEGFQVLHYQPGG 186
Query: 62 KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFD+F R +QL GG R+AT+++YL+ V GG T FP
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVPAGGATGFP-------------- 232
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
+ G V P+KG+A+ F PD + D +LH PV GEKW ATKW+ R + +
Sbjct: 233 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 287
>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
Length = 292
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 21/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + E+G+ RTS G + +VA IEARIA T +P E+GE Q+LHY G
Sbjct: 130 VVNPETGEENLISARTSEGAMFQVGEHPLVARIEARIAQATGVPVEHGEGFQVLHYHPGG 189
Query: 62 KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFD+F R +QL GG R+AT+++YL+ V+ GG T FP
Sbjct: 190 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP-------------- 235
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
+ G V P+KG+A+ F PD + D +LH PV GEKW ATKW+ R + +
Sbjct: 236 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 290
>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
Length = 289
Score = 130 bits (328), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 21/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + E+G+ RTS G + +VA IEARIA T +P E+GE Q+LHY G
Sbjct: 127 VVNPETGEENLISARTSEGAMFQVGEHPLVARIEARIAQATGVPVEHGEGFQVLHYHPGG 186
Query: 62 KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFD+F R +QL GG R+AT+++YL+ V+ GG T FP
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP-------------- 232
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
+ G V P+KG+A+ F PD + D +LH PV GEKW ATKW+ R + +
Sbjct: 233 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 287
>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum IPO1609]
Length = 280
Score = 130 bits (328), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 21/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + E+G+ RTS G + +VA IEARIA T +P E+GE Q+LHY G
Sbjct: 118 VVNPETGEENLISARTSEGAMFQVGEHPLVARIEARIAQATGVPVEHGEGFQVLHYHPGG 177
Query: 62 KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFD+F R +QL GG R+AT+++YL+ V+ GG T FP
Sbjct: 178 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP-------------- 223
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
+ G V P+KG+A+ F PD + D +LH PV GEKW ATKW+ R + +
Sbjct: 224 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 278
>gi|159481038|ref|XP_001698589.1| predicted protein [Chlamydomonas reinhardtii]
gi|158282329|gb|EDP08082.1| predicted protein [Chlamydomonas reinhardtii]
Length = 258
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 83/227 (36%), Positives = 112/227 (49%), Gaps = 51/227 (22%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+V D+++G+S ++RTS G + +D ++A++E RIA WT LPPE GE MQIL Y G
Sbjct: 44 LVVDSKTGQSKLDDIRTSYGAAFGRGEDPVIAAVEERIAEWTHLPPEYGEPMQILRYVDG 103
Query: 61 QKYEPHFDFFRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
QKY+ H+D+F D ++ G+R ATVL+YLS VE GGET P ++
Sbjct: 104 QKYDAHWDWFDDPVHHAAYLHEGNRYATVLLYLSGVEGGGETNLPLAD------------ 151
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF-DKPEK 176
P+ +A +G KW+ATKWIH + + K +
Sbjct: 152 --------PIDKEA------------------------QGMKWTATKWIHNKPYMGKYDP 179
Query: 177 EPEDDDCVDEDLNCVVWAKAGECKKNPLYMVGSKSSRGYCRKSCKVC 223
C D NC A AGEC N MVG G CRKSC C
Sbjct: 180 LRTAGRCADTGGNCAARAAAGECTSNMDKMVGPA---GECRKSCNDC 223
>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 296
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 69/175 (39%), Positives = 97/175 (55%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +G+++ + R+S GMF + ++A +EARIA T LP ENGE +Q+LHYE G
Sbjct: 132 VVDPVTGRNVVAGHRSSDGMFFRLGETPLIARLEARIAELTGLPVENGEGLQLLHYEAGA 191
Query: 62 KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+ PH D+ ++ + G R+ T+LMYL+ VE GGET+FP +
Sbjct: 192 ESTPHVDYLIAGNPANRESIARSGQRVGTLLMYLNDVEGGGETMFPQT------------ 239
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G++V P +G AL F + D +SLH S P+ GEKW ATKWI R F
Sbjct: 240 ----GWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPLRAGEKWVATKWIRTRRF 290
>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 296
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 71/175 (40%), Positives = 98/175 (56%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +G+++ + R+S GMF + ++A +EARIA T LP ENGE +Q+LHYE G
Sbjct: 132 VVDPVTGRNVVAGHRSSDGMFFRLGETPLIARLEARIAELTGLPVENGEGLQLLHYEVGA 191
Query: 62 KYEPHFDFF--RDKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+ PH D+ + NQ+ G R+ T+LMYL+ VE GGET+FP +
Sbjct: 192 ESTPHVDYLIAGNPANQESIARSGQRVGTLLMYLNDVEGGGETMFPQT------------ 239
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G++V P +G AL F + D +SLH S P+ GEKW ATKWI R F
Sbjct: 240 ----GWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPLRVGEKWVATKWIRTRRF 290
>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
Length = 286
Score = 130 bits (327), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 71/175 (40%), Positives = 97/175 (55%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +SGK I E R S G F++ + D +V +I+ RIA P ENGE + IL Y G
Sbjct: 124 VVDPDSGKEITIEERRSEGAFVNASTDALVETIDRRIAELFRQPVENGEDLHILRYGMGG 183
Query: 62 KYEPHFDFFRD-----KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y PH+D+F + K + Q GG RIATV++YL+ VE+GG+T FP+
Sbjct: 184 EYRPHYDYFPEEQAGSKHHMQRGGQRIATVILYLNEVEQGGDTTFPDI------------ 231
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G A+ P +G AL F ++ +D +LH PV +GEKW ATKWI F
Sbjct: 232 ----GLAIHPRRGSALYFEYVNELGQSDPKTLHAGTPVEKGEKWIATKWIRRGRF 282
>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
Length = 288
Score = 130 bits (327), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 21/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + E+G+ RTS G + +VA IEARIA T +P E+GE Q+LHY G
Sbjct: 126 VVNPETGEENLISARTSEGAMFQVGEHPLVARIEARIAQATGVPVEHGEGFQVLHYHPGG 185
Query: 62 KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFD+F R +QL GG R+AT+++YL+ V+ GG T FP
Sbjct: 186 EYQPHFDYFNPGRSGEARQLDVGGQRVATLVIYLNSVQAGGATGFP-------------- 231
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
+ G V P+KG+A+ F PD + D +LH PV GEKW ATKW+ R + +
Sbjct: 232 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 286
>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
CFBP2957]
gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CFBP2957]
Length = 289
Score = 130 bits (327), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 70/177 (39%), Positives = 97/177 (54%), Gaps = 21/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + E+G+ RTS G + +VA IEARIA T +P E+GE Q+LHY G
Sbjct: 127 VVNPETGEENLISARTSEGAMFQVGEHPLVARIEARIAQATGVPVEHGEGFQVLHYHPGG 186
Query: 62 KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFD+F R +QL GG R+AT+++YL+ V+ GG T FP
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFP-------------- 232
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
+ G V P+KG+A+ F PD + D +LH PV GEKW ATKW+ R + +
Sbjct: 233 --KLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKWIATKWLRERPYRR 287
>gi|124267278|ref|YP_001021282.1| hypothetical protein Mpe_A2091 [Methylibium petroleiphilum PM1]
gi|124260053|gb|ABM95047.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
Length = 289
Score = 130 bits (326), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 70/174 (40%), Positives = 94/174 (54%), Gaps = 23/174 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
D +G S + RTS GMF ++ + + A EARIAA P ENGE +Q+LHY G +Y
Sbjct: 134 DTATGASEVNAARTSDGMFFTRGEHPVCARFEARIAALLNWPVENGEGLQVLHYRPGAEY 193
Query: 64 EPHFDFFRDKMN-----QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
+PH+D+F + GG R+AT++ YL+ +GG T FP+
Sbjct: 194 KPHYDYFDPDQPGTPAVLRRGGQRVATLVTYLNTPTRGGGTTFPDI-------------- 239
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
G V P+KG A+ F P ST SLHG PV+EG+KW ATKW+ V FD
Sbjct: 240 --GLEVTPLKGHAVFFSYDRPHPST--RSLHGGAPVLEGDKWVATKWLRVGRFD 289
>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
Length = 286
Score = 129 bits (325), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 69/177 (38%), Positives = 95/177 (53%), Gaps = 23/177 (12%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+ ++G +E RTSSGMF + ++E+VA IEARIA P ENGE +Q+LHY G
Sbjct: 128 LTVATKTGGEEVNEDRTSSGMFFQRGENELVARIEARIARLVNWPVENGEGLQVLHYRPG 187
Query: 61 QKYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
+Y+PH+D+F + GG R+ T++MYL EKGG T FP+ +
Sbjct: 188 AEYKPHYDYFDPAEPGTPTILKRGGQRVGTLVMYLGEPEKGGGTTFPDVHLE-------- 239
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
V P +G + F P ST +LHG PV+ GEKW ATKW+ R F+
Sbjct: 240 --------VAPKRGHGVFFSYERPHPST--RTLHGGAPVLAGEKWIATKWLRERRFE 286
>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
Length = 283
Score = 129 bits (325), Expect = 7e-28, Method: Compositional matrix adjust.
Identities = 72/172 (41%), Positives = 94/172 (54%), Gaps = 17/172 (9%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+V D SG+ A RTS M ++E+V IE RIA T P ENGE +QIL+Y G
Sbjct: 125 LVVDRGSGEERAGSGRTSKSMAFRLKENELVERIETRIAELTGYPAENGEGLQILNYGLG 184
Query: 61 QKYEPHFDFFRDKM-NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
++Y+PHFDFF M + GG R+ T L+YL+ VE GGETVF ++
Sbjct: 185 EEYKPHFDFFPPHMADASKGGQRVGTFLIYLNDVEDGGETVF----------------SK 228
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G + P KG A+ F + D S+H S PV +GEKW+ATKWI N
Sbjct: 229 AGLSFVPKKGAAIYFHYGNAQGQLDRLSVHSSVPVRKGEKWAATKWIRESNI 280
>gi|297825201|ref|XP_002880483.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297326322|gb|EFH56742.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 272
Score = 129 bits (325), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 64/143 (44%), Positives = 92/143 (64%), Gaps = 1/143 (0%)
Query: 30 IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMY 89
I+A+IE +IA T +P + E+ IL Y+ GQKY+ H+D F R+ T +++
Sbjct: 131 ILAAIEEKIALATRIPIDYYESFNILRYQLGQKYDSHYDAFHPAEYGPQISQRVVTFILF 190
Query: 90 LSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLH 149
LS VE+GGET+FP E ++ +G + G VKP +GDA+ F++L P+ + D TSLH
Sbjct: 191 LSSVEEGGETMFP-FENGRNMNGRYDYETCIGLRVKPRQGDAIFFYNLLPNRTIDQTSLH 249
Query: 150 GSCPVIEGEKWSATKWIHVRNFD 172
GSCPVI+GEKW ATKWI + +D
Sbjct: 250 GSCPVIKGEKWVATKWIRDQTYD 272
>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
Length = 283
Score = 129 bits (324), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 67/177 (37%), Positives = 95/177 (53%), Gaps = 21/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + E+G+ RTS G + +VA IEARIA T +P E+GE Q+LHY G
Sbjct: 121 VVNPETGEENLISARTSEGAMFQVGEHPLVARIEARIAQATGVPVEHGEGFQVLHYHPGG 180
Query: 62 KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFD+F + ++GG R+AT+++YL+ V+ GG T FP
Sbjct: 181 EYQPHFDYFNPGRGGEARQLEVGGQRVATLVIYLNSVQAGGATGFP-------------- 226
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
+ G V P+KG+A+ F PD D +LH PV GEKW ATKW+ R + +
Sbjct: 227 --KLGLEVAPVKGNAVFFVYKRPDGMLDDNTLHAGLPVERGEKWIATKWLRERPYRR 281
>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CMR15]
Length = 289
Score = 129 bits (324), Expect = 9e-28, Method: Compositional matrix adjust.
Identities = 69/177 (38%), Positives = 97/177 (54%), Gaps = 21/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + E+G+ RTS G + ++A IEARIA T +P E+GE Q+LHY+ G
Sbjct: 127 VVNPETGEENLISARTSQGAMFQVGEHPLIARIEARIAQATGVPVEHGEGFQVLHYQPGG 186
Query: 62 KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFD+F R +QL GG R+AT+++YL+ V GG T FP
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVPAGGATGFP-------------- 232
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
+ G V P+KG+A+ F PD + D +LH PV GEKW ATKW+ R + +
Sbjct: 233 --KLGLEVAPVKGNAVFFVYKRPDGTLDDKTLHAGLPVERGEKWIATKWLRERPYRR 287
>gi|388567209|ref|ZP_10153646.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
gi|388265592|gb|EIK91145.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
Length = 296
Score = 129 bits (323), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 68/161 (42%), Positives = 90/161 (55%), Gaps = 23/161 (14%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK-- 73
RTSSGMF ++ Q V ++E RIA P ENGE +Q+LHY G +Y+PH+D+F K
Sbjct: 153 RTSSGMFFTRGQTPEVTAVERRIARLVGWPVENGEGLQVLHYRPGAEYKPHYDYFDPKEA 212
Query: 74 ---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
+ GG R+AT++MYL+ +GG T FP+ G V P+KG
Sbjct: 213 GTPTILKRGGQRVATLVMYLNEPARGGGTTFPDV----------------GLEVAPVKGS 256
Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
A+ F P +T SLHG PV+EGEKW ATKW+ R F
Sbjct: 257 AVFFSYDRPHPTTR--SLHGGAPVLEGEKWVATKWLREREF 295
>gi|299115886|emb|CBN75895.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Ectocarpus siliculosus]
Length = 404
Score = 127 bits (320), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 65/168 (38%), Positives = 102/168 (60%), Gaps = 6/168 (3%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+ D++ GK + RTS+ F+ +D ++ I+ R+ +T +P + E +Q+L Y+ GQ
Sbjct: 232 LMDHDKGKP-DTNWRTSTTYFMPSTRDPLLQGIDRRVEEFTRVPKSHQEQVQVLKYDKGQ 290
Query: 62 KYEPHFDFFRDKMNQQLGG---HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
+Y H DF ++ + + G +R+ TV YLS VE+GGET+FP R ++S+C
Sbjct: 291 RYTAHHDFLDERTMRNMDGGRKNRMITVFWYLSDVEEGGETIFPRYGGRTGRV-DFSDCT 349
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
G VKP++G +F+SL PD D SLHG+CPVI G+KW+A KW+
Sbjct: 350 T-GLKVKPVEGKVAMFYSLKPDGQFDDFSLHGACPVITGQKWAANKWV 396
>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
Length = 283
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 68/175 (38%), Positives = 95/175 (54%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +SG + + R S G F++ + D +VA+I+ RIA P ENGE + IL Y G
Sbjct: 121 VVDPDSGGEVLIDARKSEGAFVNGSTDPLVATIDRRIAELVQQPVENGEDLHILRYGAGG 180
Query: 62 KYEPHFDFFRD-----KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y PHFD+F + K + Q GG RIAT+++YL+ VE+GG+T FP+
Sbjct: 181 EYRPHFDYFPEEQAGSKHHMQRGGQRIATLILYLNQVEEGGDTTFPDI------------ 228
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G + P +G AL F ++ TD +LH PV GEKW ATKW+ F
Sbjct: 229 ----GLTIHPRRGAALYFEYVNALGQTDPRTLHAGMPVERGEKWIATKWMRRGRF 279
>gi|219121927|ref|XP_002181308.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407294|gb|EEC47231.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 226
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 71/175 (40%), Positives = 103/175 (58%), Gaps = 14/175 (8%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+ D + G+ AS+ RTS F+ D I+ I+ R A+ +P + E +Q+L Y+ +
Sbjct: 46 LMDKDQGRP-ASDFRTSQSAFIRAHDDAILTDIDYRTASLVRIPRRHQEDVQVLRYDVTE 104
Query: 62 KYEPHFDFF------RDKMNQQL--GGHR--IATVLMYLSHVEKGGETVFPNSEVSQSRD 111
KY+ H D+F +DK L GHR +ATV YLS VEKGGETVFP +Q +
Sbjct: 105 KYDSHADYFDPALYTKDKRTLALIRNGHRNRMATVFWYLSDVEKGGETVFPRFNGAQ--E 162
Query: 112 GNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ +C + G VKP KG ++F+S+ PD + D SLHG+CPV +G KW+A KW+
Sbjct: 163 TSMKDC-KTGLKVKPEKGKVIIFYSMTPDGALDEYSLHGACPVQKGTKWAANKWV 216
>gi|147834798|emb|CAN75013.1| hypothetical protein VITISV_039948 [Vitis vinifera]
Length = 282
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 74/188 (39%), Positives = 101/188 (53%), Gaps = 38/188 (20%)
Query: 15 VRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGE---------------------- 50
+R SG+F+S ++D+ + IE +IA +P +GE
Sbjct: 90 IRLCSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEIKPKENCLNWLGQVPPFEFVVM 149
Query: 51 -----------AMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGET 99
A IL YE GQ+Y H+D F HRIAT L+YLS VE+GGET
Sbjct: 150 KRFLTDVVYHVAFNILRYEIGQRYNSHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGET 209
Query: 100 VFP-NSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGE 158
+FP + ++ +D ++ C G VKP +GD LLF+S+ P+ + D TSLHGSCPVI+GE
Sbjct: 210 MFPFENGLNMDKDYDFQRCI--GLKVKPHQGDGLLFYSMFPNGTIDPTSLHGSCPVIKGE 267
Query: 159 KWSATKWI 166
KW ATKWI
Sbjct: 268 KWVATKWI 275
>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 296
Score = 127 bits (318), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 69/173 (39%), Positives = 95/173 (54%), Gaps = 17/173 (9%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G+ A+ RTS GM ++E + +E RIA P ENGE +Q+L+Y G+
Sbjct: 139 VIDPKTGEEKAATGRTSKGMSFYLQENEFIKKVEKRIAELIEFPVENGEGLQVLNYGIGE 198
Query: 62 KYEPHFDFF-RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
+Y+ HFD+F + K+ + GG R+ T L+YL+ V GGETVFP +
Sbjct: 199 EYKSHFDYFPQSKVVPEKGGQRVGTFLIYLNDVPAGGETVFPKA---------------- 242
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
G ++ P KG A+ F + D SLH S PV EGEKW ATKWI N K
Sbjct: 243 GVSIVPKKGSAVYFQYGNSKGEVDRMSLHSSIPVSEGEKWVATKWIRQENIYK 295
>gi|428182311|gb|EKX51172.1| hypothetical protein GUITHDRAFT_92735 [Guillardia theta CCMP2712]
Length = 190
Score = 126 bits (317), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 75/180 (41%), Positives = 100/180 (55%), Gaps = 18/180 (10%)
Query: 3 ADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQK 62
A NE+ + S RTSS +LSK D +VA I R+A LP E E MQ+LHY Q
Sbjct: 9 AGNEAKNGVGS-ARTSSTAWLSKTADPLVAKIRTRVAELVKLPMELAEDMQVLHYSKNQH 67
Query: 63 YEPHFDFFRDKMNQQL----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
Y H DFF + + G +R TV YLS VE+GGETVFP + R ++++C+
Sbjct: 68 YWAHHDFFDPNIYRGFVTSPGQNRFITVFFYLSDVEEGGETVFPFANGDDRRVTDFADCS 127
Query: 119 RRGYAVKPMKGDALLFFSLH---------PD---ASTDSTSLHGSCPVIEGEKWSATKWI 166
RG VKP G+A++F+S+ PD + D SLHG C VI+G+KW+A WI
Sbjct: 128 -RGLKVKPKAGNAIIFYSMLAKRQQEICPPDDLGCNLDVRSLHGGCDVIKGDKWAANYWI 186
>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
Length = 309
Score = 126 bits (316), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 67/176 (38%), Positives = 98/176 (55%), Gaps = 24/176 (13%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VA G+ + + RTS+GMF + ++ +VA +EARIA P ENGE +Q+LHY G
Sbjct: 153 VATRTGGEEVNDD-RTSNGMFFQREENPVVARLEARIARLVNWPLENGEGLQVLHYRPGA 211
Query: 62 KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PH+D+F + GG R+AT+++YL+ EKGG T FP+ +
Sbjct: 212 EYKPHYDYFDPAEPGTPTILRRGGQRVATIVIYLNDPEKGGGTTFPDVHLE--------- 262
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
V P +G+A+ F P ST +LHG PV+ G+KW ATKW+ R F+
Sbjct: 263 -------VAPRRGNAVFFSYERPHPST--RTLHGGAPVVAGDKWIATKWLRERRFE 309
>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
Length = 296
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 69/175 (39%), Positives = 94/175 (53%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +G+++ + R+S GMF + ++ IEARIAA T P ENGE +Q+LHYE G
Sbjct: 132 VVDPVTGRNVVAGHRSSHGMFFRLGETPLIVRIEARIAALTGTPVENGEGLQMLHYEEGA 191
Query: 62 KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+ PH D+ ++ + G R+ T+LMYL VE GGETVFP
Sbjct: 192 ESTPHVDYLITGNEANRESIARSGQRMGTLLMYLKDVEGGGETVFPQI------------ 239
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G++V P +G AL F + D +SLH S P+ G+KW ATKWI R F
Sbjct: 240 ----GWSVAPQRGHALYFEYGNRFGLCDPSSLHASTPLRVGDKWVATKWIRTRRF 290
>gi|295700439|ref|YP_003608332.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295439652|gb|ADG18821.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 296
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 69/175 (39%), Positives = 95/175 (54%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +G+ + + R+S GMF + ++A IEARIA T P ENGE +Q+LHYE G
Sbjct: 132 VVDPVTGRDVIATHRSSHGMFFRLGETPLIARIEARIAELTATPVENGEGLQMLHYEEGA 191
Query: 62 KYEPHFDFFR--DKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+ PH D+ ++ N++ G R+ T+LMYL VE GGETVFP
Sbjct: 192 ESTPHVDYLMTGNEANRESIARSGQRMGTLLMYLKDVEGGGETVFPQV------------ 239
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G+++ P +G AL F + D +SLH S P+ G+KW ATKWI R F
Sbjct: 240 ----GWSIVPQRGHALYFEYGNRYGMCDPSSLHASTPLRTGDKWVATKWIRTRRF 290
>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
Length = 298
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 67/176 (38%), Positives = 98/176 (55%), Gaps = 24/176 (13%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VA G+ + + RTS+GMF + ++ +VA +EARIA P ENGE +Q+LHY G
Sbjct: 142 VATRTGGEEVNDD-RTSNGMFFQREENPMVAKLEARIARLVNWPLENGEGLQVLHYRPGA 200
Query: 62 KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PH+D+F + GG R+AT+++YL+ EKGG T FP+ +
Sbjct: 201 EYKPHYDYFDPTEPGTPTILRRGGQRVATIVIYLNDPEKGGGTTFPDVHLE--------- 251
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
V P +G+A+ F P ST +LHG PV+ G+KW ATKW+ R F+
Sbjct: 252 -------VAPRRGNAVFFSYERPHPST--RTLHGGAPVVAGDKWIATKWLRERRFE 298
>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
Length = 288
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 67/177 (37%), Positives = 97/177 (54%), Gaps = 21/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G+ RTS G + ++A IEARIA +P E+GE Q+L+Y+ G
Sbjct: 126 VVNPDTGEENLISARTSQGGMFQVGEHPLIAKIEARIAQAVGVPVEHGEGFQVLNYQPGG 185
Query: 62 KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFDFF R +QL GG R+AT+++YL+ V+ GG T FP
Sbjct: 186 EYQPHFDFFNPGRSGEARQLEVGGQRVATMVIYLNSVQAGGATGFP-------------- 231
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
+ G V P+KG+A+ F PD + D +LH PV GEKW ATKW+ R + +
Sbjct: 232 --KLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERGEKWIATKWLRERPYRR 286
>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
Length = 216
Score = 125 bits (314), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 94/153 (61%), Gaps = 17/153 (11%)
Query: 14 EVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK 73
E+RTSS F + ++EIVA IE RI+ +P E+GE +QIL+Y+ GQ+Y+ HFDFF
Sbjct: 76 ELRTSSSTFFHEGENEIVARIEKRISQIMNIPVEHGEGLQILNYKIGQEYKAHFDFF-SS 134
Query: 74 MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALL 133
++ RI+T++MYL+ VE+GGET FP + ++V P KG A+
Sbjct: 135 TSRAASNPRISTLVMYLNDVEQGGETYFP----------------KLNFSVSPQKGMAVY 178
Query: 134 FFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
F + D + + +LHG PV+ G+KW+AT+W+
Sbjct: 179 FEYFYNDQNLNDLTLHGGAPVVMGDKWAATQWM 211
>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
Length = 305
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 65/171 (38%), Positives = 92/171 (53%), Gaps = 21/171 (12%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
ESG+ ++RTS G + + +D + ++ RI+A P E+GE +QILHY G +Y P
Sbjct: 150 ESGREDVIQLRTSEGFWFQRCEDAFIERLDRRISALMNWPLEHGEGLQILHYTKGGEYRP 209
Query: 66 HFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
HFD+F ++ GG R+AT+++YLS V GGETVFPN+
Sbjct: 210 HFDYFPPSQSGSVLHTSRGGQRVATLIVYLSDVAGGGETVFPNA---------------- 253
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G AV +G A+ F L+ D +LHG PV GEKW TKW+ R +
Sbjct: 254 GLAVMARQGGAIYFRYLNGHRQLDPLTLHGGAPVTNGEKWIMTKWMRERPY 304
>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Alteromonas sp. S89]
Length = 294
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 69/175 (39%), Positives = 93/175 (53%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + + G RTS G ++ + ++A IEARIA+ +P +GE +QILHY
Sbjct: 133 VVNTQHGAFELKPSRTSGGTHFARGETPLIADIEARIASLLKVPEAHGEPLQILHYPVSG 192
Query: 62 KYEPHFDFFRDKM--NQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y PH+DFF + NQ++ GG R+ T++MYLS VE GG TVFP
Sbjct: 193 EYRPHYDFFDPEKPGNQEVLAAGGQRVGTLIMYLSDVESGGATVFP-------------- 238
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
R G V+P KG AL F + D SLHG PV+ GEKW ATKW+ +
Sbjct: 239 --RVGLEVQPQKGAALFFSYVGEHGKLDLQSLHGGSPVLAGEKWIATKWLRAAEY 291
>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
Length = 212
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 60/154 (38%), Positives = 94/154 (61%), Gaps = 21/154 (13%)
Query: 13 SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+++RTSS F+ + + E+V +E RI+ +P ENGE +QIL+Y+ GQ+Y+ HFDFF++
Sbjct: 74 NDMRTSSSTFMEEGESEVVTRVEKRISQIMNIPYENGEGLQILNYKIGQEYKAHFDFFKN 133
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
N RI+T++MYL+ VE+GGET FP + ++V P KG A+
Sbjct: 134 ASNP-----RISTLVMYLNDVEEGGETYFP----------------KLNFSVSPQKGMAV 172
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
F + + + +LHG PVI G+KW+AT+W+
Sbjct: 173 YFEYFYDNQELNDLTLHGGAPVIIGDKWAATQWM 206
>gi|357417854|ref|YP_004930874.1| procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
gi|355335432|gb|AER56833.1| Procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
Length = 283
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 66/176 (37%), Positives = 98/176 (55%), Gaps = 23/176 (13%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G+ + RTS GMF + + + A +EARIAA P ENGE +Q+L Y G
Sbjct: 126 VFDPDTGQDQQHQARTSEGMFFGRGANPLCARVEARIAALLNWPLENGEGLQVLRYGPGA 185
Query: 62 KYEPHFDFFRD-----KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+YEPH+D+F ++ + GG R+A++++YL+ +GG T FP++ +
Sbjct: 186 QYEPHYDYFDPARPGAEVALRRGGQRVASLVIYLNTPTQGGATTFPDAHLE--------- 236
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
V P+KG+A+ F P T +LHG PV+EGEKW ATKW+ R D
Sbjct: 237 -------VAPIKGNAVYFSYDRPHPMTG--TLHGGAPVVEGEKWVATKWLRERRHD 283
>gi|351731158|ref|ZP_08948849.1| 2OG-Fe(II) oxygenase [Acidovorax radicis N35]
Length = 303
Score = 125 bits (313), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 69/175 (39%), Positives = 92/175 (52%), Gaps = 24/175 (13%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VA G+ I + RTS GMF + Q ++ IE RIA P ENGE +Q+LHY G
Sbjct: 147 VATKTGGEEINDD-RTSDGMFFQRGQSPLIQRIEERIARLLNWPIENGEGLQVLHYRPGA 205
Query: 62 KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PH+D+F + GG R+ T++MYL+ EKGG T FP+ V
Sbjct: 206 EYKPHYDYFDPAEPGTPTIVKRGGQRVGTLVMYLNTPEKGGGTTFPDVHVE--------- 256
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
V P +G+A+ F P ST +LHG PV+ GEKW ATKW+ R F
Sbjct: 257 -------VAPQRGNAVFFSYERPHPST--RTLHGGAPVLAGEKWIATKWLREREF 302
>gi|365090417|ref|ZP_09328465.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
gi|363416516|gb|EHL23626.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
Length = 302
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 70/177 (39%), Positives = 94/177 (53%), Gaps = 28/177 (15%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VA G+ I + RTS GMF + Q ++ IE RIA P ENGE +Q+LHY G
Sbjct: 146 VATKTGGEEINDD-RTSDGMFFQRGQSPLIQRIEERIARLLNWPIENGEGLQVLHYRPGA 204
Query: 62 KYEPHFDFFRDK-------MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW 114
+Y+PH+D+F +N+ GG R+ T++MYL+ EKGG T FP+ +
Sbjct: 205 EYKPHYDYFDPAEPGTPSIVNR--GGQRVGTLVMYLNTPEKGGGTTFPDVHLE------- 255
Query: 115 SECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
V P +G+A+ F P ST +LHG PVI GEKW ATKW+ R F
Sbjct: 256 ---------VAPQRGNAVFFSYERPHPST--RTLHGGAPVIAGEKWIATKWLREREF 301
>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
Length = 302
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 67/175 (38%), Positives = 92/175 (52%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +G++I + R+S GMF + +++ IE RIAA T P ENGE +Q+LHYE G
Sbjct: 132 VVDPVTGRNIVAGHRSSDGMFFRLGETPLISRIEQRIAALTGFPVENGEGLQMLHYEAGA 191
Query: 62 KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+ PH D+ + + G R+ T+LMYL+ VE GGET+FP
Sbjct: 192 ESTPHVDYLVPGNPANAESIARSGQRVGTLLMYLNDVESGGETLFPQV------------ 239
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G +V P +G A F + +D SLH S P+ G+KW ATKWI R F
Sbjct: 240 ----GCSVVPRRGQAFYFEYGNGSGRSDPASLHASSPIGSGDKWVATKWIRTRRF 290
>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
Length = 288
Score = 124 bits (311), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 66/177 (37%), Positives = 96/177 (54%), Gaps = 21/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G+ RTS G + ++A IE RIA +P E+GE Q+L+Y+ G
Sbjct: 126 VVNPDTGEENLISARTSQGGMFQVGEHPLIAKIEVRIAQAVGVPVEHGEGFQVLNYQPGG 185
Query: 62 KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFDFF R +QL GG R+AT+++YL+ V+ GG T FP
Sbjct: 186 EYQPHFDFFNPGRSGEARQLEVGGQRVATMVIYLNSVQAGGATGFP-------------- 231
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
+ G V P+KG+A+ F PD + D +LH PV GEKW ATKW+ R + +
Sbjct: 232 --KLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERGEKWIATKWLRERPYRR 286
>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
Length = 219
Score = 124 bits (310), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 65/158 (41%), Positives = 94/158 (59%), Gaps = 19/158 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++ EI IE RIA+ +P +GE +QIL Y GQ+Y+ H+DFF
Sbjct: 78 TNDIRTSSGAFLEES--EITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFFV 135
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ + +R++T++MYL+HVE+GGET FP +S V P KG A
Sbjct: 136 EN-SAAASNNRMSTLVMYLNHVEEGGETFFPKLNLS----------------VSPKKGMA 178
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
+ F + D S + +LHG PVI+GEKW AT+W+ R
Sbjct: 179 VYFEYFYQDESINKLTLHGGAPVIKGEKWVATQWMRRR 216
>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
Length = 219
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 65/158 (41%), Positives = 94/158 (59%), Gaps = 19/158 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++ EI IE RIA+ +P +GE +QIL Y GQ+Y+ H+DFF
Sbjct: 78 TNDIRTSSGAFLEES--EITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFFV 135
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ + +R++T++MYL+HVE+GGET FP +S V P KG A
Sbjct: 136 EN-SAAASNNRMSTLVMYLNHVEEGGETFFPKLNLS----------------VSPKKGMA 178
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
+ F + D S + +LHG PVI+GEKW AT+W+ R
Sbjct: 179 VYFEYFYQDESINKLTLHGGAPVIKGEKWVATQWMRRR 216
>gi|377810637|ref|YP_005043077.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
gi|357939998|gb|AET93554.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
Length = 297
Score = 124 bits (310), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 69/175 (39%), Positives = 92/175 (52%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +G+ +A+ R+S G F A+ +VA +E RIAA T L ENGE +Q+L Y+ G
Sbjct: 132 VVDPVTGRDVAAGHRSSDGTFFRLAETPLVARLEMRIAALTGLAAENGEGLQLLRYQPGA 191
Query: 62 KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+ PH D+ ++ + G R+ T+LMYL+ VE GGETVFP
Sbjct: 192 ESTPHVDYLVAGNETNRESIARSGQRVGTLLMYLNDVEGGGETVFPQV------------ 239
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G +V P +G AL F + D SLH S P+ GEKW ATKWI R F
Sbjct: 240 ----GCSVVPRRGQALYFEYCNRAGVCDPASLHASTPLRSGEKWVATKWIRARRF 290
>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
Length = 211
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 61/154 (39%), Positives = 95/154 (61%), Gaps = 18/154 (11%)
Query: 13 SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+E+RTSS MF+ ++ IV ++ RI+A +P E+GE +QIL Y GQ+Y+ H DFF
Sbjct: 70 NELRTSSSMFIEDDENLIVTRVKKRISAIMKIPMEHGEGLQILRYTPGQQYKAHHDFFSS 129
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
+ ++ +RI+T++MYL+ VE+GGET FP+ + ++V P KG A+
Sbjct: 130 --DSKITNNRISTLVMYLNDVEQGGETFFPHLK----------------FSVSPRKGMAV 171
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
F + D + + +LHG PV+EGEKW AT+W+
Sbjct: 172 YFEYFYSDQTLNDFTLHGGAPVVEGEKWVATQWM 205
>gi|319792090|ref|YP_004153730.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315594553|gb|ADU35619.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 280
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 65/176 (36%), Positives = 91/176 (51%), Gaps = 23/176 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+ + +G + + RTS GMF + ++EIVA +E R+A P E GE +QIL Y G
Sbjct: 122 LTVETRTGGEVLNVDRTSDGMFFERGENEIVARLEQRLAMLLRWPLEYGEGLQILRYAPG 181
Query: 61 QKYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
+Y PH+D+F + GG R+AT++MYL E+GG T FP+
Sbjct: 182 AQYRPHYDYFDPNEPGTPTILKRGGQRVATLVMYLQEPEQGGATTFPDV----------- 230
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G V P++G + F PD T +LHG PV+ GEKW ATKW+ R F
Sbjct: 231 -----GLEVAPVRGTGVFFSYDRPDPVT--RTLHGGAPVLAGEKWVATKWLREREF 279
>gi|159485424|ref|XP_001700744.1| hypothetical protein CHLREDRAFT_187378 [Chlamydomonas reinhardtii]
gi|158281243|gb|EDP06998.1| predicted protein [Chlamydomonas reinhardtii]
Length = 253
Score = 123 bits (309), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 75/177 (42%), Positives = 103/177 (58%), Gaps = 12/177 (6%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+V N+S + +RTS + + ++VA IE RIA WT LP + E M++L Y +G
Sbjct: 74 LVVGNKSDE--VDPIRTSYSASIGYNETDVVADIEGRIARWTHLPRSHQEPMEVLRYING 131
Query: 61 QKYEPHFDFF-RDKMNQQLGGHRIATVLMYLSHVE--KGGETVFP-----NSEVSQSRDG 112
QKY+ H+D+F + GG+R+AT LMYLS +E GGET P + EV
Sbjct: 132 QKYDAHWDWFDETETGGTGGGNRMATALMYLSDMEPAAGGETALPLAQPLDWEVQGVEGR 191
Query: 113 NWSECA-RRGYAVKPMKGDALLFFSLHPDA-STDSTSLHGSCPVIEGEKWSATKWIH 167
+SECA + G +V+P KGD LLF+ + P D +LH SCP G KW+ATKWIH
Sbjct: 192 GYSECASKMGISVRPKKGDVLLFWDMEPGGREPDRHALHASCPTFSGTKWTATKWIH 248
>gi|239814309|ref|YP_002943219.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239800886|gb|ACS17953.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 279
Score = 123 bits (308), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 65/176 (36%), Positives = 92/176 (52%), Gaps = 23/176 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+ + +G + + RTS GMF + +++IVA +E RIAA P E GE +QIL Y G
Sbjct: 121 LTVETRTGGEVLNVDRTSEGMFFERGENDIVARLEQRIAALLRWPVEFGEGLQILRYAPG 180
Query: 61 QKYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
+Y PH+D+F + GG R+AT++MYL +GG T FP+
Sbjct: 181 AQYRPHYDYFDPGEPGTPTILKRGGQRVATLVMYLQEPGQGGATTFPDV----------- 229
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G V P++G + F PD +T +LHG PV+ GEKW ATKW+ R F
Sbjct: 230 -----GLEVAPVRGTGVFFSYEEPDPAT--RTLHGGAPVLAGEKWVATKWLREREF 278
>gi|241767624|ref|ZP_04765273.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
gi|241361463|gb|EER57922.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
Length = 318
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 94/176 (53%), Gaps = 23/176 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+ ++G ++ RTS GMF + + +V IE RIA+ P ENGE +Q+LHY G
Sbjct: 160 LTVATQTGGEEVNDDRTSHGMFFQRGESPLVQRIEERIASLLNWPIENGEGLQVLHYRPG 219
Query: 61 QKYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
+Y+PH+D+F Q GG R+ T++MYL+ E+GG T FP++++
Sbjct: 220 AEYKPHYDYFDPAEPGTPTVIQRGGQRVGTLVMYLNTPEQGGGTTFPDAQIE-------- 271
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
V P +G+A F P ST +LHG PV+ G+KW ATKW+ R F
Sbjct: 272 --------VAPQRGNAAFFSYERPTPSTR--TLHGGAPVLAGDKWIATKWLREREF 317
>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 286
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 61/170 (35%), Positives = 94/170 (55%), Gaps = 21/170 (12%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+G+ RTS G++ + +D+++A +E RIA+ T P ENGE +Q+LHY +Y PH
Sbjct: 132 TGREDVIRNRTSEGVWYRRGEDQLIARVERRIASLTNWPLENGEGLQVLHYGTSGEYSPH 191
Query: 67 FDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
FDFF ++ GG R+AT+++YL+ V GGETVFP + G
Sbjct: 192 FDFFAPDQPGSAVHTTQGGQRVATLIIYLNDVADGGETVFPTA----------------G 235
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+V G A+ F ++ + D ++LHG PV+ G+KW TKW+ R +
Sbjct: 236 LSVAAQAGGAVYFRYMNAERQLDPSTLHGGAPVLAGDKWIMTKWMRERAY 285
>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 217
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 62/152 (40%), Positives = 92/152 (60%), Gaps = 17/152 (11%)
Query: 15 VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM 74
+RTSS F+ + ++ IV+ IE RI+ +P E GE +QIL+Y+ GQ+Y+ HFDFF
Sbjct: 77 MRTSSSTFIEENENIIVSRIEKRISQIMNIPTEYGEGLQILNYQVGQEYKSHFDFFSSPH 136
Query: 75 NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLF 134
N + RI+T++MYLS VE+GGET FP + ++V P KG A+ F
Sbjct: 137 N-AINNPRISTLVMYLSDVEQGGETYFP----------------KLHFSVSPQKGMAVYF 179
Query: 135 FSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ D + + +LHG PVI G+KW+AT+W+
Sbjct: 180 EYFYNDQTLNELTLHGGAPVIVGDKWAATQWM 211
>gi|354334983|gb|AER23925.1| procollagen-proline dioxygenase [Variovorax sp. HH01]
Length = 280
Score = 123 bits (308), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 67/176 (38%), Positives = 90/176 (51%), Gaps = 23/176 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+ + +G + + RTS GMF + ++EIVA +E RIAA P E GE +QIL Y G
Sbjct: 122 LTVETRTGGEVLNVDRTSDGMFFERGENEIVARVEQRIAALLRWPLEFGEGLQILRYAPG 181
Query: 61 QKYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
+Y PH+D+F + GG R+AT++MYL E GG T FP+
Sbjct: 182 AQYRPHYDYFDPSEPGTPTILKRGGQRVATLVMYLQEPEGGGATTFPDV----------- 230
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G V P +G + F PD T +LHG PV+ GEKW ATKW+ R F
Sbjct: 231 -----GLEVAPARGCGVFFSYDRPDPVT--RTLHGGAPVLAGEKWVATKWLREREF 279
>gi|255633460|gb|ACU17088.1| unknown [Glycine max]
Length = 207
Score = 122 bits (307), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 56/92 (60%), Positives = 72/92 (78%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+E+GKS S VRTSSG FL++ +D+IV IE RIA ++F+P E+GE +Q+LHYE GQ
Sbjct: 116 VVDSETGKSKDSRVRTSSGTFLARGRDKIVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQ 175
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHV 93
KYEPH+D+F D N + GG RIATVLMYL+ V
Sbjct: 176 KYEPHYDYFLDDFNTKNGGQRIATVLMYLTDV 207
>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 286
Score = 122 bits (307), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 64/175 (36%), Positives = 91/175 (52%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+ D +GK R+S G F D+ +A ++ RI+A LP ++GE +QILHY G
Sbjct: 125 IVDPTTGKHETIADRSSEGTFFEINADDFIARLDRRISALMNLPVDHGEGLQILHYGPGG 184
Query: 62 KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFDFF + GG R++T++MYL+ VE GG T+FP
Sbjct: 185 EYKPHFDFFPPGDPGSAVQMATGGQRVSTLVMYLNEVEDGGATIFPEL------------ 232
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G +V P KG A+ F + D +LHG PV+ GEKW TKW+ R +
Sbjct: 233 ----GLSVLPKKGSAVYFEYTNSRGQLDPRTLHGGAPVLRGEKWIVTKWMRQRRY 283
>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
Length = 319
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/171 (38%), Positives = 96/171 (56%), Gaps = 21/171 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G + RTS G + ++ +EARIAA T +P E+GE +QIL+Y+ G
Sbjct: 157 VVNPDTGDENLIDARTSMGAMFQVGEHPLIERLEARIAAVTGVPVEHGEGLQILNYKPGA 216
Query: 62 KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PH+DFF R +QL GG R+AT+++YL+ V GG T FP
Sbjct: 217 EYQPHYDFFNPQRPGEARQLRVGGQRMATLVIYLNDVPAGGATAFP-------------- 262
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
+ G V P++G+A+ F L D S D +LH PV +GEKW ATKW+
Sbjct: 263 --KLGLRVNPVQGNAVFFAYLGEDGSLDERTLHAGLPVEQGEKWIATKWLR 311
>gi|352086439|ref|ZP_08953941.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
gi|389799401|ref|ZP_10202396.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
gi|351679404|gb|EHA62545.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
gi|388442818|gb|EIL98985.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
Length = 284
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 70/181 (38%), Positives = 100/181 (55%), Gaps = 26/181 (14%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+ D+E G+ RTS GMF + + +V IE R+AA +P +GE +QILHY G
Sbjct: 124 LTVDSE-GRQQVDRRRTSEGMFFTLNEVPLVGRIEQRLAALLRVPASHGEGLQILHYLPG 182
Query: 61 QKYEPHFDFFRDKMN-----QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
Q+YEPHFD+F + +GG RIA+V+MYL+ +GG T FP ++ +
Sbjct: 183 QEYEPHFDWFDPEQPGYGAITAVGGQRIASVVMYLNTPARGGGTAFPELGLTVT------ 236
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
ARRG AV +F+ D +SLH PV++GEKW ATKW+ R + +P+
Sbjct: 237 --ARRGSAV---------YFAYE---GGDPSSLHAGLPVLDGEKWIATKWLRERPYKRPK 282
Query: 176 K 176
K
Sbjct: 283 K 283
>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
Length = 285
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 63/171 (36%), Positives = 92/171 (53%), Gaps = 21/171 (12%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
E+G ++RTS G + + +D + ++ RI+A P E+GE +QILHY G +Y P
Sbjct: 130 ENGSEDVIQLRTSEGFWFQRCEDAFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRP 189
Query: 66 HFDFFRDKMNQQL-----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
HFD+F N + GG R+AT+++YLS VE GGETVFP++
Sbjct: 190 HFDYFPPGQNGSVLHTARGGQRVATLIVYLSDVEGGGETVFPDA---------------- 233
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G AV +G A+ F ++ D +LHG PV G+KW TKW+ R +
Sbjct: 234 GLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDKWIMTKWMRERPY 284
>gi|302835042|ref|XP_002949083.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
nagariensis]
gi|300265828|gb|EFJ50018.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
nagariensis]
Length = 263
Score = 122 bits (306), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 76/181 (41%), Positives = 104/181 (57%), Gaps = 23/181 (12%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MV +S + ++RTS + + IV+SIE RIA WT +L Y +G
Sbjct: 95 MVVGTDS--DLIDDIRTSFSASIMYGETSIVSSIEERIARWT-----------VLRYVNG 141
Query: 61 QKYEPHFDFFRDKMNQQLGG-HRIATVLMYLSHVE--KGGETVFPNSEV----SQSRDGN 113
QKY+ H+D+F D + GG +R+ATVLMYLS V+ GGET P +E QS DG
Sbjct: 142 QKYDAHWDWFDDNEVAKAGGSNRMATVLMYLSDVDPAAGGETALPLAEPLDPHKQSVDGQ 201
Query: 114 -WSECA-RRGYAVKPMKGDALLFFSLHPDAST-DSTSLHGSCPVIEGEKWSATKWIHVRN 170
+S+CA R G +++P KGD LLF+ + P D +LH SCP G KW+ATKWIH +
Sbjct: 202 GYSQCAARMGISIRPRKGDVLLFWDMDPAGLIPDRHALHASCPTFSGTKWTATKWIHNKP 261
Query: 171 F 171
+
Sbjct: 262 Y 262
>gi|239816557|ref|YP_002945467.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239803134|gb|ACS20201.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 296
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 66/181 (36%), Positives = 95/181 (52%), Gaps = 21/181 (11%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
D +G++ R+S GMF ++ VA ++ R++ LP ENGE +Q+LHY G +
Sbjct: 131 DPLTGRNRLGAQRSSLGMFFRLRENAFVARLDERLSELMNLPVENGEGLQVLHYPAGAQS 190
Query: 64 EPHFDFF--RDKMNQ---QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PHFDF + NQ Q G R++T++ YL+ VE+GGETVFP +
Sbjct: 191 LPHFDFLVPSNAANQASLQRSGQRVSTLVAYLNEVEEGGETVFPET-------------- 236
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
G++V P +G A+ F + D SLH PV+ GEKW ATKW+ R F + P
Sbjct: 237 --GWSVSPQRGGAVYFEYCNSLGQVDHASLHAGAPVLSGEKWVATKWMRQRRFVAAAQAP 294
Query: 179 E 179
Sbjct: 295 R 295
>gi|395003644|ref|ZP_10387769.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
gi|394318439|gb|EJE54870.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
Length = 299
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 69/175 (39%), Positives = 95/175 (54%), Gaps = 24/175 (13%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VA G+ + + RTS GMF + ++ +V IE RIA P ENGE +Q+LHY G
Sbjct: 143 VATKTGGEEVNDD-RTSDGMFFQRGENPVVQRIEERIARLLDWPIENGEGLQVLHYRPGA 201
Query: 62 KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PH+D+F + GG R+ T++MYL+ EKGG T FP+ V
Sbjct: 202 EYKPHYDYFDPGEPGTPTILKRGGQRVGTLVMYLNTPEKGGGTTFPDVHVE--------- 252
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
V P +G+A +FFS + A + +LHG PVI GEKW ATKW+ R F
Sbjct: 253 -------VAPQRGNA-VFFS-YERAHPATRTLHGGAPVIAGEKWIATKWLREREF 298
>gi|407938132|ref|YP_006853773.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
gi|407895926|gb|AFU45135.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
Length = 303
Score = 122 bits (305), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 68/176 (38%), Positives = 94/176 (53%), Gaps = 26/176 (14%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VA G+ I ++ RTS GMF + Q ++ IE RIA P ENGE +Q+LHY G
Sbjct: 147 VATKTGGEEINAD-RTSDGMFFQRGQSPLIQRIEERIARLLQWPIENGEGLQVLHYRPGA 205
Query: 62 KYEPHFDFFRDKMNQ------QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
+Y+PH+D+F D + GG R+ T++MYL+ +KGG T FP+ +
Sbjct: 206 EYKPHYDYF-DPAEPGTPSIIKRGGQRVGTLVMYLNTPDKGGGTTFPDVHLE-------- 256
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
V P +G+A+ F P ST +LHG PVI G+KW ATKW+ R F
Sbjct: 257 --------VAPQRGNAVFFSYERPHPST--RTLHGGAPVIAGDKWIATKWLREREF 302
>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
Length = 282
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 63/171 (36%), Positives = 92/171 (53%), Gaps = 21/171 (12%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
E+G ++RTS G + + +D + ++ RI+A P E+GE +QILHY G +Y P
Sbjct: 127 ENGSEDVIQLRTSEGFWFQRCEDAFIERLDHRISALMNWPLEHGEGLQILHYRQGGEYRP 186
Query: 66 HFDFFRDKMNQQL-----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
HFD+F N + GG R+AT+++YLS VE GGETVFP++
Sbjct: 187 HFDYFPPGQNGSVLHTARGGQRVATLIVYLSDVEGGGETVFPDA---------------- 230
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G AV +G A+ F ++ D +LHG PV G+KW TKW+ R +
Sbjct: 231 GLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDKWIMTKWMRERPY 281
>gi|414870897|tpg|DAA49454.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 222
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 55/89 (61%), Positives = 68/89 (76%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +G S S VRTSSGMFL + QD+I+ +IE RIA +TF+P E GE +Q+LHYE GQ
Sbjct: 133 VVDSATGGSKDSRVRTSSGMFLRRGQDKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQ 192
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYL 90
KYEPHFD+F D N + GG RIAT+LMYL
Sbjct: 193 KYEPHFDYFHDDYNTKNGGQRIATLLMYL 221
>gi|357483927|ref|XP_003612250.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513585|gb|AES95208.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 204
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 55/89 (61%), Positives = 72/89 (80%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+E+GKS S VRTSSG FL++ +D+IV +IE +IA +TF+P E+GE +Q+LHYE GQ
Sbjct: 115 VVDSETGKSKDSRVRTSSGTFLARGRDKIVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQ 174
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYL 90
KYEPH+D+F D+ N + GG RIATVLMYL
Sbjct: 175 KYEPHYDYFLDEFNTKNGGQRIATVLMYL 203
>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
Length = 224
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 67/169 (39%), Positives = 91/169 (53%), Gaps = 25/169 (14%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN +G S + RTS GMF + + ++ IE RIA P E GE +Q+LHY G +Y
Sbjct: 69 DNSTGGSEVNAARTSDGMFFERGETPLIERIERRIAELVHWPVERGEGLQVLHYRPGAQY 128
Query: 64 EPHFDFFRDKMNQ------QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+PH DFF D + + GG R+ TV++YL+ GG T FP
Sbjct: 129 KPHHDFF-DPAHPGTANILRRGGQRVGTVVIYLNTPAGGGATTFPEV------------- 174
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
G V+P+KG+A+ F P AST +LHG PV++GEKW ATKW+
Sbjct: 175 ---GLEVQPIKGNAVFFSYERPLASTR--TLHGGAPVLDGEKWVATKWL 218
>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
Length = 229
Score = 121 bits (304), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 59/160 (36%), Positives = 98/160 (61%), Gaps = 18/160 (11%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
S KS+ ++RTSS MF A++++V+++E R++ +P ++GE +QIL+Y GQ+Y+ H
Sbjct: 75 SNKSV-HDLRTSSSMFFDDAENDVVSTVEKRVSQIMKIPVDHGEGIQILNYAIGQEYKAH 133
Query: 67 FDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
+D+F N ++ RI+T++MYL+ VE GGET FP + + V P
Sbjct: 134 YDYFSSG-NSKVNNPRISTLVMYLNDVEAGGETYFP----------------KLNFYVAP 176
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
KG A+ F + D + + +LHG PV+ G+KW+AT+W+
Sbjct: 177 KKGMAVYFEYFYNDTTLNELTLHGGAPVVIGDKWAATQWM 216
>gi|319763870|ref|YP_004127807.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
gi|330823866|ref|YP_004387169.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
gi|317118431|gb|ADV00920.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
gi|329309238|gb|AEB83653.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
Length = 284
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 68/170 (40%), Positives = 97/170 (57%), Gaps = 23/170 (13%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
SG ++ RTS GMF + ++E VA +E RIA P ENGE +Q+LHY G +Y+PH
Sbjct: 132 SGGEEVNKDRTSDGMFFQRGENEAVARLEERIARLVRWPVENGEGLQVLHYRPGAEYKPH 191
Query: 67 FDFF--RDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
+D+F + +L GG R+AT+++YL+ +GG T FP+ +
Sbjct: 192 YDYFDPAEPGTPRLLRRGGQRVATLVIYLNDPVRGGGTTFPDVPLE-------------- 237
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ P +G+A +FFS + A S +LHG PVIEGEKW ATKW+ R F
Sbjct: 238 --IGPRQGNA-VFFS-YGRAHPSSRTLHGGAPVIEGEKWIATKWLREREF 283
>gi|398808448|ref|ZP_10567311.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398087480|gb|EJL78066.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 280
Score = 121 bits (303), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 65/176 (36%), Positives = 90/176 (51%), Gaps = 23/176 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+ + +G + + RTS GMF + ++EIVA +E R+A P E GE +QIL Y G
Sbjct: 122 LTVETRTGGEVLNVDRTSDGMFFERGENEIVARLEQRLATLLRWPLEYGEGLQILRYAPG 181
Query: 61 QKYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
+Y PH+D+F + GG R+AT++MYL E GG T FP+
Sbjct: 182 AQYRPHYDYFDPGEPGTPTILKRGGQRVATLVMYLQEPEGGGATTFPDV----------- 230
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G V P++G + F PD T +LHG PV+ GEKW ATKW+ R F
Sbjct: 231 -----GLEVAPVRGCGVFFSYDRPDPVT--RTLHGGAPVLAGEKWVATKWLREREF 279
>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 215
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 63/164 (38%), Positives = 94/164 (57%), Gaps = 20/164 (12%)
Query: 11 IASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF 70
+ SE+RTS GMF + ++ + IE RI+A +P E+ E +Q+LHY GQ+Y+ H+DFF
Sbjct: 64 VVSEIRTSRGMFFEEEENPFIHRIEKRISALMNVPIEHAEGLQVLHYGPGQEYQAHYDFF 123
Query: 71 RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
+ +RI+T+++YL+ VE GGETVFP ++ VKP +G
Sbjct: 124 GPN-SPSASNNRISTLIIYLNDVEAGGETVFPLLDLE----------------VKPERGS 166
Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI---HVRNF 171
AL F + ++ +LH S PV+ GEKW AT+W+ VR F
Sbjct: 167 ALYFEYFYRQQELNNLTLHSSVPVVRGEKWVATQWMRRQRVREF 210
>gi|307108817|gb|EFN57056.1| hypothetical protein CHLNCDRAFT_143796 [Chlorella variabilis]
Length = 334
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 64/147 (43%), Positives = 89/147 (60%), Gaps = 4/147 (2%)
Query: 28 DEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVL 87
D ++A IE ++AA T +P +GE +L YE Q Y+ H+D F ++ RIATVL
Sbjct: 185 DGVLAWIEDKLAAVTMIPAGHGEPFNVLRYEPSQHYDSHYDSFSEEEYGPQFSQRIATVL 244
Query: 88 MYLSHVEKGGETVF---PNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTD 144
+YL+ VE+GGETVF +++ ++ C G VKP +GDALLFFS+ + + D
Sbjct: 245 LYLADVEEGGETVFLLEGKGGLARLERIDYKAC-DTGIKVKPRQGDALLFFSVSVNGTLD 303
Query: 145 STSLHGSCPVIEGEKWSATKWIHVRNF 171
SLHG CPV+ G KW+ TKWI R F
Sbjct: 304 KHSLHGGCPVVAGTKWAMTKWIRNRCF 330
>gi|389809938|ref|ZP_10205598.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
gi|388441354|gb|EIL97635.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
Length = 284
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 71/181 (39%), Positives = 98/181 (54%), Gaps = 26/181 (14%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+ D+E G+ RTS GMF + + +V IE R+AA +P +GE +QILHY G
Sbjct: 124 LTVDSE-GRQQVDRRRTSEGMFFTLDEVPLVGRIERRVAALLDVPASHGEGLQILHYLPG 182
Query: 61 QKYEPHFDFFRD-----KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
Q YEPHFD+F + +GG RIA+V+MYL+ +GG T FP ++ +
Sbjct: 183 QAYEPHFDWFDPDQPGYETITAVGGQRIASVVMYLNTPARGGGTAFPALGLTVT------ 236
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
ARRG AV +F+ D +SLH PV+EGEKW ATKW+ R + +P
Sbjct: 237 --ARRGAAV---------YFAYE---GGDCSSLHAGLPVLEGEKWIATKWLRERPYRRPT 282
Query: 176 K 176
K
Sbjct: 283 K 283
>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
Length = 211
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 62/159 (38%), Positives = 91/159 (57%), Gaps = 17/159 (10%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
K S +RTSSGMF + ++ +++ IE RI++ LP E+ E +Q+LHYE GQ+++PHF
Sbjct: 61 AKKEISSIRTSSGMFFEENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKPHF 120
Query: 68 DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
DFF + +RI T+++YL+ VE+GG T FPN G P
Sbjct: 121 DFFGPN-HPSSSNNRICTLVVYLNDVEEGGVTTFPN----------------LGIVNVPK 163
Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
KG A+ F + D + +LH PVI+GEKW AT+W+
Sbjct: 164 KGTAVYFEYFYNDQKLNELTLHSGEPVIQGEKWVATQWM 202
>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
Length = 277
Score = 121 bits (303), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 65/161 (40%), Positives = 90/161 (55%), Gaps = 23/161 (14%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
RTS GMF ++ ++ +V +EARIA P + GE +Q+L Y G +Y+PH+D+F
Sbjct: 134 RTSQGMFFARGENPLVQRVEARIARLVGWPVDRGEGLQVLRYRQGAQYKPHYDYFDPAEP 193
Query: 72 -DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
Q GG R+AT++MYL+ E+GG TVFP+ G V P +G
Sbjct: 194 GTPAILQRGGQRVATLIMYLNEPEQGGATVFPDI----------------GLQVTPRRGT 237
Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
A +FFS +P A+ S + HG PV GEKW ATKW+ R F
Sbjct: 238 A-VFFS-YPAANPASLTRHGGEPVKAGEKWIATKWLREREF 276
>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
Length = 293
Score = 120 bits (302), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 66/170 (38%), Positives = 92/170 (54%), Gaps = 21/170 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G + RTS G + ++ IEARIAA T P E+GE Q+L+Y+ G
Sbjct: 131 VVNPDTGDENLIDARTSMGAMFQVGEHALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGG 190
Query: 62 KYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFDFF K ++GG R+AT+++YL+ GG T FP
Sbjct: 191 EYQPHFDFFNPKRPGEARQLRVGGQRVATMVIYLNSPASGGATAFP-------------- 236
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
R G V P+KG+A+LF PD + D +LH PV GEKW ATKW+
Sbjct: 237 --RIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVEAGEKWIATKWL 284
>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
Length = 293
Score = 120 bits (302), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 66/170 (38%), Positives = 92/170 (54%), Gaps = 21/170 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G + RTS G + ++ IEARIAA T P E+GE Q+L+Y+ G
Sbjct: 131 VVNPDTGDENLIDARTSMGAMFQVGEHALLQRIEARIAAVTGWPVEHGEGFQVLNYKPGG 190
Query: 62 KYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFDFF K ++GG R+AT+++YL+ GG T FP
Sbjct: 191 EYQPHFDFFNPKRPGEARQLRVGGQRVATMVIYLNSPASGGATAFP-------------- 236
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
R G V P+KG+A+LF PD + D +LH PV GEKW ATKW+
Sbjct: 237 --RIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVEAGEKWIATKWL 284
>gi|302830268|ref|XP_002946700.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300267744|gb|EFJ51926.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 186
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 72/168 (42%), Positives = 91/168 (54%), Gaps = 7/168 (4%)
Query: 14 EVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK 73
+VRTS G FL + +E +IAA T LP NGE +L+Y+H Q Y+ H D F K
Sbjct: 19 QVRTSKGTFLGGDSSPALRWLEDKIAAVTLLPRTNGEFWNVLNYKHSQHYDSHMDSFDPK 78
Query: 74 MNQQLGGHRIATVLMYLSHVE-KGGETVFPNSEVSQSRD--GNWSEC-ARRGYAVKPMKG 129
RIATV++ LS GGETVF S NW++C A G KP G
Sbjct: 79 EYGPQYSQRIATVIVVLSDDGLMGGETVFKREGKSSINKPISNWTDCDADGGLKYKPRAG 138
Query: 130 DALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR---NFDKP 174
DA+LF+S PD D +LHGSCPV+ G KW A KW+ + + DKP
Sbjct: 139 DAVLFWSARPDGQLDPHALHGSCPVVTGNKWVAVKWLRNKGEYDHDKP 186
>gi|326518408|dbj|BAJ88233.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 276
Score = 120 bits (301), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 65/140 (46%), Positives = 91/140 (65%), Gaps = 9/140 (6%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D +GK + S+VRTSSGMF++ + + ++ +IE RI+ ++ +P ENGE +Q+L YE
Sbjct: 97 VVDVATGKGVKSDVRTSSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEP 156
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
Q Y PH D+F D N + GG R+AT+LMYL+ +GGET FP Q+ DG R
Sbjct: 157 NQYYRPHHDYFSDTFNLKRGGQRVATMLMYLTDGVEGGETHFP-----QAGDGECICGGR 211
Query: 120 --RGYAVKPMKGDALLFFSL 137
RG VKP KGDA+LF+S+
Sbjct: 212 LVRGLCVKPNKGDAVLFWSM 231
>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
Length = 282
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 64/170 (37%), Positives = 91/170 (53%), Gaps = 21/170 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G + RTS G + ++ IE RIAA +P ++GE +QIL+Y+ G
Sbjct: 120 VINPDTGDENLIDARTSMGAMFQVGEHTLIQRIEDRIAAVLGVPVDHGEGLQILNYKPGG 179
Query: 62 KYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFDFF K ++GG R AT+++YL+ + GG T FP
Sbjct: 180 EYQPHFDFFNPKRPGEARQLRVGGQRTATLVIYLNTPQAGGATAFP-------------- 225
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
R G V P+KG+A+ F L PD D +LH PV GEKW ATKW+
Sbjct: 226 --RIGLEVAPVKGNAVYFSYLQPDGKLDERTLHAGLPVQSGEKWIATKWL 273
>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
IL144]
gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Rubrivivax gelatinosus IL144]
Length = 279
Score = 120 bits (301), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 67/170 (39%), Positives = 90/170 (52%), Gaps = 25/170 (14%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN +G S + RTS GMF + + ++ IE RIA P E GE +Q+L Y G +Y
Sbjct: 124 DNSTGGSEVNAARTSDGMFFERGEKPLIERIERRIAELVRWPVERGEGLQVLRYRPGAQY 183
Query: 64 EPHFDFFRDKMNQ------QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+PH DFF D + + GG R+ TV+MYL+ GG T FP
Sbjct: 184 KPHHDFF-DPAHPGTANILRRGGQRVGTVVMYLNTPAGGGATTFPEV------------- 229
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
G V+P+KG+A+ F P AST +LHG PV++GEKW ATKW+
Sbjct: 230 ---GLEVQPVKGNAVFFSYERPLAST--RTLHGGAPVLDGEKWVATKWMR 274
>gi|159490898|ref|XP_001703410.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
gi|158280334|gb|EDP06092.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
Length = 429
Score = 120 bits (300), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 71/158 (44%), Positives = 91/158 (57%), Gaps = 6/158 (3%)
Query: 14 EVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK 73
+VRTS G FL + +E++IAA T +P +NGE +L+Y+H Q Y+ H D F K
Sbjct: 265 QVRTSKGTFLGGDSSPALTWLESKIAAVTDIPRQNGEFWNVLNYKHTQHYDSHMDSFDPK 324
Query: 74 MNQQLGGHRIATVLMYLS-HVEKGGETVFPNSEVSQSRD---GNWSEC-ARRGYAVKPMK 128
Q RIATV++ LS GGETVF E + D NW++C A G KP
Sbjct: 325 EYGQQYSQRIATVIVVLSDEGLVGGETVF-KREGKANIDKPITNWTDCDADGGLRYKPRA 383
Query: 129 GDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
GDA+LF+S PD D +LHGSCPV+ G KW A KWI
Sbjct: 384 GDAVLFWSAFPDGRLDQHALHGSCPVVTGNKWVAVKWI 421
>gi|389793983|ref|ZP_10197143.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
gi|388433014|gb|EIL89992.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
Length = 282
Score = 120 bits (300), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 69/178 (38%), Positives = 97/178 (54%), Gaps = 27/178 (15%)
Query: 5 NESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYE 64
+ GK + RTS GMF + +VA+IE R+A +P +GE +QILHY GQ+YE
Sbjct: 125 DSDGKQQIDQRRTSEGMFFRAGETPLVAAIEQRLAQLLGVPASHGEGLQILHYGPGQEYE 184
Query: 65 PHFDFF------RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PH+D+F DK+ + G RIA+V+MYL+ E+GG T FP ++ + A
Sbjct: 185 PHYDWFDPALPGYDKLTAR-AGQRIASVVMYLNTPERGGGTAFPEIGLTVT--------A 235
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
RRG AV +F+ D +SLH PV++GEKW AT W+ R F + K
Sbjct: 236 RRGAAV---------YFAYE---GGDQSSLHAGLPVLQGEKWIATHWLRERPFGQGSK 281
>gi|398810140|ref|ZP_10568970.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398083831|gb|EJL74535.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 296
Score = 119 bits (299), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 64/173 (36%), Positives = 93/173 (53%), Gaps = 21/173 (12%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
D SG+ + E R+S GMF ++ +A ++ R++ LP ENGE +Q+L Y G +
Sbjct: 131 DPLSGRDLVGEQRSSLGMFFRLRENAFIARLDQRVSELMNLPVENGEGLQVLCYPAGAQS 190
Query: 64 EPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PHFDF +K + G R++T++ YL+ VE+GGET+FP EC
Sbjct: 191 MPHFDFLVPSNAANKASLARSGQRVSTLVSYLNEVEEGGETIFP-------------EC- 236
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G++V P +G A+ F + D SLH PV+ GEKW ATKW+ R F
Sbjct: 237 --GWSVPPRRGSAVYFEYCNSLGQVDHASLHAGGPVLHGEKWVATKWMRQRRF 287
>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
Length = 220
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/157 (39%), Positives = 94/157 (59%), Gaps = 19/157 (12%)
Query: 15 VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM 74
+RTSSG FL ++E VA IE R+++ +P E+GE + IL Y GQ+Y+ H+D+F +
Sbjct: 82 IRTSSGTFLE--ENETVAIIEKRVSSIMNIPVEHGEGLHILKYTPGQEYKAHYDYFAEH- 138
Query: 75 NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLF 134
++ +RI+T++MYL+ VE+GGET FP +S + P KG A+ F
Sbjct: 139 SRAAENNRISTLVMYLNDVEEGGETFFPKLNLS----------------IAPKKGSAVYF 182
Query: 135 FSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ D S + +LHG PVI+GEKW AT+W+ R+
Sbjct: 183 EYFYNDKSLNELTLHGGAPVIKGEKWVATQWMKRRSL 219
>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
Length = 492
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 62/159 (38%), Positives = 90/159 (56%), Gaps = 16/159 (10%)
Query: 14 EVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK 73
E R S+ +L D++V ++ RIA T L E EA+Q+ +Y G YE H+D +
Sbjct: 344 EFRISTAAWLQPDHDDVVTNLHTRIADATQLDLEFAEALQVSNYGIGGFYETHYDHHASR 403
Query: 74 MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALL 133
+ G RIAT ++YL+ VE+GG T FP R G AV+P GDA+
Sbjct: 404 ERELPEGDRIATFMIYLNQVEQGGYTAFP----------------RLGAAVEPGHGDAVF 447
Query: 134 FFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
+++L PD +D+ +LHG+CPV++G KW A KWIH + D
Sbjct: 448 WYNLLPDGESDNNTLHGACPVLQGSKWVANKWIHEKKND 486
>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
Length = 511
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 67/171 (39%), Positives = 92/171 (53%), Gaps = 19/171 (11%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+GK ++ VRTS G +L + + + IE R+ T L + EA I++Y G Y H
Sbjct: 350 NGKYVSRRVRTSKGAWLERDLNNLTRRIERRVVDMTELSMQGSEAYNIMNYGLGGHYAAH 409
Query: 67 FDFFRDKMNQQL-GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
+DFF Q G RIATVL YLS VE+GG TVFPN ++ AV
Sbjct: 410 YDFFNTTKQQTSETGDRIATVLFYLSDVEQGGATVFPNLKL----------------AVS 453
Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
P +G AL +++L + + D+ +LHG CPV+ G KW T WIH R F +P
Sbjct: 454 PERGMALFWYNLLDNGTGDTRTLHGGCPVLVGSKWVMTLWIHERAQLFTRP 504
>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
Length = 211
Score = 119 bits (297), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 61/159 (38%), Positives = 91/159 (57%), Gaps = 17/159 (10%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
K S +RTSSGMF + ++ +++ IE RI++ LP E+ E +Q+LHYE GQ+++ HF
Sbjct: 61 AKKEISSIRTSSGMFFEENENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKAHF 120
Query: 68 DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
DFF + +RI+T+++YL+ VE+GG T FPN G P
Sbjct: 121 DFFGPN-HPSSSNNRISTLVVYLNDVEEGGVTTFPN----------------LGIVNVPK 163
Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
KG A+ F + D + +LH PVI+GEKW AT+W+
Sbjct: 164 KGTAVYFEYFYNDQKLNELTLHSGEPVIQGEKWVATQWM 202
>gi|413923982|gb|AFW63914.1| hypothetical protein ZEAMMB73_179176 [Zea mays]
Length = 222
Score = 118 bits (296), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 52/89 (58%), Positives = 70/89 (78%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GKS S VRTSSGMFL + +D+++ IE RIA +TF+P ++GE +Q+LHYE GQ
Sbjct: 134 VVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQ 193
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYL 90
KYEPHFD+F D+ N + GG R+AT+LMYL
Sbjct: 194 KYEPHFDYFLDEFNTKNGGQRMATLLMYL 222
>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
Length = 294
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 65/175 (37%), Positives = 94/175 (53%), Gaps = 24/175 (13%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VA G+ I + RTS+GMF + + IV+ +E RIA P ++GE +Q+LHY G
Sbjct: 138 VATQSGGEEINDD-RTSNGMFFQRGETGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGA 196
Query: 62 KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PH D+F + GG R+ T+++YL+ E+GG T+FP +
Sbjct: 197 EYKPHHDYFAPGEPGTPTILKRGGQRVGTLVIYLNEPERGGATIFPEVPLQ--------- 247
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
V P +G+A+ F PD ST +LHG PV+ GEKW ATKW+ R F
Sbjct: 248 -------VVPRRGNAVFFSYERPDPST--RTLHGGAPVLAGEKWIATKWLREREF 293
>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
Length = 294
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 65/175 (37%), Positives = 94/175 (53%), Gaps = 24/175 (13%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VA G+ I + RTS+GMF + + IV+ +E RIA P ++GE +Q+LHY G
Sbjct: 138 VATQSGGEEINDD-RTSNGMFFQRGETGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGA 196
Query: 62 KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PH D+F + GG R+ T+++YL+ E+GG T+FP +
Sbjct: 197 EYKPHHDYFAPGEPGTPTILKRGGQRVGTLVIYLNEPERGGATIFPEVPLQ--------- 247
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
V P +G+A+ F PD ST +LHG PV+ GEKW ATKW+ R F
Sbjct: 248 -------VVPRRGNAVFFSYERPDPST--RTLHGGAPVLAGEKWIATKWLREREF 293
>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
Length = 216
Score = 118 bits (295), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 61/155 (39%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F H D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFHQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|148653656|ref|YP_001280749.1| procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
gi|148572740|gb|ABQ94799.1| Procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
Length = 268
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 63/170 (37%), Positives = 89/170 (52%), Gaps = 21/170 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E G + RTS+ + + +I+ +IEARIA P ++GE +Q+L YE G
Sbjct: 109 VVDPEDGSFVEHSARTSTSTGYHRGEIDIIKTIEARIADLINWPVDHGEGLQVLRYEDGG 168
Query: 62 KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y PHFDFF ++ + GG R+ T LMYLS V+ GG T FPN
Sbjct: 169 EYRPHFDFFDPAKKSSRLVTKQGGQRVGTFLMYLSEVDSGGSTRFPN------------- 215
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ ++P KG AL F + + A + +LH PV EG K+ ATKW+
Sbjct: 216 ---LNFEIRPNKGSALYFANTNLKAEIEPLTLHAGMPVTEGVKYLATKWL 262
>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
Length = 216
Score = 117 bits (294), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 61/155 (39%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL DE+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLD--DDELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
Length = 216
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 94/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ T +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVSHGEGLHILNYEVDQEYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|398804098|ref|ZP_10563100.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
gi|398094921|gb|EJL85274.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
Length = 277
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 66/178 (37%), Positives = 96/178 (53%), Gaps = 27/178 (15%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+ D +G + RTS GMF ++ ++E++ IEARIA P +NGE +Q+L Y G
Sbjct: 119 LTVDIRTGGEELNHDRTSHGMFYTRGENEVIRRIEARIARLLNWPVQNGEGLQVLRYRRG 178
Query: 61 QKYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
+Y+PH+D+F + GG R+A+++MYL +GG TVFP+
Sbjct: 179 AEYKPHYDYFDPGEPGTAAILRRGGQRVASLIMYLREPGEGGATVFPDI----------- 227
Query: 116 ECARRGYAVKPMKGDALLF-FSL-HPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G V+P +G A+ F ++L HP S +LHG PV GEKW ATKW+ R F
Sbjct: 228 -----GLKVRPQQGSAVFFSYALAHP----ASLTLHGGEPVKSGEKWIATKWLREREF 276
>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
Length = 216
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 94/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ T +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|222111817|ref|YP_002554081.1| procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
gi|221731261|gb|ACM34081.1| Procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
Length = 289
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 62/161 (38%), Positives = 87/161 (54%), Gaps = 23/161 (14%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMN 75
RTS GMF + + +V +E RIA P +NGE +Q+LHY G +Y+PH+D+F
Sbjct: 146 RTSDGMFFQRGETPVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQP 205
Query: 76 Q-----QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
+ GG R+AT+++YL++ KGG T FP+ + V P +G+
Sbjct: 206 GTSTIVRRGGQRVATLVIYLNNPRKGGGTTFPDVPLE----------------VAPRQGN 249
Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
A+ F P ST +LHG VIEGEKW ATKW+ R F
Sbjct: 250 AVFFSYERPHPST--RTLHGGASVIEGEKWIATKWLREREF 288
>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
Length = 216
Score = 117 bits (293), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 94/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ T +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
Length = 216
Score = 117 bits (292), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 94/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ T +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
Length = 216
Score = 117 bits (292), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 94/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ T +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
Length = 216
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 94/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ T +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
Length = 209
Score = 117 bits (292), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 60/162 (37%), Positives = 92/162 (56%), Gaps = 23/162 (14%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
SEVRTSS MF ++++E + +EAR+A +P + E +Q+L Y+ G++Y PHFD+F
Sbjct: 68 VSEVRTSSSMFFEESENECIGQVEARVAELMNIPVSHAEPLQVLRYQPGEQYHPHFDYFT 127
Query: 72 D--KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKG 129
MN +RI+T++MYL+ VE+GGET FP+ ++V P KG
Sbjct: 128 QGSSMN-----NRISTLVMYLNDVEEGGETYFPSLH----------------FSVTPKKG 166
Query: 130 DALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
A+ F + D + +LH PV GEKW AT+W+ + +
Sbjct: 167 SAVYFEYFYNDTRLNELTLHAGHPVEAGEKWVATQWMRRQRY 208
>gi|91091610|ref|XP_969386.1| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
gi|270001037|gb|EEZ97484.1| hypothetical protein TcasGA2_TC011321 [Tribolium castaneum]
Length = 536
Score = 116 bits (291), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 70/188 (37%), Positives = 104/188 (55%), Gaps = 25/188 (13%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G+ ++ R S +L + + + +A + R++ T L E +Q+++Y G
Sbjct: 362 VQNTDTGELEIAQYRISKSAWLKEEEHKHIADVSQRVSDMTGLTMSTAEELQVVNYGIGG 421
Query: 62 KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R D+ N + LG G+RIATVL Y+S VE+GG TVFP+ +VS W
Sbjct: 422 HYEPHFDFARRDERNAFKSLGTGNRIATVLFYMSDVEQGGATVFPSIQVSL-----W--- 473
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP- 174
P KG A +++LHP D + H +CPV+ G KW + KWIH R F +P
Sbjct: 474 --------PQKGSAAFWYNLHPSGDGDKMTRHAACPVLTGSKWVSNKWIHERGQEFRRPC 525
Query: 175 --EKEPED 180
E+ ED
Sbjct: 526 TLERPSED 533
>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
Length = 286
Score = 116 bits (291), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 91/171 (53%), Gaps = 21/171 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+ D ++GK R+S G + + + +++ ++ RI+ P ++GE +QILHY G
Sbjct: 126 IVDPQTGKFQVIADRSSEGTYFQRGESPLISRLDRRISELMNWPEDHGEGIQILHYGVGA 185
Query: 62 KYEPHFDFFRDK-----MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PHFD+F + + G R+AT++MYL+ V +GGETVFP+
Sbjct: 186 QYKPHFDYFLENESGGALQMTQSGQRVATLVMYLNEVTEGGETVFPDV------------ 233
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
G ++ P +G A F + D +LHG PV+ GEKW ATKW+
Sbjct: 234 ----GISITPKRGSAAYFAYCNSLGQVDPATLHGGAPVLTGEKWIATKWMR 280
>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
KWC4]
Length = 215
Score = 116 bits (290), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 60/156 (38%), Positives = 90/156 (57%), Gaps = 17/156 (10%)
Query: 11 IASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF 70
+ S++RTS GMF + + + IE RIA +P E+ E +Q+LHY GQ+Y+ H DFF
Sbjct: 64 VVSDIRTSRGMFFEEEESPFIHRIERRIAQLMNVPIEHAEGLQVLHYGPGQEYKAHHDFF 123
Query: 71 RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
+ +RI+T+++YL+ VE+GGETVFP G A+KP +G
Sbjct: 124 APG-SPAARNNRISTLIVYLNDVEEGGETVFP----------------LLGIAMKPKRGA 166
Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
AL F + + + + +LH S PV+ GEKW AT+W+
Sbjct: 167 ALYFEYFYRNQALNDLTLHSSVPVVRGEKWVATQWM 202
>gi|196041590|ref|ZP_03108882.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218905373|ref|YP_002453207.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
AH820]
gi|225866219|ref|YP_002751597.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|423550018|ref|ZP_17526345.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
gi|196027578|gb|EDX66193.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218537435|gb|ACK89833.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH820]
gi|225786013|gb|ACO26230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|401189634|gb|EJQ96684.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
Length = 216
Score = 116 bits (290), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|301055727|ref|YP_003793938.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus biovar
anthracis str. CI]
gi|300377896|gb|ADK06800.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus biovar
anthracis str. CI]
Length = 216
Score = 116 bits (290), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|30264308|ref|NP_846685.1| prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. Ames]
gi|47529753|ref|YP_021102.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. 'Ames
Ancestor']
gi|65321616|ref|ZP_00394575.1| hypothetical protein Bant_01005109 [Bacillus anthracis str. A2012]
gi|165873278|ref|ZP_02217887.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167634610|ref|ZP_02392930.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|167638693|ref|ZP_02396969.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|170687507|ref|ZP_02878724.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|170709341|ref|ZP_02899757.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|177655890|ref|ZP_02937082.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190566156|ref|ZP_03019075.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|196034803|ref|ZP_03102210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227817011|ref|YP_002817020.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
anthracis str. CDC 684]
gi|228929280|ref|ZP_04092307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|228935557|ref|ZP_04098373.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|229123754|ref|ZP_04252949.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|229604260|ref|YP_002868528.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|254683996|ref|ZP_05147856.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CNEVA-9066]
gi|254721830|ref|ZP_05183619.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A1055]
gi|254736344|ref|ZP_05194050.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Western North America USA6153]
gi|254741382|ref|ZP_05199069.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Kruger B]
gi|254753983|ref|ZP_05206018.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Vollum]
gi|254757854|ref|ZP_05209881.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Australia 94]
gi|386738126|ref|YP_006211307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|421506493|ref|ZP_15953416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|421638315|ref|ZP_16078911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
gi|30258953|gb|AAP28171.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Ames]
gi|47504901|gb|AAT33577.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. 'Ames Ancestor']
gi|164710995|gb|EDR16563.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167513541|gb|EDR88911.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|167530062|gb|EDR92797.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|170125767|gb|EDS94678.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|170668702|gb|EDT19448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|172079923|gb|EDT65028.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190563075|gb|EDV17041.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|195992342|gb|EDX56303.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227005734|gb|ACP15477.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CDC 684]
gi|228659889|gb|EEL15534.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|228824095|gb|EEM69911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|228830570|gb|EEM76180.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|229268668|gb|ACQ50305.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|384387978|gb|AFH85639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|401823486|gb|EJT22633.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|403394741|gb|EJY91981.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
Length = 216
Score = 116 bits (290), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|196046329|ref|ZP_03113555.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|376268135|ref|YP_005120847.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
gi|196022799|gb|EDX61480.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|364513935|gb|AEW57334.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
Length = 216
Score = 116 bits (290), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|52141260|ref|YP_085568.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
gi|51974729|gb|AAU16279.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
Length = 232
Score = 116 bits (290), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 91 VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226
>gi|49187135|ref|YP_030387.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. Sterne]
gi|228947951|ref|ZP_04110238.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
gi|49181062|gb|AAT56438.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Sterne]
gi|228811938|gb|EEM58272.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
Length = 232
Score = 116 bits (290), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 91 VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226
>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
Length = 293
Score = 116 bits (290), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 61/161 (37%), Positives = 86/161 (53%), Gaps = 21/161 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R+S G F D+ +A ++ RIA P ENGE +Q+LHY G +Y+PHFD+F
Sbjct: 141 RSSEGTFFPVNADDFIARLDRRIAELMNCPVENGEGLQVLHYGEGGEYQPHFDYFSPGDP 200
Query: 72 -DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
+ +GG R++T+L+YL+ V +GG TVFP G V P KG
Sbjct: 201 GSEAQMVVGGQRVSTLLIYLNDVAQGGATVFPT----------------LGLRVLPRKGM 244
Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
A+ F + D D +LHG PV +GEKW TKW+ R++
Sbjct: 245 AVYFEYSNRDGQVDPLTLHGGEPVEKGEKWIITKWMRQRSY 285
>gi|428183249|gb|EKX52107.1| hypothetical protein GUITHDRAFT_150687 [Guillardia theta CCMP2712]
Length = 315
Score = 115 bits (289), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 62/165 (37%), Positives = 88/165 (53%), Gaps = 20/165 (12%)
Query: 11 IASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF 70
+ S RT++ +L Q +V +E +A T PENGE +QILHY+ Q+++ H D+F
Sbjct: 128 VESSTRTNTAAWLEYHQGPVVTKLENLLAKVTNTEPENGENLQILHYQTSQQFKEHHDYF 187
Query: 71 RDKM----NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
N + GG+R+AT ++YL + E+GGET F + VKP
Sbjct: 188 DPATDPPENFEPGGNRLATAIIYLQNAEEGGETDFMKIDTK----------------VKP 231
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G A+LF+ L PD S D ++H P GEKW ATKWIH R +
Sbjct: 232 EAGSAVLFYDLKPDGSVDKLTIHSGNPPKGGEKWVATKWIHERRY 276
>gi|121595595|ref|YP_987491.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
gi|120607675|gb|ABM43415.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
Length = 289
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 62/161 (38%), Positives = 87/161 (54%), Gaps = 23/161 (14%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMN 75
RTS GMF + + +V +E RIA P +NGE +Q+LHY G +Y+PH+D+F
Sbjct: 146 RTSDGMFFQRGETPVVQRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQP 205
Query: 76 Q-----QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
+ GG R+AT+++YL++ KGG T FP+ + V P +G+
Sbjct: 206 GTSTIVRRGGQRVATLVIYLNNPLKGGGTTFPDVPLE----------------VAPRQGN 249
Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
A+ F P ST +LHG VIEGEKW ATKW+ R F
Sbjct: 250 AVFFSYERPHPST--RTLHGGASVIEGEKWIATKWLREREF 288
>gi|217961727|ref|YP_002340297.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus AH187]
gi|222097680|ref|YP_002531737.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
Q1]
gi|229198365|ref|ZP_04325071.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|375286242|ref|YP_005106681.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus NC7401]
gi|423354732|ref|ZP_17332357.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|423566803|ref|ZP_17543050.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
gi|423574080|ref|ZP_17550199.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|217067199|gb|ACJ81449.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH187]
gi|221241738|gb|ACM14448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
Q1]
gi|228585065|gb|EEK43177.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|358354769|dbj|BAL19941.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NC7401]
gi|401086280|gb|EJP94507.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|401212649|gb|EJR19392.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|401215318|gb|EJR22035.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
Length = 216
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
Length = 313
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 58/175 (33%), Positives = 92/175 (52%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+ D +G+ RTS G++ + +D ++ ++ RIA+ P ENGE +QILHY
Sbjct: 154 IVDPATGREDVIRNRTSEGIWYQRGEDALIERLDQRIASLMNWPLENGEGLQILHYGPSG 213
Query: 62 KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y PHFD+F ++ GG R+AT+++YL+ V GGET+FP +
Sbjct: 214 EYRPHFDYFPPDQPGSAVHTARGGQRVATLVVYLNDVPDGGETIFPEA------------ 261
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G +V +G A+ F ++ D +LHG PV+ G+KW TKW+ R +
Sbjct: 262 ----GLSVAAQQGGAVYFRYMNGRRQLDPLTLHGGAPVLSGDKWIMTKWVRERPY 312
>gi|343172438|gb|AEL98923.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
gi|343172440|gb|AEL98924.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
Length = 120
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 57/116 (49%), Positives = 74/116 (63%), Gaps = 2/116 (1%)
Query: 51 AMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR 110
A +L YE GQKY H+D F RIA+ L+YLS VE+GGET+FP +
Sbjct: 1 AYNVLRYEVGQKYNSHYDAFHPAEYGPQKSQRIASFLLYLSDVEEGGETMFPYENDNIDS 60
Query: 111 DGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ ++ +C G VKP +GD LLF+SL + + D TS+HGSCPVI+GEKW ATKWI
Sbjct: 61 NYDYVQCI--GLKVKPRQGDGLLFYSLFSNGTIDPTSIHGSCPVIKGEKWVATKWI 114
>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
Length = 216
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|423478381|ref|ZP_17455096.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
gi|402428543|gb|EJV60640.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
Length = 216
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
Length = 285
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 61/174 (35%), Positives = 89/174 (51%), Gaps = 21/174 (12%)
Query: 3 ADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQK 62
+ E+G RTS G + +D ++ IE R+AA P ENGE +Q+L Y G +
Sbjct: 127 VNAETGTQEVIRHRTSHGTWFQNGEDALIRRIETRLAALMNCPVENGEGLQVLRYTPGGE 186
Query: 63 YEPHFDFFRDKMNQQL-----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y H+D+F+ L GG R+AT+++YL+ V GGETVFP +
Sbjct: 187 YRSHYDYFQPTAAGSLTHVRTGGQRVATLIVYLNDVPSGGETVFPEA------------- 233
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G +V P +GDA+ F ++ D +LH PV +GEKW TKW+ R +
Sbjct: 234 ---GISVVPRRGDAVYFRYMNRLRQLDPATLHAGAPVRDGEKWIMTKWVRERPY 284
>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
Length = 249
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 101/170 (59%), Gaps = 20/170 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G+ ++ R S +L + +D +VA++ R+ T L E E +Q+++Y G
Sbjct: 74 VQNYKTGELEFADYRISKSAWLKEHEDVVVANVAKRVEVMTGLTTETAEELQVVNYGVGG 133
Query: 62 KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y+PH+DF R +++N + LG G+RIATVL Y+S V +GG TVFP W
Sbjct: 134 HYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDVAQGGATVFP-----------WL-- 180
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
G A++P+KG A ++F+L+P + D + H +CPV++G KW KW+H
Sbjct: 181 ---GVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQGSKWVCNKWLH 227
>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
Length = 232
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 91 VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 148
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226
>gi|384182063|ref|YP_005567825.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
gi|324328147|gb|ADY23407.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
Length = 216
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDRSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
Length = 210
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 58/154 (37%), Positives = 93/154 (60%), Gaps = 18/154 (11%)
Query: 13 SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+++RTS+ +FL + E+V +E RI+ +P E+GE +Q+L+Y+ GQ+Y+ HFDFF
Sbjct: 69 NDIRTSTSVFLPEDASEVVQRVEKRISQIMNIPVEHGEGLQLLNYQIGQEYKAHFDFFSP 128
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
K + + RI+T+++YL+ VE+GG+T FPN ++S V P KG A+
Sbjct: 129 K--KLIENPRISTLVLYLNDVEEGGDTYFPNLKLS----------------VSPHKGMAV 170
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
F + D + +LHG PV G+KW+AT W+
Sbjct: 171 YFEYFYDDPMLNELTLHGGAPVTIGDKWAATMWM 204
>gi|319786559|ref|YP_004146034.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
gi|317465071|gb|ADV26803.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
Length = 289
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 65/168 (38%), Positives = 91/168 (54%), Gaps = 23/168 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
D ++G S RTS G F + + A+IEARIA P ENGE +Q+LHY G ++
Sbjct: 134 DAQTGGSQVHADRTSRGTFFERGAHPVCATIEARIARLLEWPVENGEGLQVLHYPPGAEF 193
Query: 64 EPHFDFFR-DKMNQQL----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PH+D+F D+ ++ GG R+ATV+MYL+ +GG T FP++ +
Sbjct: 194 RPHYDYFDPDEPGAEVLLRQGGQRVATVVMYLNTPARGGATTFPDAHLE----------- 242
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
V +KG+A+ F P T +LHG PV EGEKW ATKW+
Sbjct: 243 -----VAAVKGNAVFFSYDRPHPMT--RTLHGGAPVTEGEKWIATKWL 283
>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
Length = 217
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 17/155 (10%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
++RTSS MF + ++E+VA IE R++ +P E+GE +Q+L+Y GQ+Y+ HFDFF
Sbjct: 75 VDDIRTSSSMFFEEGENELVARIERRLSQIMNIPVEHGEGLQMLNYHIGQEYKAHFDFFS 134
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
RI+T++MYL+ VE+GGET FP + ++V P KG A
Sbjct: 135 SSSRAASNP-RISTLVMYLNDVEEGGETYFP----------------KLNFSVNPQKGSA 177
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + + + +LHG PVI+G KW+AT+W+
Sbjct: 178 VYFEYFYDNQDLNDLTLHGGAPVIKGSKWAATQWM 212
>gi|113682363|ref|NP_001038463.1| prolyl 4-hydroxylase, alpha polypeptide I a precursor [Danio rerio]
Length = 522
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/179 (36%), Positives = 93/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++GK ++ R S +L+ + +V I RI T L + E +Q+ +Y G
Sbjct: 353 VHDPQTGKLTTAQYRVSKSAWLAAYEHPVVDRINQRIEDITGLNVKTAEELQVANYGVGG 412
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 413 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVAAGGATVFPEV------------- 459
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G AVKP+KG A+ +++L P D ++ H +CPV+ G KW + KWIH R F +P
Sbjct: 460 ---GAAVKPLKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 515
>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
Length = 216
Score = 115 bits (288), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
Length = 216
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|229152436|ref|ZP_04280628.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
gi|228631044|gb|EEK87681.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
Length = 248
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSINELTLHGGAPVTKGEKWIATQWV 242
>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
Length = 248
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242
>gi|42783360|ref|NP_980607.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10987]
gi|42739288|gb|AAS43215.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
ATCC 10987]
Length = 216
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWM 210
>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
Length = 248
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242
>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
Length = 248
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242
>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
Length = 248
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242
>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
Length = 216
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
Length = 216
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|423558182|ref|ZP_17534484.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
gi|401191450|gb|EJQ98472.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
Length = 216
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWVATQWV 210
>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
Length = 216
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
Length = 248
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242
>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
Length = 229
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTS G FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 88 VNDIRTSKGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 145
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 146 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VNPRKGMA 188
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 189 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 223
>gi|193688213|ref|XP_001943683.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Acyrthosiphon pisum]
Length = 552
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 101/170 (59%), Gaps = 20/170 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G+ ++ R S +L + +D +VA++ R+ T L E E +Q+++Y G
Sbjct: 377 VQNYKTGELEFADYRISKSAWLKEHEDVVVANVAKRVEVMTGLTTETAEELQVVNYGVGG 436
Query: 62 KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y+PH+DF R +++N + LG G+RIATVL Y+S V +GG TVFP W
Sbjct: 437 HYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDVAQGGATVFP-----------WL-- 483
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
G A++P+KG A ++F+L+P + D + H +CPV++G KW KW+H
Sbjct: 484 ---GVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQGSKWVCNKWLH 530
>gi|402555628|ref|YP_006596899.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus FRI-35]
gi|401796838|gb|AFQ10697.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus FRI-35]
Length = 216
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 60/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWM 210
>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus thuringiensis HD-771]
gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-771]
gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
Length = 216
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
Length = 248
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242
>gi|328696638|ref|XP_003240086.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Acyrthosiphon pisum]
Length = 534
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 62/170 (36%), Positives = 101/170 (59%), Gaps = 20/170 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G+ ++ R S +L + +D +VA++ R+ T L E E +Q+++Y G
Sbjct: 359 VQNYKTGELEFADYRISKSAWLKEHEDVVVANVAKRVEVMTGLTTETAEELQVVNYGVGG 418
Query: 62 KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y+PH+DF R +++N + LG G+RIATVL Y+S V +GG TVFP W
Sbjct: 419 HYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDVAQGGATVFP-----------W--- 464
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
G A++P+KG A ++F+L+P + D + H +CPV++G KW KW+H
Sbjct: 465 --LGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQGSKWVCNKWLH 512
>gi|423452458|ref|ZP_17429311.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
gi|401140096|gb|EJQ47653.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
Length = 216
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWVATQWV 210
>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
Length = 216
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|228941395|ref|ZP_04103947.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|228974327|ref|ZP_04134896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228980919|ref|ZP_04141223.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|384188306|ref|YP_005574202.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|410676625|ref|YP_006928996.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452200698|ref|YP_007480779.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
gi|228778855|gb|EEM27118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|228785377|gb|EEM33387.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228818321|gb|EEM64394.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|326942015|gb|AEA17911.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|409175754|gb|AFV20059.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452106091|gb|AGG03031.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
Length = 216
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|218231188|ref|YP_002369041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
B4264]
gi|218159145|gb|ACK59137.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
B4264]
Length = 216
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSINELTLHGGAPVTKGEKWIATQWV 210
>gi|423512354|ref|ZP_17488885.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
gi|402449325|gb|EJV81162.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
Length = 216
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ T +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWV 210
>gi|423368291|ref|ZP_17345723.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
gi|401081042|gb|EJP89322.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
Length = 216
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ T +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWV 210
>gi|386712780|ref|YP_006179102.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
gi|384072335|emb|CCG43825.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
Length = 211
Score = 115 bits (287), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/160 (36%), Positives = 92/160 (57%), Gaps = 19/160 (11%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
S++RTSS FL +D++ IE R+A +P E+GE + IL+Y+ GQ+Y+ H+D+FR
Sbjct: 70 VSDIRTSSSTFL--PEDDLTNRIEKRVAQIMNVPVEHGEGLHILNYKQGQEYKAHYDYFR 127
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
K + RI+T+++YL+ VE+GGET FP+ +S + P KG A
Sbjct: 128 SKA-KAANNPRISTLVLYLNDVEEGGETYFPHMNLS----------------ISPHKGMA 170
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ F + D + +LHG PV GEKW+AT W+ + +
Sbjct: 171 VYFEYFYSDPLINERTLHGGSPVTSGEKWAATMWVRRKQY 210
>gi|423518940|ref|ZP_17495421.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
gi|401159995|gb|EJQ67374.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
Length = 216
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ T +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPQLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWV 210
>gi|347964867|ref|XP_309164.4| AGAP000971-PA [Anopheles gambiae str. PEST]
gi|333466515|gb|EAA04901.5| AGAP000971-PA [Anopheles gambiae str. PEST]
Length = 553
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/179 (36%), Positives = 96/179 (53%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G+ + R S +L A+DE++ +I R+ T L E E +Q+++Y G
Sbjct: 379 VQNYKTGELEFANYRISKSAWLKDAEDEMIRTISQRVEDMTGLTMETAEELQVVNYGIGG 438
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + LG G+RIATVL Y+S V +GG TVFP+ +
Sbjct: 439 HYEPHFDFARREERNAFKSLGTGNRIATVLFYMSDVTQGGATVFPSLNL----------- 487
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
A+ P KG A +F+LH D + H +CPV+ G KW + KWIH R F +P
Sbjct: 488 -----ALWPRKGTAAFWFNLHASGRGDYATRHAACPVLTGTKWVSNKWIHERGQEFRRP 541
>gi|375106426|ref|ZP_09752687.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
JOSHI_001]
gi|374667157|gb|EHR71942.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
JOSHI_001]
Length = 295
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 67/172 (38%), Positives = 90/172 (52%), Gaps = 23/172 (13%)
Query: 5 NESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYE 64
N SG S + RTS GMF + + + +IE RIAA P ENGE +Q+L Y G +Y+
Sbjct: 141 NGSGGSEVNAARTSDGMFFDRGEFPLCRTIEQRIAALVNWPVENGEGLQVLRYRPGSEYK 200
Query: 65 PHFDFFRDKMNQ-----QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
H D+F + GG R+ TV+MYL+H +GG T FP+
Sbjct: 201 AHHDYFDPAQPGTPTILKRGGQRVGTVVMYLNHPIRGGGTAFPDV--------------- 245
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G V P KG+A +FFS + A + +LH PV+EGEKW ATKW+ F
Sbjct: 246 -GLEVAPFKGNA-VFFS-YDRAHPMTRTLHAGTPVLEGEKWVATKWVREGEF 294
>gi|229135058|ref|ZP_04263863.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
gi|228648443|gb|EEL04473.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
Length = 216
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ T +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWV 210
>gi|418523362|ref|ZP_13089380.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410699993|gb|EKQ58573.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 286
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 65/176 (36%), Positives = 91/176 (51%), Gaps = 23/176 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN +G+ + RTS GM L QD + IEARIA P ++GE +Q+L Y G +Y
Sbjct: 128 DNANGEHLVHAARTSDGMCLRVGQDALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEY 187
Query: 64 EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PH+D+F + Q GG R+A+++MYL+ E+GG T FP++ +
Sbjct: 188 RPHYDYFDPDAVGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 236
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
V +KG+A+ F P T SLH PV+ GEKW ATKW+ R P
Sbjct: 237 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRERAVRMP 285
>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
Length = 216
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ T +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWV 210
>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
Length = 216
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTS G FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSKGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|281307110|pdb|3ITQ|A Chain A, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
gi|281307111|pdb|3ITQ|B Chain B, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
Length = 216
Score = 114 bits (286), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLD--DNELTAKIEKRISSIXNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++ YL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVXYLNDVEEGGETFFPKLNLS----------------VHPRKGXA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|229186477|ref|ZP_04313640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
gi|228596991|gb|EEK54648.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
Length = 216
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T+++YL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVIYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
carolinensis]
Length = 542
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 69/179 (38%), Positives = 93/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++GK + R S +LS ++ IVA I RI T L E +Q+ +Y G
Sbjct: 373 VHDPQTGKLTTAHYRVSKSAWLSGYENPIVARINTRIQDLTGLDVSTAEELQVANYGVGG 432
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP EV
Sbjct: 433 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 479
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G +V P KG A+ +++L P D ++ H +CPV+ G KW + KWIH R F +P
Sbjct: 480 ---GASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 535
>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
Length = 232
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTS G FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 91 VNDIRTSKGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226
>gi|365158975|ref|ZP_09355162.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
gi|363625964|gb|EHL76973.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
Length = 248
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA 164
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242
>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
Length = 216
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTS G FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSKGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|423426372|ref|ZP_17403403.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
gi|401111119|gb|EJQ19018.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
Length = 248
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA 164
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242
>gi|229061929|ref|ZP_04199257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
gi|228717372|gb|EEL69042.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
Length = 216
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ T +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSITNVPVVHGEGLHILNYEVDQEYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWV 210
>gi|423470454|ref|ZP_17447198.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
gi|402436583|gb|EJV68613.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
Length = 216
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 93/155 (60%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWVATQWM 210
>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
Length = 216
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--DNELTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
Length = 215
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/160 (38%), Positives = 93/160 (58%), Gaps = 19/160 (11%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG+F ++E VA IE RI+ +P E+G+ +Q+L Y GQ+Y+PHFDFF
Sbjct: 72 VNQIRTSSGVFCE--ENETVAKIEKRISQIMNIPIEHGDGLQVLLYAPGQEYKPHFDFFA 129
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 130 DT-SRASANNRISTLVMYLNDVEEGGETTFPMLNLS----------------VFPSKGMA 172
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ F + + + +LH PV +GEKW AT W+ + F
Sbjct: 173 VYFEYFYSNHELNERTLHAGAPVRKGEKWVATMWMRRQTF 212
>gi|229192445|ref|ZP_04319408.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
gi|228591022|gb|EEK48878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
Length = 216
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|206971296|ref|ZP_03232247.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|229081494|ref|ZP_04213993.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|423411965|ref|ZP_17389085.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|423432249|ref|ZP_17409253.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
gi|206734068|gb|EDZ51239.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|228701801|gb|EEL54288.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|401104033|gb|EJQ12010.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|401117005|gb|EJQ24843.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
Length = 216
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|118479416|ref|YP_896567.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis str. Al
Hakam]
gi|118418641|gb|ABK87060.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis str. Al
Hakam]
Length = 232
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ A IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 91 VNDIRTSSGAFLD--DNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T+++YL+ VE+GGET FP +S V P KG A
Sbjct: 149 EH-SRSAANNRISTLVIYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226
>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
Length = 248
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 107 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ + +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 165 EHSRSAVN-NRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242
>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
Length = 248
Score = 114 bits (285), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL + E+ IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 107 VNDIRTSSGAFLEDS--ELTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242
>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
Length = 248
Score = 114 bits (285), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 107 VNDIRTSSGAFLE--DNEFTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242
>gi|229180513|ref|ZP_04307855.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
gi|228602937|gb|EEK60416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
Length = 232
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 91 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA 148
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 191
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226
>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
G9842]
gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
G9842]
Length = 216
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ + +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EHSRSAVN-NRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
Length = 279
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 60/175 (34%), Positives = 91/175 (52%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +G+ + + RTS ++A+ ++A +EARIAA P ENGE MQ+L Y G
Sbjct: 121 VVDPATGEFVKHQDRTSMNAAFARAEHPLIARLEARIAAAIHWPAENGEGMQVLRYRSGG 180
Query: 62 KYEPHFDFFRD-----KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+ HFD+F + N Q GG R+ T L+YL V+ GG T FP
Sbjct: 181 EYKAHFDYFDTQSEGGRKNMQTGGQRVGTFLVYLCDVDAGGATRFP-------------- 226
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ ++P KG AL F + P+ + +LH PV+ G K+ A+KW+ + +
Sbjct: 227 --ALNFEIRPKKGMALFFANTLPNGEGNPLTLHAGVPVVSGVKYLASKWLREKPY 279
>gi|325915062|ref|ZP_08177391.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
gi|325538760|gb|EGD10427.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
Length = 286
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 64/177 (36%), Positives = 90/177 (50%), Gaps = 23/177 (12%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN +G + RTS M L QD + IEARIA P ENGE +Q+L Y G +Y
Sbjct: 128 DNANGAHVVHAARTSDSMCLQLGQDALCQRIEARIARLLDWPVENGEGLQVLRYGTGAEY 187
Query: 64 EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
+PH+D+F + Q GG R+A+++MYL+ ++GG T FP+ +
Sbjct: 188 QPHYDYFDPDAAGTPVLLQAGGQRVASLVMYLNTPDRGGATRFPDVHLD----------- 236
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPE 175
+ +KG+A+ F P T SLH PV+ GEKW ATKW+ R P+
Sbjct: 237 -----IAAIKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRERAARMPD 286
>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
konkukian str. 97-27]
gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
konkukian str. 97-27]
Length = 232
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 91 VNDIRTSSGAFLD--DNELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226
>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
Length = 248
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL + E+ IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 107 VNDIRTSSGAFLEDS--ELTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 164
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 165 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 207
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 208 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 242
>gi|325922187|ref|ZP_08183974.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
gi|325547306|gb|EGD18373.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
Length = 285
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/176 (36%), Positives = 90/176 (51%), Gaps = 23/176 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN++G I RTS M L QD + IEARIA P ++GE +Q+L Y G +Y
Sbjct: 128 DNDNGAQIVHAARTSDSMCLQLGQDALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEY 187
Query: 64 EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
+PH+D+F + Q GG R+A+++MYL+ E+GG T FP+ +
Sbjct: 188 QPHYDYFDPTAAGTPVLLQAGGQRLASLVMYLNTPERGGATRFPDVHLD----------- 236
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
V +KG+A+ F P T SLH PV+ GEKW ATKW+ R P
Sbjct: 237 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRERAVRLP 285
>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
Length = 216
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 91/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL + E+ IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLEDS--ELTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|77761111|ref|YP_241833.2| hypothetical protein XC_0735 [Xanthomonas campestris pv. campestris
str. 8004]
Length = 288
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/176 (36%), Positives = 87/176 (49%), Gaps = 23/176 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN G I RTS M L QD + IEARIA P E+GE +Q+L Y G +Y
Sbjct: 130 DNRDGSEIVHAARTSHSMALQPGQDALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQY 189
Query: 64 EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PH+D+F + Q GG R+A+++MYL+ E+GG T FP+ +
Sbjct: 190 APHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFPDVHLD----------- 238
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
V +KG+A+ F P T +LH PV+ GEKW ATKW+ R P
Sbjct: 239 -----VAAVKGNAVFFSYDRPHPMT--RTLHAGAPVLAGEKWVATKWLRERPLHAP 287
>gi|66572403|gb|AAY47813.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 308
Score = 114 bits (284), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 65/176 (36%), Positives = 87/176 (49%), Gaps = 23/176 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN G I RTS M L QD + IEARIA P E+GE +Q+L Y G +Y
Sbjct: 150 DNRDGSEIVHAARTSHSMALQPGQDALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQY 209
Query: 64 EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PH+D+F + Q GG R+A+++MYL+ E+GG T FP+ +
Sbjct: 210 APHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFPDVHLD----------- 258
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
V +KG+A+ F P T +LH PV+ GEKW ATKW+ R P
Sbjct: 259 -----VAAVKGNAVFFSYDRPHPMT--RTLHAGAPVLAGEKWVATKWLRERPLHAP 307
>gi|384429387|ref|YP_005638747.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
gi|341938490|gb|AEL08629.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
Length = 286
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 65/176 (36%), Positives = 87/176 (49%), Gaps = 23/176 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN G I RTS M L QD + IEARIA P E+GE +Q+L Y G +Y
Sbjct: 128 DNRDGSEIVHAARTSHSMALQPGQDALCQRIEARIARLLEWPVEHGEGLQVLRYATGAQY 187
Query: 64 EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PH+D+F + Q GG R+A+++MYL+ E+GG T FP+ +
Sbjct: 188 APHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFPDVHLD----------- 236
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
V +KG+A+ F P T +LH PV+ GEKW ATKW+ R P
Sbjct: 237 -----VAAVKGNAVFFSYDRPHPMT--RTLHAGAPVLAGEKWVATKWLRERPLHAP 285
>gi|156333122|ref|XP_001619372.1| hypothetical protein NEMVEDRAFT_v1g151555 [Nematostella vectensis]
gi|156202442|gb|EDO27272.1| predicted protein [Nematostella vectensis]
Length = 144
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 63/155 (40%), Positives = 83/155 (53%), Gaps = 18/155 (11%)
Query: 22 FLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGH 81
+L +DE+V I R+ A++ L E +Q+++Y G YEPH+DF RDK G+
Sbjct: 5 WLRDEEDELVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPHYDFARDKFTSLGTGN 64
Query: 82 RIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDA 141
RIAT L YLS VE GG TVF R G V P KGDA +++L
Sbjct: 65 RIATFLSYLSDVEAGGGTVF----------------TRVGATVWPQKGDAAFWYNLKRSG 108
Query: 142 STDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
DS++ H +CPV+ G KW A KWIH + F KP
Sbjct: 109 DGDSSTRHAACPVLVGSKWVANKWIHEVGQEFLKP 143
>gi|428175714|gb|EKX44602.1| hypothetical protein GUITHDRAFT_71994 [Guillardia theta CCMP2712]
Length = 244
Score = 113 bits (283), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 68/178 (38%), Positives = 90/178 (50%), Gaps = 14/178 (7%)
Query: 3 ADNESGK-SIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
D +G+ + EVRTS +L + IVA I R+ +P E MQ+L Y Q
Sbjct: 61 GDQSNGEEKVKDEVRTSETAWLMDKKVPIVAKIRQRVEELIRIPMSYAEDMQVLKYTFKQ 120
Query: 62 KYEPHFDFFRDKM---NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS---QSRDGNWS 115
Y H+DFF KM G +R+ TV YL+ VEKGGET+FP S + +W
Sbjct: 121 HYHVHYDFFDPKMYPGRWSSGHNRLVTVFFYLTSVEKGGETIFPFGNTSAEEHHKIQSWG 180
Query: 116 EC---ARRGYAVKPMKGDALLFFSLHPDAST----DSTSLHGSCPVIEGEKWSATKWI 166
C VKP++G A++F+ + P T D TSLHG C I GEKW+A WI
Sbjct: 181 PCENAVESSIKVKPVRGSAVIFYLMKPHGHTHGELDHTSLHGGCDPIVGEKWAANYWI 238
>gi|229071739|ref|ZP_04204954.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
gi|228711334|gb|EEL63294.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
Length = 232
Score = 113 bits (283), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 91 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA 148
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 191
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWM 226
>gi|30022316|ref|NP_833947.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|229129515|ref|ZP_04258486.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
gi|29897873|gb|AAP11148.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|228654120|gb|EEL09987.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
Length = 232
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +++ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 91 VNDIRTSSGAFLE--DNKLTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 191
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226
>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
harrisii]
Length = 385
Score = 113 bits (282), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 66/179 (36%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS +D +V+ I RI T L E +Q+ +Y G
Sbjct: 216 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 275
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 276 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 322
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KWIH R F +P
Sbjct: 323 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 378
>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
Length = 216
Score = 113 bits (282), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 90/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTS G FL +E+ IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSKGAFLD--DNELTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|229031885|ref|ZP_04187873.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
gi|228729503|gb|EEL80492.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
Length = 216
Score = 112 bits (281), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 90/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTS G FL +E+ IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSKGAFLD--DNELTTKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|319795182|ref|YP_004156822.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315597645|gb|ADU38711.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 296
Score = 112 bits (281), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 59/175 (33%), Positives = 93/175 (53%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+ D SG+ + S+ R S GMF ++++VA ++ R++A LP ENGE + +L+Y G
Sbjct: 130 LVDPMSGRDVVSDKRASWGMFFRLCENDLVARLDRRLSALMNLPLENGEGLHLLYYPTGA 189
Query: 62 KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
EPH D+ ++ + G R++T++ YL+ +GG+TVFP
Sbjct: 190 GSEPHHDYLAPTNAANRESIARSGQRVSTLVTYLNDAPEGGQTVFP-------------- 235
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ G AV P++G+A F + D+ SLH S PV G+KW TKW+ R F
Sbjct: 236 --QLGLAVSPIRGNACYFEYCDGNGRVDARSLHASAPVTRGDKWVMTKWMRERRF 288
>gi|418515355|ref|ZP_13081536.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|410708074|gb|EKQ66523.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 216
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 90/176 (51%), Gaps = 23/176 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN +G+ + RTS M L QD + IEARIA P ++GE +Q+L Y G +Y
Sbjct: 58 DNANGEHLVHAARTSDSMCLRVGQDALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEY 117
Query: 64 EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PH+D+F + Q GG R+A+++MYL+ E+GG T FP++ +
Sbjct: 118 RPHYDYFDPDAVGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 166
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
V +KG+A+ F P T SLH PV+ GEKW ATKW+ R P
Sbjct: 167 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRERAVRMP 215
>gi|229168980|ref|ZP_04296697.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|423591765|ref|ZP_17567796.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
gi|228614572|gb|EEK71680.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|401231898|gb|EJR38400.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
Length = 216
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ IE RI++ T +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTWKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQLLNELTLHGGAPVTKGEKWIATQWV 210
>gi|294666178|ref|ZP_06731433.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292604043|gb|EFF47439.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 296
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 90/176 (51%), Gaps = 23/176 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN +G+ + RTS M L QD + IEARIA P ++GE +Q+L Y G +Y
Sbjct: 138 DNANGEHVVHAARTSDSMCLRVGQDALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEY 197
Query: 64 EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PH+D+F + Q GG R+A+++MYL+ E+GG T FP++ +
Sbjct: 198 RPHYDYFDPDAAGTPVLLQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 246
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
V +KG+A+ F P T SLH PV+ GEKW ATKW+ R P
Sbjct: 247 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRERAVRMP 295
>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
Length = 254
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 59/155 (38%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E + IE RI++ T +P +GE + IL+Y Q+Y+ H+D+F
Sbjct: 113 VNDIRTSSGAFLE--ENEFTSKIEKRISSITNVPVAHGEGLHILNYAVDQEYKAHYDYFA 170
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 171 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 213
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 214 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWM 248
>gi|294627644|ref|ZP_06706226.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292597996|gb|EFF42151.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 296
Score = 112 bits (281), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 90/176 (51%), Gaps = 23/176 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN +G+ + RTS M L QD + IEARIA P ++GE +Q+L Y G +Y
Sbjct: 138 DNANGEHVVHAARTSDSMCLRVGQDALCQRIEARIARLLDWPVDHGEGLQVLRYGTGAEY 197
Query: 64 EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PH+D+F + Q GG R+A+++MYL+ E+GG T FP++ +
Sbjct: 198 RPHYDYFDPDAAGTPVLLQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 246
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
V +KG+A+ F P T SLH PV+ GEKW ATKW+ R P
Sbjct: 247 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRERAVRMP 295
>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
Length = 216
Score = 112 bits (281), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 90/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTS G FL +E+ IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSKGAFLD--DNELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
Length = 232
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 90/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTS G FL +E+ IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 91 VNDIRTSKGAFLD--DNELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226
>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
Length = 295
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 58/171 (33%), Positives = 90/171 (52%), Gaps = 21/171 (12%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
E+GK RTS G++ + +D + ++ RI++ P ENGE +QILHY +Y P
Sbjct: 140 ETGKEDVIRNRTSEGIWYQRGEDAFIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRP 199
Query: 66 HFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
HFD+F ++ GG R+AT+++YL+ V GGET+FP +
Sbjct: 200 HFDYFPPDQPGSAVHTAQGGQRVATLVIYLNDVPDGGETIFPEA---------------- 243
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G +V +G A+ F ++ D +LHG PV+ G+KW TKW+ R +
Sbjct: 244 GISVAARQGGAVYFRYMNGQRQLDPLTLHGGAPVLGGDKWIMTKWMRERAY 294
>gi|433460968|ref|ZP_20418587.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
gi|432190746|gb|ELK47751.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
Length = 211
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 60/160 (37%), Positives = 90/160 (56%), Gaps = 19/160 (11%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
S++RTSS FL DE+ IE R+A +P E+GE + ILHY+ GQ+Y+ H D+FR
Sbjct: 70 VSDIRTSSSAFL--PDDELTGRIEKRLAQIMNVPVEHGEGIHILHYKPGQEYKAHHDYFR 127
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
++ RI+T+++YL+ VE+GGET FP ++ V P KG A
Sbjct: 128 -STSRAAKNPRISTLVLYLNDVEEGGETYFPEMNLT----------------VSPHKGMA 170
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ F + D + + +LHG PV GEKW+AT W+ + +
Sbjct: 171 VYFEYFYNDPAINERTLHGGSPVTAGEKWAATMWVRRQQY 210
>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
Length = 232
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 90/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTS G FL +E+ IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 91 VNDIRTSKGAFLD--DNELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW AT+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWIATQWV 226
>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
Length = 232
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 90/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 91 VNDIRTSSGAFLD--DNELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 148
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 149 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 191
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG PV +GEKW T+W+
Sbjct: 192 VYFEYFYQDQSLNELTLHGGAPVTKGEKWITTQWV 226
>gi|229098707|ref|ZP_04229647.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|423441025|ref|ZP_17417931.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|423533441|ref|ZP_17509859.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
gi|228684786|gb|EEL38724.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|402417686|gb|EJV49986.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|402463660|gb|EJV95360.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
Length = 216
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG V +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGASVTKGEKWIATQWV 210
>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
Length = 216
Score = 112 bits (280), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 92/155 (59%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL ++E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--ENELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETFFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + D S + +LHG V +GEKW AT+W+
Sbjct: 176 VYFEYFYQDQSLNELTLHGGASVTKGEKWIATQWV 210
>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Cricetulus griseus]
Length = 534
Score = 112 bits (279), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 67/179 (37%), Positives = 93/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS +D +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFRD---KMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R Q+LG G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFQELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|21106803|gb|AAM35580.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 306
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 90/176 (51%), Gaps = 23/176 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN +G+ + RTS M L QD + IEARIA P ++GE +Q+L Y G +Y
Sbjct: 148 DNANGEHMVHAARTSDSMCLRVGQDALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEY 207
Query: 64 EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PH+D+F + Q GG R+A+++MYL+ E+GG T FP++ +
Sbjct: 208 RPHYDYFDPDAAGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 256
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
V +KG+A+ F P T SLH PV+ GEKW ATKW+ R P
Sbjct: 257 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRERAVRMP 305
>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Monodelphis domestica]
Length = 537
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 66/179 (36%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS +D +V+ I RI T L E +Q+ +Y G
Sbjct: 368 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 427
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 428 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 474
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KWIH R F +P
Sbjct: 475 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 530
>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
Length = 454
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 67/179 (37%), Positives = 93/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS +D +V+ I RI T L E +Q+ +Y G
Sbjct: 285 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 344
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP EV
Sbjct: 345 QYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 391
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 392 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 447
>gi|77748547|ref|NP_641044.2| hypothetical protein XAC0691 [Xanthomonas axonopodis pv. citri str.
306]
gi|381169877|ref|ZP_09879039.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380689647|emb|CCG35526.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 286
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 90/176 (51%), Gaps = 23/176 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN +G+ + RTS M L QD + IEARIA P ++GE +Q+L Y G +Y
Sbjct: 128 DNANGEHMVHAARTSDSMCLRVGQDALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEY 187
Query: 64 EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PH+D+F + Q GG R+A+++MYL+ E+GG T FP++ +
Sbjct: 188 RPHYDYFDPDAAGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 236
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
V +KG+A+ F P T SLH PV+ GEKW ATKW+ R P
Sbjct: 237 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKWLRERAVRMP 285
>gi|78046308|ref|YP_362483.1| 2OG-Fe(II) oxygenase [Xanthomonas campestris pv. vesicatoria str.
85-10]
gi|78034738|emb|CAJ22383.1| putative 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas
campestris pv. vesicatoria str. 85-10]
Length = 296
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/176 (35%), Positives = 90/176 (51%), Gaps = 23/176 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN +G+ + RTS M L QD + IEARIA P ++GE +Q+L Y G +Y
Sbjct: 138 DNANGEHVVHAARTSDSMCLRLGQDALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEY 197
Query: 64 EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PH+D+F + Q GG R+A+++MYL+ E+GG T FP++ +
Sbjct: 198 RPHYDYFDPDAAGTPVLVQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 246
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
V +KG+A+ F P T SLH PV+ G+KW ATKW+ R P
Sbjct: 247 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGDKWVATKWLRERAVRMP 295
>gi|295704991|ref|YP_003598066.1| 2OG-Fe(II) oxygenase [Bacillus megaterium DSM 319]
gi|294802650|gb|ADF39716.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium DSM 319]
Length = 219
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 59/152 (38%), Positives = 88/152 (57%), Gaps = 17/152 (11%)
Query: 15 VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM 74
+RTSSGMF ++++E+V IE R++ E E +QIL Y Q+Y+ H D+F
Sbjct: 78 IRTSSGMFFDESENELVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSA- 136
Query: 75 NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLF 134
++ +RI+T++MYL+ VE+GGET FP + G +V P KG A+ F
Sbjct: 137 SKASKNNRISTLVMYLNDVEEGGETYFP----------------KLGLSVSPTKGMAVYF 180
Query: 135 FSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ DA + +LHG PVI+GEKW AT+W+
Sbjct: 181 EYFYSDAELNDRTLHGGAPVIKGEKWVATQWM 212
>gi|229146822|ref|ZP_04275187.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
gi|228636650|gb|EEK93115.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
Length = 216
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/155 (37%), Positives = 91/155 (58%), Gaps = 19/155 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+++RTSSG FL +E+ + IE RI++ +P +GE + IL+YE Q+Y+ H+D+F
Sbjct: 75 VNDIRTSSGAFLE--DNELTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA 132
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGET FP +S V P KG A
Sbjct: 133 EH-SRSAANNRISTLVMYLNDVEEGGETYFPKLNLS----------------VHPRKGMA 175
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ F + S + +LHG PV +GEKW AT+W+
Sbjct: 176 VYFEYFYQGQSLNELTLHGGAPVTKGEKWIATQWV 210
>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
gallus]
Length = 489
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/179 (36%), Positives = 90/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK + R S +LS + +V+ I RI T L E +Q+ +Y G
Sbjct: 320 VHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGG 379
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 380 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 426
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G +V P KG A+ +++L P D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 427 ---GASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 482
>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
anatinus]
Length = 493
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 66/179 (36%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS +D +V+ I RI T L E +Q+ +Y G
Sbjct: 324 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 383
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 384 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 430
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KWIH R F +P
Sbjct: 431 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 486
>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
gallus]
Length = 536
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 67/179 (37%), Positives = 92/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK + R S +LS + +V+ I RI T L E +Q+ +Y G
Sbjct: 367 VHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGG 426
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP EV
Sbjct: 427 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 473
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G +V P KG A+ +++L P D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 474 ---GASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 529
>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
[Rattus norvegicus]
Length = 534
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/179 (36%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS +D +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
Length = 526
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 67/179 (37%), Positives = 93/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS +D +V+ I RI T L E +Q+ +Y G
Sbjct: 357 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 416
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP EV
Sbjct: 417 QYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 463
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 464 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 519
>gi|325925807|ref|ZP_08187179.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
gi|325543793|gb|EGD15204.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
Length = 286
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/176 (35%), Positives = 90/176 (51%), Gaps = 23/176 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN +G+ + RTS M L QD + IEARIA P ++GE +Q+L Y G +Y
Sbjct: 128 DNANGEHVVHAARTSDSMCLRLGQDALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEY 187
Query: 64 EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PH+D+F + Q GG R+A+++MYL+ E+GG T FP++ +
Sbjct: 188 RPHYDYFDPDAAGTPVLVQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 236
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
V +KG+A+ F P T SLH PV+ G+KW ATKW+ R P
Sbjct: 237 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGDKWVATKWLRERAVRMP 285
>gi|346723630|ref|YP_004850299.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346648377|gb|AEO41001.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 286
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 63/176 (35%), Positives = 90/176 (51%), Gaps = 23/176 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN +G+ + RTS M L QD + IEARIA P ++GE +Q+L Y G +Y
Sbjct: 128 DNANGEHVVHAARTSDSMCLRLGQDALCQRIEARIARLLDWPVDHGEGLQVLRYATGAEY 187
Query: 64 EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PH+D+F + Q GG R+A+++MYL+ E+GG T FP++ +
Sbjct: 188 RPHYDYFDPDAAGTPVLVQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------- 236
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
V +KG+A+ F P T SLH PV+ G+KW ATKW+ R P
Sbjct: 237 -----VAAVKGNAVFFSYDRPHPMT--RSLHAGAPVLAGDKWVATKWLRERAVRMP 285
>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
musculus]
Length = 534
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/179 (36%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS +D +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
Length = 534
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/179 (36%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS +D +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
Length = 536
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/179 (36%), Positives = 90/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK + R S +LS + +V+ I RI T L E +Q+ +Y G
Sbjct: 367 VHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGG 426
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 427 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 473
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G +V P KG A+ +++L P D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 474 ---GASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 529
>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
Length = 219
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 88/152 (57%), Gaps = 17/152 (11%)
Query: 15 VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM 74
+RTSSGMF ++++E+V IE R++ E E +QIL Y Q+Y+ H D+F
Sbjct: 78 IRTSSGMFFEESENELVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSA- 136
Query: 75 NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLF 134
++ +RI+T++MYL+ VE+GGET FP + G ++ P KG A+ F
Sbjct: 137 SKASKNNRISTLVMYLNDVEEGGETYFP----------------KLGLSISPTKGMAVYF 180
Query: 135 FSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ DA + +LHG PVI+GEKW AT+W+
Sbjct: 181 EYFYSDAELNDRTLHGGAPVIKGEKWVATQWM 212
>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
Length = 300
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/175 (33%), Positives = 90/175 (51%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+ D +G+ RTS G++ + +D + ++ RIA+ P ENGE +QILHY
Sbjct: 141 IVDPATGQEGVIRNRTSEGIWYQRGEDAFIERLDRRIASLMNWPVENGEGLQILHYGPTG 200
Query: 62 KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y PHFD+F ++ GG R+AT+++YL+ V GGET+FP +
Sbjct: 201 EYRPHFDYFPPDQPGSAVHTARGGQRVATLVVYLNDVADGGETIFPAA------------ 248
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G +V +G A+ F ++ D +LHG PV G+KW TKW+ R +
Sbjct: 249 ----GLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVRAGDKWIMTKWMRERAY 299
>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1
Length = 516
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 67/179 (37%), Positives = 92/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK + R S +LS + +V+ I RI T L E +Q+ +Y G
Sbjct: 347 VHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGG 406
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP EV
Sbjct: 407 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 453
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G +V P KG A+ +++L P D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 454 ---GASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 509
>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
Length = 548
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/162 (39%), Positives = 84/162 (51%), Gaps = 18/162 (11%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G+ E R S+ +L D IV I RI T + E EA+QI +Y G YEPHF
Sbjct: 372 GRFQPVEFRISTAAWLQPDHDAIVKRIHGRIEDATQVDIEYAEALQISNYGMGGFYEPHF 431
Query: 68 DFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
D N G R+AT ++YL+ V++GG T FP R G AV+P
Sbjct: 432 DHSSRGTNPD--GERLATFMIYLNPVKQGGFTAFP----------------RLGAAVQPG 473
Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
GDA+ +++L P D +LHG+CPV+ G KW A KWIH R
Sbjct: 474 YGDAVFWYNLQPSGVGDPLTLHGACPVLRGSKWVANKWIHER 515
>gi|357605723|gb|EHJ64752.1| prolyl 4-hydroxylase alpha subunit [Danaus plexippus]
Length = 235
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 59/168 (35%), Positives = 87/168 (51%), Gaps = 18/168 (10%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +G+ + + R S +L + +VA + R+A T L E +Q+++Y G
Sbjct: 52 VHDPATGELVPAHYRISKSAWLKDEESAVVARVSRRVADITGLSMTTAEELQVVNYGIGG 111
Query: 62 KYEPHFDFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
Y+PHFDF R + N ++ G+RIATVL Y+S V +GG TVF
Sbjct: 112 HYDPHFDFARKEENAFEKFNGNRIATVLFYMSDVAQGGATVF----------------TE 155
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
G +V P +G A+ + +LHP D + H +CPV+ G KW KWIH
Sbjct: 156 LGLSVFPRRGSAVFWLNLHPSGEGDLATRHAACPVLRGSKWVCNKWIH 203
>gi|363814557|ref|NP_001242754.1| uncharacterized protein LOC100794585 [Glycine max]
gi|255628535|gb|ACU14612.1| unknown [Glycine max]
Length = 238
Score = 111 bits (278), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/118 (49%), Positives = 80/118 (67%), Gaps = 3/118 (2%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D ++GK I S+VRTSSGMFL+ + + +V +IE RI+ ++ +P ENGE MQ+L YE
Sbjct: 118 VVDTKTGKGIKSDVRTSSGMFLNSKERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEK 177
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNS-EVSQSRDGNWSE 116
Q Y+PH D+F D N + GG RIAT+LMYLS + GET FP + V+ + GN S+
Sbjct: 178 NQYYKPHHDYFSDTFNLKRGGQRIATMLMYLSDNIERGETYFPLAGSVNAAVVGNLSK 235
>gi|77747935|ref|NP_638775.2| hypothetical protein XCC3429 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
Length = 288
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 86/176 (48%), Gaps = 23/176 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN G I RTS M L QD + IEARIA P E+GE +Q+L Y G +Y
Sbjct: 130 DNRDGSEIVHAARTSHSMALQPGQDALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQY 189
Query: 64 EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PH+D+F + Q GG R+A+++MYL+ E+GG T P+ +
Sbjct: 190 APHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRVPDVHLD----------- 238
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
V +KG+A+ F P T +LH PV+ GEKW ATKW+ R P
Sbjct: 239 -----VAAVKGNAVFFSYDRPHPMT--RTLHAGAPVLAGEKWVATKWLRERPLHAP 287
>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
gallus]
Length = 536
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 67/179 (37%), Positives = 92/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK + R S +LS + +V+ I RI T L E +Q+ +Y G
Sbjct: 367 VHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGG 426
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP EV
Sbjct: 427 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 473
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G +V P KG A+ +++L P D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 474 ---GASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 529
>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
Length = 307
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 58/175 (33%), Positives = 90/175 (51%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+ D +G+ RTS G++ + +D + ++ RIA+ P ENGE +QILHY
Sbjct: 148 IVDPATGQEDVIRNRTSEGIWYQRGEDAFIERLDQRIASLMNWPVENGEGLQILHYGPTG 207
Query: 62 KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y PHFD+F ++ GG R+AT+++YL+ V GGET+FP +
Sbjct: 208 EYRPHFDYFPPDQPGSMVHTARGGQRVATLVIYLNDVPDGGETIFPEA------------ 255
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G +V +G A+ F ++ D +LHG PV G+KW TKW+ R +
Sbjct: 256 ----GLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVRAGDKWIMTKWMRERAY 306
>gi|156370129|ref|XP_001628324.1| predicted protein [Nematostella vectensis]
gi|156215298|gb|EDO36261.1| predicted protein [Nematostella vectensis]
Length = 541
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 67/173 (38%), Positives = 89/173 (51%), Gaps = 21/173 (12%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+GK + R S +L +DE+V I R+ A++ L E +Q+++Y G YEPH
Sbjct: 367 TGKLEFANYRISKSGWLRDEEDELVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPH 426
Query: 67 FDFFRD---KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
+DF RD K G+RIAT L YLS VE GG TVF R G
Sbjct: 427 YDFARDGEDKFTSLGTGNRIATFLSYLSDVEAGGGTVF----------------TRVGAT 470
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP 174
V P KGDA +++L DS++ H +CPV+ G KW A KWIH + F KP
Sbjct: 471 VWPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKWVANKWIHEVGQEFRKP 523
>gi|21114687|gb|AAM42699.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
Length = 308
Score = 111 bits (277), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/176 (36%), Positives = 86/176 (48%), Gaps = 23/176 (13%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN G I RTS M L QD + IEARIA P E+GE +Q+L Y G +Y
Sbjct: 150 DNRDGSEIVHAARTSHSMALQPGQDALCQRIEARIAQLLEWPVEHGEGLQVLRYATGAQY 209
Query: 64 EPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PH+D+F + Q GG R+A+++MYL+ E+GG T P+ +
Sbjct: 210 APHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRVPDVHLD----------- 258
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
V +KG+A+ F P T +LH PV+ GEKW ATKW+ R P
Sbjct: 259 -----VAAVKGNAVFFSYDRPHPMT--RTLHAGAPVLAGEKWVATKWLRERPLHAP 307
>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Meleagris gallopavo]
Length = 536
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 67/179 (37%), Positives = 92/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK + R S +LS + +V+ I RI T L E +Q+ +Y G
Sbjct: 367 VHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGG 426
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP EV
Sbjct: 427 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 473
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G +V P KG A+ +++L P D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 474 ---GASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 529
>gi|294499597|ref|YP_003563297.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
gi|294349534|gb|ADE69863.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
Length = 219
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 88/152 (57%), Gaps = 17/152 (11%)
Query: 15 VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM 74
+RTSSGMF ++++E+V IE R++ E E +Q+L Y Q+Y+ H D+F
Sbjct: 78 IRTSSGMFFEESENELVHQIERRLSKIMGPSIEYAEGLQVLKYLPDQEYKAHHDYFTSA- 136
Query: 75 NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLF 134
++ +RI+T++MYL+ VE+GGET FP + G +V P KG A+ F
Sbjct: 137 SKASKNNRISTLVMYLNDVEEGGETYFP----------------KLGLSVSPTKGMAVYF 180
Query: 135 FSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ DA + +LHG PVI+GEKW AT+W+
Sbjct: 181 EYFYSDAELNDRTLHGGAPVIKGEKWVATQWM 212
>gi|389728965|ref|ZP_10189244.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388441204|gb|EIL97500.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 285
Score = 111 bits (277), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 65/174 (37%), Positives = 87/174 (50%), Gaps = 25/174 (14%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
E G E RTS GMF + ++ IEARIAA +P ++GE +Q+LHY GQ+YEP
Sbjct: 128 EDGAQQIDEHRTSDGMFFGLGEQPLIERIEARIAALLGIPVDHGEGLQVLHYLPGQQYEP 187
Query: 66 HFDFFRDKMN-----QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
H D+F GG RIA++++YL+ + GG T FP
Sbjct: 188 HQDWFDPTQPGYAAITATGGQRIASLVIYLNTPDAGGGTAFPEI---------------- 231
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
G V ++G A+ F S D SLH PV GEKW ATKW+ R + +P
Sbjct: 232 GLTVTALRGSAVCFTY----ESGDVFSLHAGLPVTRGEKWIATKWLRERPYREP 281
>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Loxodonta africana]
Length = 534
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 92/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP+
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPDV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
Length = 300
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 58/175 (33%), Positives = 90/175 (51%), Gaps = 21/175 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+ D +G+ RTS G++ + +D + ++ RIA+ P ENGE +QILHY
Sbjct: 141 IVDPATGQEGVIRNRTSEGIWYQRGEDAFIERLDQRIASLMNWPVENGEGLQILHYGPTG 200
Query: 62 KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y PHFD+F ++ GG R+AT+++YL+ V GGET+FP +
Sbjct: 201 EYRPHFDYFPPDQPGSAVHTARGGQRVATLVVYLNDVADGGETIFPAA------------ 248
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G +V +G A+ F ++ D +LHG PV G+KW TKW+ R +
Sbjct: 249 ----GLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVHAGDKWIMTKWMRERAY 299
>gi|224001336|ref|XP_002290340.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973762|gb|EED92092.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 483
Score = 110 bits (276), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 69/176 (39%), Positives = 98/176 (55%), Gaps = 14/176 (7%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+ D + GK +SE RTS FLS DE++ I+ R+A+ T +P + E +Q+L Y G+
Sbjct: 300 LKDADKGKD-SSEWRTSQSAFLSARDDEVLTEIDHRVASLTRIPRNHQEYVQVLRYGAGE 358
Query: 62 KYEPHFDFF------RDKMNQQL----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD 111
KY+ H D+F DK +L +R ATV YL+ V GGET+FP + +
Sbjct: 359 KYDSHHDYFDPSAYRSDKSTLRLIENGKKNRYATVFWYLTDVHDGGETIFPRYGGAPAPR 418
Query: 112 GNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGE-KWSATKWI 166
+ +C+ G VKP KG ++F+SL D SLHG+CPV E KW+A KWI
Sbjct: 419 SH-KDCS-IGLKVKPQKGKVVIFYSLDASGEMDPFSLHGACPVGENNLKWAANKWI 472
>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
Length = 215
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 60/160 (37%), Positives = 91/160 (56%), Gaps = 19/160 (11%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+ +RTSSG+F Q E + IE RI+ +P E+G+ +Q+L Y GQ+Y+PH+DFF
Sbjct: 72 VNSIRTSSGVFCE--QTETITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFA 129
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGETVFP +S V P KG A
Sbjct: 130 ET-SRASTNNRISTLVMYLNDVEQGGETVFPLLHLS----------------VFPTKGMA 172
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ F + + + +LH VI GEKW AT W+ ++F
Sbjct: 173 VYFEYFYRNQEVNEFTLHAGAQVIHGEKWVATMWMRRQSF 212
>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
Length = 244
Score = 110 bits (276), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 75 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 134
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 135 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 181
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 182 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 237
>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
guttata]
Length = 536
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 65/179 (36%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK + R S +LS + +V+ I RI T L E +Q+ +Y G
Sbjct: 367 VHDPETGKLTTAHYRVSKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGG 426
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 427 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 473
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G +V P KG A+ +++L P D ++ H +CPV+ G KW KW+H R F +P
Sbjct: 474 ---GASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVFNKWLHERGQEFRRP 529
>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 292
Score = 110 bits (275), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 89/170 (52%), Gaps = 21/170 (12%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+GK RTS G++ + +D + ++ RI++ P ENGE +QILHY +Y PH
Sbjct: 138 TGKEDVIRNRTSEGIWYQRGEDPFIERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPH 197
Query: 67 FDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
FD+F ++ GG R+AT+++YL+ V GGET+FP + G
Sbjct: 198 FDYFPPDQPGSAVHTAQGGQRVATLVIYLNDVPDGGETIFPEA----------------G 241
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+V +G A+ F ++ D +LHG PV+ G+KW TKW+ R +
Sbjct: 242 MSVAASQGGAVYFRYMNDRRQLDPLTLHGGAPVLAGDKWIMTKWMRERAY 291
>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
rubripes]
Length = 538
Score = 110 bits (275), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 90/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +SG + R S +L +D I+A + RI T L + E +Q+ +Y G
Sbjct: 369 VRDPKSGVLTTASYRVSKSAWLEGEEDPIIARVNQRIEDLTGLTVKTAELLQVANYGVGG 428
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+R+AT L Y+S VE GG TVFP+
Sbjct: 429 QYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDF------------- 475
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KWIH R F +P
Sbjct: 476 ---GAAIWPRKGTAVFWYNLFKSGEGDYRTRHAACPVLVGNKWVSNKWIHERGQEFRRP 531
>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
caballus]
Length = 302
Score = 110 bits (275), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 66/179 (36%), Positives = 93/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 133 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 192
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP EV
Sbjct: 193 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 239
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 240 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 295
>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
Length = 220
Score = 110 bits (275), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 60/160 (37%), Positives = 91/160 (56%), Gaps = 19/160 (11%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+ +RTSSG+F Q E + IE RI+ +P E+G+ +Q+L Y GQ+Y+PH+DFF
Sbjct: 77 VNSIRTSSGVFCE--QTETITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFA 134
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ +RI+T++MYL+ VE+GGETVFP +S V P KG A
Sbjct: 135 ET-SRASTNNRISTLVMYLNDVEQGGETVFPLLHLS----------------VFPTKGMA 177
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ F + + + +LH VI GEKW AT W+ ++F
Sbjct: 178 VYFEYFYSNQELNDFTLHAGTQVIHGEKWVATMWMRRQSF 217
>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
[Papio anubis]
Length = 379
Score = 110 bits (274), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 210 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 269
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 270 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 316
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 317 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 372
>gi|195505207|ref|XP_002099404.1| GE23380 [Drosophila yakuba]
gi|194185505|gb|EDW99116.1| GE23380 [Drosophila yakuba]
Length = 540
Score = 110 bits (274), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 61/173 (35%), Positives = 91/173 (52%), Gaps = 20/173 (11%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
+SG S +E+RTS +L + +A I+ R+ T L E+ E +Q+++Y G +YEP
Sbjct: 368 QSGNSTTTEIRTSQNTWLWYDANPWLAKIKQRLEDVTGLSTESAEPLQLVNYGIGGQYEP 427
Query: 66 HFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
HFDF D + G G+R+AT L YL+ V GG T FP + A
Sbjct: 428 HFDFMEDDGQKVFGWKGNRLATALFYLNDVALGGATAFPFLRL----------------A 471
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP 174
V P+KG L++++LH D + H CPV++G KW +W HV + F +P
Sbjct: 472 VPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVGAQEFRRP 524
>gi|389775678|ref|ZP_10193553.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
gi|388437120|gb|EIL93940.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
Length = 284
Score = 110 bits (274), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 65/174 (37%), Positives = 90/174 (51%), Gaps = 25/174 (14%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G + + RTS GMF + + +V IE R+A +P +GE +QILHY GQ+YEPHF
Sbjct: 130 GSNQVDQRRTSEGMFFTLNELPLVGRIEQRLATLLGMPVSHGEGLQILHYLPGQEYEPHF 189
Query: 68 DFFRDKMN-----QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
D+F + +GG R+A+V+MYL+ +GG T FP ++ + ARRG
Sbjct: 190 DWFDPQQPGYDTITAVGGQRVASVVMYLNTPAQGGGTAFPELGLTVT--------ARRGA 241
Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK 176
AV +F+ D SLH PV GEKW ATKW+ R + K
Sbjct: 242 AV---------YFAYE---GGDQQSLHAGLPVQRGEKWIATKWLRERPYGHSHK 283
>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
garnettii]
Length = 534
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
[Oryctolagus cuniculus]
Length = 534
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
porcellus]
Length = 534
Score = 110 bits (274), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1 [Nomascus leucogenys]
Length = 502
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 66/179 (36%), Positives = 93/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 333 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 392
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP EV
Sbjct: 393 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV----------- 439
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 440 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 495
>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
jacchus]
Length = 534
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
sapiens]
gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
troglodytes]
gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I variant [Homo
sapiens]
gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
sapiens]
gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
sapiens]
gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
Length = 488
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 319 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 378
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 379 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 425
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 426 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 481
>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
Length = 534
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|156398644|ref|XP_001638298.1| predicted protein [Nematostella vectensis]
gi|156225417|gb|EDO46235.1| predicted protein [Nematostella vectensis]
Length = 495
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/166 (38%), Positives = 85/166 (51%), Gaps = 21/166 (12%)
Query: 7 SGKSIASEVRTSSGMFLS-KAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
+G + R S +LS + E++ +E RIAA T L E E Q+ +Y +Y+P
Sbjct: 330 TGHLETAHYRISKNCWLSGREHGEVIDRVERRIAAMTRLNLETAEGFQVQNYGLAGQYDP 389
Query: 66 HFDFFRDKMNQQLG----GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
HFDF RD N LG G+RIATVL+++S VE GG TVFP G
Sbjct: 390 HFDFSRDLANSSLGSLGTGNRIATVLVWMSQVESGGATVFPYV----------------G 433
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
+ P KGDA+ + +L D + H CPV+ G KW A KWIH
Sbjct: 434 ARILPQKGDAVFWHNLLRSGDGDFRTRHAGCPVLSGIKWVANKWIH 479
>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
lupus familiaris]
Length = 534
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
melanoleuca]
Length = 534
Score = 109 bits (273), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 109 bits (273), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|47218149|emb|CAG10069.1| unnamed protein product [Tetraodon nigroviridis]
Length = 595
Score = 109 bits (273), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 61/172 (35%), Positives = 89/172 (51%), Gaps = 20/172 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++GK ++ R S +L+ + ++ +I RI T L + E +Q+ +Y G
Sbjct: 426 VHDPQTGKLTTAQYRVSKSAWLTGYEHPVIETINQRIEDLTGLEVDTAEELQVANYGVGG 485
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP+
Sbjct: 486 QYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVAAGGATVFPDV------------- 532
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G AV P KG A+ +++L D ++ H +CPV+ G KW + KWIH R
Sbjct: 533 ---GAAVWPQKGSAVFWYNLFTSGEGDYSTRHAACPVLVGNKWVSNKWIHER 581
>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
Length = 534
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVLAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|260812289|ref|XP_002600853.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
gi|229286143|gb|EEN56865.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
Length = 281
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/161 (39%), Positives = 82/161 (50%), Gaps = 20/161 (12%)
Query: 11 IASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYEHGQKYEPHFD 68
+ S +R S +L DEIVA + RI T L P + E +Q+L+Y G +YEPH D
Sbjct: 121 VESHIRISQQAWLHDKDDEIVARVSKRIGLLTGLNTTPTSTELLQVLNYGLGGQYEPHHD 180
Query: 69 FF--RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
+ +KM + G+R+AT LMYLS V GG TVFP + V+ V
Sbjct: 181 YMTAEEKMWGTILGNRMATFLMYLSDVTAGGATVFPVANVT----------------VPV 224
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
+K LLF L D SLH CPV+ G KW A KWIH
Sbjct: 225 VKNAGLLFMDLLRSGRGDVNSLHAGCPVVIGSKWIANKWIH 265
>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
aries]
Length = 534
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L Y+S V GG TVFP
Sbjct: 425 QYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVLAGGATVFPEV------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 472 ---GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
Length = 454
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 90/174 (51%), Gaps = 21/174 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G E RTS G + ++ IE IAA T + E GE +QIL+Y+ G
Sbjct: 162 VINPDTGDENLIEARTSLGAMFQVGEHPLIERIEDCIAAVTGIAAERGEGLQILNYKPGG 221
Query: 62 KYEPHFDFF---RDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y+PH+DFF R +QL GG R+ T+++YL+ GG T FP
Sbjct: 222 EYQPHYDFFNPQRPGEARQLKVGGQRVGTLVIYLNSPLAGGATAFP-------------- 267
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
+ G V P+KG+A+ F D + D +LH PV GEKW ATKW++ R
Sbjct: 268 --KLGLEVAPVKGNAVYFSYRKSDGALDERTLHAGLPVEAGEKWIATKWLNART 319
>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
niloticus]
Length = 536
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 90/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L +D ++ + RI A T L E E +Q+ +Y G
Sbjct: 367 VRDPKTGVLTTANYRVSKSAWLEGEEDPVIDRVNQRIEAITGLTVETAELLQVANYGVGG 426
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+R+AT L Y+S VE GG TVFP+
Sbjct: 427 QYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDF------------- 473
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G A+ P KG ++ +++L D + H +CPV+ G KW + KWIH R F +P
Sbjct: 474 ---GAAIWPRKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWIHERGQEFRRP 529
>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
laevis]
gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
Length = 533
Score = 108 bits (271), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 87/170 (51%), Gaps = 18/170 (10%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D ++ + +R+ A T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVANYRVSKSAWLEEYDDPVIGRVNSRMQAITGLTKDTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRRPFDSNLKTEGNRLATYLNYMSDVEAGGATVFPDF--------------- 470
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R
Sbjct: 471 -GAAIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWFHER 519
>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
Length = 509
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 64/161 (39%), Positives = 86/161 (53%), Gaps = 20/161 (12%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMN 75
R S +L + + +A + R+A T L E Q+++Y G YEPHFDF + ++
Sbjct: 365 RISKVAWLEDQEHQHLAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDF-QSTVD 423
Query: 76 QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFF 135
+G RI TVL YLS VE+GG TVFP +VS V P KG A+++F
Sbjct: 424 PAIGS-RIETVLFYLSDVEQGGATVFPEIQVS----------------VWPQKGSAVVWF 466
Query: 136 SLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
+LHP D + H CPV+ G KW ATKWIH R F +P
Sbjct: 467 NLHPSGDGDQRTKHAGCPVLIGSKWIATKWIHERGQEFLRP 507
>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
Length = 515
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 64/161 (39%), Positives = 86/161 (53%), Gaps = 20/161 (12%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMN 75
R S +L + + +A + R+A T L E Q+++Y G YEPHFDF + ++
Sbjct: 371 RISKVAWLEDQEHQHLAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDF-QSTVD 429
Query: 76 QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFF 135
+G RI TVL YLS VE+GG TVFP +VS V P KG A+++F
Sbjct: 430 PAIGS-RIETVLFYLSDVEQGGATVFPEIQVS----------------VWPQKGSAVVWF 472
Query: 136 SLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
+LHP D + H CPV+ G KW ATKWIH R F +P
Sbjct: 473 NLHPSGDGDQRTKHAGCPVLIGSKWIATKWIHERGQEFLRP 513
>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
Length = 525
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 62/173 (35%), Positives = 92/173 (53%), Gaps = 22/173 (12%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
SG++ + RTS + + + + + ARIA T E +Q+++Y G Y+ H
Sbjct: 364 SGRNEVVKTRTSKVAWFPDSYNPLTVRLNARIADMTGFNLYGSEMLQLMNYGLGGHYDQH 423
Query: 67 FDFFRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
+DFF + +N L G RIATVL YL+ VE+GG TVFPN R+ A
Sbjct: 424 YDFF-NTINSNLTAMSGDRIATVLFYLTDVEQGGATVFPN--------------IRK--A 466
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
V P +G +++++L + TD+ +LH +CPVI G KW KWI R F +P
Sbjct: 467 VFPQRGSVIMWYNLQDNGQTDNKTLHAACPVIVGSKWVCNKWIREREQIFSRP 519
>gi|414587755|tpg|DAA38326.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 244
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 53/117 (45%), Positives = 76/117 (64%), Gaps = 2/117 (1%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEH 59
V D +GK + S+VRTSSGMF++ + + +V +IE RI+ ++ +P ENGE +Q+L YE
Sbjct: 94 VVDVATGKGVKSDVRTSSGMFVNSEERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEA 153
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
Q Y PH D+F D N + GG R+AT+LMYL+ GGET FP S + + WS+
Sbjct: 154 SQYYRPHHDYFSDTFNLKRGGQRVATMLMYLTDGVVGGETHFPQEMESAAVEETWSK 210
>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
Length = 522
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 97/181 (53%), Gaps = 21/181 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+A+ ++GK+ S+ R S + + +I R+A T L + E +Q+++Y G
Sbjct: 359 IANQQTGKAERSKDRVSKSSWFPDEYHSTIRTITKRVADMTGLSMDTAEELQVVNYGLGG 418
Query: 62 KYEPHFDFFR-DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
+Y+PHFDFF K+ + +RIATVL Y+S V GG TVFP +
Sbjct: 419 QYDPHFDFFHWGKLKEV---NRIATVLFYMSDVSIGGATVFP----------------KL 459
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEK-EPE 179
G ++ KG A +++LH D ++LHG+CPV+ GEKW A KWI R + K +P+
Sbjct: 460 GVTLEARKGTAAFWYNLHSSGELDYSTLHGACPVLIGEKWVANKWIRERGQEFRRKCDPK 519
Query: 180 D 180
D
Sbjct: 520 D 520
>gi|403183473|gb|EJY58123.1| AAEL017524-PA, partial [Aedes aegypti]
Length = 212
Score = 108 bits (269), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/175 (37%), Positives = 96/175 (54%), Gaps = 24/175 (13%)
Query: 1 MVADNESGKSIASEV---RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHY 57
M+ + G+S + EV RTS +L+ E+V + R T L ++ E++Q+ +Y
Sbjct: 37 MLKRSMVGESFSKEVSNERTSQNAWLADYDFELVKVLSLRTEDMTGLDRKSYESLQVNNY 96
Query: 58 EHGQKYEPHFDFFRDKMNQQ----LG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG 112
G Y PHFD+ R ++ +G G+RIAT++ YLS VE+GG TVFP
Sbjct: 97 GIGGFYLPHFDWVRTNGTEEPYKDMGLGNRIATLMYYLSDVEQGGATVFP---------- 146
Query: 113 NWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
+ G V P KG A+ +++L PD + D +LHG+CPV+ G KW A KWIH
Sbjct: 147 ------QIGVGVFPKKGSAIFWYNLLPDGTGDERTLHGACPVLLGSKWVANKWIH 195
>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
(Silurana) tropicalis]
Length = 526
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 86/169 (50%), Gaps = 22/169 (13%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
++ R + +LS +D +VA + RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 367 TAQYRITKSAWLSGYEDPVVARLNRRIEGVTGLDMSTAEELQVANYGIGGQYEPHFDFLR 426
Query: 72 ----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
D + G+R+AT L Y+S VE GG TVFP G AV P
Sbjct: 427 KYEPDAFKKLGTGNRVATWLFYMSDVEAGGATVFPEV----------------GAAVYPK 470
Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
KG A+ +++L D ++ H +CPV+ G KW + KWIH R F +P
Sbjct: 471 KGTAVFWYNLLESGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 519
>gi|195505190|ref|XP_002099397.1| GE10881 [Drosophila yakuba]
gi|194185498|gb|EDW99109.1| GE10881 [Drosophila yakuba]
Length = 487
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 64/185 (34%), Positives = 93/185 (50%), Gaps = 22/185 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G + R S +L +D ++ ++ R A T L E+ E +Q+++Y G
Sbjct: 313 VQNSVTGALETANYRISKSAWLKTHEDRVIGTVVQRTADMTGLDMESAEELQVVNYGIGG 372
Query: 62 KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + G +RIATVL Y+S VE+GG TVF +
Sbjct: 373 HYEPHFDFARKEEERAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHT----------- 421
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKPE 175
A+ P KG A + +LH D D + H +CPV+ G KW + KWIH R F +P
Sbjct: 422 -----ALFPRKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKWIHERGQEFRRPC 476
Query: 176 KEPED 180
ED
Sbjct: 477 DLEED 481
>gi|292619367|ref|XP_001922562.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Danio rerio]
Length = 541
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 62/165 (37%), Positives = 85/165 (51%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +L+ + +V I RI T L + E +Q+ +Y G +YEPHFDF R
Sbjct: 386 RISKSAWLAAYEHPVVDRINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEP 445
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP G AVKP+KG A
Sbjct: 446 DAFKELGTGNRIATWLFYMSDVAAGGATVFPEV----------------GAAVKPLKGTA 489
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
+ +++L P D ++ H +CPV+ G KW + KWIH R F +P
Sbjct: 490 VFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 534
>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
Length = 578
Score = 107 bits (268), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 56/167 (33%), Positives = 92/167 (55%), Gaps = 17/167 (10%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
M + + + + RTS+ ++L+ ++ ++ +E R+ T EN E Q+++Y G
Sbjct: 415 MTFNKQKLRPLIDSGRTSNSVWLTSHENAVMERLERRVGVMTNFEMENSEVYQLINYGIG 474
Query: 61 QKYEPHFDFFRD-KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
Y+PH D F ++ + GG RIATVL YLS V +GG T+FP +S
Sbjct: 475 GHYKPHTDHFETPQLEHRGGGDRIATVLFYLSDVPQGGATLFPRLNIS------------ 522
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
V+P +GDALL+++L+ + ++H SCP+I+G KW+ KWI
Sbjct: 523 ----VQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWALVKWI 565
>gi|242018356|ref|XP_002429643.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
humanus corporis]
gi|212514628|gb|EEB16905.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
humanus corporis]
Length = 534
Score = 107 bits (267), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 65/179 (36%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++E+GK + R S +L VA + R+ T L E++Q+++Y G
Sbjct: 364 VQNSETGKLEVAHYRISKSAWLEDVDHPYVAKVSQRVEDITGLNMATAESLQVVNYGIGG 423
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + Q LG G+RIAT+L Y+S V +GG TVFP +VS W
Sbjct: 424 HYEPHFDFARKEEKNAFQSLGTGNRIATILFYMSDVSQGGATVFPGIKVSL-----W--- 475
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
P KG A +++L + D + H +CPV+ G KW KWIH R F +P
Sbjct: 476 --------PKKGTAAFWYNLRKNGEGDYLTRHAACPVLTGSKWVCNKWIHERGQEFRRP 526
>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
Length = 513
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 58/178 (32%), Positives = 95/178 (53%), Gaps = 19/178 (10%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
M + + + + RTS+ ++L+ ++ ++ +E R+ T EN E Q+++Y G
Sbjct: 352 MTFNKQKLRPLIDSGRTSNSVWLTSHENAVMERLERRVGVMTNFEMENSEVYQLINYGIG 411
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
Y+PH D F ++ GG RIATVL YLS V +GG T+FP +S
Sbjct: 412 GHYKPHTDHFETPQHRG-GGDRIATVLFYLSDVPQGGATLFPRLNIS------------- 457
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
V+P +GDALL+++L+ + ++H SCP+I+G KW+ KWI +P + P
Sbjct: 458 ---VQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWALVKWID--ELSQPFRRP 510
>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
Length = 541
Score = 107 bits (267), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 92/177 (51%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G S ++ R + FL ++ + + + RI T L E +Q+ +Y G
Sbjct: 372 VQNSLTGASEPTKYRIAKAAFLQNSEHDHIVKMTRRIGDVTGLDMTTAEELQVCNYGIGG 431
Query: 62 KYEPHFDFFRD-KMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
YEPH+D R ++ + G G+RIAT + Y+S VE GG TVFP +
Sbjct: 432 HYEPHYDHARKGEVQKDFGWGNRIATWMFYMSDVEAGGATVFPQINL------------- 478
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
A+ P KG A +F+LHP+ D + H +CPV+ G KW + KWIH RN F +P
Sbjct: 479 ---ALWPQKGSAAFWFNLHPNGEGDDLTQHAACPVLTGSKWVSNKWIHERNQEFRRP 532
>gi|147791524|emb|CAN70717.1| hypothetical protein VITISV_029140 [Vitis vinifera]
Length = 173
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 69/189 (36%), Positives = 94/189 (49%), Gaps = 50/189 (26%)
Query: 9 KSIASEVRTSSGMFLSKAQDE---------------------------IVASIEARIAAW 41
K I S+VRTSSGMFLS + +IE RI+ +
Sbjct: 6 KGIQSDVRTSSGMFLSPDDSTYPIVRVFVVPPMEGFWNSCGLSNSLCLFLQAIEKRISVY 65
Query: 42 TFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVF 101
+ +P ENGE +Q N + GG R+AT+L+YLS +GGET F
Sbjct: 66 SQVPVENGELIQF--------------------NLKRGGQRVATMLIYLSDNVEGGETYF 105
Query: 102 PNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWS 161
P + R G S RG +V P+KG+A+LF+S+ D +D S+HG C V+ GEKWS
Sbjct: 106 PMAGSGFCRCGGKSV---RGLSVAPVKGNAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWS 162
Query: 162 ATKWIHVRN 170
ATKW+ R+
Sbjct: 163 ATKWMRQRS 171
>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 292
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 55/170 (32%), Positives = 88/170 (51%), Gaps = 21/170 (12%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+GK RTS G++ + +D + ++ RI++ P ENGE +Q+L Y +Y PH
Sbjct: 138 TGKEDVIRNRTSEGIWYQRGEDPFIERMDRRISSLMNWPVENGEGLQLLRYGTTGEYRPH 197
Query: 67 FDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
FD+F ++ GG R+AT+++YL+ V GGET+FP + G
Sbjct: 198 FDYFPPDQPGSTVHTAQGGQRVATLVIYLNDVPDGGETIFPEA----------------G 241
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+V +G A+ F ++ D +LHG PV+ G+KW TKW+ R +
Sbjct: 242 MSVAASQGGAVYFRYMNGRRQLDPLTLHGGAPVLSGDKWIMTKWMRERAY 291
>gi|170064960|ref|XP_001867743.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
gi|167882146|gb|EDS45529.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
Length = 545
Score = 107 bits (266), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 93/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G+ + R S +L + +V ++ R+ T L E +Q+++Y G
Sbjct: 372 VQNYKTGELEVANYRISKSAWLKDEEHSVVRTVGQRVEDMTGLTMTTAEELQVVNYGIGG 431
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + LG G+RIATVL Y+S V +GG TVFP+ V
Sbjct: 432 HYEPHFDFARREEKNAFKSLGTGNRIATVLFYMSDVSQGGATVFPSIRV----------- 480
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
A++P KG A +++LH D + H +CPV+ G KW + KWIH R F +P
Sbjct: 481 -----ALRPKKGTAAFWYNLHASGHGDYATRHAACPVLTGTKWVSNKWIHERGQEFLRP 534
>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
latipes]
Length = 532
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L D ++ + RI T L E E +Q+ +Y G
Sbjct: 365 VRDPKTGVLTTAPYRVSKSAWLEGEDDPVIDRVNQRIQDITGLTVETAELLQVANYGVGG 424
Query: 62 KYEPHFDFFRDKM--NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R N ++ G+R+AT L Y+S VE GG TVFP+
Sbjct: 425 QYEPHFDFSRRPFDSNLKVDGNRLATFLNYMSDVEAGGATVFPDF--------------- 469
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G ++ P KG A+ +++L D + H +CPV+ G KW + KWIH R F +P
Sbjct: 470 -GASIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWIHERGQEFRRP 525
>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
Length = 490
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 63/166 (37%), Positives = 89/166 (53%), Gaps = 24/166 (14%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+ +++G + S++RTS +FL + VA I RIA T L E+ E + + +Y G
Sbjct: 333 LTRSQTGWGVISDIRTSQSVFLEEVG--TVARISQRIADITGLSVESAEKLHVQNYGIGG 390
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
+Y PHFD D++N+ R AT L+Y+S VE GG TVF N G
Sbjct: 391 RYTPHFDT-GDEVNE-----RTATFLIYMSDVEVGGATVFTNV----------------G 428
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
AVKP KG A+ +++LH + D + H CPV+ G KW A KWIH
Sbjct: 429 VAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGNKWVANKWIH 474
>gi|345481336|ref|XP_001600680.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Nasonia
vitripennis]
Length = 556
Score = 107 bits (266), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 65/190 (34%), Positives = 99/190 (52%), Gaps = 25/190 (13%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G+ + R S +L + + + V ++ R+ T + E E +Q+++Y G
Sbjct: 379 VQNYKTGELEIANYRISKSAWLQEHEHKHVRAVSQRVEHMTSMSIETAEELQVVNYGIGG 438
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + LG G+RIATVL Y+S VE+GG TVF +S
Sbjct: 439 HYEPHFDFARREEKNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTKINIS---------- 488
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP- 174
+ P KG A +++L P+ D + H +CPV+ G KW A KW+H R F +P
Sbjct: 489 ------LWPKKGSAAFWYNLKPNGEGDYKTRHAACPVLTGSKWVANKWLHERGQEFHRPC 542
Query: 175 --EKEPEDDD 182
E +P D D
Sbjct: 543 TLENQPADVD 552
>gi|194765194|ref|XP_001964712.1| GF22904 [Drosophila ananassae]
gi|190614984|gb|EDV30508.1| GF22904 [Drosophila ananassae]
Length = 547
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 63/185 (34%), Positives = 93/185 (50%), Gaps = 22/185 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G + R S +L +D ++ ++ R A T L ++ E +Q+++Y G
Sbjct: 373 VQNSVTGALETANYRISKSAWLKTEEDHVIGTVVQRTADMTGLDMDSAEELQVVNYGIGG 432
Query: 62 KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + G +RIATVL Y+S VE+GG TVF +
Sbjct: 433 HYEPHFDFARKEEKRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHT----------- 481
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKPE 175
A+ P KG A + +LH D D + H +CPV+ G KW + KWIH R F +P
Sbjct: 482 -----ALFPKKGTAAFWMNLHRDGEGDVRTRHAACPVLTGTKWVSNKWIHERGQEFRRPC 536
Query: 176 KEPED 180
ED
Sbjct: 537 SMDED 541
>gi|410927705|ref|XP_003977281.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 531
Score = 106 bits (265), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 87/172 (50%), Gaps = 20/172 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G+ + R S +L + +V I RI T L E +Q+ +Y G
Sbjct: 362 VHDPQTGQLTTAPYRVSKSAWLGAFEHPVVDRINQRIEDITGLDVSTAEDLQVANYGVGG 421
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPH+DF R D + G+RIAT L+Y+S V+ GG TVF +
Sbjct: 422 QYEPHYDFGRKDEPDAFKELGTGNRIATWLLYMSEVQAGGATVFTDI------------- 468
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G +V P KG A+ +++LHP D + H +CPV+ G KW + KWIH R
Sbjct: 469 ---GASVSPKKGSAVFWYNLHPSGDGDYRTRHAACPVLLGNKWVSNKWIHER 517
>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
Length = 508
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 63/166 (37%), Positives = 89/166 (53%), Gaps = 24/166 (14%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+ +++G + S++RTS +FL + VA I RIA T L E+ E + + +Y G
Sbjct: 351 LTRSQTGWGVISDIRTSQSVFLEEVG--TVARISQRIADITGLSVESAEKLHVQNYGIGG 408
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
+Y PHFD D++N+ R AT L+Y+S VE GG TVF N G
Sbjct: 409 RYTPHFDT-GDEVNE-----RTATFLIYMSDVEVGGATVFTNV----------------G 446
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
AVKP KG A+ +++LH + D + H CPV+ G KW A KWIH
Sbjct: 447 VAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGNKWVANKWIH 492
>gi|24651407|ref|NP_733371.1| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
gi|20269806|gb|AAM18058.1|AF495536_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]EFB
[Drosophila melanogaster]
gi|15292529|gb|AAK93533.1| SD05564p [Drosophila melanogaster]
gi|23172692|gb|AAF57053.2| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
gi|220946562|gb|ACL85824.1| PH4alphaEFB-PA [synthetic construct]
Length = 550
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G + R S +L +D ++ ++ R A T L ++ E +Q+++Y G
Sbjct: 376 VQNSVTGALETANYRISKSAWLKTQEDRVIETVVQRTADMTGLDMDSAEELQVVNYGIGG 435
Query: 62 KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + G +RIATVL Y+S VE+GG TVF +
Sbjct: 436 HYEPHFDFARKEEQRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHT----------- 484
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
A+ P KG A + +LH D D + H +CPV+ G KW + KWIH R F +P
Sbjct: 485 -----ALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKWIHERGQEFRRP 538
>gi|195341536|ref|XP_002037362.1| GM12882 [Drosophila sechellia]
gi|194131478|gb|EDW53521.1| GM12882 [Drosophila sechellia]
Length = 550
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G + R S +L +D ++ ++ R A T L ++ E +Q+++Y G
Sbjct: 376 VQNSVTGALETANYRISKSAWLKTQEDRVIETVVQRTADMTGLDMDSAEELQVVNYGIGG 435
Query: 62 KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + G +RIATVL Y+S VE+GG TVF +
Sbjct: 436 HYEPHFDFARKEEERAFEGINLGNRIATVLFYMSDVEQGGATVFTSLHT----------- 484
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
A+ P KG A + +LH D D + H +CPV+ G KW + KWIH R F +P
Sbjct: 485 -----ALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKWIHERGQEFRRP 538
>gi|383864775|ref|XP_003707853.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Megachile
rotundata]
Length = 550
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 65/179 (36%), Positives = 94/179 (52%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G + R S +L + + + VA++ R+ T L E E +Q+++Y G
Sbjct: 373 VQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSKRVEHMTSLNVETAEELQVVNYGIGG 432
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + LG G+RIATVL Y+S VE+GG TVF +S W
Sbjct: 433 HYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINISL-----W--- 484
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
P KG A +F+L P+ D + H +CPV+ G KW A KW+H R F +P
Sbjct: 485 --------PRKGSAAFWFNLKPNGEGDLRTRHAACPVLTGSKWVANKWLHERGQEFLRP 535
>gi|195575089|ref|XP_002105512.1| GD21521 [Drosophila simulans]
gi|194201439|gb|EDX15015.1| GD21521 [Drosophila simulans]
Length = 550
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G + R S +L +D ++ ++ R A T L ++ E +Q+++Y G
Sbjct: 376 VQNSVTGALETANYRISKSAWLKTQEDRVIETVVQRTADMTGLDMDSAEELQVVNYGIGG 435
Query: 62 KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + G +RIATVL Y+S VE+GG TVF +
Sbjct: 436 HYEPHFDFARKEEERAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHT----------- 484
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
A+ P KG A + +LH D D + H +CPV+ G KW + KWIH R F +P
Sbjct: 485 -----ALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKWIHERGQEFRRP 538
>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
Length = 538
Score = 106 bits (265), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 88/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L +D ++ + RI T L + E +QI +Y G
Sbjct: 369 VRDPKTGVLTTANYRVSKSAWLEGEEDPVIERVNQRIEDITGLTTQTAELLQIANYGVGG 428
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D G+R+AT L Y+S VE GG TVFP+
Sbjct: 429 QYEPHFDFSRKDEPDAFKTLGTGNRVATFLNYMSDVEAGGATVFPDF------------- 475
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KWIH R F +P
Sbjct: 476 ---GAAIYPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWIHERGQEFRRP 531
>gi|125772807|ref|XP_001357662.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
gi|54637394|gb|EAL26796.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 106 bits (264), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 90/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G + R S +L +D ++A + R A T L E+ E +Q+++Y G
Sbjct: 375 VQNSVTGALETANYRISKSAWLKTEEDSVIAKVVQRTADMTGLDMESAEELQVVNYGIGG 434
Query: 62 KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y PHFDF R + + G +RIATVL Y+S VE+GG TVF + R W
Sbjct: 435 HYAPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQGGATVF-----TTLRTALW--- 486
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
P +G A + +LH D D + H +CPV+ G KW + KWIH R F +P
Sbjct: 487 --------PKRGTAAFWMNLHRDGEGDKRTQHAACPVLTGTKWVSNKWIHERGQEFRRP 537
>gi|195452726|ref|XP_002073473.1| GK14136 [Drosophila willistoni]
gi|194169558|gb|EDW84459.1| GK14136 [Drosophila willistoni]
Length = 550
Score = 106 bits (264), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 89/172 (51%), Gaps = 20/172 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G + R S +L +D+++ ++ R A T L ++ E +Q+++Y G
Sbjct: 376 VQNSVTGALETANYRISKSAWLKTEEDQVIGTVVQRTADMTGLDMDSAEELQVVNYGIGG 435
Query: 62 KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + G +RIATVL Y+S VE+GG TVF +
Sbjct: 436 HYEPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHA----------- 484
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
A+ P KG A + +LH D D + H +CPV+ G KW + KWIH R
Sbjct: 485 -----ALWPKKGTAAFWMNLHRDGEGDVRTRHAACPVLTGTKWVSNKWIHER 531
>gi|112984520|ref|NP_001037195.1| prolyl 4-hydroxylase alpha subunit precursor [Bombyx mori]
gi|37543673|gb|AAM21932.1| prolyl 4-hydroxylase alpha subunit [Bombyx mori]
Length = 550
Score = 106 bits (264), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 58/169 (34%), Positives = 87/169 (51%), Gaps = 18/169 (10%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+V D ++G+ + R S +L + ++A I R+ T L + E +Q+++Y G
Sbjct: 366 VVHDPKTGELTPAHYRISKSSWLRDEESPVIARITQRVTDMTGLSMLHAEELQVVNYGIG 425
Query: 61 QKYEPHFDFFRDKMN--QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
YEPHFDF R + N + GG+RIATVL Y+S V +GG TVF
Sbjct: 426 GHYEPHFDFARKRENPFTKFGGNRIATVLFYMSDVAQGGATVF----------------T 469
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
G ++ P+K A + +LH D + H +CPV+ G KW + KWIH
Sbjct: 470 ELGLSLFPIKRAAAFWLNLHASGEGDLATRHAACPVLRGSKWVSNKWIH 518
>gi|195159323|ref|XP_002020531.1| GL13463 [Drosophila persimilis]
gi|194117300|gb|EDW39343.1| GL13463 [Drosophila persimilis]
Length = 487
Score = 106 bits (264), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 90/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G + R S +L +D ++A + R A T L E+ E +Q+++Y G
Sbjct: 313 VQNSVTGALETANYRISKSAWLKTEEDSVIAKVVQRTADMTGLDMESAEELQVVNYGIGG 372
Query: 62 KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y PHFDF R + + G +RIATVL Y+S VE+GG TVF + R W
Sbjct: 373 HYAPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQGGATVF-----TTLRTALW--- 424
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
P +G A + +LH D D + H +CPV+ G KW + KWIH R F +P
Sbjct: 425 --------PKRGTAAFWMNLHRDGEGDKRTQHAACPVLTGTKWVSNKWIHERGQEFRRP 475
>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 523
Score = 106 bits (264), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 62/172 (36%), Positives = 86/172 (50%), Gaps = 20/172 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++GK ++ R S +L + IV I RI T L E +Q+ +Y G
Sbjct: 354 VHDPQTGKLTTAQYRVSKSAWLGSHEHPIVDRINQRIEDITGLDVSTAEDLQVANYGVGG 413
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L+Y+S V+ GG TVF +
Sbjct: 414 QYEPHFDFGRKDEADAFEELGTGNRIATWLLYMSDVQAGGNTVFTDI------------- 460
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G V P KG A+ +++LH D + H +CPV+ G KW + KWIH R
Sbjct: 461 ---GAVVWPKKGTAVFWYNLHRSGEGDYRTRHAACPVLVGNKWVSNKWIHER 509
>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
Length = 543
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 60/158 (37%), Positives = 81/158 (51%), Gaps = 20/158 (12%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS ++ +VA I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 388 RISKSAWLSGYENPVVARINQRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEP 447
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP G +V P KG A
Sbjct: 448 DAFKELGTGNRIATWLFYMSDVAAGGATVFPEV----------------GASVWPKKGTA 491
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
+ +++L P D ++ H +CPV+ G KW + KWIH R
Sbjct: 492 VFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIHER 529
>gi|398806116|ref|ZP_10565064.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
gi|398089832|gb|EJL80333.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
Length = 294
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/173 (35%), Positives = 82/173 (47%), Gaps = 18/173 (10%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D A+ R++ L A E+V +EARI T P E +Q+ Y GQ
Sbjct: 125 VVDPHQDAVHAAHFRSNDSAQLPAAGSELVRRVEARIERLTGWPSAFCETLQLQRYAQGQ 184
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
Y PH+DFF M + GG R+AT+++YL E GG T F N G
Sbjct: 185 DYRPHYDFFGQDMVEAQGGQRLATLILYLRAPEAGGATYFAN----------------LG 228
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
+ P KG AL F +PD +S +LHG V+ GEKW AT+W R + P
Sbjct: 229 MRIAPRKGSALFF--TYPDPGNNSGTLHGGEAVLAGEKWIATQWFRDRAWRHP 279
>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
Length = 525
Score = 105 bits (263), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 60/178 (33%), Positives = 95/178 (53%), Gaps = 21/178 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + + G++ + RTS +L+ + + + + RI+ T E +Q+++Y G
Sbjct: 359 VFNQKMGRNTVVKTRTSKVTWLTDSLNPLTVRLNRRISDMTGFDLYGSEMLQVMNYGLGG 418
Query: 62 KYEPHFDFFRDKMNQ---QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
Y+ HFD+F + + +L G RIATVL YL+ VE+GG TVFPN + Q
Sbjct: 419 HYDLHFDYFNATIAKDLTKLNGDRIATVLFYLTDVEQGGATVFPN--IKQ---------- 466
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI--HVRNFDKP 174
A+ P KG A+++++L + D +LH +CPVI G KW KWI H + F +P
Sbjct: 467 ----AIFPKKGTAVMWYNLRHNNDGDPQTLHAACPVIVGSKWVCNKWIREHQQLFRRP 520
>gi|307211752|gb|EFN87747.1| Prolyl 4-hydroxylase subunit alpha-1 [Harpegnathos saltator]
Length = 415
Score = 105 bits (262), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 94/179 (52%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G + R S +L + + + VA++ R+ T + E E +Q+++Y G
Sbjct: 238 VQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSKRVEHMTSMSVETAEELQVVNYGIGG 297
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + LG G+RIATVL Y+S VE+GG TVF +S
Sbjct: 298 HYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINIS---------- 347
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
+ P KG A +++L P+ D + H +CPV+ G KW A KW+H R F +P
Sbjct: 348 ------LWPRKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLHERGQEFHRP 400
>gi|390989473|ref|ZP_10259770.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
gi|372555742|emb|CCF66745.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
Length = 152
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/164 (37%), Positives = 84/164 (51%), Gaps = 23/164 (14%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
RTS M L QD + IEARIA P ++GE +Q+L Y G +Y PH+D+F
Sbjct: 6 RTSDSMCLRVGQDALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAA 65
Query: 72 -DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
+ Q GG R+A+++MYL+ E+GG T FP++ + V +KG+
Sbjct: 66 GTPILLQAGGQRVASLVMYLNTPERGGATRFPDAHLD----------------VAAVKGN 109
Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKP 174
A+ F P T SLH PV+ GEKW ATKW+ R P
Sbjct: 110 AVFFSYDRPHPMT--RSLHAGAPVLTGEKWVATKWLRERAVRMP 151
>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
Length = 235
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/168 (39%), Positives = 89/168 (52%), Gaps = 28/168 (16%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEH 59
+ +++G + SE+RTS +FL DE+ VA I RIA T L E+ E + + +Y
Sbjct: 78 LTRSQTGWGVISEIRTSQSVFL----DEVGTVARISQRIADITGLSVESAEKLHVQNYGI 133
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
G +Y PHFD D +N+ R AT L+Y+S VE GG TVF N
Sbjct: 134 GGRYTPHFDAGGD-VNE-----RTATFLIYMSDVEVGGATVFTNV--------------- 172
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
G AVKP KG A+ + +LH + D + H CPV+ G KW A KWIH
Sbjct: 173 -GVAVKPEKGSAVFWNNLHKNGELDLKTKHAGCPVLVGNKWVANKWIH 219
>gi|307190793|gb|EFN74662.1| Prolyl 4-hydroxylase subunit alpha-2 [Camponotus floridanus]
Length = 476
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/179 (35%), Positives = 94/179 (52%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G + R S +L + + + VA++ R+ T + E E +Q+++Y G
Sbjct: 299 VQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSKRVEHMTSMSIETAEELQVVNYGIGG 358
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + LG G+RIATVL Y+S VE+GG TVF +S W
Sbjct: 359 HYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINISL-----W--- 410
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
P KG A +++L P+ D + H +CPV+ G KW A KW+H R F +P
Sbjct: 411 --------PRKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLHERGQEFHRP 461
>gi|195425415|ref|XP_002061004.1| GK10713 [Drosophila willistoni]
gi|194157089|gb|EDW71990.1| GK10713 [Drosophila willistoni]
Length = 502
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 100/194 (51%), Gaps = 26/194 (13%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYEH 59
+ D + ++ RTS+ +FL +V + R+A T L + + +Q+++Y
Sbjct: 317 IYDYDKEGNVPVNFRTSNSVFLLNNASYLVDILRQRVADMTHLNVFKNSSDDLQVMNYGL 376
Query: 60 GQKYEPHFDFF-RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
G Y HFDFF +D+ +L G RI TVL+Y++ V++GG TVFP ++
Sbjct: 377 GGYYRYHFDFFGKDESPNKLLGDRIITVLIYMTDVQQGGATVFPALRITNF--------- 427
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP-- 174
P KG AL+F +L + S D ++LH CPV+ G KW+ATKWI+ + F KP
Sbjct: 428 -------PKKGSALIFRNLDNNISPDPSTLHAGCPVLFGSKWAATKWIYSAEQMFRKPCL 480
Query: 175 ---EKEPEDDDCVD 185
E P D ++
Sbjct: 481 PQNELRPYDTHVIE 494
>gi|198429625|ref|XP_002128613.1| PREDICTED: similar to procollagen-proline, 2-oxoglutarate
4-dioxygenase (proline 4-hydroxylase), alpha 1
polypeptide [Ciona intestinalis]
Length = 195
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/159 (36%), Positives = 87/159 (54%), Gaps = 21/159 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD--- 72
R S +L ++ + RI+ T L E E +QI +Y G +YEPHFD+ R
Sbjct: 36 RVSKSAWLKDEDHPVIKRVCQRISDVTGLSMETAEELQIANYGVGGQYEPHFDYSRKSDF 95
Query: 73 -KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
K + ++G +RIAT L Y+S+VE+GG TVF + G AV+P+KG A
Sbjct: 96 GKFDDEVG-NRIATFLTYMSNVEQGGSTVFLHP----------------GIAVRPIKGSA 138
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
+ +++L P + D + H +CPV+ G KW + KWIH R+
Sbjct: 139 VFWYNLLPSGAGDERTRHAACPVLTGVKWVSNKWIHERD 177
>gi|321474876|gb|EFX85840.1| hypothetical protein DAPPUDRAFT_309107 [Daphnia pulex]
Length = 528
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 91/181 (50%), Gaps = 26/181 (14%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G+S+ + R + FL ++ ++ + R+ T L E +Q+ +Y G
Sbjct: 359 VTDRDTGRSMPVQYRIAKAAFLKDSEHNLIVKMSRRVGDITGLDMAASEDLQVCNYGIGG 418
Query: 62 KYEPHFDFFRDKMNQQLG------GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
Y PHFD+ R + G G+RIAT L Y+S VE GG TVFP
Sbjct: 419 HYVPHFDYARQ--GEIHGPRDLDWGNRIATWLFYMSDVEAGGATVFPAV----------- 465
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDK 173
G A+ P KG A +++L P+ + D +LH CPV+ G KW + KWIH R+ F +
Sbjct: 466 -----GAALWPQKGSAAFWYNLRPNGNGDEDTLHAGCPVLTGSKWVSNKWIHERSQEFRR 520
Query: 174 P 174
P
Sbjct: 521 P 521
>gi|334311009|ref|XP_001371555.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Monodelphis
domestica]
Length = 534
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 86/170 (50%), Gaps = 18/170 (10%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G I R S +L + D I+A + R+ T L + E +Q+ +Y G
Sbjct: 367 VRDPKTGHLIVVSYRISKSSWLKEDDDPIIAQVNRRMQYITGLSVKTAELLQVSNYGMGG 426
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 427 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDF--------------- 471
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G A+ P KG ++ +++L D + H +CPV+ G KW + KW H R
Sbjct: 472 -GAAIWPKKGTSVFWYNLFRSGECDYRTRHAACPVLVGSKWVSNKWFHER 520
>gi|397568865|gb|EJK46391.1| hypothetical protein THAOC_34939 [Thalassiosira oceanica]
Length = 488
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/178 (38%), Positives = 95/178 (53%), Gaps = 18/178 (10%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+ D + G+ AS+ RTS F++ D I+ IE R A+ T +P + E +Q+L Y +
Sbjct: 306 LKDADKGRP-ASDWRTSQSTFVAAMGDPILRDIELRTASLTRVPVTHQEFVQVLRYGVTE 364
Query: 62 KYEPHFDFF------RDKMNQQL----GGHRIATVLMYLSHVEKGGETVFP-NSEVSQSR 110
KY+ H DFF D QL +R ATV YL+ V +GGET FP + R
Sbjct: 365 KYDAHHDFFDPSSYRSDPGTLQLIENGKKNRYATVFWYLTDVARGGETCFPRHGGAPPPR 424
Query: 111 DGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGE--KWSATKWI 166
D +S C G VKP KG ++F+SL D SLHG+CPV+ E KW+A KW+
Sbjct: 425 D--FSMCT--GLKVKPQKGKVIIFYSLDASGEMDPLSLHGACPVLGKEDIKWAANKWL 478
>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
Length = 517
Score = 105 bits (261), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/180 (32%), Positives = 94/180 (52%), Gaps = 20/180 (11%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
++ +N + RTS+ ++L+ ++ ++ +E R+ T EN E Q+++Y G
Sbjct: 353 VMVNNLKVRPFIDSGRTSNSVWLASHENAVMERLERRVGVMTNFEMENSEVYQLINYGIG 412
Query: 61 QKYEPHFDFFRDKM--NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
Y+PH D F + GG RIATVL YLS V +GG T+FP +S
Sbjct: 413 GHYKPHTDHFETPQAPEHRGGGDRIATVLFYLSDVPQGGATLFPRLNIS----------- 461
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
V+P +GDALL+++L+ + ++H SCP+I+G KW+ KWI +P + P
Sbjct: 462 -----VQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWALVKWID--ELSQPFRRP 514
>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 555
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 58/158 (36%), Positives = 81/158 (51%), Gaps = 20/158 (12%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +L+ +D +V I RI T L + E +Q+ +Y G +YEPHFDF R
Sbjct: 395 RISKSAWLTAYEDPVVEKINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEP 454
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP+ G +V P KG A
Sbjct: 455 DAFKELGTGNRIATWLFYMSDVSAGGATVFPDV----------------GASVGPQKGTA 498
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
+ +++L D ++ H +CPV+ G KW + KWIH R
Sbjct: 499 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIHER 536
>gi|57525020|ref|NP_001006155.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Gallus gallus]
gi|82082587|sp|Q5ZLK5.1|P4HA2_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|53129464|emb|CAG31388.1| hypothetical protein RCJMB04_5l17 [Gallus gallus]
Length = 534
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 86/170 (50%), Gaps = 18/170 (10%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 367 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQQITGLTVKTAELLQVANYGMGG 426
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 427 QYEPHFDFSRRPFDSTLKSEGNRLATFLNYMSDVEAGGATVFPDF--------------- 471
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R
Sbjct: 472 -GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFHER 520
>gi|340722330|ref|XP_003399560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
terrestris]
Length = 557
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/179 (35%), Positives = 94/179 (52%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G + R S +L + + E VA++ R+ T + + E +Q+++Y G
Sbjct: 380 VQNYKTGALEIANYRISKSAWLQEHEHEHVAAVSRRVEHMTSMTVDTAEELQVVNYGIGG 439
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + LG G+RIATVL Y+S VE+GG TVF +S W
Sbjct: 440 HYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINISL-----W--- 491
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
P KG A +++L P+ D + H +CPV+ G KW A KW+H R F +P
Sbjct: 492 --------PKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLHERGQEFLRP 542
>gi|350416719|ref|XP_003491070.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
impatiens]
Length = 557
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 63/179 (35%), Positives = 94/179 (52%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G + R S +L + + E VA++ R+ T + + E +Q+++Y G
Sbjct: 380 VQNYKTGALEIANYRISKSAWLQEHEHEHVAAVSRRVEHMTSMTVDTAEELQVVNYGIGG 439
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + LG G+RIATVL Y+S VE+GG TVF +S W
Sbjct: 440 HYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINISL-----W--- 491
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
P KG A +++L P+ D + H +CPV+ G KW A KW+H R F +P
Sbjct: 492 --------PKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLHERGQEFLRP 542
>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
Length = 510
Score = 104 bits (260), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 57/165 (34%), Positives = 90/165 (54%), Gaps = 20/165 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+S RT+ G +L ++ + + I R+ + L E E MQ+++Y G Y PH D+F
Sbjct: 357 SSPTRTAMGAWLKRSSNALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWF- 415
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ G+R+ATVL YL+ VE+GG T+F +E + V P +G A
Sbjct: 416 -TQHPEVMGNRLATVLFYLTDVEQGGATMFNKAE----------------HKVLPRRGTA 458
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
L +++LH D D ++ H +CP+I G KW T+WI RN F +P
Sbjct: 459 LFWYNLHTDGEGDWSTTHAACPIIVGSKWVLTQWIRERNQIFIRP 503
>gi|326928728|ref|XP_003210527.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Meleagris
gallopavo]
Length = 535
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 86/170 (50%), Gaps = 18/170 (10%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 368 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQQITGLTVKTAELLQVANYGMGG 427
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 428 QYEPHFDFSRRPFDSTLKSEGNRLATFLNYMSDVEAGGATVFPDF--------------- 472
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R
Sbjct: 473 -GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFHER 521
>gi|195341590|ref|XP_002037389.1| GM12139 [Drosophila sechellia]
gi|194131505|gb|EDW53548.1| GM12139 [Drosophila sechellia]
Length = 525
Score = 104 bits (260), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 58/172 (33%), Positives = 89/172 (51%), Gaps = 20/172 (11%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
SG++ + RTS + + + + ARI+ T E +Q+++Y G Y+ H
Sbjct: 364 SGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQH 423
Query: 67 FDFFRDKMNQQ--LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAV 124
+DFF + + + G RIATVL YL+ VE+GG TVFPN R+ AV
Sbjct: 424 YDFFNNTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPN--------------IRK--AV 467
Query: 125 KPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
P +G +++++L + D+ +LH +CPVI G KW KWI R F +P
Sbjct: 468 FPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKWVCNKWIREREQIFSRP 519
>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
Length = 535
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 57/165 (34%), Positives = 90/165 (54%), Gaps = 20/165 (12%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
+S RT+ G +L ++ + + I R+ + L E E MQ+++Y G Y PH D+F
Sbjct: 382 SSPTRTALGAWLKRSSNALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWF- 440
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
+ ++ G+R+ATVL YL+ VE+GG T+F +E + V P +G A
Sbjct: 441 -TQHPEVMGNRLATVLFYLTDVEQGGATMFNKAE----------------HKVLPRRGTA 483
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
L +++LH D D ++ H +CP+I G KW T+WI RN F +P
Sbjct: 484 LFWYNLHTDGEGDWSTTHAACPIIVGSKWVLTQWIRERNQIFIRP 528
>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide precursor [Salmo
salar]
gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
Length = 545
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 59/158 (37%), Positives = 82/158 (51%), Gaps = 20/158 (12%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +L+ +D +V I RI T L + E +Q+ +Y G +YEPHFDF R
Sbjct: 390 RISKSAWLTAYEDPVVDKINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEP 449
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L+Y+S V GG TVF + G AV P KG A
Sbjct: 450 DAFKELGTGNRIATWLIYMSDVPSGGATVFTDV----------------GAAVWPKKGSA 493
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
+ +++L P D ++ H +CPV+ G KW + KWIH R
Sbjct: 494 VFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIHER 531
>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
gallus]
Length = 536
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 88/174 (50%), Gaps = 22/174 (12%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+G + R S +LS + +V+ I RI T L E +Q+ +Y G +YEPH
Sbjct: 372 TGALETAHYRISKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPH 431
Query: 67 FDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
FDF R D + G+RIAT L Y+S V GG TVFP EV G
Sbjct: 432 FDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV--------------GA 475
Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
+V P KG A+ +++L P D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 476 SVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 529
>gi|195575145|ref|XP_002105540.1| GD16902 [Drosophila simulans]
gi|194201467|gb|EDX15043.1| GD16902 [Drosophila simulans]
Length = 525
Score = 104 bits (259), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 91/178 (51%), Gaps = 22/178 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V SG++ + RTS + + + + ARI+ T E +Q+++Y G
Sbjct: 359 VYQASSGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGG 418
Query: 62 KYEPHFDFFRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
Y+ H+DFF +K N + G RIATVL YL+ VE+GG TVFPN
Sbjct: 419 HYDQHYDFF-NKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPN--------------I 463
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
R+ AV P +G +++++L + D+ +LH +CPVI G KW KWI R F +P
Sbjct: 464 RK--AVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKWVCNKWIREREQIFSRP 519
>gi|224068121|ref|XP_002191580.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Taeniopygia
guttata]
Length = 539
Score = 104 bits (259), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 86/170 (50%), Gaps = 18/170 (10%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 372 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQHITGLTVKTAELLQVANYGMGG 431
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 432 QYEPHFDFSRRPFDSTLKSEGNRLATFLNYMSDVEAGGATVFPDF--------------- 476
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R
Sbjct: 477 -GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFHER 525
>gi|24651477|ref|NP_733395.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
gi|20269812|gb|AAM18061.1|AF495539_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]PV [Drosophila
melanogaster]
gi|23172718|gb|AAN14252.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
Length = 525
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 91/178 (51%), Gaps = 22/178 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V SG++ + RTS + + + + ARI+ T E +Q+++Y G
Sbjct: 359 VYQASSGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGG 418
Query: 62 KYEPHFDFFRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
Y+ H+DFF +K N + G RIATVL YL+ VE+GG TVFPN
Sbjct: 419 HYDQHYDFF-NKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPN--------------I 463
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
R+ AV P +G +++++L + D+ +LH +CPVI G KW KWI R F +P
Sbjct: 464 RK--AVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKWVCNKWIREREQIFSRP 519
>gi|21711777|gb|AAM75079.1| RE70601p [Drosophila melanogaster]
Length = 316
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 61/178 (34%), Positives = 91/178 (51%), Gaps = 22/178 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V SG++ + RTS + + + + ARI+ T E +Q+++Y G
Sbjct: 150 VYQASSGRNEVVKTRTSKVAWFPDGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGG 209
Query: 62 KYEPHFDFFRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
Y+ H+DFF +K N + G RIATVL YL+ VE+GG TVFPN
Sbjct: 210 HYDQHYDFF-NKTNSNMTAMSGDRIATVLFYLTDVEQGGATVFPN--------------I 254
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
R+ AV P +G +++++L + D+ +LH +CPVI G KW KWI R F +P
Sbjct: 255 RK--AVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKWVCNKWIREREQIFSRP 310
>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
gallus]
Length = 536
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 88/174 (50%), Gaps = 22/174 (12%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+G + R S +LS + +V+ I RI T L E +Q+ +Y G +YEPH
Sbjct: 372 TGALETAHYRISKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPH 431
Query: 67 FDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
FDF R D + G+RIAT L Y+S V GG TVFP EV G
Sbjct: 432 FDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV--------------GA 475
Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
+V P KG A+ +++L P D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 476 SVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 529
>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
Length = 187
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 20 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 79
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 80 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 124
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 125 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 180
>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Meleagris gallopavo]
Length = 536
Score = 103 bits (258), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 63/174 (36%), Positives = 88/174 (50%), Gaps = 22/174 (12%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+G + R S +LS + +V+ I RI T L E +Q+ +Y G +YEPH
Sbjct: 372 TGALETAHYRISKSAWLSGYESPVVSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPH 431
Query: 67 FDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
FDF R D + G+RIAT L Y+S V GG TVFP EV G
Sbjct: 432 FDFGRKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV--------------GA 475
Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
+V P KG A+ +++L P D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 476 SVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 529
>gi|449267219|gb|EMC78185.1| Prolyl 4-hydroxylase subunit alpha-2 [Columba livia]
Length = 538
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 86/172 (50%), Gaps = 20/172 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 369 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQQITGLTVKTAELLQVANYGMGG 428
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+R+AT L Y+S VE GG TVFP+
Sbjct: 429 QYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDF------------- 475
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R
Sbjct: 476 ---GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFHER 524
>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
Length = 507
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 64/165 (38%), Positives = 87/165 (52%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK-- 73
R S +LS +D +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 352 RISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 411
Query: 74 -MNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
Q+LG G+RIAT L Y+S V GG TVFP EV G +V P KG A
Sbjct: 412 DAFQELGTGNRIATWLFYMSDVSAGGATVFP--EV--------------GASVWPKKGTA 455
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 456 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 500
>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
araneus]
Length = 533
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTTASYRVSKSSWLEETDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 469
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 470 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526
>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
musculus]
gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_d [Rattus norvegicus]
Length = 189
Score = 103 bits (258), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 22 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 81
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 82 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 126
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 127 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 182
>gi|345326417|ref|XP_001510155.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
[Ornithorhynchus anatinus]
Length = 888
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 719 VRDPKTGVLTVANYRVSKSSWLEEEDDPVVAQVNRRMQYITGLTVKTAELLQVANYGMGG 778
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+R+AT L Y+S VE GG TVFP+
Sbjct: 779 QYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDF------------- 825
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 826 ---GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 881
>gi|157114985|ref|XP_001658091.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108877086|gb|EAT41311.1| AAEL007038-PA [Aedes aegypti]
Length = 545
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 92/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G+ + R S +L + + +I R+ T L E +Q+++Y G
Sbjct: 372 VQNYKTGELEVANYRISKSAWLKDHEHPYIKAIGERVEDMTGLTMSTAEELQVVNYGIGG 431
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + LG G+RIATVL Y+S V +GG TVFP+ +
Sbjct: 432 HYEPHFDFARREETNAFKSLGTGNRIATVLFYMSDVTQGGATVFPSLRL----------- 480
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
A+ P KG A +F+LH D ++ H +CPV+ G KW + KWIH R F +P
Sbjct: 481 -----ALWPKKGAAAFWFNLHASGQGDYSTRHAACPVLTGTKWVSNKWIHERGQEFRRP 534
>gi|387016442|gb|AFJ50340.1| Prolyl 4-hydroxylase subunit alpha-2-like [Crotalus adamanteus]
Length = 533
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 86/170 (50%), Gaps = 18/170 (10%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVANYRVSKSSWLEEEDDLVVARVNHRMEQITGLTTKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRRPFDITLKTEGNRLATFLNYMSDVEAGGATVFPDF--------------- 470
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R
Sbjct: 471 -GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFHER 519
>gi|291387304|ref|XP_002710243.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 3 [Oryctolagus
cuniculus]
Length = 535
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 63/179 (35%), Positives = 92/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA I R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARINRRMQHITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R+ ++LG G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRNNERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 473 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|54792285|emb|CAG28668.1| prolyl 4-hydroxylase alpha-2 subunit [Gallus gallus]
Length = 538
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 86/172 (50%), Gaps = 20/172 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 370 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVAKVNQRMQQITGLTVKTAELLQVANYGMGG 429
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+R+AT L Y+S VE GG TVFP+
Sbjct: 430 QYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDF------------- 476
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R
Sbjct: 477 ---GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFHER 525
>gi|348501574|ref|XP_003438344.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 615
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 58/158 (36%), Positives = 81/158 (51%), Gaps = 20/158 (12%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +L++ D ++ I RI T L + E +Q+ +Y G +YEPHFDF R
Sbjct: 460 RISKSAWLTEYDDPMIEKINDRIEGVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEP 519
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP+ G AV P KG A
Sbjct: 520 DAFKELGTGNRIATWLFYMSDVSAGGATVFPDV----------------GAAVWPQKGTA 563
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
+ +++L D ++ H +CPV+ G KW + KWIH R
Sbjct: 564 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIHER 601
>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 1 [Oryctolagus
cuniculus]
Length = 533
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA I R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARINRRMQHITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 469
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 470 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526
>gi|3297815|emb|CAA19873.1| putative protein [Arabidopsis thaliana]
gi|7270340|emb|CAB80108.1| putative protein [Arabidopsis thaliana]
Length = 257
Score = 103 bits (257), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 57/136 (41%), Positives = 80/136 (58%), Gaps = 5/136 (3%)
Query: 15 VRTSSGMFLSKAQDEIVAS--IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
RTSSG F+S +++ A +E +IA T +P +GE+ IL YE GQKY+ H+D F
Sbjct: 78 TRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNP 137
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG-NWSECARRGYAVKPMKGDA 131
RIA+ L+YLS VE+GGET+FP S G ++ +C G VKP KGD
Sbjct: 138 TEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGYDYKQCI--GLKVKPRKGDG 195
Query: 132 LLFFSLHPDASTDSTS 147
LLF+S+ P+ + D +
Sbjct: 196 LLFYSVFPNGTIDQVN 211
>gi|332026992|gb|EGI67088.1| Prolyl 4-hydroxylase subunit alpha-1 [Acromyrmex echinatior]
Length = 415
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 93/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G + R S +L + + + VA++ R+ T + E E +Q+++Y G
Sbjct: 238 VQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSKRVEHMTSMSVETAEELQVVNYGIGG 297
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + LG G+RIATVL Y+S VE+GG TVF +S
Sbjct: 298 HYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINIS---------- 347
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
+ P KG A + +L P+ D + H +CPV+ G KW A KW+H R F +P
Sbjct: 348 ------LWPRKGSAAFWHNLKPNGEGDFKTRHAACPVLTGSKWVANKWLHERGQEFHRP 400
>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
musculus]
Length = 593
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 426 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 485
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 486 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 529
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 530 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 586
>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Cricetulus griseus]
Length = 534
Score = 103 bits (257), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 62/165 (37%), Positives = 85/165 (51%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK-- 73
R S +LS +D +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 379 RISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438
Query: 74 -MNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
Q+LG G+RIAT L Y+S V GG TVFP G +V P KG A
Sbjct: 439 DAFQELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Cavia porcellus]
Length = 533
Score = 103 bits (256), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEEDDPVVARVNRRMQQITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 470
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 471 -GAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526
>gi|194905436|ref|XP_001981196.1| GG11753 [Drosophila erecta]
gi|190655834|gb|EDV53066.1| GG11753 [Drosophila erecta]
Length = 550
Score = 103 bits (256), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 62/185 (33%), Positives = 92/185 (49%), Gaps = 22/185 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G + R S +L + ++ ++ R A T L ++ E +Q+++Y G
Sbjct: 376 VQNSVTGALETANYRISKSAWLKTPEHRVIETVVQRTADMTGLDMDSAEELQVVNYGIGG 435
Query: 62 KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + G +RIATVL Y+S VE+GG TVF +
Sbjct: 436 HYEPHFDFARKEEQRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHT----------- 484
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKPE 175
A+ P KG A + +LH D D + H +CPV+ G KW + KWIH R F +P
Sbjct: 485 -----ALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKWIHERGQEFRRPC 539
Query: 176 KEPED 180
ED
Sbjct: 540 SLEED 544
>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
Length = 529
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 60/178 (33%), Positives = 90/178 (50%), Gaps = 21/178 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + +S ++ + RTS +L +++ + RI T E +Q+++Y G
Sbjct: 360 VFNQQSMRNHVVKTRTSKVTWLLDTLNQLTIRLNRRITDMTGFDMYGSEMLQVMNYGLGG 419
Query: 62 KYEPHFDFFRDKMN---QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
Y+ H+D+F + +L G RIATVL YL+ VE+GG TVFPN E
Sbjct: 420 HYDKHYDYFNSSVAADLTRLNGDRIATVLFYLTDVEQGGATVFPNIE------------- 466
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
AV P G A+++++L D + D +LH +CPVI G KW KWI R F +P
Sbjct: 467 ---KAVFPKSGTAVVWYNLRHDGNGDPQTLHAACPVIVGSKWVCNKWIRERQQVFRRP 521
>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
Length = 532
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 469
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 470 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526
>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
lupus familiaris]
Length = 533
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 469
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 470 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526
>gi|350014318|dbj|GAA37183.1| prolyl 4-hydroxylase [Clonorchis sinensis]
Length = 595
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/173 (35%), Positives = 85/173 (49%), Gaps = 21/173 (12%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+GK + RTS +L DE+ + RI A T L E E +Q+ +Y G Y PH
Sbjct: 425 TGKLENAYYRTSKSAWLQDGLDEVTHRLNQRIHALTGLAMETAEDLQVGNYGIGGYYAPH 484
Query: 67 FDFFRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
FDF R + G+RIAT++ YL+ V+ GG TVF R G +
Sbjct: 485 FDFGRKREKDAFEVENGNRIATIIFYLTDVKAGGATVF----------------NRFGAS 528
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
VKP++G A +++LHP D + H +CPV+ G KW W H R F +P
Sbjct: 529 VKPVRGAAGFWYNLHPSGEGDLRTRHVACPVLVGSKWVMNVWFHERGQEFRRP 581
>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
Length = 528
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 60/173 (34%), Positives = 87/173 (50%), Gaps = 22/173 (12%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
SG++ RTS + + + ARI T E +Q+++Y G Y+ H
Sbjct: 367 SGRNEVVRTRTSKVAWFPDGYSPLTVRLNARITDMTGFNLHGSEMLQLMNYGLGGHYDQH 426
Query: 67 FDFFRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
+D+F + +N L G RIATVL YL+ VE+GG TVFPN R+ A
Sbjct: 427 YDYF-NTINSNLTAMSGDRIATVLFYLTDVEQGGATVFPN--------------IRK--A 469
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
V P +G +++++L D D+ +LH +CPVI G KW KWI R F +P
Sbjct: 470 VFPQRGSVIMWYNLKDDGQIDTQTLHAACPVIVGSKWVCNKWIREREQLFRRP 522
>gi|321474953|gb|EFX85917.1| hypothetical protein DAPPUDRAFT_309108 [Daphnia pulex]
Length = 549
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/176 (34%), Positives = 87/176 (49%), Gaps = 25/176 (14%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +GK ++ R S FL + V + R+ A T L E +Q+ +Y G
Sbjct: 377 VMNSATGKLETAKYRISKAAFLKNKEHHHVLKMSRRVGAITGLDMSTAEDLQVCNYGIGG 436
Query: 62 KYEPHFDFFRDKMNQQLG-------GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW 114
YEPHFD+ R N+ +G +RIAT L Y+S VE GG TVFP V
Sbjct: 437 HYEPHFDYARK--NETIGFNKDSGWRNRIATWLFYMSDVEAGGATVFPALNV-------- 486
Query: 115 SECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
A+ P KG A +++L P+ + + H +CPV+ G KW A KWIH +N
Sbjct: 487 --------ALWPQKGSAAFWYNLFPNGEGNELTRHAACPVLTGSKWVANKWIHEKN 534
>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Ovis aries]
Length = 487
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 320 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 379
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 380 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 423
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 424 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 480
>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
Length = 487
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 320 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 379
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 380 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 423
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 424 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 480
>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Loxodonta africana]
Length = 534
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 367 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVAQVNRRMQHITGLTVKTAELLQVANYGMGG 426
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 427 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 471
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 472 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 527
>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 575
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 408 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 467
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 468 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 512
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 513 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 568
>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Monodelphis domestica]
Length = 537
Score = 103 bits (256), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 61/165 (36%), Positives = 83/165 (50%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS +D +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 382 RISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEP 441
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP G +V P KG A
Sbjct: 442 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 485
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
+ +++L D ++ H +CPV+ G KW + KWIH R F +P
Sbjct: 486 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 530
>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
taurus]
gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
Length = 533
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 469
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 470 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526
>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
Length = 561
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 62/165 (37%), Positives = 85/165 (51%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS +D +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 379 RISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP EV G +V P KG A
Sbjct: 439 DAFRELGTGNRIATWLFYMSDVSAGGATVFP--EV--------------GASVWPKKGTA 482
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
[Rattus norvegicus]
Length = 534
Score = 103 bits (256), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 60/165 (36%), Positives = 83/165 (50%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS +D +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 379 RISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP G +V P KG A
Sbjct: 439 DAFRELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
musculus]
gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
Length = 535
Score = 102 bits (255), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 368 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 427
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 428 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 472
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 473 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
Length = 535
Score = 102 bits (255), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 368 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 427
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 428 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 471
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 472 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide [Mus musculus]
gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
musculus]
Length = 534
Score = 102 bits (255), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 60/165 (36%), Positives = 83/165 (50%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS +D +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 379 RISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP G +V P KG A
Sbjct: 439 DAFRELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|328790718|ref|XP_392392.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Apis mellifera]
Length = 415
Score = 102 bits (255), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 94/179 (52%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G + R S +L + + + VA++ R+ T + + E +Q+++Y G
Sbjct: 238 VQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSRRVEHMTSMTVDTAEELQVVNYGIGG 297
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + LG G+RIATVL Y+S VE+GG TVF +
Sbjct: 298 HYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINI----------- 346
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
A+ P KG A +++L P+ D + H +CPV+ G KW A KW+H R F +P
Sbjct: 347 -----ALWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLHERGQEFLRP 400
>gi|218665910|ref|YP_002425647.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
gi|218518123|gb|ACK78709.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
ferrooxidans ATCC 23270]
Length = 248
Score = 102 bits (255), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 59/166 (35%), Positives = 87/166 (52%), Gaps = 17/166 (10%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G+ +A R S + + I+ S+ IA T +P + E +QILHY G
Sbjct: 93 VTDEQTGQEVAHGERVSEMAWPKRDDYPILQSLAEGIAQLTGIPIDCQEPLQILHYRPGG 152
Query: 62 KYEPHFD-FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
+Y+PH+D F D + GG+R AT+++YL+ VE+GGET FP
Sbjct: 153 EYKPHYDAFAADAPTLRQGGNRQATLILYLNAVEEGGETAFPE----------------L 196
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
G V P+ G + F +L+ + SLH PV +GEKW AT+WI
Sbjct: 197 GLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPVRKGEKWIATQWI 242
>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
catus]
gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
catus]
Length = 533
Score = 102 bits (255), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 470
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 471 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526
>gi|348557542|ref|XP_003464578.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Cavia porcellus]
Length = 535
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEEDDPVVARVNRRMQQITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFF----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF RD + G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRSHERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 473 ---GAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|148701597|gb|EDL33544.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_b [Mus
musculus]
Length = 506
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 337 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 396
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+R+AT L Y+S VE GG TVFP+
Sbjct: 397 QYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 443
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 444 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 499
>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
Length = 534
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 60/165 (36%), Positives = 83/165 (50%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS +D +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 379 RISKSAWLSGYEDPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP G +V P KG A
Sbjct: 439 DAFRELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|301754231|ref|XP_002912939.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Ailuropoda
melanoleuca]
Length = 535
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPD-------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 472 --LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Cricetulus griseus]
Length = 533
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 469
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 470 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526
>gi|195110919|ref|XP_002000027.1| GI24860 [Drosophila mojavensis]
gi|193916621|gb|EDW15488.1| GI24860 [Drosophila mojavensis]
Length = 487
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 60/179 (33%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + +G + R S +L A+ ++ ++ R A T L ++ E +Q+++Y G
Sbjct: 313 VQNAVTGALETANYRISKSAWLKTAEHRVIGTVVQRTADMTGLDMDSAEELQVVNYGIGG 372
Query: 62 KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + G +RIATVL Y+S VE+GG TVF +
Sbjct: 373 HYEPHFDFARREEIRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHA----------- 421
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
+KP KG A + +LH D + H +CPV+ G KW + KWIH R F +P
Sbjct: 422 -----VLKPKKGTAAFWMNLHRSGEGDVRTRHAACPVLTGSKWVSNKWIHERGQEFRRP 475
>gi|281206564|gb|EFA80750.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
pallidum PN500]
Length = 251
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 87/162 (53%), Gaps = 24/162 (14%)
Query: 16 RTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR-- 71
R+ G+F+ + +++ + +I R+ + L E+ E MQ++ Y G++ HFD+F
Sbjct: 101 RSGWGLFMKEGEEDHPVTQNIFNRMKTFVNLT-ESSEVMQVIRYNPGEETSAHFDYFNPL 159
Query: 72 ---DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMK 128
M L G RI T+LMYL+ VE+GGET FP V VKP+K
Sbjct: 160 TTNGAMKIGLYGQRICTILMYLADVEEGGETSFPEVNVK----------------VKPIK 203
Query: 129 GDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
GDA+LF++ P+ D SLH PVI+G KW A K ++ +N
Sbjct: 204 GDAVLFYNCKPNGEVDPLSLHQGDPVIKGTKWIAIKLVNQKN 245
>gi|56118630|ref|NP_001007975.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
(Silurana) tropicalis]
gi|51513259|gb|AAH80485.1| p4ha2 protein [Xenopus (Silurana) tropicalis]
Length = 527
Score = 102 bits (255), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 58/171 (33%), Positives = 87/171 (50%), Gaps = 20/171 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D ++A + R+ A T L + E +Q+ +Y G
Sbjct: 368 VRDPKTGVLSVANYRVSKSAWLEENDDPVIARVNLRMQAITGLTVDTAELLQVANYGMGG 427
Query: 62 KYEPHFDFFRDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 428 QYEPHFDFSRRPFDSNLKTDGNRLATFLNYMSDVEAGGATVFPDF--------------- 472
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
G A+ P KG A+ +++L D + H +CPV+ G KW KW H ++
Sbjct: 473 -GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWG--KWTHTQD 520
>gi|328876967|gb|EGG25330.1| putative prolyl 4-hydroxylase alpha subunit [Dictyostelium
fasciculatum]
Length = 244
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/159 (37%), Positives = 84/159 (52%), Gaps = 24/159 (15%)
Query: 16 RTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR-- 71
R+ G+F+ + +++ +V I R+ L EN E MQ++ Y G++ H+D+F
Sbjct: 70 RSGWGLFMKEGEEDHDVVKKIFQRMKMLVNLT-ENCEVMQVIRYHPGEETSAHYDYFNPL 128
Query: 72 ---DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMK 128
M L G R+ T+LMYLS VE+GGET FP G VKP+K
Sbjct: 129 TTNGAMKIGLYGQRVCTILMYLSEVEEGGETSFP----------------EVGVKVKPVK 172
Query: 129 GDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
GDA+LF++ P+ D SLH PVI+G KW A K I+
Sbjct: 173 GDAVLFYNCKPNGEVDPLSLHQGDPVIKGTKWVAIKLIN 211
>gi|426229219|ref|XP_004008688.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Ovis aries]
Length = 535
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPD-------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 472 --LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|327265288|ref|XP_003217440.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Anolis
carolinensis]
Length = 554
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 86/172 (50%), Gaps = 20/172 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 385 VRDPKTGVLTVANYRVSKSSWLEEEDDLVVAKVNQRMEHITGLTVKTAELLQVANYGMGG 444
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+R+AT L Y+S VE GG TVFP+
Sbjct: 445 QYEPHFDFSRKEEPDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDF------------- 491
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R
Sbjct: 492 ---GAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKWVSNKWFHER 540
>gi|149052606|gb|EDM04423.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_c [Rattus norvegicus]
Length = 506
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 337 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 396
Query: 62 KYEPHFDFF----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF RD + G+R+AT L Y+S VE GG TVFP+
Sbjct: 397 QYEPHFDFSRSDERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 443
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 444 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 499
>gi|380025232|ref|XP_003696381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Apis florea]
Length = 537
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 94/179 (52%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G + R S +L + + + VA++ R+ T + + E +Q+++Y G
Sbjct: 360 VQNYKTGALEIANYRISKSAWLQEHEHKHVAAVSRRVEHMTSMTVDTAEELQVVNYGIGG 419
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + LG G+RIATVL Y+S VE+GG TVF +
Sbjct: 420 HYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVEQGGGTVFTAINI----------- 468
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
A+ P KG A +++L P+ D + H +CPV+ G KW A KW+H R F +P
Sbjct: 469 -----ALWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKWVANKWLHERGQEFLRP 522
>gi|157818741|ref|NP_001101745.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Rattus norvegicus]
gi|149052604|gb|EDM04421.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 535
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFF----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF RD + G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRSDERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 473 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|344264847|ref|XP_003404501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Loxodonta africana]
Length = 536
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 367 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVAQVNRRMQHITGLTVKTAELLQVANYGMGG 426
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+R+AT L Y+S VE GG TVFP+
Sbjct: 427 QYEPHFDFSRSHEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 473
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 474 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 529
>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
leucogenys]
Length = 556
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 389 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 448
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 449 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPD---------------- 492
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 493 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 549
>gi|351706369|gb|EHB09288.1| Prolyl 4-hydroxylase subunit alpha-2 [Heterocephalus glaber]
Length = 535
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 92/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQYITGLTVQTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R+ ++LG G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRNHERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 473 ---GAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|226874889|ref|NP_001152881.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Bos
taurus]
gi|296485624|tpg|DAA27739.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Bos taurus]
Length = 535
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPD-------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 472 --LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|226874876|ref|NP_035161.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Mus
musculus]
gi|148701601|gb|EDL33548.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_f [Mus
musculus]
Length = 537
Score = 102 bits (255), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 368 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 427
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+R+AT L Y+S VE GG TVFP+
Sbjct: 428 QYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 474
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 475 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 530
>gi|2498741|sp|Q60716.1|P4HA2_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|836900|gb|AAC52198.1| prolyl 4-hydroxylase alpha(II)-subunit [Mus musculus]
gi|18073923|emb|CAC85691.1| Prolyl 4-hydroxylase alpha IIb subunit [Mus musculus]
gi|1096888|prf||2112362B Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=II
Length = 537
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 368 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 427
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+R+AT L Y+S VE GG TVFP+
Sbjct: 428 QYEPHFDFSRSDDEDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 474
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 475 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 530
>gi|410948134|ref|XP_003980796.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Felis
catus]
Length = 535
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 473 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|354474413|ref|XP_003499425.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Cricetulus griseus]
Length = 535
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPD-------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 472 --LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|395736141|ref|XP_003776706.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 577
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 88/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 408 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 467
Query: 62 KYEPHFDFF----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF RD G+R+AT L Y+S VE GG TVFP+
Sbjct: 468 QYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 514
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 515 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 570
>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
boliviensis boliviensis]
gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
boliviensis boliviensis]
Length = 533
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 470
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 471 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526
>gi|443709454|gb|ELU04126.1| hypothetical protein CAPTEDRAFT_167710 [Capitella teleta]
Length = 535
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 63/178 (35%), Positives = 89/178 (50%), Gaps = 21/178 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G+ ++ R S +L + VA I R +A T L E +QI +Y G
Sbjct: 367 VVNSVTGELEFAKYRISKSGWLKDEEHPTVAKISNRCSALTNLSLSTVEELQIANYGIGG 426
Query: 62 KYEPHFDFFRDKMNQQLG---GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
YEPHFD+ R G+RI TV+ YLS VE GG TVF +
Sbjct: 427 HYEPHFDYSRLAEVTSFDHWRGNRILTVIFYLSDVEAGGGTVFMTA-------------- 472
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G ++P KG A ++++LHPD + D + H +CPV+ G KW A KW H R F +P
Sbjct: 473 --GTKLRPEKGAAAVWYNLHPDGTGDDETKHAACPVLTGNKWVANKWFHERGQEFTRP 528
>gi|335283456|ref|XP_003354320.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Sus scrofa]
Length = 535
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPD-------------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 472 --LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
Length = 533
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 470
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 471 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526
>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
Length = 533
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 470
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 471 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526
>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
leucogenys]
gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
leucogenys]
Length = 535
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 368 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 427
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 428 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 472
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 473 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
Length = 504
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 337 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 396
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 397 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 441
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 442 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 497
>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
Length = 533
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 470
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 471 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526
>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_d
[Homo sapiens]
Length = 488
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 321 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 380
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 381 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 425
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 426 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 481
>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
Length = 539
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++++G+ + R S +L D ++ + RI +T L E +Q+ +Y G
Sbjct: 355 VQNSKTGELEHATYRISKSAWLKGDLDPVIDRVNRRIEDFTGLNQATSEELQVANYGLGG 414
Query: 62 KYEPHFDFFRDKMNQQL----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y+PHFDF R + G+RIATVL Y+S E+GG TVF +
Sbjct: 415 HYDPHFDFARKEEKNAFKTLNTGNRIATVLFYMSQPERGGATVFNHL------------- 461
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G AV P K DAL +++L D D + H +CPV+ G KW + KWIH R F +P
Sbjct: 462 ---GTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHERGQEFTRP 517
>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_c
[Homo sapiens]
Length = 565
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 398 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 457
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 458 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 502
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 503 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 558
>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
troglodytes]
gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
troglodytes]
gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
troglodytes]
gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
paniscus]
gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
paniscus]
gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
paniscus]
gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 533
Score = 102 bits (254), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 470
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 471 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526
>gi|395509387|ref|XP_003758979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Sarcophilus harrisii]
Length = 534
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 85/170 (50%), Gaps = 18/170 (10%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D ++A + R+ T L + E +Q+ +Y G
Sbjct: 367 VRDPKTGVLTVANYRVSKSSWLEEGDDPVIAQLNRRMHYITGLSVKTAELLQVANYGMGG 426
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 427 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDF--------------- 471
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G + P KG ++ +++L D + H +CPV+ G KW + KW H R
Sbjct: 472 -GATIWPKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWFHER 520
>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
abelii]
gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 533
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 470
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 471 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 526
>gi|321474877|gb|EFX85841.1| hypothetical protein DAPPUDRAFT_208740 [Daphnia pulex]
Length = 545
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 59/173 (34%), Positives = 91/173 (52%), Gaps = 20/173 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G + R S +L + + V + R+ T L E +Q+++Y G
Sbjct: 374 VQNSVTGNLEPANYRISKSAWLKSEEHDHVFKVTRRVGDVTGLDMATAEDLQVVNYGIGG 433
Query: 62 KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFD+ R +++N + LG G+R+AT L Y+S VE GG TVFP
Sbjct: 434 HYEPHFDYARKEEVNAFKDLGWGNRVATWLFYMSEVEAGGATVFP--------------- 478
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
+ A+ P KG A +++LHP+ + + H +CPV+ G KW + KWIH RN
Sbjct: 479 -KLNLALWPQKGSAAFWYNLHPNGEGNELTRHAACPVLTGSKWVSNKWIHERN 530
>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
gorilla]
Length = 565
Score = 102 bits (253), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 398 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 457
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 458 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFPDL--------------- 502
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 503 -GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 558
>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Loxodonta africana]
Length = 534
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 84/165 (50%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS ++ +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP+ G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPDV----------------GASVWPKKGTA 482
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|195452742|ref|XP_002073480.1| GK13123 [Drosophila willistoni]
gi|194169565|gb|EDW84466.1| GK13123 [Drosophila willistoni]
Length = 540
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 87/170 (51%), Gaps = 19/170 (11%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G S SEVRTS +L Q + +++ R+ T L E+ E +Q+++Y G YEPH+
Sbjct: 366 GNSTVSEVRTSQNTWLWYEQQPWLKNLKLRLEDITGLGMESAEPLQLVNYGIGGHYEPHY 425
Query: 68 DFFRDKMNQ-QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
DF DK+ G+R+ T L+YL+ V GG T FP ++ AV P
Sbjct: 426 DFVEDKVTTFGWKGNRLLTALLYLNEVPMGGATAFPYLKL----------------AVPP 469
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
+KG L++++LH D + H CPV+ G KW +W H + F +P
Sbjct: 470 VKGSLLVWYNLHRSLDPDFRTKHAGCPVLMGSKWVCNEWFHEGAQEFRRP 519
>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
Length = 280
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 57/173 (32%), Positives = 89/173 (51%), Gaps = 21/173 (12%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
DN SG + + RTS + + + E++A I+AR+AA + P ++GE +Q+ Y+ G +Y
Sbjct: 124 DNASGINRFDDSRTSESAHIQRGETELIARIDARLAALSGWPVDHGEPLQLQKYQAGNEY 183
Query: 64 EPHFDFFRDKM-----NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
PHFD+F + + + G R+AT+++YL+ VE+GG T FP
Sbjct: 184 RPHFDWFDPALAGTAKHLEKSGQRLATIILYLTDVEEGGGTSFPGI-------------- 229
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G V P KG AL F + P D + H PV +G K A KW+ + +
Sbjct: 230 --GLDVHPQKGGALFFRNTTPYGVPDRKTQHAGLPVEKGTKIIANKWLREKPY 280
>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
garnettii]
Length = 538
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/177 (34%), Positives = 89/177 (50%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 371 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNHRMQHITGLSVKTAELLQVANYGVGG 430
Query: 62 KYEPHFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
+YEPHFDF R + L G+R+AT L Y+S VE GG TVFP+
Sbjct: 431 QYEPHFDFSRRPFDSGLKTEGNRVATFLNYMSDVEAGGATVFPD---------------- 474
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 475 LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 531
>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
Length = 545
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/177 (35%), Positives = 87/177 (49%), Gaps = 20/177 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ESG+ S R + +L + + V+ I R+ T L E +Q+ +Y G
Sbjct: 376 VQKKESGEREFSRYRIAKSAWLKHEEHDYVSDINFRVGDITGLDMATSEDLQVCNYGIGG 435
Query: 62 KYEPHFDFFRD-KMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
YEPH+D+ R ++ Q G G RIAT L Y+S VE GG TVFP +S
Sbjct: 436 HYEPHYDYARKGEVQQDFGWGGRIATWLFYMSDVEAGGATVFPKLNLS------------ 483
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
+ P KG A +F+L+P+ + + H CPV+ G KW A WIH R F +P
Sbjct: 484 ----LWPQKGSAAFWFNLYPNGEGNEMTQHAGCPVLTGSKWVANYWIHERGQEFRRP 536
>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
Length = 514
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 60/174 (34%), Positives = 87/174 (50%), Gaps = 23/174 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
MV D+ + S+ RTSS +L +V ++ R T L E +Q+ +Y G
Sbjct: 345 MVGDDH--EKAVSKTRTSSNAWLDDVMHPVVRTLSQRTEDMTNLAMTAAERLQVGNYGIG 402
Query: 61 QKYEPHFDFFRDKMNQQL-----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
Y PH+D+ + +++ G+RIATV+ YLS V GG TVFP
Sbjct: 403 GHYLPHYDYAVAEEGKEVYPSIGKGNRIATVMYYLSDVAIGGATVFP------------- 449
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
+ G V P KG A+ +++LH + + D +LHG+CPV G KW KWIH R
Sbjct: 450 ---QLGLGVFPQKGSAIFWYNLHANGTVDHRTLHGACPVFVGSKWVGNKWIHER 500
>gi|416009427|ref|ZP_11561250.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
gi|339836568|gb|EGQ64151.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
Length = 196
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/166 (34%), Positives = 86/166 (51%), Gaps = 17/166 (10%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G+ +A R S + + I+ S+ IA T +P + E +QILHY G
Sbjct: 41 VTDEQTGQEVAHGERVSEMAWPKRDDHPILQSLAEGIAQLTGIPIDCQEPLQILHYRPGG 100
Query: 62 KYEPHFD-FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
+Y+PH+D F D + GG+R T+++YL+ VE+GGET FP
Sbjct: 101 EYKPHYDAFAADAPTLRQGGNRQGTLILYLNAVEEGGETAFPE----------------L 144
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
G V P+ G + F +L+ + SLH PV +GEKW AT+WI
Sbjct: 145 GLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPVRKGEKWIATQWI 190
>gi|332221664|ref|XP_003259983.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Nomascus
leucogenys]
Length = 558
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 88/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 389 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 448
Query: 62 KYEPHFDFF----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF RD G+R+AT L Y+S VE GG TVFP+
Sbjct: 449 QYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPD-------------- 494
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 495 --LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 551
>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
Length = 520
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 56/163 (34%), Positives = 83/163 (50%), Gaps = 17/163 (10%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
GK+ + RTS + + + + + ARI T E +Q+++Y G Y+ H+
Sbjct: 363 GKNEVVKTRTSKVAWFPDSYNSLTLRLNARIHDMTGFDLSGSEMLQLMNYGLGGHYDKHY 422
Query: 68 DFFR-DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
DFF + + L G RIATVL Y+S VE+GG TVFPN + V P
Sbjct: 423 DFFNATEKSSSLTGDRIATVLFYMSDVEQGGATVFPNIYKT----------------VYP 466
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
+G A+++++L D D +LH +CPV+ G KW KWI R
Sbjct: 467 QRGTAVMWYNLKDDGQPDEQTLHAACPVLVGSKWVCNKWIRER 509
>gi|156370133|ref|XP_001628326.1| predicted protein [Nematostella vectensis]
gi|156215300|gb|EDO36263.1| predicted protein [Nematostella vectensis]
Length = 526
Score = 102 bits (253), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/170 (35%), Positives = 82/170 (48%), Gaps = 20/170 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + E+G+ + R S +LS + +IV I R+ T L GE +Q+ +Y G
Sbjct: 356 VNNLETGEIEDVDYRISQIAWLSDSDGDIVRRINRRVGFITGLNTNTGECLQVNNYGVGG 415
Query: 62 KYEPHFDFFRDKMNQQLG----GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFD D N + G+RIAT + YLS VE GG TVF
Sbjct: 416 HYEPHFDHSLDMENSPIASLGQGNRIATFMFYLSEVEAGGSTVF---------------- 459
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
+ G P KG A+ +++L D SLH CPV+ G KW A KW+H
Sbjct: 460 IKTGVKTNPFKGGAVFWYNLKKSGEGDWDSLHAGCPVLIGNKWVANKWLH 509
>gi|403255941|ref|XP_003920663.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Saimiri
boliviensis boliviensis]
gi|403255945|ref|XP_003920665.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Saimiri
boliviensis boliviensis]
Length = 535
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R+ + LG G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRNDERDAFKHLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 473 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
Length = 549
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/172 (33%), Positives = 88/172 (51%), Gaps = 19/172 (11%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
++G S SE+RTS +L + +A I+ R+ T L + E +Q+++Y G +YEP
Sbjct: 378 QTGNSTVSEIRTSQNTWLWYENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEP 437
Query: 66 HFDFFRD-KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAV 124
HFDF D + N G+R+ T L YL+ V GG T FP + AV
Sbjct: 438 HFDFMDDAEKNFGWKGNRLLTALFYLNDVPLGGATAFPFLHL----------------AV 481
Query: 125 KPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
P+KG L++++LH D + H CPV++G KW +W H + F +P
Sbjct: 482 PPVKGSLLVWYNLHRSLHKDFRTKHAGCPVLKGSKWICNEWFHEAAQEFRRP 533
>gi|332221660|ref|XP_003259981.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Nomascus
leucogenys]
Length = 537
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 368 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 427
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R+ + LG G+R+AT L Y+S VE GG TVFP+
Sbjct: 428 QYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 474
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 475 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 530
>gi|114601566|ref|XP_001162222.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
troglodytes]
gi|114601568|ref|XP_001162843.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 17 [Pan
troglodytes]
gi|397518358|ref|XP_003829358.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pan
paniscus]
gi|397518362|ref|XP_003829360.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Pan
paniscus]
gi|410215944|gb|JAA05191.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255608|gb|JAA15771.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331279|gb|JAA34586.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 535
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 88/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425
Query: 62 KYEPHFDFF----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF RD G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 473 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|4758868|ref|NP_004190.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|217272863|ref|NP_001136071.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|20455169|sp|O15460.1|P4HA2_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|2439985|gb|AAB71339.1| prolyl 4-hydroxylase alpha (II) subunit [Homo sapiens]
gi|18073926|emb|CAC85689.1| Prolyl 4-hydroxylase alpha IIb subunit [Homo sapiens]
gi|119582746|gb|EAW62342.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
gi|119582747|gb|EAW62343.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
Length = 535
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 88/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425
Query: 62 KYEPHFDFF----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF RD G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 473 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|297675929|ref|XP_002815906.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pongo
abelii]
Length = 535
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/179 (34%), Positives = 88/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425
Query: 62 KYEPHFDFF----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF RD G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 473 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
kowalevskii]
Length = 533
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 58/172 (33%), Positives = 86/172 (50%), Gaps = 20/172 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+ ++ +G +E R S +LS+ ++V + RI +T L + E +Q+ +Y G
Sbjct: 363 IQNSVTGNLEFAEYRISKSAWLSEDDGDVVHRLNHRIEQYTGLTMDTAEELQVANYGLGG 422
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + G+RIAT L Y+S VE GG TVFP
Sbjct: 423 HYEPHFDFARKEEINAFKSLNTGNRIATFLFYMSDVEAGGATVFPQV------------- 469
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G + P KG A +++L + D ++ H +CPV+ G KW + KWIH R
Sbjct: 470 ---GARLIPEKGSAAFWYNLLKNGEGDYSTRHAACPVLVGSKWVSNKWIHER 518
>gi|355691582|gb|EHH26767.1| hypothetical protein EGK_16829 [Macaca mulatta]
gi|355750162|gb|EHH54500.1| hypothetical protein EGM_15360 [Macaca fascicularis]
gi|384939464|gb|AFI33337.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Macaca
mulatta]
Length = 535
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R+ + LG G+R+AT L Y+S VE GG TVFP+
Sbjct: 426 QYEPHFDFSRNDERHTFKHLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 472
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 473 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 528
>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 548
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 57/158 (36%), Positives = 81/158 (51%), Gaps = 20/158 (12%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +L+ + ++ I RI T L + E +Q+ +Y G +YEPHFDF R
Sbjct: 393 RISKSAWLTGYEHPVIEIINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEP 452
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP+ G AV P KG A
Sbjct: 453 DAFKELGTGNRIATWLFYMSDVAAGGATVFPDV----------------GAAVWPQKGTA 496
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
+ +++L + D ++ H +CPV+ G KW + KWIH R
Sbjct: 497 VFWYNLFANGEGDYSTRHAACPVLVGNKWVSNKWIHER 534
>gi|119582752|gb|EAW62348.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_f
[Homo sapiens]
Length = 567
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 398 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 457
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R+ + LG G+R+AT L Y+S VE GG TVFP+
Sbjct: 458 QYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 504
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 505 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 560
>gi|195391754|ref|XP_002054525.1| GJ24502 [Drosophila virilis]
gi|194152611|gb|EDW68045.1| GJ24502 [Drosophila virilis]
Length = 487
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 59/179 (32%), Positives = 90/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G + R S +L A+ ++ ++ R A T L ++ E +Q+++Y G
Sbjct: 313 VQNSVTGALETANYRISKSAWLKTAEHRVIGTVVQRTADMTGLDMDSAEELQVVNYGIGG 372
Query: 62 KYEPHFDFFRDKMNQQLGG----HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + + G +RIAT+L Y+S VE+GG TVF +
Sbjct: 373 HYEPHFDFARREEKRAFEGLNLGNRIATMLFYMSDVEQGGATVFTSLHA----------- 421
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
A+ P KG A + +LH D + H +CPV+ G KW + KWIH R F +P
Sbjct: 422 -----ALWPKKGTAAFWMNLHRSGEGDVRTRHAACPVLTGSKWVSNKWIHERGQEFRRP 475
>gi|449673565|ref|XP_002167120.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 571
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 61/169 (36%), Positives = 86/169 (50%), Gaps = 19/169 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+ D +GK ++ R S +LS + + ++EAR A T L E +Q+ +Y G
Sbjct: 403 IQDPITGKLRHADYRISKSAWLSTNKYNFLQALEARTQATTGLDLSYAEQLQVANYGLGG 462
Query: 62 KYEPHFDFFR---DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
YEPHFD R D+ G+RIATVL YLS VE GG TVF +
Sbjct: 463 HYEPHFDHSRENEDRFTDLGMGNRIATVLFYLSDVEAGGATVFTVGKT------------ 510
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
AV P KGDA+ +F+L + + + H +CPV+ G+KW + WIH
Sbjct: 511 ----AVFPSKGDAVFWFNLKRNGKGNPNTRHAACPVLVGQKWVSNWWIH 555
>gi|195055779|ref|XP_001994790.1| GH14110 [Drosophila grimshawi]
gi|193892553|gb|EDV91419.1| GH14110 [Drosophila grimshawi]
Length = 487
Score = 101 bits (252), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 63/187 (33%), Positives = 95/187 (50%), Gaps = 26/187 (13%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G + R S +L + EI+ ++ R A T L ++ E +Q+++Y G
Sbjct: 313 VQNSVTGALETANYRISKSAWLKTPEHEIIGTVVQRTADMTGLDMDSAEELQVVNYGIGG 372
Query: 62 KYEPHFDFFRDKMNQQLG------GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
YEPHFDF R + ++L G+RIAT+L Y+S V++GG TVF + R W
Sbjct: 373 HYEPHFDFARRE--EKLAFEGLNLGNRIATMLFYMSDVQQGGATVF-----TSLRTALW- 424
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDK 173
P KG A + +LH D+ + H +CPV+ G KW + KWIH R F +
Sbjct: 425 ----------PKKGTAAFWMNLHRSGEGDARTRHAACPVLTGSKWVSNKWIHERGQEFRR 474
Query: 174 PEKEPED 180
P ED
Sbjct: 475 PCALEED 481
>gi|395509389|ref|XP_003758980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Sarcophilus harrisii]
Length = 536
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 56/172 (32%), Positives = 84/172 (48%), Gaps = 20/172 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D ++A + R+ T L + E +Q+ +Y G
Sbjct: 367 VRDPKTGVLTVANYRVSKSSWLEEGDDPVIAQLNRRMHYITGLSVKTAELLQVANYGMGG 426
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D G+R+AT L Y+S VE GG TVFP+
Sbjct: 427 QYEPHFDFSRKGEQDAFKHLGTGNRVATFLNYMSDVEAGGATVFPDF------------- 473
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G + P KG ++ +++L D + H +CPV+ G KW + KW H R
Sbjct: 474 ---GATIWPKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWFHER 522
>gi|195061074|ref|XP_001995919.1| GH14105 [Drosophila grimshawi]
gi|193891711|gb|EDV90577.1| GH14105 [Drosophila grimshawi]
Length = 513
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 53/165 (32%), Positives = 82/165 (49%), Gaps = 16/165 (9%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D + +++ + RTS +L A + + RI + E +Q+++Y G
Sbjct: 351 VVDQVTHRNMMVKERTSKVTWLGDATNAFTMRLNKRIEDMSGFTMYGSEMLQVMNYGLGG 410
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
Y H+DF +L G RIATV+ YLS VE+GG TVFP +
Sbjct: 411 HYASHYDFLNATSKTRLNGDRIATVMFYLSDVEQGGATVFPKIQ---------------- 454
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
AV P +G A+++++L + D+ ++H +CPVI G KW KWI
Sbjct: 455 KAVFPQRGTAIIWYNLKENGDFDTNTIHAACPVIVGSKWVCNKWI 499
>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
garnettii]
Length = 540
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 92/179 (51%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 371 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNHRMQHITGLSVKTAELLQVANYGVGG 430
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R+ ++LG G+R+AT L Y+S VE GG TVFP+
Sbjct: 431 QYEPHFDFSRNHERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPDL------------- 477
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ P KG A+ +++L D + H +CPV+ G KW + KW H R F +P
Sbjct: 478 ---GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFHERGQEFLRP 533
>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
garnettii]
Length = 534
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS ++ +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS ++ +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS ++ +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|348523976|ref|XP_003449499.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 594
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 58/172 (33%), Positives = 87/172 (50%), Gaps = 20/172 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+++ +G + R S +L + +V I I T L + E +Q+ +Y G
Sbjct: 425 ISNPVTGVLETAHYRISKSAWLGAYEHPVVDKINQLIEDVTGLNVKTAEDLQVANYGLGG 484
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+RIAT L+Y++ V+ GG TVF +
Sbjct: 485 QYEPHFDFGRKDEPDAFEELGTGNRIATWLLYMTDVQAGGATVFTDI------------- 531
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G AVKP KG A+ +++L+P D + H +CPV+ G KW + KWIH R
Sbjct: 532 ---GAAVKPKKGTAVFWYNLYPSGEGDYRTRHAACPVLLGNKWVSNKWIHER 580
>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
[Oryctolagus cuniculus]
Length = 534
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS ++ +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
troglodytes]
gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
troglodytes]
gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
sapiens]
gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
sapiens]
gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 101 bits (251), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS ++ +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
Length = 272
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 81/170 (47%), Gaps = 21/170 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V G S+ E RTS F+ + + E+ IE R+AA P E E Q+ Y+ Q
Sbjct: 113 VTGEADGSSMVHEGRTSEMAFIQRGEAEVAERIERRLAALAHWPAECSEPFQLQKYDATQ 172
Query: 62 KYEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
+Y PH+D+ + + GG R+AT ++YLS VE+GG TVFP
Sbjct: 173 EYRPHYDWLDPDSSGHRSHLARGGQRLATFILYLSDVEQGGGTVFPG------------- 219
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
G V P KG AL F + + D +LHG PV+ G K A KW+
Sbjct: 220 ---LGLEVYPKKGSALWFLNTDINHQPDKRTLHGGAPVVRGTKIIANKWL 266
>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS ++ +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Danio rerio]
Length = 536
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 60/165 (36%), Positives = 82/165 (49%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS + + I RI T L + E +Q+ +Y G +YEPHFDF R
Sbjct: 381 RISKSAWLSGYEHSTIERINQRIEDVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEP 440
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVF + G AV P KG A
Sbjct: 441 DAFKELGTGNRIATWLFYMSDVSAGGATVFTDV----------------GAAVWPKKGTA 484
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
+ +++L P D ++ H +CPV+ G KW + KWIH R F +P
Sbjct: 485 VFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIHERGQEFRRP 529
>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS ++ +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|326914688|ref|XP_003203656.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Meleagris
gallopavo]
Length = 539
Score = 100 bits (250), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 64/188 (34%), Positives = 97/188 (51%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
+VA E + + E R S +L D +V ++E R+AA T L P E +Q+++Y
Sbjct: 370 VVASGEKQQKV--EYRISKSAWLKDTADPVVRALELRMAAITGLDLRPPYAEYLQVVNYG 427
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + + G+RIATV++YLS VE GG T F +
Sbjct: 428 LGGHYEPHFDHATSRKSPLYRMKSGNRIATVMIYLSAVEAGGSTAFIYAN---------- 477
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
++V +K AL +++L + D +LH CPV+ G+KW A KWIH + F +
Sbjct: 478 ------FSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPVLAGDKWVANKWIHEYGQEFRR 531
Query: 174 P-EKEPED 180
P ++P D
Sbjct: 532 PCSRDPRD 539
>gi|363729586|ref|XP_417248.3| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Gallus gallus]
Length = 542
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/188 (34%), Positives = 97/188 (51%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
+VA E + + E R S +L D +V ++E R+AA T L P E +Q+++Y
Sbjct: 373 VVASGEKQQKV--EYRISKSAWLKDTADPVVQALELRMAAITGLDLRPPYAEYLQVVNYG 430
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + + G+RIATV++YLS VE GG T F +
Sbjct: 431 LGGHYEPHFDHATSRKSPLYRMKSGNRIATVMIYLSAVEAGGSTAFIYAN---------- 480
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
++V +K AL +++L + D +LH CPV+ G+KW A KWIH + F +
Sbjct: 481 ------FSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPVLAGDKWVANKWIHEYGQEFRR 534
Query: 174 P-EKEPED 180
P ++P D
Sbjct: 535 PCSRDPRD 542
>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
Length = 533
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 58/161 (36%), Positives = 79/161 (49%), Gaps = 25/161 (15%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMN 75
RT+ G +L K +E+ I RI T + E Q+++Y G Y HFD+F +
Sbjct: 368 RTAKGYWLKKESNEMTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYSLHFDYFGFASS 427
Query: 76 QQLG---------GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
G G RIATVL YL+ VE+GG TVF N GY+V P
Sbjct: 428 NYTGERSHHSIVLGDRIATVLFYLTDVEQGGATVFGNV----------------GYSVYP 471
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
G A+ +++L D + D + H SCPV+ G KW T+WIH
Sbjct: 472 QAGTAIFWYNLDTDGNGDPLTRHASCPVVVGSKWVMTEWIH 512
>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 100 bits (250), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 57/172 (33%), Positives = 88/172 (51%), Gaps = 19/172 (11%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
++G S S++RTS +L + +A I+ R+ T L + E +Q+++Y G +YEP
Sbjct: 378 QTGNSTVSDIRTSQNTWLWYENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEP 437
Query: 66 HFDFFRD-KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAV 124
HFDF D + N G+R+ T L YL+ V GG T FP + AV
Sbjct: 438 HFDFMDDAEKNFGWKGNRLLTALFYLNDVPLGGATAFPFLHL----------------AV 481
Query: 125 KPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
P+KG L++++LH D + H CPV++G KW +W H + F +P
Sbjct: 482 PPVKGSLLVWYNLHRSLHKDFRTKHAGCPVLKGSKWICNQWFHEAAQEFRRP 533
>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
Length = 537
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 55/170 (32%), Positives = 90/170 (52%), Gaps = 20/170 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
+ ++++G+ + R S +L + + + + R+ T L E +Q+++Y G
Sbjct: 366 IRNSKTGELEPANYRISKSAWLKSEEHDHILKVTRRVGDITGLDMSTAEDLQVVNYGIGG 425
Query: 62 KYEPHFDFFRDKMNQ---QLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFD+ R + + +LG G+RIAT L Y+S VE GG TVFP +
Sbjct: 426 HYEPHFDYARTETTEAFKELGWGNRIATWLFYMSDVEAGGATVFPPT------------- 472
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
G AV P KG A +++L+P+ + + H +CPV+ G KW + +WIH
Sbjct: 473 ---GAAVWPRKGSAAFWYNLYPNGKGNELTRHAACPVLSGSKWVSNRWIH 519
>gi|240974259|ref|XP_002401836.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215491070|gb|EEC00711.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 490
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + +SG+ + R S +L + ++A + RI T L + E +Q+++Y G
Sbjct: 321 VQNYKSGELEVANYRISKSAWLRNEEHGVIARVTRRIEHITGLSADTAEELQVVNYGIGG 380
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YEPHFDF R + Q LG G+RIAT L Y+S V GG TVFP Q R W
Sbjct: 381 HYEPHFDFARREEKNAFQSLGTGNRIATWLNYMSDVPAGGATVFP-----QLRLTLW--- 432
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
P KG A +++LH D + H +CPV+ G KW + KW H R F +P
Sbjct: 433 --------PEKGAAAFWYNLHRSGEGDMLTRHAACPVLAGSKWVSNKWFHERGQEFTRP 483
>gi|115495019|ref|NP_001069238.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|122144801|sp|Q1RMU3.1|P4HA1_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|92097479|gb|AAI14709.1| Prolyl 4-hydroxylase, alpha polypeptide I [Bos taurus]
gi|296472132|tpg|DAA14247.1| TPA: prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|440892721|gb|ELR45796.1| Prolyl 4-hydroxylase subunit alpha-1 [Bos grunniens mutus]
Length = 534
Score = 100 bits (249), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS ++ +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVLAGGATVFPEV----------------GASVWPKKGTA 482
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|17541712|ref|NP_502317.1| Protein PHY-2 [Caenorhabditis elegans]
gi|32171589|sp|Q20065.1|P4HA2_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|3876769|emb|CAA93469.1| Protein PHY-2 [Caenorhabditis elegans]
Length = 539
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 60/179 (33%), Positives = 89/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++++G+ + R S +L D ++ + RI +T L E +Q+ +Y G
Sbjct: 355 VQNSKTGELEHATYRISKSAWLKGDLDPVIDRVNRRIEDFTNLNQATSEELQVANYGLGG 414
Query: 62 KYEPHFDFFRDKMNQQL----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y+PHFDF R + G+RIATVL Y+S E+GG TVF +
Sbjct: 415 HYDPHFDFARKEEKNAFKTLNTGNRIATVLFYMSQPERGGATVFNHL------------- 461
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G AV P K DAL +++L D D + H +CPV+ G KW + KWIH + F +P
Sbjct: 462 ---GTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIHEKGQEFTRP 517
>gi|426255744|ref|XP_004021508.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Ovis
aries]
Length = 534
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS ++ +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVLAGGATVFPEV----------------GASVWPKKGTA 482
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|148233143|ref|NP_001090904.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Sus scrofa]
gi|83778522|gb|ABC47142.1| procollagen-proline 2-oxoglutarate-4-dioxygenase [Sus scrofa]
Length = 534
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 58/165 (35%), Positives = 83/165 (50%), Gaps = 22/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS ++ +V+ + RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 379 RISKSAWLSGYENPVVSRLNMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 527
>gi|256083648|ref|XP_002578053.1| prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
gi|360044447|emb|CCD81995.1| putative prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
Length = 584
Score = 100 bits (249), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 61/164 (37%), Positives = 78/164 (47%), Gaps = 21/164 (12%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMN 75
RTS +L + EI I RI A T L E E +Q+ +Y G Y PHFDF R +
Sbjct: 423 RTSKSAWLPHSMSEITDQISQRIRAVTGLSLETAEDLQVGNYGLGGHYAPHFDFGRKREK 482
Query: 76 QQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
G+RIAT++ YLS V+ GG TVF R G V P KG A
Sbjct: 483 DAFEVKNGNRIATIIFYLSDVQAGGATVF----------------NRIGTRVVPKKGAAG 526
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
+F+L P+ D + H +CPV+ G KW W H R F +P
Sbjct: 527 FWFNLLPNGEGDLRTRHAACPVLAGSKWVMNLWFHERGQEFRRP 570
>gi|348555277|ref|XP_003463450.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cavia porcellus]
Length = 584
Score = 100 bits (248), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 65/186 (34%), Positives = 91/186 (48%), Gaps = 34/186 (18%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYEHGQKYEP 65
GK + E R S +L D ++ ++ RIAA T L P E +Q+++Y G YEP
Sbjct: 420 GKQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEP 479
Query: 66 HFD--------FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
HFD FR K G+R+AT ++YLS VE GG T F +
Sbjct: 480 HFDHATSPSSPLFRMK-----SGNRVATFMIYLSSVEAGGATAFIYA------------- 521
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP- 174
++V +K AL +++LH D +LH CPV+ G+KW A KWIH + F +P
Sbjct: 522 ---NFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRRPC 578
Query: 175 EKEPED 180
PED
Sbjct: 579 SSSPED 584
>gi|344175386|emb|CCA88057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 331
Score = 100 bits (248), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 63/175 (36%), Positives = 84/175 (48%), Gaps = 23/175 (13%)
Query: 2 VADNESGKSIASEVRTSS--GMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEH 59
V + ESG+ + +E S F + + I R A P + E + Y
Sbjct: 162 VIEYESGQEVVNEATRSCSCASFPPEEMSMLQKRIVERAARLVGQPGAHCEGVTFARYLP 221
Query: 60 GQKYEPHFDFFRDKM---NQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNW 114
G+++ PH D+FR + ++ +G GHRIATVL+YL+ VE GG T FPN
Sbjct: 222 GEQFRPHVDYFRGAVLNNDKIMGSSGHRIATVLLYLNEVEAGGATFFPNP---------- 271
Query: 115 SECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G+ V+P KG AL F D S D TSLH C V +GEKW AT W R
Sbjct: 272 ------GFEVRPQKGGALYFAYQQADGSMDPTSLHEGCAVTQGEKWIATLWFRER 320
>gi|290243077|ref|YP_003494747.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
gi|288945582|gb|ADC73280.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
Length = 575
Score = 100 bits (248), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 63/184 (34%), Positives = 88/184 (47%), Gaps = 24/184 (13%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G S S+ RT S +L ++ + I R+A P E E +Q++HY H Q+Y PH+
Sbjct: 91 GSSGVSQGRTGSNCWLRYQEEPLARRIGERVAKRVGFPLEYAEPLQVIHYGHEQEYRPHY 150
Query: 68 DFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
D + R + GG R+ T L+YL+ VE+GG T FPN+ G
Sbjct: 151 DAYDLDTPRGLRCTRQGGQRMVTALLYLNEVEEGGATAFPNA----------------GV 194
Query: 123 AVKPMKGDALLFFSLHPD-ASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD 181
V P KG +F ++ D SLHG PV GEKW+A+ W R E++P D
Sbjct: 195 EVAPRKGRIAIFNNVGADPGRPHPRSLHGGMPVKSGEKWAASIWFRARPAH--ERQPWFD 252
Query: 182 DCVD 185
D D
Sbjct: 253 DVED 256
>gi|196011902|ref|XP_002115814.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
gi|190581590|gb|EDV21666.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
Length = 534
Score = 100 bits (248), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 60/171 (35%), Positives = 88/171 (51%), Gaps = 21/171 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++GK + R S +L+ +V I I T L E+ EA+QI +Y G
Sbjct: 364 VHNKDTGKLEYATYRISKSAWLNDDDHPLVRRISTLIEDVTGLTMESAEALQIANYGIGG 423
Query: 62 KYEPHFDFF-----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
YEPHFD D GG+RIAT+L+YLS VE GG TVF ++
Sbjct: 424 HYEPHFDHADVRSGTDVFKTWKGGNRIATMLIYLSSVELGGATVFSSA------------ 471
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
G ++P +G A +++LH + + ++ + H +CPV+ G KW A KWIH
Sbjct: 472 ----GVRIEPRQGSAAFWYNLHRNGNGNNLTRHAACPVLIGSKWIANKWIH 518
>gi|443709455|gb|ELU04127.1| hypothetical protein CAPTEDRAFT_149240 [Capitella teleta]
Length = 532
Score = 100 bits (248), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 60/178 (33%), Positives = 92/178 (51%), Gaps = 21/178 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G+ ++ R S +L +D ++A I R +A T L E +Q+++Y G
Sbjct: 366 VHNSATGQLEHAKYRISKSGWLRDEEDPLIARISERCSALTNLSLTTVEELQVVNYGIGG 425
Query: 62 KYEPHFDFFRD---KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
+YEPHFDF R ++ G+RI TV+ Y++ VE GG TVF ++
Sbjct: 426 QYEPHFDFSRRSEPTAFEKWRGNRILTVIYYMTDVEAGGATVFLDA-------------- 471
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G V P KG A ++ +L P D + H +CPV+ G KW A KW H R+ F +P
Sbjct: 472 --GVKVYPEKGSAAVWHNLLPSGEGDMRTRHAACPVLTGSKWVANKWFHERDQEFRRP 527
>gi|299115443|emb|CBN75608.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 548
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 71/219 (32%), Positives = 107/219 (48%), Gaps = 39/219 (17%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPEN-----GEAMQILHYEHGQK 62
GK+I S+ RTS F++ +++ RI + L E + +Q+L Y Q
Sbjct: 254 GKAI-SKTRTSDNAFVTHTN--TAQALKRRI--FQLLGIEEYHETWADGLQVLRYNESQA 308
Query: 63 YEPHFDFFR-----DKMNQQLGGHRIATVLMYLSHVEKGGETVF---------------- 101
Y HFD+ D ++ LG +R ATV++Y + V +GGETVF
Sbjct: 309 YVAHFDYLESAEGHDFKSEGLGTNRFATVVLYFNDVREGGETVFTHAPGIDHHLVPDTKV 368
Query: 102 PNSEVSQSRD---GNWSECA----RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPV 154
P EV ++ D W E RR V P +G A+LF++ HPD D +S HG+CPV
Sbjct: 369 PVREVLENLDLPRSGWEEKLLLQCRRHMVVAPKRGQAVLFYNQHPDGRKDLSSEHGACPV 428
Query: 155 IEGEKWSATKWI-HVRNFDKPEKEPEDDDCVDEDLNCVV 192
I+G+KW+A W+ + + +PE VD+ N +V
Sbjct: 429 IDGQKWAANLWVWNGPRYGLSSVDPETGRTVDKAGNNIV 467
>gi|301093292|ref|XP_002997494.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110636|gb|EEY68688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 324
Score = 100 bits (248), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 57/165 (34%), Positives = 91/165 (55%), Gaps = 12/165 (7%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
A++ RTS+ FLS ++ + I+ R+A T +P ++ E +Q+L YE QKY+ H D+F
Sbjct: 156 ATDWRTSTTYFLSSSKHSKLDEIDQRVADLTKVPVDHQEDVQVLRYEETQKYDHHTDYFP 215
Query: 72 DKMNQQLG----------GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
+ ++ +R+ TV Y+S V KGG T+FP + R + +C+ G
Sbjct: 216 VEHHKNSPHVLESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAG-GAPRPQSMKDCST-G 273
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
V P K ++F+S+ P+ D SLHG CPV +G K+S KW+
Sbjct: 274 LKVSPKKRKVIVFYSMLPNGQGDPMSLHGGCPVEDGIKYSGNKWV 318
>gi|66820122|ref|XP_643703.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
gi|60471803|gb|EAL69758.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
Length = 221
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 55/159 (34%), Positives = 85/159 (53%), Gaps = 24/159 (15%)
Query: 16 RTSSGMFLSKAQD--EIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR-- 71
R+ G+F+ + ++ +I +I ++ ++ + E+ E MQ++ Y G++ HFD+F
Sbjct: 69 RSGWGLFMKEGEEDHQITKNIFNKMKSFVNIS-ESCEVMQVIRYNQGEETSSHFDYFNPL 127
Query: 72 ---DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMK 128
M L G R+ T+LMYL VE+GGET FP G VKP+K
Sbjct: 128 TTNGSMKIGLYGQRVCTILMYLCDVEEGGETTFPEV----------------GIKVKPIK 171
Query: 129 GDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
GDA+LF++ P+ D SLH PV++G KW A K I+
Sbjct: 172 GDAVLFYNCKPNGDVDPLSLHQGDPVLKGNKWVAIKLIN 210
>gi|48675383|ref|NP_001001598.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
gi|75053350|sp|Q75UG4.1|P4HA3_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|47115494|dbj|BAD18888.1| Collagen prolyl 4-hydroxylase alpha III subunit [Bos taurus]
gi|296479828|tpg|DAA21943.1| TPA: prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
Length = 544
Score = 99.8 bits (247), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 69/191 (36%), Positives = 97/191 (50%), Gaps = 32/191 (16%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E K + E R S +L D ++ +++ RIAA T L P E +Q+++Y
Sbjct: 375 VVASGE--KQLPVEYRISKSAWLKDTVDPVLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 432
Query: 59 HGQKYEPHFDFFRD------KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG 112
G YEPHFD +MN G+R+AT ++YLS VE GG T F G
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMN---SGNRVATFMIYLSSVEAGGATAFIY--------G 481
Query: 113 NWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RN 170
N+S V +K AL +++LH D +LH +CPV+ G+KW A KWIH +
Sbjct: 482 NFS--------VPVVKNAALFWWNLHRSGEGDGDTLHAACPVLVGDKWVANKWIHEYGQE 533
Query: 171 FDKP-EKEPED 180
F +P PED
Sbjct: 534 FRRPCSSRPED 544
>gi|66772331|gb|AAY55477.1| IP03959p [Drosophila melanogaster]
gi|66772361|gb|AAY55492.1| IP03859p [Drosophila melanogaster]
Length = 541
Score = 99.8 bits (247), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 58/173 (33%), Positives = 86/173 (49%), Gaps = 20/173 (11%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
+S S SEVR S +L + ++ I+ R+ T L E+ E +Q+++Y G +YEP
Sbjct: 369 QSENSTTSEVRISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNYGIGGQYEP 428
Query: 66 HFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
HFDF D G+R+ T L YL+ V GG T FP + A
Sbjct: 429 HFDFVEDDGQSVFSWKGNRLLTALFYLNDVALGGATAFPFLRL----------------A 472
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP 174
V P+KG L++++LH D + H CPV++G KW +W HV + F +P
Sbjct: 473 VPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVGAQEFRRP 525
>gi|426245942|ref|XP_004016760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Ovis
aries]
Length = 514
Score = 99.4 bits (246), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 69/191 (36%), Positives = 97/191 (50%), Gaps = 32/191 (16%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E K + E R S +L D ++ +++ RIAA T L P E +Q+++Y
Sbjct: 345 VVASGE--KQLPVEYRISKSAWLKDTVDPVLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 402
Query: 59 HGQKYEPHFDFFRD------KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG 112
G YEPHFD +MN G+R+AT ++YLS VE GG T F G
Sbjct: 403 IGGHYEPHFDHATSPSSPLYRMN---SGNRVATFMIYLSSVEAGGATAFIY--------G 451
Query: 113 NWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RN 170
N+S V +K AL +++LH D +LH +CPV+ G+KW A KWIH +
Sbjct: 452 NFS--------VPVVKNAALFWWNLHRSGEGDGDTLHAACPVLVGDKWVANKWIHEYGQE 503
Query: 171 FDKP-EKEPED 180
F +P PED
Sbjct: 504 FRRPCSSRPED 514
>gi|24651424|ref|NP_733376.1| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
gi|23172697|gb|AAF57059.2| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
gi|66772443|gb|AAY55533.1| IP03659p [Drosophila melanogaster]
gi|220951214|gb|ACL88150.1| PH4alphaSG1-PA [synthetic construct]
gi|220959938|gb|ACL92512.1| PH4alphaSG1-PA [synthetic construct]
Length = 540
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/173 (33%), Positives = 86/173 (49%), Gaps = 20/173 (11%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
+S S SEVR S +L + ++ I+ R+ T L E+ E +Q+++Y G +YEP
Sbjct: 368 QSENSTTSEVRISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNYGIGGQYEP 427
Query: 66 HFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
HFDF D G+R+ T L YL+ V GG T FP + A
Sbjct: 428 HFDFVEDDGQSVFSWKGNRLLTALFYLNDVALGGATAFPFLRL----------------A 471
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP 174
V P+KG L++++LH D + H CPV++G KW +W HV + F +P
Sbjct: 472 VPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVGAQEFRRP 524
>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
Length = 493
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/171 (33%), Positives = 83/171 (48%), Gaps = 21/171 (12%)
Query: 9 KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD 68
+S + RTS + +E+ + RIA T E +Q ++Y G Y+ H+D
Sbjct: 332 RSEVVKTRTSKVAWFPDTFNELTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYD 391
Query: 69 FFRDKMN---QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
FF Q+ G RIATVL YL+ VE+GG TVFPN AV
Sbjct: 392 FFNASTAANLTQMNGDRIATVLFYLTDVEQGGATVFPNIR----------------KAVF 435
Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
P +G A+++++L D + +LH +CPV+ G KW KWI R F +P
Sbjct: 436 PQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGSKWVCNKWIRERAQLFKRP 486
>gi|20269816|gb|AAM18063.1|AF495541_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]SG1
[Drosophila melanogaster]
Length = 540
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/173 (33%), Positives = 86/173 (49%), Gaps = 20/173 (11%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
+S S SEVR S +L + ++ I+ R+ T L E+ E +Q+++Y G +YEP
Sbjct: 368 QSENSTTSEVRISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNYGIGGQYEP 427
Query: 66 HFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
HFDF D G+R+ T L YL+ V GG T FP + A
Sbjct: 428 HFDFVEDDGQSVFSWKGNRLLTALFYLNDVALGGATAFPFLRL----------------A 471
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP 174
V P+KG L++++LH D + H CPV++G KW +W HV + F +P
Sbjct: 472 VPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVGAQEFRRP 524
>gi|440899661|gb|ELR50930.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Bos grunniens mutus]
Length = 478
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 69/191 (36%), Positives = 97/191 (50%), Gaps = 32/191 (16%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E K + E R S +L D ++ +++ RIAA T L P E +Q+++Y
Sbjct: 309 VVASGE--KQLPVEYRISKSAWLKDTVDPVLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 366
Query: 59 HGQKYEPHFDFFRD------KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG 112
G YEPHFD +MN G+R+AT ++YLS VE GG T F G
Sbjct: 367 IGGHYEPHFDHATSPSSPLYRMN---SGNRVATFMIYLSSVEAGGATAFIY--------G 415
Query: 113 NWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RN 170
N+S V +K AL +++LH D +LH +CPV+ G+KW A KWIH +
Sbjct: 416 NFS--------VPVVKNAALFWWNLHRSGEGDGDTLHAACPVLVGDKWVANKWIHEYGQE 467
Query: 171 FDKP-EKEPED 180
F +P PED
Sbjct: 468 FRRPCSSRPED 478
>gi|194213450|ref|XP_001495951.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Equus
caballus]
Length = 548
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 95/188 (50%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E K + E R S +L D ++ +++ RIAA T L P E +Q+++Y
Sbjct: 379 VVASGE--KQLPVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 436
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F +
Sbjct: 437 IGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 485
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
++V +K AL +++LH DS +LH CPV+ G+KW A KWIH + F +
Sbjct: 486 -----NFSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 540
Query: 174 P-EKEPED 180
P PED
Sbjct: 541 PCSSSPED 548
>gi|170591592|ref|XP_001900554.1| prolyl 4-hydroxylase [Brugia malayi]
gi|16415740|emb|CAC82616.1| prolyl 4-hydroxylase [Brugia malayi]
gi|21425621|emb|CAD19314.1| prolyl 4-hydroxylase [Brugia malayi]
gi|158592166|gb|EDP30768.1| prolyl 4-hydroxylase, putative [Brugia malayi]
Length = 541
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 94/179 (52%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G+ + RTS +L + E+V I RI T L E E +Q+ +Y G
Sbjct: 362 VQNSITGELETASYRTSKSAWLKDEEHEVVHRINKRIDLMTNLEQETSEELQVGNYGIGG 421
Query: 62 KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y+PHFDF R +++N Q L G+R+AT+L Y++ E GG TVF +EV +
Sbjct: 422 HYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPESGGATVF--TEVKTT-------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
V P K DAL +++L D + H +CPV+ G KW + KWIH R F +P
Sbjct: 472 ------VMPSKNDALFWYNLLRSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQEFRRP 524
>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
Length = 541
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/179 (36%), Positives = 94/179 (52%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G+ + RTS +L + EIV I RI T L E E +Q+ +Y G
Sbjct: 362 VQNSITGELETASYRTSKSAWLKDEEHEIVHRINRRIDLMTNLEQETSEELQVGNYGIGG 421
Query: 62 KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y+PHFDF R +++N Q L G+R+AT+L Y++ E GG TVF +EV +
Sbjct: 422 HYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPESGGATVF--TEVKTT-------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
V P K DAL +++L D + H +CPV+ G KW + KWIH R F +P
Sbjct: 472 ------VMPSKNDALFWYNLLRSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQEFRRP 524
>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
Length = 542
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 65/179 (36%), Positives = 94/179 (52%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G+ + RTS +L + EIV I RI T L E E +Q+ +Y G
Sbjct: 363 VQNSITGELETASYRTSKSAWLKDEEHEIVHRINRRIDLMTNLEQETSEELQVGNYGIGG 422
Query: 62 KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y+PHFDF R +++N Q L G+R+AT+L Y++ E GG TVF +EV +
Sbjct: 423 HYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPESGGATVF--TEVKTT-------- 472
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
V P K DAL +++L D + H +CPV+ G KW + KWIH R F +P
Sbjct: 473 ------VMPSKNDALFWYNLLRSGEGDLRTRHAACPVLIGSKWVSNKWIHERGQEFRRP 525
>gi|402593814|gb|EJW87741.1| hypothetical protein WUBG_01349 [Wuchereria bancrofti]
Length = 541
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 94/179 (52%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++ +G+ + RTS +L + E+V I RI T L E E +Q+ +Y G
Sbjct: 362 VQNSITGELETASYRTSKSAWLKDEEHEVVHRINKRIDLMTNLEQETSEELQVGNYGIGG 421
Query: 62 KYEPHFDFFR-DKMN--QQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y+PHFDF R +++N Q L G+R+AT+L Y++ E GG TVF +EV +
Sbjct: 422 HYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPESGGATVF--TEVKTT-------- 471
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
V P K DAL +++L D + H +CPV+ G KW + KWIH R F +P
Sbjct: 472 ------VMPSKNDALFWYNLLRSGEGDLRTRHAACPVLTGTKWVSNKWIHERGQEFRRP 524
>gi|224006596|ref|XP_002292258.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
CCMP1335]
gi|220971900|gb|EED90233.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
CCMP1335]
Length = 206
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 90/181 (49%), Gaps = 29/181 (16%)
Query: 5 NESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENG-------EAMQILHY 57
N+ G + RTS F +I + RI F G + +QIL Y
Sbjct: 36 NQGGSNAKLTTRTSMNAF------DITTKLSFRIKRRAFRLLRMGAYKENLADGIQILRY 89
Query: 58 EHGQKYEPHFDFFRDKM-NQQL------GGHRIATVLMYLSHVEKGGETVFPNSEVSQSR 110
E GQ Y H D+F + N L G +R AT+ +YLS VE GG+T+ ++ V
Sbjct: 90 ELGQAYIAHHDYFPVRQSNDHLWDPSKGGSNRFATIFLYLSDVEVGGQTLEKDAGVDA-- 147
Query: 111 DGNW-----SECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKW 165
G+W +C + AV P +GDA+LF+S +PD D SLHG+CP+++G KW A W
Sbjct: 148 -GSWEDKLVDQCYSK-LAVPPRRGDAILFYSQYPDGHLDPNSLHGACPILKGTKWGANLW 205
Query: 166 I 166
+
Sbjct: 206 V 206
>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
Length = 528
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/171 (33%), Positives = 83/171 (48%), Gaps = 21/171 (12%)
Query: 9 KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD 68
+S + RTS + +E+ + RIA T E +Q ++Y G Y+ H+D
Sbjct: 367 RSEVVKTRTSKVAWFPDTFNELTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYD 426
Query: 69 FFRDKMNQ---QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
FF Q+ G RIATVL YL+ VE+GG TVFPN AV
Sbjct: 427 FFNASTATNLTQMNGDRIATVLFYLTDVEQGGATVFPNIR----------------KAVF 470
Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
P +G A+++++L D + +LH +CPV+ G KW KWI R F +P
Sbjct: 471 PQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGSKWVCNKWIRERAQLFKRP 521
>gi|402894624|ref|XP_003910453.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3 [Papio anubis]
Length = 535
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 66/188 (35%), Positives = 94/188 (50%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
+VA E K + E R S +L D ++ ++ RIAA T L P E +Q+++Y
Sbjct: 366 VVASGE--KQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 423
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F + +S
Sbjct: 424 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 475
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
V +K AL +++LH DS +LH CPV+ G+KW A KWIH + F +
Sbjct: 476 --------VPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 527
Query: 174 P-EKEPED 180
P PED
Sbjct: 528 PCSSSPED 535
>gi|335294484|ref|XP_003357239.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Sus scrofa]
Length = 545
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 67/188 (35%), Positives = 95/188 (50%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E K + E R S +L D ++ +++ RIAA T L P E +Q+++Y
Sbjct: 376 LVASGE--KQLPVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 433
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F GN+S
Sbjct: 434 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIY--------GNFS 485
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
V +K AL +++LH D +LH CPV+ G+KW A KWIH + F +
Sbjct: 486 --------VPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 537
Query: 174 P-EKEPED 180
P PED
Sbjct: 538 PCSSSPED 545
>gi|38454288|ref|NP_942070.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Rattus norvegicus]
gi|81870816|sp|Q6W3E9.1|P4HA3_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|36962768|gb|AAQ87605.1| collagen prolyl 4-hydroxylase alpha III subunit [Rattus norvegicus]
Length = 544
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 67/188 (35%), Positives = 96/188 (51%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E K + E R S +L D ++ +++ RIAA T L P E +Q+++Y
Sbjct: 375 VVASGE--KQLQVEYRISKSAWLKDTVDPVLVTLDRRIAALTGLDIQPPYAEYLQVVNYG 432
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R AT+++YLS VE GG T F GN+S
Sbjct: 433 IGGHYEPHFDHATSPSSPLYKMKSGNRAATLMIYLSSVEAGGATAFIY--------GNFS 484
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
V +K AL +++LH D +LH CPV+ G+KW A KWIH + F +
Sbjct: 485 --------VPVVKNAALFWWNLHRSGEGDDDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536
Query: 174 P-EKEPED 180
P + PED
Sbjct: 537 PCDTNPED 544
>gi|405965633|gb|EKC30995.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 617
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 58/179 (32%), Positives = 90/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + +GK +E R S +L D ++ ++ RI+ T L E +QI +Y G
Sbjct: 448 VHNPRTGKLETAEYRVSKSAWLKDGDDPVIHNVNNRISDITGLSMATAEELQIANYGLGG 507
Query: 62 KYEPHFDFFRDKMNQQL----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R + + G+RIAT L Y+++V+ GG TVF +
Sbjct: 508 QYEPHFDFARREETEAFRDLGSGNRIATWLTYMTNVDAGGATVFTHI------------- 554
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G + P+KG A +++L+ + H +CPV+ G+KW + KWIH R F +P
Sbjct: 555 ---GVKLFPIKGAAAFWYNLYRSGDGIFDTRHAACPVLVGQKWVSNKWIHERGQEFRRP 610
>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
Length = 550
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 64/179 (35%), Positives = 88/179 (49%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++++G+ + R S +L E++ I RI T L E E +QI +Y G
Sbjct: 359 VQNSKTGELETAAYRISKSAWLKGGDHELIDRINRRIELMTNLIQETSEELQIANYGVGG 418
Query: 62 KYEPHFDFFRD---KMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y+PHFDF R K + LG G+R+ATVL YL+ E GG TVF
Sbjct: 419 HYDPHFDFARKEEPKAFESLGTGNRLATVLFYLTEPEIGGGTVFTELRT----------- 467
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
AV P K AL +++L+ D + H +CPV+ G KW A KWIH R F +P
Sbjct: 468 -----AVMPSKNGALFWYNLYRSGEGDLRTRHAACPVLVGIKWVANKWIHERGQEFLRP 521
>gi|156352054|ref|XP_001622587.1| predicted protein [Nematostella vectensis]
gi|156209158|gb|EDO30487.1| predicted protein [Nematostella vectensis]
Length = 531
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 61/175 (34%), Positives = 91/175 (52%), Gaps = 25/175 (14%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDE----IVASIEARIAAWTFL--PPENGEAMQIL 55
V + ++G+ ++ R S +L +D+ I+ + R + T L P + EA+QI+
Sbjct: 357 VTNLKTGELEFADYRISKSGWLEDPRDDNEEKILNRVNRRTSIITGLDTTPRSAEALQIV 416
Query: 56 HYEHGQKYEPHFDFFRDKMNQQLG---GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG 112
+Y YEPHFD + ++ L G+RIATVL Y+S VE GG TVF ++E
Sbjct: 417 NYGAAGHYEPHFDHATEAVSSILKLGIGNRIATVLYYMSDVEAGGATVFVDAEA------ 470
Query: 113 NWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
VKP KGDA +++LH + D + H +CP+I G KW KWIH
Sbjct: 471 ----------IVKPSKGDAAFWYNLHKNGKGDERTRHAACPIIVGSKWVCNKWIH 515
>gi|308497208|ref|XP_003110791.1| CRE-DPY-18 protein [Caenorhabditis remanei]
gi|308242671|gb|EFO86623.1| CRE-DPY-18 protein [Caenorhabditis remanei]
Length = 559
Score = 99.0 bits (245), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 63/179 (35%), Positives = 91/179 (50%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GK + + R S +L + + E+V + RI T L E E +QI +Y G
Sbjct: 358 VHDSATGKLVTATYRISKSAWLKEWEHEVVERVNKRIELMTNLEMETAEELQIANYGIGG 417
Query: 62 KYEPHFDFFR---DKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y+PHFD + K + LG G+RIATVL Y+S GG TVF +EV +
Sbjct: 418 HYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVF--TEVKST-------- 467
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
V P K DAL +++L + + H +CPV+ G KW + KWIH + F +P
Sbjct: 468 ------VLPTKNDALFWYNLFKQGDGNPDTRHAACPVLVGIKWVSNKWIHEKGNEFRRP 520
>gi|431838427|gb|ELK00359.1| Prolyl 4-hydroxylase subunit alpha-3 [Pteropus alecto]
Length = 483
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 89/172 (51%), Gaps = 23/172 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E K + E R S +L D ++ +++ RIAA T L P E +Q+++Y
Sbjct: 314 VVASGE--KQLPVEYRISKSAWLKDTADPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 371
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F +
Sbjct: 372 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 420
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
++V +K AL +++LH DS +LH +CPV+ G+KW A KWIH
Sbjct: 421 -----NFSVPVVKNAALFWWNLHRSGEGDSDTLHAACPVLVGDKWVANKWIH 467
>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
Length = 535
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 83/170 (48%), Gaps = 19/170 (11%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+G S A+ RTS G + +++ + + ++ L E E +Q+ +Y G YEPH
Sbjct: 362 NGDSTAAAFRTSQGASFNYSRNAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPH 421
Query: 67 FDFFRDKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
+D F D Q L G+RIAT + YLS VE GG T FP +
Sbjct: 422 WDSFPDNHVYQEGDLHGNRIATAIYYLSDVEAGGGTAFPFLPL----------------L 465
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
V P +G L +++LHP D + H +CPV++G KW A WI RN D
Sbjct: 466 VTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDN 515
>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 533
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 59/179 (32%), Positives = 91/179 (50%), Gaps = 19/179 (10%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
V D +G I ++ R S ++++ D I A I R+ T L E +Q+ +Y
Sbjct: 362 FVHDMVTGDLIYADYRVSKNTWIAEDMDVIAAKIIRRVGDVTGLNMRYAEHLQVANYGIA 421
Query: 61 QKYEPHFDF---FRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFD R K + GG+RIAT+L+YLS V+ GG TVF N+
Sbjct: 422 GQYEPHFDHSTGTRPKHFDRWGGNRIATMLLYLSDVDWGGRTVFTNTA------------ 469
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP 174
G P+KG + +++L + ++ + H CPV+ G+KW A WIH + F++P
Sbjct: 470 --PGVGTDPIKGAGVFWYNLLRNGKSNPKTQHAGCPVVLGQKWVANLWIHEHGQEFNRP 526
>gi|52139015|gb|AAH82538.1| P4ha3 protein [Mus musculus]
Length = 404
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/188 (35%), Positives = 95/188 (50%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E K + E R S +L D ++ +++ RIAA T L P E +Q+++Y
Sbjct: 235 VVASGE--KQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYG 292
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F GN+S
Sbjct: 293 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIY--------GNFS 344
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
V +K AL +++LH D +LH CPV+ G+KW A KWIH + F +
Sbjct: 345 --------VPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 396
Query: 174 P-EKEPED 180
P PED
Sbjct: 397 PCSTNPED 404
>gi|268572523|ref|XP_002641343.1| C. briggsae CBR-DPY-18 protein [Caenorhabditis briggsae]
gi|94442971|emb|CAJ98658.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
Length = 559
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 61/170 (35%), Positives = 87/170 (51%), Gaps = 20/170 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GK + + R S +L + E+V + RI T L E E +QI +Y G
Sbjct: 358 VHDSVTGKLVTATYRISKSAWLKAWEHEVVERVNKRIDLMTNLEMETAEELQIANYGIGG 417
Query: 62 KYEPHFDFFR---DKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y+PHFD + K + LG G+RIATVL Y+S GG TVF +EV +
Sbjct: 418 HYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVF--TEVKST-------- 467
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
V P K DAL +++L+ + + H +CPV+ G KW + KWIH
Sbjct: 468 ------VLPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKWVSNKWIH 511
>gi|428178571|gb|EKX47446.1| hypothetical protein GUITHDRAFT_152114 [Guillardia theta CCMP2712]
Length = 262
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 60/185 (32%), Positives = 93/185 (50%), Gaps = 29/185 (15%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
M+ + K + S RT+ G +L QD++V +E + T P+ GE +Q+LHY +G
Sbjct: 94 MIMPYGTHKLVESTTRTNDGAWLDFLQDDVVRRLEETLGKLTKTTPQQGENLQVLHYSNG 153
Query: 61 -QKYEPHFDFF---RDKMNQ-QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
Q ++ H+D+F RD + GG+R TV++YL +GGET FP
Sbjct: 154 AQFFQEHYDYFDPARDPPESFEQGGNRYITVIVYLEAALEGGETHFPE------------ 201
Query: 116 ECARRGYAVKPMKGDALLFFSLH-------PDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
G + GDAL+F++L PD + ++H + P + GEKW A KWIH
Sbjct: 202 ----LGLKLTAQPGDALMFYNLKEHCSGTDPDC-VEKKTIHAALPPVRGEKWVAVKWIHE 256
Query: 169 RNFDK 173
+ + K
Sbjct: 257 KPYQK 261
>gi|403263105|ref|XP_003923900.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3, partial [Saimiri boliviensis boliviensis]
Length = 534
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/188 (35%), Positives = 93/188 (49%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
+VA E K + E R S +L D ++ ++ RIAA T L P E +Q+++Y
Sbjct: 365 VVASGE--KQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 422
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F + +S
Sbjct: 423 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 474
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
V +K AL +++LH DS +LH CPV+ G KW A KWIH + F +
Sbjct: 475 --------VPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLVGNKWVANKWIHEYGQEFRR 526
Query: 174 P-EKEPED 180
P PED
Sbjct: 527 PCSSSPED 534
>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
Length = 516
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/185 (35%), Positives = 97/185 (52%), Gaps = 24/185 (12%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENG-EAMQILHYEH 59
MV D + K S+ RTS +L+ +VA++ R E E++Q+ +Y
Sbjct: 349 MVGD--AAKKEVSKSRTSQNSWLTDYDHPVVAALSRRTKDMALGLDETAYESLQVNNYGI 406
Query: 60 GQKYEPHFDFFRDK--MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
G Y PH+D+ R++ + G+RIAT++ YLS VE+GG TVFP+
Sbjct: 407 GGHYLPHYDWSREENPYPELNTGNRIATLMFYLSDVEEGGATVFPH-------------- 452
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP- 174
G V P KG A+ +++L D +LHG+CPV+ G KW A KWIH R+ F +P
Sbjct: 453 --LGVGVFPKKGTAIFWYNLRASGKGDEKTLHGACPVLIGSKWVANKWIHERHQEFVRPC 510
Query: 175 EKEPE 179
+ +PE
Sbjct: 511 DPDPE 515
>gi|17552840|ref|NP_499464.1| Protein DPY-18 [Caenorhabditis elegans]
gi|20455505|sp|Q10576.2|P4HA1_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; AltName: Full=Protein dumpy-18; Flags:
Precursor
gi|3881011|emb|CAA21045.1| Protein DPY-18 [Caenorhabditis elegans]
gi|6900013|emb|CAB71298.1| prolyl 4-hydroxylase alpha subunit 1 [Caenorhabditis elegans]
Length = 559
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 88/170 (51%), Gaps = 20/170 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GK + + R S +L + + ++V ++ RI T L E E +QI +Y G
Sbjct: 358 VHDSVTGKLVTATYRISKSAWLKEWEGDVVETVNKRIGYMTNLEMETAEELQIANYGIGG 417
Query: 62 KYEPHFDFFR---DKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y+PHFD + K + LG G+RIATVL Y+S GG TVF ++ +
Sbjct: 418 HYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVFTEAKST---------- 467
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
+ P K DAL +++L+ + + H +CPV+ G KW + KWIH
Sbjct: 468 ------ILPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKWVSNKWIH 511
>gi|395814850|ref|XP_003780953.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Otolemur
garnettii]
Length = 544
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 64/188 (34%), Positives = 96/188 (51%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E K + + R S +L D ++ +++ RIAA T L P E +Q+++Y
Sbjct: 375 VVASGE--KQLQVDYRISKSAWLKDTVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 432
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F +
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 481
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
++V +K AL +++LH + DS +LH CPV+ G+KW A KWIH + F +
Sbjct: 482 -----NFSVPVVKNAALFWWNLHRNGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536
Query: 174 P-EKEPED 180
P PED
Sbjct: 537 PCSSSPED 544
>gi|81870817|sp|Q6W3F0.1|P4HA3_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|36962749|gb|AAQ87604.1| collagen prolyl 4-hydroxylase alpha III subunit [Mus musculus]
Length = 542
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/188 (35%), Positives = 95/188 (50%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E K + E R S +L D ++ +++ RIAA T L P E +Q+++Y
Sbjct: 373 VVASGE--KQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYG 430
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F GN+S
Sbjct: 431 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIY--------GNFS 482
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
V +K AL +++LH D +LH CPV+ G+KW A KWIH + F +
Sbjct: 483 --------VPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 534
Query: 174 P-EKEPED 180
P PED
Sbjct: 535 PCSTNPED 542
>gi|296217074|ref|XP_002754870.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Callithrix
jacchus]
Length = 544
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 66/188 (35%), Positives = 93/188 (49%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
+VA E K + E R S +L D ++ ++ RIAA T L P E +Q+++Y
Sbjct: 375 VVASGE--KQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 432
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F + +S
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 484
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
V +K AL +++LH DS +LH CPV+ G KW A KWIH + F +
Sbjct: 485 --------VPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLVGNKWVANKWIHEYGQEFRR 536
Query: 174 P-EKEPED 180
P PED
Sbjct: 537 PCSSSPED 544
>gi|227908832|ref|NP_796135.3| prolyl 4-hydroxylase subunit alpha-3 precursor [Mus musculus]
Length = 542
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 67/188 (35%), Positives = 95/188 (50%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E K + E R S +L D ++ +++ RIAA T L P E +Q+++Y
Sbjct: 373 VVASGE--KQLQVEYRISKSAWLKDTVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYG 430
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F GN+S
Sbjct: 431 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIY--------GNFS 482
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
V +K AL +++LH D +LH CPV+ G+KW A KWIH + F +
Sbjct: 483 --------VPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 534
Query: 174 P-EKEPED 180
P PED
Sbjct: 535 PCSTNPED 542
>gi|297689698|ref|XP_002822285.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pongo abelii]
Length = 544
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 94/188 (50%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
+VA E K + E R S +L D ++ ++ RIAA T L P E +Q+++Y
Sbjct: 375 VVASGE--KQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 432
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F + +S
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 484
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
V ++ AL +++LH DS +LH CPV+ G+KW A KWIH + F +
Sbjct: 485 --------VPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536
Query: 174 P-EKEPED 180
P PED
Sbjct: 537 PCSSSPED 544
>gi|332211329|ref|XP_003254773.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Nomascus
leucogenys]
Length = 544
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 94/188 (50%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
+VA E K + E R S +L D ++ ++ RIAA T L P E +Q+++Y
Sbjct: 375 VVASGE--KQLQVEYRISKSAWLKDTVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 432
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F + +S
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 484
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
V ++ AL +++LH DS +LH CPV+ G+KW A KWIH + F +
Sbjct: 485 --------VPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536
Query: 174 P-EKEPED 180
P PED
Sbjct: 537 PCSSSPED 544
>gi|607947|gb|AAA62207.1| prolyl 4-hydroxylase alpha subunit [Caenorhabditis elegans]
Length = 558
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 88/170 (51%), Gaps = 20/170 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D+ +GK + + R S +L + + ++V ++ RI T L E E +QI +Y G
Sbjct: 357 VHDSVTGKLVTATYRISKSAWLKEWEGDVVETVNKRIGYMTNLEMETAEELQIANYGIGG 416
Query: 62 KYEPHFDFFR---DKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y+PHFD + K + LG G+RIATVL Y+S GG TVF ++ +
Sbjct: 417 HYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVFTEAKST---------- 466
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
+ P K DAL +++L+ + + H +CPV+ G KW + KWIH
Sbjct: 467 ------ILPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKWVSNKWIH 510
>gi|395521232|ref|XP_003764722.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Sarcophilus
harrisii]
Length = 521
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 62/175 (35%), Positives = 88/175 (50%), Gaps = 29/175 (16%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E + + E R S +L D I+ S++ RIAA T L P E +Q+++Y
Sbjct: 352 VVASGEKQQQV--EYRISKSAWLKDTVDPILVSLDRRIAALTGLNVQPPYAEHLQVVNYG 409
Query: 59 HGQKYEPHFDFFRD------KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG 112
G YEPHFD +MN G+R+AT ++YLS VE GG T F +
Sbjct: 410 IGGHYEPHFDHATSPSSPLYRMNS---GNRVATFMIYLSSVEAGGSTAFIYAN------- 459
Query: 113 NWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
++V +K AL +++LH D +LH CPV+ G+KW A KWIH
Sbjct: 460 ---------FSVPVVKNAALFWWNLHRSGQGDGDTLHAGCPVLVGDKWVANKWIH 505
>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
Length = 525
Score = 98.2 bits (243), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/201 (32%), Positives = 97/201 (48%), Gaps = 28/201 (13%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+ +S+ S VRTS FL +D+++A+I+ R+A T E Q +Y G Y H
Sbjct: 335 TNESVVSNVRTSQFTFLPVTEDKVLATIDRRVADMTNFNMRYAEDHQFANYGIGGHYGQH 394
Query: 67 FD-FFRDKMNQQL-----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
D F++ + L G+RIATVL YLS V +GG T FP+ V
Sbjct: 395 MDWFYQPSFDAGLVSSPEMGNRIATVLFYLSDVTQGGGTAFPHLRV-------------- 440
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
+KP K A +++LH D + HG+CP+I G KW +WI R F + ++ P
Sbjct: 441 --LLKPKKYAAAFWYNLHASGVGDPRTQHGACPIISGSKWVQNRWI--REFIQSDRRP-- 494
Query: 181 DDCVDEDLNCVVWAKAGECKK 201
C+ D + A+ E +K
Sbjct: 495 --CLTWDDSLATLAEIRELEK 513
>gi|397643670|gb|EJK76008.1| hypothetical protein THAOC_02250 [Thalassiosira oceanica]
Length = 480
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 63/203 (31%), Positives = 91/203 (44%), Gaps = 42/203 (20%)
Query: 5 NESGKSIASEVRTSSGMF-LSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
N+ G RTS F ++ Q V R+ + +QIL Y+ GQ Y
Sbjct: 247 NQGGDGAVLTTRTSENAFDITTKQSFDVKKRAFRLLRMNGYQENMADGIQILRYKVGQAY 306
Query: 64 EPHFDFFRDKMNQQL-------GGHRIATVLMYLSHVEKGGETVFPNSE----------- 105
H D+F ++ G +R AT+ +YLS V GG+TVFPN E
Sbjct: 307 VAHHDYFPTHQSKDFNWDPLSGGSNRFATIFLYLSDVSYGGQTVFPNCEKLSAEKSPELV 366
Query: 106 ---------------VSQS--RDGNWSE-----CARRGYAVKPMKGDALLFFSLHPDAST 143
VS + +G+W + C + +AV P +GDA+LF+S PD
Sbjct: 367 ERLGESPSASELKEFVSNAGLMEGSWEDNLIHKCYEK-FAVPPRRGDAILFYSQRPDGLL 425
Query: 144 DSTSLHGSCPVIEGEKWSATKWI 166
D+ SLHG+CP++ G KW A W+
Sbjct: 426 DTNSLHGACPILNGTKWGANLWV 448
>gi|126327904|ref|XP_001367838.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Monodelphis
domestica]
Length = 559
Score = 98.2 bits (243), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 66/191 (34%), Positives = 96/191 (50%), Gaps = 32/191 (16%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E + + E R S +L D ++ S++ RIAA T L P E +Q+++Y
Sbjct: 390 VVASGEKQQQV--EYRISKSAWLKDTVDPMLVSLDHRIAALTGLNVQPPYAEHLQVVNYG 447
Query: 59 HGQKYEPHFDFFRD------KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDG 112
G YEPHFD +MN G+R+AT ++YLS VE GG T F +
Sbjct: 448 IGGHYEPHFDHATSPSSPLYRMNS---GNRVATFMIYLSSVEAGGSTAFIYAN------- 497
Query: 113 NWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RN 170
++V +K AL +++LH D +LH CPV+ G+KW A KWIH +
Sbjct: 498 ---------FSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQE 548
Query: 171 FDKP-EKEPED 180
F +P +PED
Sbjct: 549 FRRPCSAKPED 559
>gi|417402564|gb|JAA48127.1| Putative prolyl 4-hydroxylase alpha subunit [Desmodus rotundus]
Length = 544
Score = 97.8 bits (242), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 64/188 (34%), Positives = 94/188 (50%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E K + E R S +L D ++ +++ RIAA T L P E +Q+++Y
Sbjct: 375 VVASGE--KQLPVEYRISKSAWLKDTVDPMLVTLDRRIAALTGLDTQPPYAEHLQVVNYG 432
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F +
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 481
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
++V +K AL +++LH D +LH CPV+ G+KW A KWIH + F +
Sbjct: 482 -----NFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536
Query: 174 P-EKEPED 180
P PED
Sbjct: 537 PCSSSPED 544
>gi|195341548|ref|XP_002037368.1| GM12149 [Drosophila sechellia]
gi|194131484|gb|EDW53527.1| GM12149 [Drosophila sechellia]
Length = 537
Score = 97.4 bits (241), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 57/169 (33%), Positives = 85/169 (50%), Gaps = 20/169 (11%)
Query: 10 SIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF 69
S +EVR S +L + ++ I+ R+ T L E+ E +Q+++Y G +YEPHFDF
Sbjct: 369 STTTEVRISRNTWLWYDANPWLSKIKQRLEDVTGLSTESAEPLQLVNYGIGGQYEPHFDF 428
Query: 70 FRD--KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
D K G+R+ T L YL+ V GG T FP + AV P+
Sbjct: 429 VEDDGKTVFSWKGNRLLTALFYLNDVALGGATAFPFLRL----------------AVPPV 472
Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP 174
KG L++++LH D + H CPV++G KW +W HV + F +P
Sbjct: 473 KGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKWICNEWFHVAAQEFRRP 521
>gi|383642155|ref|ZP_09954561.1| hypothetical protein SeloA3_06917 [Sphingomonas elodea ATCC 31461]
Length = 327
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 60/173 (34%), Positives = 82/173 (47%), Gaps = 22/173 (12%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVA-SIEARIAAWTFLPPENGEAMQILHYEH 59
V D SG+ I +RTS G + + +V +I RIAA T E GE++ +L Y
Sbjct: 169 FVLDPNSGRPIPHPIRTSDGGAIGPTNENLVVRAINLRIAAATGTAVEQGESLTVLRYAR 228
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
GQ+Y H D NQ RIAT ++YL+ +GGET FP +
Sbjct: 229 GQEYRRHLDTIAGAENQ-----RIATFIVYLNDGFEGGETHFPLLNIQ------------ 271
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
V+P GDA+ F ++ PD + D +H PV G KW AT+WI D
Sbjct: 272 ----VRPRIGDAIRFDTIRPDGTPDPRLVHAGQPVRNGVKWIATRWIRREPVD 320
>gi|354504916|ref|XP_003514519.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cricetulus
griseus]
Length = 509
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 61/180 (33%), Positives = 90/180 (50%), Gaps = 24/180 (13%)
Query: 9 KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYEHGQKYEPH 66
K + E R S +L D ++ +++ RIAA T L P E +Q+++Y G YEPH
Sbjct: 346 KQLPVEYRISKSAWLKDTVDPMLGTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPH 405
Query: 67 FDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
FD + + G+R+AT ++YLS VE GG T F + ++
Sbjct: 406 FDHATSPSSPLYRMKSGNRVATFMIYLSAVEAGGATAFIYA----------------NFS 449
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP-EKEPED 180
V +K AL +++LH D +LH CPV+ G+KW A KWIH + F +P PED
Sbjct: 450 VPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRRPCSTNPED 509
>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
Length = 521
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 56/166 (33%), Positives = 83/166 (50%), Gaps = 21/166 (12%)
Query: 14 EVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK 73
+ RT+ + +++ + RI T E +Q+++Y G Y HFD+F
Sbjct: 367 KTRTAKVAWFLDTFNQLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTT 426
Query: 74 MN---QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
N Q+ G RIATVL YL+ VE+GG TVFP E+ + AV P +G
Sbjct: 427 TNPHISQINGDRIATVLFYLNDVEQGGATVFP--EIKK--------------AVFPKRGS 470
Query: 131 ALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
A+++++L D + +LH +CPVI G KW KWI R F +P
Sbjct: 471 AIMWYNLKDDGEGNRDTLHAACPVIVGSKWVCNKWIREREQIFRRP 516
>gi|324507368|gb|ADY43128.1| Prolyl 4-hydroxylase subunit alpha-2 [Ascaris suum]
Length = 534
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 62/179 (34%), Positives = 84/179 (46%), Gaps = 22/179 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + +G + R S +L + I RI T L E E +Q +Y G
Sbjct: 355 VQNARTGDLEYANYRISKSAWLKGTDHPAIDRINKRIDLMTNLNQETAEELQAQNYGIGG 414
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
Y+PHFDF R + G+RIAT+L+Y+S VE GG TVF +
Sbjct: 415 HYDPHFDFARKEDINAFKTLNTGNRIATILIYMSDVESGGATVFNH-------------- 460
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
G AV P K DAL +++L D D + H +CPV+ G KW + KWIH R F +P
Sbjct: 461 --LGNAVFPSKYDALFWYNLRRDGEGDLRTRHAACPVLTGIKWVSNKWIHDRGQEFRRP 517
>gi|391342914|ref|XP_003745760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Metaseiulus
occidentalis]
Length = 525
Score = 97.4 bits (241), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 81/172 (47%), Gaps = 20/172 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + +SG+ + R S +L E+V + R T L E +Q+++Y G
Sbjct: 356 VQNAKSGELEVANYRISKSAWLKNHDHEVVERLSFRFEYLTGLTHLTAEELQVVNYGIGG 415
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
YE HFDF R D Q G+RIAT + Y+S V+ GG TVFP
Sbjct: 416 HYEAHFDFARRDEKDAFKQLGTGNRIATWINYMSDVKAGGATVFP--------------- 460
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
R G V P KG A +++LH D + H +CPV+ G KW + KW H R
Sbjct: 461 -RLGLTVWPEKGSAAFWWNLHRSGEGDILTRHAACPVLAGSKWVSNKWFHER 511
>gi|443712762|gb|ELU05926.1| hypothetical protein CAPTEDRAFT_153364 [Capitella teleta]
Length = 491
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 64/187 (34%), Positives = 89/187 (47%), Gaps = 29/187 (15%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWT-----FLPPENG-EAMQILHYEHGQKYEP 65
+S+ R S +L D ++ + ARI T + P + EAMQ+++Y G +YEP
Sbjct: 320 SSDQRISKVGWLFDNVDTLIKKLSARIGDVTGLNTVYTPVRSPVEAMQVVNYGIGGQYEP 379
Query: 66 HFDFFRD-----KMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
H DF+ D +N L G RI+T L YLS V GG TVFP V
Sbjct: 380 HLDFYEDPEMLKNVNPSLQDTGDRISTFLFYLSRVHLGGATVFPKLNVR----------- 428
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
V P+K A +++ P+ D +LH CPV+ GEKW A KWI R + P
Sbjct: 429 -----VPPVKNGAAFWYNARPNGEHDKRTLHAGCPVVLGEKWVANKWIRERGQEFYRPCP 483
Query: 179 EDDDCVD 185
D + +D
Sbjct: 484 LDKEAID 490
>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
Length = 534
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 59/161 (36%), Positives = 80/161 (49%), Gaps = 27/161 (16%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF----- 70
RT+ G +L K +E+ I RI T + E Q+++Y G Y H D+F
Sbjct: 369 RTAKGHWLKKESNELTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYFLHMDYFDYASS 428
Query: 71 -----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
R + ++ LG RIATVL YLS VE+GG TVF N GY+V
Sbjct: 429 NYTGPRSRQSKVLGD-RIATVLFYLSDVEQGGATVFGNV----------------GYSVY 471
Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
P G A+ +++L D + D + H SCPVI G KW T+WI
Sbjct: 472 PQAGTAIFWYNLDTDGNGDPLTRHASCPVIVGSKWVMTEWI 512
>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
Length = 533
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 84/170 (49%), Gaps = 19/170 (11%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G+ + S R S +L + ++ + + T L E +Q+ +Y G YEPH+
Sbjct: 363 GQRMKSAFRVSKNAWLPYSTHPMMGRMLRDVGDATGLDMTYCEQLQVANYGVGGHYEPHW 422
Query: 68 DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
DFFRD + G+RIAT + YLS VE+GG T FP +AV+P
Sbjct: 423 DFFRDSRHYPAAEGNRIATAIFYLSDVEQGGATAFPFL----------------NFAVRP 466
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
G+ L +++LH + D + H CPV++G KW A WIH + F +P
Sbjct: 467 QLGNILFWYNLHRSSDEDYRTKHAGCPVLKGSKWIANIWIHEATQTFARP 516
>gi|194765174|ref|XP_001964702.1| GF23328 [Drosophila ananassae]
gi|190614974|gb|EDV30498.1| GF23328 [Drosophila ananassae]
Length = 542
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 56/173 (32%), Positives = 91/173 (52%), Gaps = 21/173 (12%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
+S + SE+RTS+ +L ++ ++ I+ R+ T L E+ E +Q+++Y G +YEP
Sbjct: 371 QSQNATTSEIRTSANTWLWYNENPWLSKIKQRLEDITGLSTESAEPLQLVNYGIGGQYEP 430
Query: 66 HFDFFRDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
HFDF + + G G+R+ T L Y++ V GG T FP ++ A
Sbjct: 431 HFDFVEEP-QKVFGWKGNRMLTALFYINDVALGGATAFPFLQL----------------A 473
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
V P+KG L++++LH D + H CPVI+G KW +W H + F +P
Sbjct: 474 VPPVKGSLLVWYNLHRSLHKDFRTKHAGCPVIKGSKWICNEWFHEGTQVFKRP 526
>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
queenslandica]
Length = 525
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 61/176 (34%), Positives = 89/176 (50%), Gaps = 23/176 (13%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEI--VASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
E+G+ + + R S +LS + D + V I+ RI T L E +Q+++Y G +Y
Sbjct: 359 ENGELLHATYRISKSGWLSGSDDPLGYVDRIDQRIEDVTGLTMSTAEQLQVVNYGIGGQY 418
Query: 64 EPHFDFFR---DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
EPH+DF R D G+RI+T+L+Y+S VEKGG TVFP
Sbjct: 419 EPHYDFARTGEDTFTSLGSGNRISTLLIYMSDVEKGGATVFPGV---------------- 462
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G + P+K A +++L D ++ H CPV+ G KW KWIH R F +P
Sbjct: 463 GARLVPIKRAAAYWWNLKRSGDGDYSTRHAGCPVLVGSKWVCNKWIHERGQEFRRP 518
>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
Length = 517
Score = 97.1 bits (240), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 58/165 (35%), Positives = 88/165 (53%), Gaps = 23/165 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWT--FLPPENGEAMQILHYEHGQKYEPHFDFFRDK 73
RTS+ +F+ + ++ +I R A T ++ + E +Q+++Y G +Y PH D+F +
Sbjct: 368 RTSNSVFMEETGITLLETISQRAADMTDLYVTAISSEDLQVINYGLGGQYTPHCDYFDEN 427
Query: 74 MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALL 133
G R+ATVL YL+ V++GG TVFP +S P KG AL+
Sbjct: 428 AE---NGDRLATVLFYLTDVQQGGATVFPFLRLSYF----------------PKKGSALI 468
Query: 134 FFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
F +L S D S H +CPV+ G KW ATKWI+ +FD+ + P
Sbjct: 469 FRNLDNAMSGDKDSTHSACPVLFGNKWVATKWIY--HFDQMTRWP 511
>gi|348683507|gb|EGZ23322.1| hypothetical protein PHYSODRAFT_310730 [Phytophthora sojae]
Length = 417
Score = 97.1 bits (240), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 87/165 (52%), Gaps = 12/165 (7%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
A++ RTS+ FL + I+ R++ T +P ++ E +Q+L YE QKY+ H D+F
Sbjct: 249 ATDWRTSTTYFLPSDAHPKIDEIDQRVSDLTKVPIDHQEDVQVLRYEKTQKYDHHTDYFP 308
Query: 72 DKMNQQLG----------GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
+ ++ +R+ TV Y+S V KGG T+FP + R + +C G
Sbjct: 309 VEHHKNAPHILESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAG-GAPRPTSMKDCTT-G 366
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
V P K ++F+S+ P+ D SLHG CPV EG K+S KW+
Sbjct: 367 LNVPPKKRKVIVFYSMLPNGEGDPMSLHGGCPVEEGVKYSGNKWV 411
>gi|313229039|emb|CBY18191.1| unnamed protein product [Oikopleura dioica]
Length = 522
Score = 96.7 bits (239), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 55/166 (33%), Positives = 84/166 (50%), Gaps = 17/166 (10%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D + K + ++ R S +L + V RI+ T L E E +Q+ +Y G
Sbjct: 353 VFDPATHKLVNADYRVSKSAWLKDEDSDTVEKYNRRISRLTGLDLEYAEQLQMSNYGIGG 412
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
+YEPH+D+ R + + RIAT L YL+ VE+GG TVF G
Sbjct: 413 QYEPHYDYSRREWDI-YNNRRIATWLSYLTTVEQGGGTVF----------------TELG 455
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
++ +KG A+ +++L P+ S D + H +CPV+ G KW + KWIH
Sbjct: 456 LHIRSIKGSAVFWYNLLPNGSGDERTRHAACPVLRGNKWVSNKWIH 501
>gi|73988166|ref|XP_851718.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Canis lupus
familiaris]
Length = 544
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 93/188 (49%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E K + E R S +L D ++ +++ RI A T L P E +Q+++Y
Sbjct: 375 VVASGE--KQLPVEYRISKSAWLKDTVDPLLVTLDHRIGALTGLDVQPPYAEYLQVVNYG 432
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F +
Sbjct: 433 IGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 481
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
++V +K AL +++LH D +LH CPV+ G+KW A KWIH + F +
Sbjct: 482 -----NFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536
Query: 174 P-EKEPED 180
P PED
Sbjct: 537 PCSSRPED 544
>gi|33589818|ref|NP_878907.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Homo sapiens]
gi|114639354|ref|XP_001174896.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan troglodytes]
gi|397487266|ref|XP_003814725.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan paniscus]
gi|74738714|sp|Q7Z4N8.1|P4HA3_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|33188232|gb|AAP97874.1| prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
gi|36962719|gb|AAQ87603.1| collagen prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
gi|37182165|gb|AAQ88885.1| GPGA711 [Homo sapiens]
gi|109658570|gb|AAI17334.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
gi|119595341|gb|EAW74935.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide III, isoform CRA_b
[Homo sapiens]
gi|410219716|gb|JAA07077.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
gi|410248278|gb|JAA12106.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
gi|410336087|gb|JAA36990.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
Length = 544
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 93/188 (49%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
+VA E K + E R S +L D + ++ RIAA T L P E +Q+++Y
Sbjct: 375 VVASGE--KQLQVEYRISKSAWLKDTVDPKLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 432
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F + +S
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 484
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
V ++ AL +++LH DS +LH CPV+ G+KW A KWIH + F +
Sbjct: 485 --------VPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536
Query: 174 P-EKEPED 180
P PED
Sbjct: 537 PCSSSPED 544
>gi|195505199|ref|XP_002099401.1| GE23383 [Drosophila yakuba]
gi|194185502|gb|EDW99113.1| GE23383 [Drosophila yakuba]
Length = 535
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 59/170 (34%), Positives = 83/170 (48%), Gaps = 19/170 (11%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+G S A+ RTS G + ++ + + ++ L E E +Q+ +Y G YEPH
Sbjct: 362 NGGSTAAAFRTSQGASFNYSRSAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPH 421
Query: 67 FDFFRDKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
+D F + Q L G+RIAT + YLS VE GG T FP +
Sbjct: 422 WDSFPENHVYQEGDLHGNRIATGIYYLSDVEAGGGTAFPFLPL----------------L 465
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
V P KG L +++LHP D + H +CPV++G KW A WI RN DK
Sbjct: 466 VTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDK 515
>gi|59809017|gb|AAH89446.1| P4HA3 protein [Homo sapiens]
Length = 528
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 93/188 (49%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
+VA E K + E R S +L D + ++ RIAA T L P E +Q+++Y
Sbjct: 359 VVASGE--KQLQVEYRISKSAWLKDTVDPKLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 416
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F + +S
Sbjct: 417 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 468
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
V ++ AL +++LH DS +LH CPV+ G+KW A KWIH + F +
Sbjct: 469 --------VPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 520
Query: 174 P-EKEPED 180
P PED
Sbjct: 521 PCSSSPED 528
>gi|351696981|gb|EHA99899.1| Prolyl 4-hydroxylase subunit alpha-3 [Heterocephalus glaber]
Length = 572
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 95/188 (50%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPEN--GEAMQILHYE 58
+VA E K + E R S +L D ++ +++ RIAA T L ++ E +Q+++Y
Sbjct: 403 VVASGE--KQLQVEYRISKSAWLKDTADPVLVTLDHRIAALTGLDVQHPYAEYLQVVNYG 460
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F +
Sbjct: 461 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 509
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
++V +K AL +++LH D +LH CPV+ G+KW A KWIH + F +
Sbjct: 510 -----NFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 564
Query: 174 P-EKEPED 180
P PED
Sbjct: 565 PCSSNPED 572
>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 96.7 bits (239), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 63/202 (31%), Positives = 98/202 (48%), Gaps = 28/202 (13%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+ +S S VRTS F++K + E++ +I+ R+A T L + E Q +Y G Y H
Sbjct: 361 TNQSTVSNVRTSQITFIAKTEHEVLQTIDRRVADMTNLNMDYAEDHQFANYGIGGHYGQH 420
Query: 67 FDFFRDK------MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
D+F + ++ G+RIATVL YLS V +GG T FP +
Sbjct: 421 MDWFTETTFDNGLVSSTEMGNRIATVLFYLSDVAQGGGTAFPYLKQH------------- 467
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED 180
++P K A + +LH D+ + HG+CP+I G KW +WI R F + ++ P
Sbjct: 468 ---LRPKKYAAAFWHNLHAAGRGDARTQHGACPIIAGSKWVLNRWI--REFVQSDRRP-- 520
Query: 181 DDCVDEDLNCVVWAKAGECKKN 202
C+ D + +A+ E KN
Sbjct: 521 --CLLWDDSLATYAQIMELAKN 540
>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
Length = 511
Score = 96.7 bits (239), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 81/151 (53%), Gaps = 19/151 (12%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMN 75
R S+G ++ + + + IE RIA L E E +++Y G +Y+ H+DFF
Sbjct: 361 RISAGTWVERKYNNLTWRIERRIADMVDLNLEGSEPFYVINYGIGGQYKAHWDFFG---A 417
Query: 76 QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFF 135
+ +R+ATVL Y++ VE+GG TVFP R G V+ +G+AL ++
Sbjct: 418 DTVEDNRLATVLFYMNDVEQGGATVFP----------------RLGQTVRAKRGNALFWY 461
Query: 136 SLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
++ + + D +LHG CP++ G KW T+WI
Sbjct: 462 NMQHNGTVDDRTLHGGCPILVGSKWIFTQWI 492
>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
Length = 496
Score = 96.3 bits (238), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 84/162 (51%), Gaps = 19/162 (11%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK-M 74
RTS G ++ + + + IE RI L E Q+++Y G Y H DF D
Sbjct: 345 RTSKGTWIERDHNNLTKRIERRITDMVELDLRYSEPFQVMNYGLGGHYAAHEDFLGDTWA 404
Query: 75 NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLF 134
+++ RIATVL YL+ VE+GG TVF + ++Q AV P +G AL +
Sbjct: 405 DKKEEDDRIATVLFYLTDVEQGGATVF--TILNQ--------------AVSPKRGTALFW 448
Query: 135 FSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
++LH + + D+ +LHG CPV+ G KW T WI R F +P
Sbjct: 449 YNLHRNGTGDTRTLHGGCPVLVGSKWIMTLWIRERMQLFTRP 490
>gi|184185444|gb|ACC68850.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Rhinolophus ferrumequinum]
Length = 555
Score = 96.3 bits (238), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 90/199 (45%), Gaps = 42/199 (21%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + +D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEETEDPVVARLNLRMQHITGLSVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFRDKMNQQL------------------------GGHRIATVLMYLSHVEKGG 97
+YEPHFDF R + L G+R+AT L Y+S VE GG
Sbjct: 426 QYEPHFDFSRRPFDNGLKTEGNRLATFLNYNDEHDVFKHLGTGNRVATFLNYMSDVEAGG 485
Query: 98 ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
TVFP+ G A+ P KG A+ +++L D + H +CPV+ G
Sbjct: 486 ATVFPD----------------LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529
Query: 158 EKWSATKWIHVRN--FDKP 174
KW + KW H R F +P
Sbjct: 530 CKWVSNKWFHERGQEFLRP 548
>gi|410972729|ref|XP_003992809.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Felis catus]
Length = 533
Score = 96.3 bits (238), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 93/188 (49%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E K + E R S +L D ++ +++ RI A T L P E +Q+++Y
Sbjct: 364 VVASGE--KQLPVEYRISKSAWLKDTVDPLLVTLDHRIGALTGLDVQPPYAEYLQVVNYG 421
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F +
Sbjct: 422 IGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 470
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
++V +K AL +++LH D +LH CPV+ G+KW A KWIH + F +
Sbjct: 471 -----NFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 525
Query: 174 P-EKEPED 180
P PED
Sbjct: 526 PCSSSPED 533
>gi|426369750|ref|XP_004051847.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Gorilla
gorilla gorilla]
Length = 517
Score = 96.3 bits (238), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 65/188 (34%), Positives = 93/188 (49%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
+VA E K + E R S +L D + ++ RIAA T L P E +Q+++Y
Sbjct: 348 VVASGE--KQLQVEYRISKSAWLKDTVDPKLVALNHRIAALTGLDVRPPYAEYLQVVNYG 405
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F + +S
Sbjct: 406 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 457
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
V ++ AL +++LH DS +LH CPV+ G+KW A KWIH + F +
Sbjct: 458 --------VPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 509
Query: 174 P-EKEPED 180
P PED
Sbjct: 510 PCSSSPED 517
>gi|291387302|ref|XP_002710242.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 2 [Oryctolagus
cuniculus]
gi|217273039|gb|ACK28132.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Oryctolagus cuniculus]
Length = 555
Score = 96.3 bits (238), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 63/199 (31%), Positives = 92/199 (46%), Gaps = 42/199 (21%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA I R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARINRRMQHITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFRDKMN-----------------------QQLG-GHRIATVLMYLSHVEKGG 97
+YEPHFDF R + ++LG G+R+AT L Y+S VE GG
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYNNERDAFKRLGTGNRVATFLNYMSDVEAGG 485
Query: 98 ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
TVFP+ G A+ P KG A+ +++L D + H +CPV+ G
Sbjct: 486 ATVFPDL----------------GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529
Query: 158 EKWSATKWIHVRN--FDKP 174
KW + KW H R F +P
Sbjct: 530 CKWVSNKWFHERGQEFLRP 548
>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 509
Score = 96.3 bits (238), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 57/165 (34%), Positives = 86/165 (52%), Gaps = 19/165 (11%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTF-LPPENGEAMQILHYEHGQKYE 64
+ G+ S RTS +L D +V +++ R+ T L ++ E +Q+ +Y G Y
Sbjct: 341 DDGEPQVSNARTSQNAWLDAGDDRVVTTLDRRVGDMTGGLRQQSYEMLQVNNYGVGGHYV 400
Query: 65 PHFDFFRDKMNQQ--LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
H D+ + + G+RIATV+ YLS VE GG TVFP + G
Sbjct: 401 AHHDWAMEAVPYAGLRVGNRIATVMFYLSDVEIGGATVFP----------------QLGL 444
Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
AV P KG A+L+++L+ + D +LH +CPV+ G KW A +WIH
Sbjct: 445 AVFPRKGSAILWYNLYRNGKGDRRTLHAACPVLSGSKWVANQWIH 489
>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
Length = 549
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 62/207 (29%), Positives = 96/207 (46%), Gaps = 35/207 (16%)
Query: 10 SIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF 69
S+ S RTS FL K + +++ +I+ R+A T L E E Q+ +Y G Y H D+
Sbjct: 370 SVVSNARTSQFTFLPKTRHKVLRTIDQRVADMTDLHLEYAEDHQLANYGIGGHYAQHMDW 429
Query: 70 F------RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
F +++ G+RI TVL YLS VE+GG T FP +
Sbjct: 430 FYPITFETKQVSNPEMGNRIGTVLFYLSDVEQGGATAFPALK----------------QL 473
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDC 183
++P K A +++LH D+ ++HG+CP+I G KW +WI R F + ++ P
Sbjct: 474 LRPKKHAAAFWYNLHASGVGDARTMHGACPIIVGSKWVLNRWI--REFVQSDRRP----- 526
Query: 184 VDEDLNCVVWAKAGECKKNPLYMVGSK 210
C W + L + GS+
Sbjct: 527 ------CYQWDDSKLTLSQVLELTGSQ 547
>gi|194751829|ref|XP_001958226.1| GF23628 [Drosophila ananassae]
gi|190625508|gb|EDV41032.1| GF23628 [Drosophila ananassae]
Length = 484
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 46/137 (33%), Positives = 77/137 (56%), Gaps = 19/137 (13%)
Query: 34 IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYL 90
+ RI T + +QI ++ G +++PH+D+F ++ +N + G RIA+++ Y+
Sbjct: 348 LNLRIRDITGFNVDEIRGLQIANFGVGGQFKPHYDYFTERILRLNNTILGDRIASIIFYV 407
Query: 91 SHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHG 150
V GG+TVFP+ ++ AVKP KG +L +F+ DA+ D SLH
Sbjct: 408 GDVVHGGQTVFPDIQI----------------AVKPQKGSSLFWFNTFDDATPDPRSLHS 451
Query: 151 SCPVIEGEKWSATKWIH 167
CPV+ G++W+ TKW+H
Sbjct: 452 VCPVLIGDRWTITKWLH 468
>gi|198418585|ref|XP_002122034.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1 (4-PH
alpha-1)
(Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1) [Ciona intestinalis]
Length = 525
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 53/168 (31%), Positives = 86/168 (51%), Gaps = 23/168 (13%)
Query: 5 NESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYE 64
N +G I ++RTS + K V + RI+ T L E E +Q+ +Y +Y+
Sbjct: 355 NNTG--IVEDIRTSKVAWFKKNDFTAVKKLYTRISEMTGLSEETFEDLQVANYGLAGEYQ 412
Query: 65 PHFDFFRD-----KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
PHFD+ D + + G+RIAT+L+YL+ V++GG T F ++
Sbjct: 413 PHFDYTEDPSIYKREDGAEVGNRIATMLLYLNDVKEGGRTAFIEPKI------------- 459
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
KP+KG A+ +++L+P D + H SCPV+ G KW++ W+H
Sbjct: 460 ---VAKPIKGSAVFWYNLYPSGLGDPRTRHASCPVVIGNKWASNVWVH 504
>gi|47213360|emb|CAF90979.1| unnamed protein product [Tetraodon nigroviridis]
Length = 511
Score = 95.9 bits (237), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 60/179 (33%), Positives = 85/179 (47%), Gaps = 27/179 (15%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +G+ + R S +L + IV I RI T L E +Q+ +Y G
Sbjct: 337 VHDPRTGQLTTAPYRVSKSAWLGAFEHPIVDQINQRIEDITGLDVSTAEDLQVANYGVGG 396
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMY-------LSHVEKGGETVFPNSEVSQSR 110
+YEPHFDF + D + G+RIAT L+Y +S V+ GG TVF +
Sbjct: 397 QYEPHFDFGQKDEPDAFEELGTGNRIATWLLYVSAAVLRMSDVQAGGATVFTDI------ 450
Query: 111 DGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
G +V P KG A+ +++L P D + H +CPV+ G KW + KWIH R
Sbjct: 451 ----------GASVLPQKGSAVFWYNLRPSGDGDYRTRHAACPVLLGNKWVSNKWIHER 499
>gi|47204411|emb|CAF95476.1| unnamed protein product [Tetraodon nigroviridis]
Length = 284
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/169 (33%), Positives = 87/169 (51%), Gaps = 31/169 (18%)
Query: 9 KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPEN--GEAMQILHYEHGQKYEPH 66
K + +E R S +L + V+ ++ RI+ T L ++ GE +Q+++Y G YEPH
Sbjct: 121 KQVTAEYRISKSAWLKGSAQSAVSRLDQRISMLTGLNVQHPHGEYLQVVNYGIGGHYEPH 180
Query: 67 FD--------FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
FD F+ K G+R+ATV++YLS VE GG T F +
Sbjct: 181 FDHATSPSSPVFKLKT-----GNRVATVMIYLSSVEAGGSTAFIYAN------------- 222
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
++V MK A+ +++LH + D +LH CPV+ G+KW A KWIH
Sbjct: 223 ---FSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPVLIGDKWVANKWIH 268
>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
Length = 537
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 63/181 (34%), Positives = 91/181 (50%), Gaps = 24/181 (13%)
Query: 10 SIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF 69
S S+ RTS +F++ + +++ +I+ R+A T L + E Q+ Y G Y HFD+
Sbjct: 371 STVSKKRTSQHIFIAATRHKVLRTIDQRVADMTNLNMQYAEDHQLADYGIGGHYSQHFDW 430
Query: 70 F--RDKMNQQLG--GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
F D N + G+RIATVL YLS V +GG T FP + +K
Sbjct: 431 FGNSDLANSKCDEMGNRIATVLFYLSDVAQGGGTAFPILK----------------QLLK 474
Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPED--DDC 183
P K A +++LH D +LHG CP+I G KW +WI R +D+ + P D DD
Sbjct: 475 PKKYAAAFWYNLHASGKGDWRNLHGGCPIIVGSKWVLNRWI--REYDQSDLRPCDLWDDS 532
Query: 184 V 184
V
Sbjct: 533 V 533
>gi|196011900|ref|XP_002115813.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
gi|190581589|gb|EDV21665.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
Length = 581
Score = 95.5 bits (236), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 58/169 (34%), Positives = 81/169 (47%), Gaps = 24/169 (14%)
Query: 4 DNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKY 63
D E G+ + R S +L K V I I L E E +QI +Y G Y
Sbjct: 387 DKEYGEE--TTYRISKTAWLDKEDHPAVKRITTLIGDIIGLTSETAEPLQIANYGIGGHY 444
Query: 64 EPHFDFFRDKMNQQLG------GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
EPH DF + + L G+RIATVL+YLS+VE GG TVFP
Sbjct: 445 EPHLDFIESEDKEALSEYTSRIGNRIATVLIYLSNVEAGGATVFP--------------- 489
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
+ G V+P +G A ++++H + + S+H +CPV+ G KW+A W
Sbjct: 490 -KAGVRVEPRQGSAAFWYNMHRNGEGNKLSVHAACPVLIGSKWAANLWF 537
>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
Length = 487
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/167 (33%), Positives = 82/167 (49%), Gaps = 21/167 (12%)
Query: 13 SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
S+ RT+ + +++ + RI T E +Q+++Y G Y HFD+F
Sbjct: 329 SKTRTAKLAWFLDTFNQLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNT 388
Query: 73 KMN---QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKG 129
Q+ G RIATVL YL+ VE+GG TVFP E+ + AV P +G
Sbjct: 389 TKGPHITQINGDRIATVLFYLNDVEQGGATVFP--EIKK--------------AVFPKRG 432
Query: 130 DALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
A+++++L D + +LH CPVI G KW KWI R F +P
Sbjct: 433 SAIMWYNLKDDGEGNRDTLHAGCPVIVGSKWVCNKWIREREQIFRRP 479
>gi|301759032|ref|XP_002915381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Ailuropoda
melanoleuca]
Length = 539
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 58/172 (33%), Positives = 86/172 (50%), Gaps = 23/172 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E K + E R S +L D ++ +++ RI A T L P E +Q+++Y
Sbjct: 370 VVASGE--KQLPVEYRISKSAWLKDTVDPLLVTLDHRIGALTGLDVQPPYAEYLQVVNYG 427
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F +
Sbjct: 428 IGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 476
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
++V +K AL +++LH D +LH CPV+ G+KW A KWIH
Sbjct: 477 -----NFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 523
>gi|289526401|gb|ADD01323.1| FI13021p [Drosophila melanogaster]
gi|373432715|gb|AEY70761.1| FI17809p1 [Drosophila melanogaster]
Length = 193
Score = 95.5 bits (236), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 56/161 (34%), Positives = 78/161 (48%), Gaps = 27/161 (16%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF----- 70
RT+ G +L K +E+ I RI T + E Q+++Y G Y H D+F
Sbjct: 28 RTAKGFWLKKESNELTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASS 87
Query: 71 -----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
R + + LG RIATVL YL+ VE+GG TVF + GY V
Sbjct: 88 NHTDTRSRYSIDLG-DRIATVLFYLTDVEQGGATVFGDV----------------GYYVS 130
Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
P G A+ +++L D + D + H +CPVI G KW T+WI
Sbjct: 131 PQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGSKWVMTEWI 171
>gi|386368303|gb|AFJ06910.1| procollagen-proline dioxygenase [Mytilus galloprovincialis]
Length = 535
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 60/187 (32%), Positives = 95/187 (50%), Gaps = 23/187 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D +GK I ++ R + +L +V ++ RI A T L ++ +A+Q+ +Y G
Sbjct: 365 VHDPTTGKLIHAKYRITKTAWLDDRDHLVVDRVQNRIKAVTGLDLDSADALQVANYGIGG 424
Query: 62 KYEPHFDF-FRDKMN----QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSE 116
Y+PH+DF RD + ++ G+RIAT L+Y++ V+ GG TVFP +V
Sbjct: 425 HYDPHYDFSTRDDDDTSETEKRDGNRIATFLLYMTDVDAGGATVFPIIDVR--------- 475
Query: 117 CARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
V P KG A+ +++L + H +CPV+ G KW + KWI R F +P
Sbjct: 476 -------VLPKKGTAVFWYNLRRSGKGIMETRHAACPVLVGTKWVSNKWIRTRGQEFRRP 528
Query: 175 EKEPEDD 181
ED+
Sbjct: 529 CGLTEDE 535
>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
Length = 285
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 57/169 (33%), Positives = 87/169 (51%), Gaps = 31/169 (18%)
Query: 9 KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPEN--GEAMQILHYEHGQKYEPH 66
K + +E R S +L + V+ ++ RI+ T L ++ GE +Q+++Y G YEPH
Sbjct: 122 KQVTAEYRISKSAWLKGSAQSAVSRLDQRISMLTGLNVQHPHGEYLQVVNYGIGGHYEPH 181
Query: 67 FD--------FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
FD F+ K G+R+ATV++YLS VE GG T F +
Sbjct: 182 FDHATSPSSPVFKLKT-----GNRVATVMIYLSSVEAGGSTAFIYAN------------- 223
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
++V MK A+ +++LH + D +LH CPV+ G+KW A KWIH
Sbjct: 224 ---FSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPVLIGDKWVANKWIH 269
>gi|432109537|gb|ELK33711.1| Prolyl 4-hydroxylase subunit alpha-2 [Myotis davidii]
Length = 555
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 91/199 (45%), Gaps = 42/199 (21%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGMGG 425
Query: 62 KYEPHFDFFRDKMN-----------------------QQLG-GHRIATVLMYLSHVEKGG 97
+YEPHFDF R + + LG G+R+AT L Y+S VE GG
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDEQDVFKHLGTGNRVATFLNYMSDVEAGG 485
Query: 98 ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
TVFP+ G A+ P KG A+ +++L D + H +CPV+ G
Sbjct: 486 ATVFPD----------------LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529
Query: 158 EKWSATKWIHVRN--FDKP 174
KW + KW H R F +P
Sbjct: 530 CKWVSNKWFHERGQEFLRP 548
>gi|308476969|ref|XP_003100699.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
gi|308264511|gb|EFP08464.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
Length = 573
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 90/198 (45%), Gaps = 42/198 (21%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V ++++G+ + R S +L ++ + RI +T L E +Q+ +Y G
Sbjct: 371 VQNSKTGELEHATYRISKSAWLKGDLHPVIERVNRRIEDFTGLYQGTSEELQVANYGLGG 430
Query: 62 KYEPHFDFFRDKMNQQLGGH-----------------------RIATVLMYLSHVEKGGE 98
Y+PHFDF R N LGGH RIATVL Y+S E+GG
Sbjct: 431 HYDPHFDFAR-IANYGLGGHYEPHYDMSLKEEKNAFKTLNTGNRIATVLFYMSQPERGGA 489
Query: 99 TVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGE 158
TVF + G AV P K DAL +++L D D + H +CPV+ G
Sbjct: 490 TVFNHL----------------GTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGV 533
Query: 159 KWSATKWIHVRN--FDKP 174
KW + KWIH R F +P
Sbjct: 534 KWVSNKWIHERGQEFTRP 551
>gi|307103831|gb|EFN52088.1| hypothetical protein CHLNCDRAFT_139357 [Chlorella variabilis]
Length = 1038
Score = 95.1 bits (235), Expect = 2e-17, Method: Composition-based stats.
Identities = 59/143 (41%), Positives = 85/143 (59%), Gaps = 11/143 (7%)
Query: 15 VRTSSGMFLSKAQDEIVASIEARIAAWTFLPPEN-GEAMQILHYEHGQKYEPHFDFFRDK 73
+RTS G FL++AQDE+V +IE R+A WT LP EN G +Q + +G ++ D D+
Sbjct: 10 IRTSWGTFLTRAQDEVVYAIEHRVANWTHLPVENAGGVLQGKRFHYGAHWD---DLDLDE 66
Query: 74 MNQQLGGH--RIATVLMYLSHVEKGGETVFPNS----EVSQSRDGNWSECARRGYAVKPM 127
LGG R+ATVL+YLS E+GGET FP+S + Q+ +S CA+ G A
Sbjct: 67 NPDGLGGGSVRVATVLIYLSDAEEGGETAFPHSRWLDKEKQTAGKAFSNCAKDGVAALAR 126
Query: 128 KGDALLFFSLHPDA-STDSTSLH 149
KG+A++F+ P + D S+H
Sbjct: 127 KGNAIMFWDAKPGSMRQDKWSMH 149
>gi|344296798|ref|XP_003420090.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Loxodonta
africana]
Length = 544
Score = 95.1 bits (235), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/188 (32%), Positives = 95/188 (50%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E K + + R S +L + D ++ +++ RIAA T L P E +Q+++Y
Sbjct: 375 VVASGE--KQLQVDYRISKSAWLKDSVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYG 432
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F +
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSAVEAGGATAFIYA----------- 481
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
+++ +K AL +++LH D +LH CPV+ G+KW A KWIH + F +
Sbjct: 482 -----NFSMPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536
Query: 174 P-EKEPED 180
P PED
Sbjct: 537 PCSSSPED 544
>gi|390459659|ref|XP_002806656.2| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-2 [Callithrix jacchus]
Length = 579
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 91/199 (45%), Gaps = 42/199 (21%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 390 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 449
Query: 62 KYEPHFDFFRDKMN-----------------------QQLG-GHRIATVLMYLSHVEKGG 97
+YEPHFDF R + + LG G+R+AT L Y+S VE GG
Sbjct: 450 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERDAFKHLGTGNRVATFLNYMSDVEAGG 509
Query: 98 ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
TVFP+ G A+ P KG A+ +++L D + H +CPV+ G
Sbjct: 510 ATVFPD----------------LGAAIWPKKGTAVFWYNLLRSGXGDYRTRHAACPVLVG 553
Query: 158 EKWSATKWIHVRN--FDKP 174
KW + KW H R F +P
Sbjct: 554 CKWVSNKWFHERGQEFLRP 572
>gi|229368743|gb|ACQ63024.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Dasypus novemcinctus]
Length = 556
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 91/199 (45%), Gaps = 42/199 (21%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 367 VRDPKTGVLTVASYRVSKSSWLEENDDPVVAQVNRRMEHITGLTVKTAELLQVANYGMGG 426
Query: 62 KYEPHFDFFRDKMN-----------------------QQLG-GHRIATVLMYLSHVEKGG 97
+YEPHFDF R + + LG G+R+AT L Y+S VE GG
Sbjct: 427 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYNHEQDVFKHLGTGNRVATFLNYMSDVEAGG 486
Query: 98 ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
TVFP+ G A+ P KG A+ +++L D + H +CPV+ G
Sbjct: 487 ATVFPDL----------------GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 530
Query: 158 EKWSATKWIHVRN--FDKP 174
KW + KW H R F +P
Sbjct: 531 CKWVSNKWFHERGQEFLRP 549
>gi|116496629|gb|AAI26171.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
Length = 544
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 64/188 (34%), Positives = 93/188 (49%), Gaps = 26/188 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
+VA E K + E R S +L + + ++ RIAA T L P E +Q+++Y
Sbjct: 375 VVASGE--KQLQVEYRISKSAWLKDTVNPKLVTLNHRIAALTGLDVRPPYAEYLQVVNYG 432
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F + +S
Sbjct: 433 IGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLS-------- 484
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDK 173
V ++ AL +++LH DS +LH CPV+ G+KW A KWIH + F +
Sbjct: 485 --------VPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKWVANKWIHEYGQEFRR 536
Query: 174 P-EKEPED 180
P PED
Sbjct: 537 PCSSSPED 544
>gi|312092237|ref|XP_003147267.1| hypothetical protein LOAG_11701 [Loa loa]
Length = 553
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 82/170 (48%), Gaps = 18/170 (10%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + E+G + R S +L + E+V I R+ T L E +Q+ +Y G
Sbjct: 357 VHNVETGNLETASYRISKSAWLRSTEHEVVNRINRRLDLATNLEIATAEELQVQNYGIGG 416
Query: 62 KYEPHFDFFRDK--MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
YEPH D RD+ + G+RIAT+L+Y++ E GG TVF N + S
Sbjct: 417 HYEPHLDCSRDEDAFERTGTGNRIATILIYMTEPEIGGRTVFINLKAS------------ 464
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
V K AL +++L + D S H +CPV+ G KW+A KW H R
Sbjct: 465 ----VPCTKNAALFWYNLMRSGAVDMRSYHAACPVLTGTKWTANKWFHER 510
>gi|449284064|gb|EMC90646.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Columba livia]
Length = 174
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 88/172 (51%), Gaps = 23/172 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
+VA E K +E R S +L +V ++E R+AA T L P E +Q+++Y
Sbjct: 5 VVASGE--KQQKAEYRISKSAWLKDTAHPVVQTLEKRMAAVTGLDLRPPYAEYLQVVNYG 62
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + + G+RIAT+++YLS V GG T F ++ +S
Sbjct: 63 LGGHYEPHFDHATSRKSPLYRMKSGNRIATLMIYLSAVGAGGSTAFVHANLS-------- 114
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
V +K AL +++L + D +LH CPV+ G+KW A KWIH
Sbjct: 115 --------VPVVKNAALFWWNLRRNGDGDGDTLHAGCPVLAGDKWVANKWIH 158
>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
Length = 537
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 82/170 (48%), Gaps = 19/170 (11%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G+ S R S +L+ +A + + T L E +Q+ +Y G YEPH+
Sbjct: 367 GQHKKSAFRVSKNAWLAYEAHPTMAGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHW 426
Query: 68 DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
DFFRD + G+RIAT + YLS VE+GG T FP + +AVKP
Sbjct: 427 DFFRDPSHYPAAEGNRIATAIFYLSEVEQGGATAFPFLD----------------FAVKP 470
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
G+ L +++LH D + H CPV++G KW WIH + F +P
Sbjct: 471 QLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEVTQTFARP 520
>gi|326435474|gb|EGD81044.1| hypothetical protein PTSG_10986 [Salpingoeca sp. ATCC 50818]
Length = 264
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 80/142 (56%), Gaps = 18/142 (12%)
Query: 34 IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHV 93
+E IA LP EN E Q+L Y+ Q Y+ H D+ ++ QQ G R+AT +YL+ V
Sbjct: 133 MEEEIARIVRLPVENQEHFQVLQYQKNQYYKVHSDYIEEQ-RQQPCGIRVATFFLYLNDV 191
Query: 94 EKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDAS-TDSTSLHGSC 152
E+GG T FPN ++ V+P KG+A+L++S +P+ + DS + H +
Sbjct: 192 EEGGGTRFPNLNLT----------------VQPAKGNAVLWYSAYPNTTRMDSRTDHEAM 235
Query: 153 PVIEGEKWSATKWIHVRNFDKP 174
PV +G K+ A KWIH+ +F P
Sbjct: 236 PVAKGMKYGANKWIHIHDFVTP 257
>gi|195575097|ref|XP_002105516.1| GD17035 [Drosophila simulans]
gi|194201443|gb|EDX15019.1| GD17035 [Drosophila simulans]
Length = 535
Score = 94.7 bits (234), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 83/170 (48%), Gaps = 19/170 (11%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+G S A+ RTS G + +++ + + ++ L + E +Q+ +Y G YEPH
Sbjct: 362 NGGSTAAAFRTSQGASFNYSRNAATKLLSHHVGDFSGLNMDYAEDLQVANYGIGGHYEPH 421
Query: 67 FDFFRDKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
+D F + Q L G+RIAT + YLS VE GG T FP +
Sbjct: 422 WDSFPENHIYQEGDLHGNRIATGIYYLSDVEAGGGTAFPFLPL----------------L 465
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
V P KG L +++LHP D + H +CPV++G KW A WI RN D
Sbjct: 466 VTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDN 515
>gi|281183175|ref|NP_001162504.1| prolyl 4-hydroxylase subunit alpha-2 [Papio anubis]
gi|159461520|gb|ABW96795.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase, alpha
polypeptide II, isoform 1 (predicted) [Papio anubis]
Length = 578
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 91/199 (45%), Gaps = 42/199 (21%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 389 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 448
Query: 62 KYEPHFDFFRDKMN-----------------------QQLG-GHRIATVLMYLSHVEKGG 97
+YEPHFDF R + + LG G+R+AT L Y+S VE GG
Sbjct: 449 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERHTFKHLGTGNRVATFLNYMSDVEAGG 508
Query: 98 ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
TVFP+ G A+ P KG A+ +++L D + H +CPV+ G
Sbjct: 509 ATVFPDL----------------GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 552
Query: 158 EKWSATKWIHVRN--FDKP 174
KW + KW H R F +P
Sbjct: 553 CKWVSNKWFHERGQEFLRP 571
>gi|355709028|gb|AES03457.1| prolyl 4-hydroxylase, alpha polypeptide III [Mustela putorius furo]
Length = 477
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 58/172 (33%), Positives = 86/172 (50%), Gaps = 23/172 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYE 58
+VA E K + E R S +L D ++ +++ RI A T L P E +Q+++Y
Sbjct: 309 VVASGE--KQLPVEYRISKSAWLKDTVDPLLVNLDHRIGALTGLDVQPPYAEYLQVVNYG 366
Query: 59 HGQKYEPHFDFFRDK---MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
G YEPHFD + + G+R+AT ++YLS VE GG T F +
Sbjct: 367 IGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYA----------- 415
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
++V +K AL +++LH D +LH CPV+ G+KW A KWIH
Sbjct: 416 -----NFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 462
>gi|427410040|ref|ZP_18900242.1| hypothetical protein HMPREF9718_02716 [Sphingobium yanoikuyae ATCC
51230]
gi|425712173|gb|EKU75188.1| hypothetical protein HMPREF9718_02716 [Sphingobium yanoikuyae ATCC
51230]
Length = 225
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 86/162 (53%), Gaps = 22/162 (13%)
Query: 13 SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
++ RTS L + +D +V +I ARI A T L P++GE +Q Y GQ+Y+ H D+F
Sbjct: 77 ADYRTSHSCNLDR-EDPLVHAISARICAMTGLEPDHGETLQGQRYTQGQEYKVHCDYFPV 135
Query: 73 KMN-----QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
+ ++ GG R T ++YLS VE GGET FP E + V P+
Sbjct: 136 NASYWPDMRKTGGQRNWTAMIYLSPVEGGGETHFPRCE----------------FMVPPI 179
Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
+G L++ +L PD + + SLH + PV +G K+ TKW R
Sbjct: 180 EGMILIWNNLKPDGAPNPYSLHAARPVAQGTKYVVTKWFRER 221
>gi|198417610|ref|XP_002125349.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1
precursor (4-PH alpha-1)
(Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1) [Ciona intestinalis]
Length = 527
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 54/142 (38%), Positives = 72/142 (50%), Gaps = 22/142 (15%)
Query: 31 VASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF-----FRDKMNQQLGGHRIAT 85
VA I RI+ T L E +Q+ +Y G +Y PHFD RD + Q G RIAT
Sbjct: 378 VAKITERISDITGLTLNTSEEIQVANYGVGGEYPPHFDIPTTDEERDDLKSQ-DGERIAT 436
Query: 86 VLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDS 145
L+YLS VE GG T F N+ G + KP+KG A+ ++++ P D
Sbjct: 437 FLIYLSDVEVGGRTAFVNA----------------GVSAKPIKGSAVFWYNVFPSGEPDL 480
Query: 146 TSLHGSCPVIEGEKWSATKWIH 167
+ HG+CPV G KW+ KWI
Sbjct: 481 RTYHGACPVAFGNKWAGNKWIR 502
>gi|167045848|gb|ABZ10515.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Callithrix jacchus]
Length = 555
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 91/199 (45%), Gaps = 42/199 (21%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425
Query: 62 KYEPHFDFFRDKMN-----------------------QQLG-GHRIATVLMYLSHVEKGG 97
+YEPHFDF R + + LG G+R+AT L Y+S VE GG
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERDAFKHLGTGNRVATFLNYMSDVEAGG 485
Query: 98 ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
TVFP+ G A+ P KG A+ +++L D + H +CPV+ G
Sbjct: 486 ATVFPDL----------------GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529
Query: 158 EKWSATKWIHVRN--FDKP 174
KW + KW H R F +P
Sbjct: 530 CKWVSNKWFHERGQEFLRP 548
>gi|381200505|ref|ZP_09907642.1| procollagen-proline dioxygenase [Sphingobium yanoikuyae XLDN2-5]
Length = 221
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 58/162 (35%), Positives = 86/162 (53%), Gaps = 22/162 (13%)
Query: 13 SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
++ RTS L + +D +V +I ARI A T L P++GE +Q Y GQ+Y+ H D+F
Sbjct: 73 ADYRTSHSCNLDR-EDPLVHAISARICAMTGLEPDHGETLQGQRYTQGQEYKVHCDYFPV 131
Query: 73 KMN-----QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
+ ++ GG R T ++YLS VE GGET FP E + V P+
Sbjct: 132 NASYWPEMRKTGGQRNWTAMIYLSPVEGGGETHFPRCE----------------FMVPPI 175
Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
+G L++ +L PD + + SLH + PV +G K+ TKW R
Sbjct: 176 EGMILIWNNLKPDGAPNPYSLHAARPVAQGTKYVVTKWFRER 217
>gi|170649696|gb|ACB21278.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Callicebus moloch]
Length = 555
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 91/199 (45%), Gaps = 42/199 (21%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGLTVKTAELLQVANYGVGG 425
Query: 62 KYEPHFDFFRDKMN-----------------------QQLG-GHRIATVLMYLSHVEKGG 97
+YEPHFDF R + + LG G+R+AT L Y+S VE GG
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERDAFKHLGTGNRVATFLNYMSDVEAGG 485
Query: 98 ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
TVFP+ G A+ P KG A+ +++L D + H +CPV+ G
Sbjct: 486 ATVFPDL----------------GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529
Query: 158 EKWSATKWIHVRN--FDKP 174
KW + KW H R F +P
Sbjct: 530 CKWVSNKWFHERGQEFLRP 548
>gi|195341542|ref|XP_002037365.1| GM12152 [Drosophila sechellia]
gi|194131481|gb|EDW53524.1| GM12152 [Drosophila sechellia]
Length = 535
Score = 94.4 bits (233), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 57/170 (33%), Positives = 83/170 (48%), Gaps = 19/170 (11%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+G S A+ RTS G + +++ + + ++ L + E +Q+ +Y G YEPH
Sbjct: 362 NGGSTAAAFRTSQGASFNYSKNAATKLLSHHVGDFSDLNMDYAEDLQVANYGIGGHYEPH 421
Query: 67 FDFFRDKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
+D F + Q L G+RIAT + YLS VE GG T FP +
Sbjct: 422 WDSFPENHIYQEGDLHGNRIATGIYYLSDVEAGGGTAFPFLPL----------------L 465
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
V P KG L +++LHP D + H +CPV++G KW A WI RN D
Sbjct: 466 VTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDN 515
>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
Length = 534
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 83/170 (48%), Gaps = 19/170 (11%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G+ S R S +L + + + ++ T L E +Q+ +Y G YEPH+
Sbjct: 364 GQRRKSAFRVSKNAWLPYSTHPTMGRMLRDVSDATGLDMTFCEQLQVANYGVGGHYEPHW 423
Query: 68 DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
DFFRD + G+RIAT + YLS VE+GG T FP +AV+P
Sbjct: 424 DFFRDSRHYPAAEGNRIATAIFYLSDVEQGGATAFPFL----------------NFAVRP 467
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
G+ L +++LH + D + H CPV++G KW A WIH + F +P
Sbjct: 468 QLGNILFWYNLHRSSDMDFRTKHAGCPVLKGSKWIANIWIHEATQTFARP 517
>gi|443730626|gb|ELU16050.1| hypothetical protein CAPTEDRAFT_114796, partial [Capitella teleta]
Length = 150
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 53/141 (37%), Positives = 75/141 (53%), Gaps = 24/141 (17%)
Query: 34 IEARIAAWTFLPPEN-GEAMQILHYEHGQKYEPHFDFFRDK------MNQQLGGHRIATV 86
+ R+++ T L E E Q+ Y G YEPHFDF + K +N+Q+G RIAT
Sbjct: 14 LSRRVSSATKLDAEKYAELFQVSTYGIGGHYEPHFDFSKVKYFTNPVLNEQMGD-RIATF 72
Query: 87 LMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDST 146
++YL+ VE GG TVFP R ++P+K A+ + +L D D
Sbjct: 73 MIYLNDVEAGGRTVFP----------------RLNLVIEPIKNSAVFWHNLLDDGQQDDR 116
Query: 147 SLHGSCPVIEGEKWSATKWIH 167
++HG+CPV+ G KW A KWIH
Sbjct: 117 TIHGACPVVLGRKWVANKWIH 137
>gi|281348666|gb|EFB24250.1| hypothetical protein PANDA_000722 [Ailuropoda melanoleuca]
Length = 505
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 57/169 (33%), Positives = 84/169 (49%), Gaps = 22/169 (13%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 355 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNLRMQHITGLTVKTAELLQVANYGMGG 414
Query: 62 KYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSEC 117
+YEPHFDF R D + G+R+AT L Y+S VE GG TVFP+
Sbjct: 415 QYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSDVEAGGATVFPD-------------- 460
Query: 118 ARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
G A+ P KG A+ +++L D + H +CPV+ G KW KW+
Sbjct: 461 --LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWG--KWL 505
>gi|197215651|gb|ACH53042.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Otolemur garnettii]
Length = 555
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 92/199 (46%), Gaps = 42/199 (21%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L + D +VA + R+ T L + E +Q+ +Y G
Sbjct: 366 VRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNHRMQHITGLSVKTAELLQVANYGVGG 425
Query: 62 KYEPHFDFFRDKMN-----------------------QQLG-GHRIATVLMYLSHVEKGG 97
+YEPHFDF R + ++LG G+R+AT L Y+S VE GG
Sbjct: 426 QYEPHFDFSRRPFDSGLKTEGNRVATFLNYNHERDAFKRLGTGNRVATFLNYMSDVEAGG 485
Query: 98 ETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEG 157
TVFP+ G A+ P KG A+ +++L D + H +CPV+ G
Sbjct: 486 ATVFPDL----------------GAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVG 529
Query: 158 EKWSATKWIHVRN--FDKP 174
KW + KW H R F +P
Sbjct: 530 CKWVSNKWFHERGQEFLRP 548
>gi|386771382|ref|NP_649044.3| CG18233 [Drosophila melanogaster]
gi|383291998|gb|AAF49254.3| CG18233 [Drosophila melanogaster]
Length = 515
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 49/139 (35%), Positives = 77/139 (55%), Gaps = 22/139 (15%)
Query: 34 IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM-----NQQLGGHRIATVLM 88
I RI+ T E A+Q+ ++ G ++PH+DF+ D++ N LG RI +++
Sbjct: 377 INQRISDMTGFKLEEFPAIQLANFGVGGYFKPHYDFYTDRLKEVDVNNTLGD-RIGSIIF 435
Query: 89 YLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSL 148
Y V +GG+TVFP+ +V AV+P KG+AL +F+ D++ D SL
Sbjct: 436 YAGEVSQGGQTVFPDLKV----------------AVEPKKGNALFWFNAFDDSTPDPRSL 479
Query: 149 HGSCPVIEGEKWSATKWIH 167
H CPV+ G +W+ TKW+H
Sbjct: 480 HSVCPVLVGSRWTITKWLH 498
>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
Length = 534
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 56/161 (34%), Positives = 78/161 (48%), Gaps = 27/161 (16%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF----- 70
RT+ G +L K +E+ I RI T + E Q+++Y G Y H D+F
Sbjct: 369 RTAKGFWLKKESNELTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASS 428
Query: 71 -----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
R + + LG RIATVL YL+ VE+GG TVF + GY V
Sbjct: 429 NHTDTRSRYSIDLGD-RIATVLFYLTDVEQGGATVFGDV----------------GYYVS 471
Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
P G A+ +++L D + D + H +CPVI G KW T+WI
Sbjct: 472 PQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGSKWVMTEWI 512
>gi|255607134|ref|XP_002538686.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223510975|gb|EEF23697.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 318
Score = 94.0 bits (232), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 59/172 (34%), Positives = 89/172 (51%), Gaps = 23/172 (13%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+VA+ SG+ + + RTS G + +K ++ +VA+I+ RIA T P + E +QIL+Y G
Sbjct: 156 VVANRGSGEFV-DDTRTSYGAYFNKGENSLVATIQRRIAELTRWPLTHAEPLQILNYGLG 214
Query: 61 QKYEPHFDFFRDKM-----NQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
+Y PHFD+F + + GG RIATV+MYL+ VE GG T+FP+ +
Sbjct: 215 GEYLPHFDYFEPQQPGLPSPLESGGQRIATVVMYLNDVEAGGGTIFPHLNLE-------- 266
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
+P KG A ++FS + S + I KW AT+W
Sbjct: 267 --------TRPRKGGA-IYFSYQLAVARSIRSRCMAARRIARRKWIATQWFR 309
>gi|330799463|ref|XP_003287764.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
gi|325082219|gb|EGC35708.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
Length = 220
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 83/159 (52%), Gaps = 24/159 (15%)
Query: 16 RTSSGMFLSKAQDE--IVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR-- 71
R+ G+F+ + ++E + +I ++ + + ++ E MQI+ Y G++ H+D+F
Sbjct: 69 RSGWGLFMKEGEEEHPVTKNIFNKMKNFVNIS-DSCEVMQIIRYNPGEETSAHYDYFNPL 127
Query: 72 ---DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMK 128
M L G RI T+LMYL VE+GGET FP G VKP++
Sbjct: 128 TTNGSMKIGLYGQRICTILMYLCDVEEGGETSFPEV----------------GIKVKPIR 171
Query: 129 GDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
GDA+LF++ P+ D SLH PV +G KW A K I+
Sbjct: 172 GDAVLFYNCKPNGDVDPLSLHQGDPVTKGTKWVAIKLIN 210
>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
Length = 534
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 56/161 (34%), Positives = 78/161 (48%), Gaps = 27/161 (16%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF----- 70
RT+ G +L K +E+ I RI T + E Q+++Y G Y H D+F
Sbjct: 369 RTAKGFWLKKESNELTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYFDFASS 428
Query: 71 -----RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
R + + LG RIATVL YL+ VE+GG TVF + GY V
Sbjct: 429 NHTDTRSRYSIDLGD-RIATVLFYLTDVEQGGATVFGDV----------------GYYVS 471
Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
P G A+ +++L D + D + H +CPVI G KW T+WI
Sbjct: 472 PQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGSKWVMTEWI 512
>gi|312383453|gb|EFR28539.1| hypothetical protein AND_03427 [Anopheles darlingi]
Length = 341
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 61/202 (30%), Positives = 94/202 (46%), Gaps = 45/202 (22%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G+ + R S +L + E++ ++ R+ T L E +Q+++Y G
Sbjct: 144 VQNYKTGELEFANYRISKSAWLKDTEHEVIRTVNQRVEDMTGLTMATAEELQVVNYGIGG 203
Query: 62 KYEPHFDFFRDKMN---QQLG-GHRIATVLMY-----------------------LSHVE 94
YEPHFDF R + + LG G+RIATVL Y +S V
Sbjct: 204 HYEPHFDFARREERNAFKSLGTGNRIATVLFYVSDLCLCHTSHTNADFRFLSVGQMSDVT 263
Query: 95 KGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPV 154
+GG TVFP+ + A++P KG A + +LH + D + H +CPV
Sbjct: 264 QGGATVFPSLNL----------------ALRPRKGTAAFWHNLHASGNGDYATRHAACPV 307
Query: 155 IEGEKWSATKWIHVR--NFDKP 174
+ G KW + KWIH R F +P
Sbjct: 308 LTGTKWVSNKWIHERGQEFRRP 329
>gi|195591298|ref|XP_002085379.1| GD14755 [Drosophila simulans]
gi|194197388|gb|EDX10964.1| GD14755 [Drosophila simulans]
Length = 515
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 49/139 (35%), Positives = 77/139 (55%), Gaps = 22/139 (15%)
Query: 34 IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM-----NQQLGGHRIATVLM 88
I RI+ T E A+Q+ ++ G ++PH+D++ D++ N LG RI +++
Sbjct: 377 INQRISDMTGFKLEEFPAIQLANFGVGGYFKPHYDYYTDRLKEVDVNNTLGD-RIGSIIF 435
Query: 89 YLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSL 148
Y V +GG+TVFP+ +V AV+P KG+AL +F+ D+S D +L
Sbjct: 436 YAGEVSQGGQTVFPDLKV----------------AVEPKKGNALFWFNAFDDSSPDPRTL 479
Query: 149 HGSCPVIEGEKWSATKWIH 167
H CPVI G +W+ TKW+H
Sbjct: 480 HSVCPVIVGSRWTITKWLH 498
>gi|37912909|gb|AAR05245.1| conserved hypothetical protein [uncultured marine proteobacterium
ANT32C12]
Length = 186
Score = 94.0 bits (232), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 57/158 (36%), Positives = 78/158 (49%), Gaps = 24/158 (15%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF----- 70
RT+S ++ EI+ + R + +P N E Q++HY G +Y+PHFD F
Sbjct: 40 RTNSYAWIQHDASEIIHEVSKRFSILVKMPINNAEQFQLVHYGPGTEYKPHFDAFDKSTE 99
Query: 71 RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
+ N GG R+ T L YL+ VE GG T FP+ VS VKP KGD
Sbjct: 100 EGRNNWFPGGQRMVTALAYLNDVEDGGATDFPDIHVS----------------VKPNKGD 143
Query: 131 ALLFFSLHPDASTD--STSLHGSCPVIEGEKWSATKWI 166
++F + D ++D SLHG PVI GEKW+ W
Sbjct: 144 VVVFHNC-KDGTSDINPNSLHGGSPVISGEKWAVNLWF 180
>gi|381200649|ref|ZP_09907785.1| Prolyl 4-hydroxylase alpha subunit [Sphingobium yanoikuyae XLDN2-5]
Length = 305
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 59/167 (35%), Positives = 81/167 (48%), Gaps = 22/167 (13%)
Query: 1 MVADNESGKSIASEVRTS-SGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEH 59
MV D SG+ + VRTS G+F +D ++ +I RIAA + GE + +L Y
Sbjct: 150 MVIDPRSGRPMPHPVRTSDGGIFGPAREDLVIQAINRRIAAASGTMLSGGEPLTLLRYAV 209
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
GQ+Y H D NQ R T+L+YL+ GGET+FP R
Sbjct: 210 GQQYRQHHDCLPHVRNQ-----RAWTMLIYLNEGYAGGETIFP----------------R 248
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
G +VK KGDALLF + ++H PV+ G+KW T+WI
Sbjct: 249 LGLSVKGRKGDALLFRNTDAQGQAAEAAVHLGAPVMAGQKWLCTRWI 295
>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
Length = 537
Score = 93.6 bits (231), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 82/170 (48%), Gaps = 19/170 (11%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G+ S R S +L+ + + + T L E +Q+ +Y G YEPH+
Sbjct: 367 GQHKKSAFRVSKNAWLAYESHPTMVGMLRDLKEATGLDTTYCEQLQVANYGVGGHYEPHW 426
Query: 68 DFFRDKMNQ-QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
DFFRD + + G+RIAT + YLS VE+GG T FP ++ AVKP
Sbjct: 427 DFFRDPNHYPEEEGNRIATAIFYLSEVEQGGATAFPFLDI----------------AVKP 470
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
G+ L +++LH D + H CPV++G KW WIH + F +P
Sbjct: 471 QLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEVTQTFARP 520
>gi|321463241|gb|EFX74258.1| hypothetical protein DAPPUDRAFT_22132 [Daphnia pulex]
Length = 523
Score = 93.2 bits (230), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 58/167 (34%), Positives = 81/167 (48%), Gaps = 28/167 (16%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPE----NGEAMQILHYEHGQKYEPHF 67
S VRTS +L + ++ + RI T L + E +Q+ +Y G Y PH
Sbjct: 359 VSNVRTSKTAWLPEGLHPLLNRLSRRIGLITGLKTDPIRDEAELLQVANYGIGGHYSPHH 418
Query: 68 DFF-RDKMNQQL-------GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
D+ +DK + + G RIAT + YL+ VE+GG T FP R
Sbjct: 419 DYLMKDKADFEYMHHRELQAGDRIATFMFYLNDVERGGSTAFP----------------R 462
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
G AVKP+KG A +F+L D +LHG+CPV+ G KW + KWI
Sbjct: 463 AGVAVKPVKGGAAFWFNLKRSGKPDPLTLHGACPVLLGHKWVSNKWI 509
>gi|51490656|emb|CAF31507.1| prolyl 4-hydroxylase 2 precursor [Brugia malayi]
Length = 551
Score = 93.2 bits (230), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 54/154 (35%), Positives = 78/154 (50%), Gaps = 18/154 (11%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR--DK 73
RTS +L + E+V I R+ T L E E +Q+ +Y G YEPH+D R +
Sbjct: 372 RTSQSSWLGSTEHEVVKRINKRLDLATNLETETAEELQVQNYGIGGHYEPHYDCSRRENV 431
Query: 74 MNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALL 133
+ G+RIAT+L+Y++ E GG TVF + + S S C K AL
Sbjct: 432 FEKTKNGNRIATILIYMTEPEIGGGTVFIDLKTSVS-------CT---------KNAALF 475
Query: 134 FFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
+++L + D S H +CPV+ G KW+A KW H
Sbjct: 476 WYNLMRSGAVDMRSYHAACPVLTGTKWTANKWFH 509
>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
Length = 545
Score = 93.2 bits (230), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 57/176 (32%), Positives = 89/176 (50%), Gaps = 26/176 (14%)
Query: 10 SIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF 69
S+ S RTS F+ K + +++ +I+ R+A T L E Q+ +Y G Y H D+
Sbjct: 366 SVVSNARTSQFTFIPKTRHKVLRTIDQRVADMTDLNMVFAEDHQLANYGIGGHYAQHMDW 425
Query: 70 F-------RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
F + N ++G +RIATVL YL+ VE+GG T FP +
Sbjct: 426 FSPNAFETKQVANSEMG-NRIATVLFYLTDVEQGGGTAFPVLK----------------Q 468
Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
+KP K A +++LH + D ++HG+CP+I G KW +WI R F + ++ P
Sbjct: 469 LLKPKKYAAAFWYNLHASGAGDVRTMHGACPIIVGSKWVLNRWI--REFVQSDRRP 522
>gi|374620441|ref|ZP_09692975.1| 2OG-Fe(II) oxygenase superfamily enzyme [gamma proteobacterium
HIMB55]
gi|374303668|gb|EHQ57852.1| 2OG-Fe(II) oxygenase superfamily enzyme [gamma proteobacterium
HIMB55]
Length = 570
Score = 92.8 bits (229), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 57/164 (34%), Positives = 81/164 (49%), Gaps = 22/164 (13%)
Query: 13 SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF-- 70
SE RT S +L +D++V S+ RI+ LP E E+MQI+HY Q+Y PHFD F
Sbjct: 60 SEGRTGSNHWLKYDEDDVVQSVGQRISDIVGLPLEYAESMQIIHYGPEQEYRPHFDAFNL 119
Query: 71 ---RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
+ + + GG R+ T L+YL+ VE GG T FP + G V +
Sbjct: 120 SLPKGQRAAKWGGQRLVTALVYLNKVEAGGATQFP----------------KLGITVPAL 163
Query: 128 KGDALLFFSLHPDAS-TDSTSLHGSCPVIEGEKWSATKWIHVRN 170
G ++F + D S SLH PV GEKW+ W +++
Sbjct: 164 PGRMVIFHNTTHDISGPHPLSLHAGMPVEAGEKWAFNMWFRLQD 207
>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
Length = 537
Score = 92.8 bits (229), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 82/170 (48%), Gaps = 19/170 (11%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G++ S R S +L+ + + + T L E +Q+ +Y G YEPH+
Sbjct: 367 GQNKKSAFRVSKNAWLAYESHPTMEGMLRDLKDATGLDTTYCEQLQVANYGVGGHYEPHW 426
Query: 68 DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
DFFRD + G+RIAT + YLS VE+GG T FP + +AVKP
Sbjct: 427 DFFRDPNHYPAEEGNRIATAIFYLSDVEQGGATAFPFLD----------------FAVKP 470
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
G+ L +++LH D + H CPV++G KW WIH + F +P
Sbjct: 471 QLGNVLFWYNLHRSLDMDYRTKHAGCPVLKGSKWIGNVWIHDMTQTFARP 520
>gi|4336512|gb|AAD17844.1| prolyl 4-hydroxylase alpha subunit [Drosophila melanogaster]
Length = 535
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 55/170 (32%), Positives = 83/170 (48%), Gaps = 19/170 (11%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+G S A+ RTS G + +++ + + ++ L + E +Q+ +Y G YEPH
Sbjct: 362 NGGSTAAAFRTSQGASFNYSRNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPH 421
Query: 67 FDFFRDKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
+D F + Q L G+R+AT + YLS VE GG T FP +
Sbjct: 422 WDSFPENHIYQEGDLHGNRMATGIYYLSDVEAGGGTAFPFLPL----------------L 465
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
V P +G L +++LHP D + H +CPV++G KW A WI RN D
Sbjct: 466 VTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDN 515
>gi|198459366|ref|XP_002138685.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
gi|198136669|gb|EDY69243.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
Length = 448
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 55/156 (35%), Positives = 80/156 (51%), Gaps = 20/156 (12%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWT---FLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
RTS F + Q V + R+ T L + + + +L+Y +Y H D+F
Sbjct: 295 RTSMSAFQTDHQYTAVTKVNRRVMHMTGFEVLADGSSDELLVLNYATAAQYLTHSDYFGP 354
Query: 73 KMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
++ + G RIATVL YL+ VE+GG+TVFP R G PMKG A
Sbjct: 355 AYSEYIQRGDRIATVLFYLNDVEQGGKTVFP----------------RLGIFRSPMKGSA 398
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
++F++L+ D + HG CPV+ G KW+ATKWI+
Sbjct: 399 VVFYNLNSSLQGDPRTEHGGCPVLVGTKWAATKWIY 434
>gi|195172672|ref|XP_002027120.1| GL20071 [Drosophila persimilis]
gi|194112933|gb|EDW34976.1| GL20071 [Drosophila persimilis]
Length = 455
Score = 92.4 bits (228), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/156 (34%), Positives = 81/156 (51%), Gaps = 20/156 (12%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWT---FLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
RTS F + Q + V + R+ T L + + + +L+Y +Y H D+F
Sbjct: 302 RTSMSAFQTDHQYKAVTKVNRRVMHMTGFEVLADGSSDELLVLNYATAAQYLTHSDYFGP 361
Query: 73 KMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
++ + G RIATVL YL+ VE+GG+TVFP R G PMKG A
Sbjct: 362 AYSEYIQRGDRIATVLFYLNDVEQGGKTVFP----------------RLGIFRSPMKGSA 405
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
++F++++ D + HG CPV+ G KW+ATKWI+
Sbjct: 406 VVFYNMNSSLQGDPRTEHGGCPVLVGTKWAATKWIY 441
>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
Length = 467
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 81/170 (47%), Gaps = 19/170 (11%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G+ S R S +L+ + + + T L E +Q+ +Y G YEPH+
Sbjct: 297 GQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHW 356
Query: 68 DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
DFFRD + G+RIAT + YLS VE+GG T FP ++ AVKP
Sbjct: 357 DFFRDPNHYPAEEGNRIATAIFYLSEVEQGGATAFPFLDI----------------AVKP 400
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
G+ L +++LH D + H CPV++G KW WIH + F +P
Sbjct: 401 QLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEVTQTFARP 450
>gi|194905372|ref|XP_001981184.1| GG11758 [Drosophila erecta]
gi|190655822|gb|EDV53054.1| GG11758 [Drosophila erecta]
Length = 550
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/176 (32%), Positives = 86/176 (48%), Gaps = 24/176 (13%)
Query: 9 KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD 68
+S+ S VRTS F+ + +++++I+ R+A T L + E Q +Y G Y H D
Sbjct: 365 ESLVSNVRTSQFTFIPASAHKVLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHMD 424
Query: 69 -FFRDKMNQQL-----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
F++ + L G+RIATVL YLS V +GG T FP
Sbjct: 425 WFYQTTFDAGLVSSPEMGNRIATVLFYLSDVSQGGGTAFPQLRT---------------- 468
Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
+KP K A + +LH D + HG+CP+I G KW +WI R FD+ ++ P
Sbjct: 469 LLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWI--REFDQSDRRP 522
>gi|66772633|gb|AAY55628.1| IP02961p [Drosophila melanogaster]
Length = 409
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/170 (31%), Positives = 83/170 (48%), Gaps = 19/170 (11%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+G S A+ RTS G + +++ + + ++ L + E +Q+ +Y G YEPH
Sbjct: 236 NGGSTAAAFRTSQGASFNYSRNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPH 295
Query: 67 FDFFRDKMNQQLG---GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
+D F + Q G G+R+AT + YL+ VE GG T FP +
Sbjct: 296 WDSFPENHIYQEGDLHGNRMATGIYYLADVEAGGGTAFPFLPL----------------L 339
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
V P +G L +++LHP D + H +CPV++G KW A WI RN D
Sbjct: 340 VTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDN 389
>gi|323455897|gb|EGB11765.1| hypothetical protein AURANDRAFT_52419 [Aureococcus anophagefferens]
Length = 478
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 60/164 (36%), Positives = 79/164 (48%), Gaps = 45/164 (27%)
Query: 50 EAMQILHYEHGQKYEPHFDFF-----------RDKMNQQL-----GGHRIATVLMYLSHV 93
+ +Q+LHYE Q Y+PH D+F D + + G +R ATV +YL++
Sbjct: 232 DGLQVLHYERPQWYKPHVDYFTSRNAGGGGASEDAFSNAIPTANNGTNRFATVFLYLNNA 291
Query: 94 EKGGETVFPNS---EVSQS-----------------RDGNW-----SECARRGYAVKPMK 128
GGETVFP S E+ Q D W SE R V P
Sbjct: 292 GSGGETVFPLSTTHEIYQGGRLTQAGTNRTPGFIRDADAAWVCDTKSEALR----VTPRT 347
Query: 129 GDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
GD++LF+S DAS D SLHGSCP+ +GEKW+A W+ R D
Sbjct: 348 GDSVLFYSQRGDASLDGYSLHGSCPMGDGEKWAANLWVWNRPRD 391
>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
[Drosophila melanogaster]
Length = 286
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 81/170 (47%), Gaps = 19/170 (11%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G+ S R S +L+ + + + T L E +Q+ +Y G YEPH+
Sbjct: 116 GQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHW 175
Query: 68 DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
DFFRD + G+RIAT + YLS VE+GG T FP ++ AVKP
Sbjct: 176 DFFRDPNHYPAEEGNRIATAIFYLSEVEQGGATAFPFLDI----------------AVKP 219
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
G+ L +++LH D + H CPV++G KW WIH + F +P
Sbjct: 220 QLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEVTQTFARP 269
>gi|301115862|ref|XP_002905660.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110449|gb|EEY68501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 215
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 62/181 (34%), Positives = 94/181 (51%), Gaps = 19/181 (10%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR 71
A++ RTS+ +L + +V +I+ R A +P + E++Q+L YE Q Y+ H D+F
Sbjct: 44 ATDWRTSTTYWLDSSSHPVVQTIDKRTADLVKVPISHQESVQVLRYEPTQHYDQHLDYFS 103
Query: 72 --------DKMNQQLGGH--RIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
D + + G+ R+ TV Y+S V KGG T F S R + +C++ G
Sbjct: 104 AERHRNSPDVLKRIEYGYKNRMITVFWYMSDVAKGGHTNFARSG-GLPRPSSNKDCSQ-G 161
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDD 181
+V P K ++F+S+ P+ D SLH CPV EG K S KWI ++KP DD
Sbjct: 162 ISVAPKKRKVVVFYSMLPNGEGDPMSLHAGCPVEEGIKLSGNKWI----WNKPR---SDD 214
Query: 182 D 182
D
Sbjct: 215 D 215
>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
Length = 537
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 68/128 (53%), Gaps = 19/128 (14%)
Query: 50 EAMQILHYEHGQKYEPHFDFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQ 108
E +Q+ +Y G YEPH+DFFRD + G+RIAT + YLS VE+GG T FP ++
Sbjct: 409 EQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGATAFPFLDI-- 466
Query: 109 SRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH- 167
AVKP G+ L +++LH D + H CPV++G KW WIH
Sbjct: 467 --------------AVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHE 512
Query: 168 -VRNFDKP 174
+ F +P
Sbjct: 513 VTQTFARP 520
>gi|17861644|gb|AAL39299.1| GH17175p [Drosophila melanogaster]
Length = 187
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/170 (32%), Positives = 81/170 (47%), Gaps = 19/170 (11%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G+ S R S +L+ + + + T L E +Q+ +Y G YEPH+
Sbjct: 17 GQLKKSAFRVSKNAWLAYESHPTMVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHW 76
Query: 68 DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
DFFRD + G+RIAT + YLS VE+GG T FP ++ AVKP
Sbjct: 77 DFFRDPNHYPAEEGNRIATAIFYLSEVEQGGATAFPFLDI----------------AVKP 120
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
G+ L +++LH D + H CPV++G KW WIH + F +P
Sbjct: 121 QLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHEVTQTFARP 170
>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
Length = 537
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 68/128 (53%), Gaps = 19/128 (14%)
Query: 50 EAMQILHYEHGQKYEPHFDFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQ 108
E +Q+ +Y G YEPH+DFFRD + G+RIAT + YLS VE+GG T FP ++
Sbjct: 409 EQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGATAFPFLDI-- 466
Query: 109 SRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH- 167
AVKP G+ L +++LH D + H CPV++G KW WIH
Sbjct: 467 --------------AVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHE 512
Query: 168 -VRNFDKP 174
+ F +P
Sbjct: 513 VTQTFARP 520
>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
Length = 537
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 68/128 (53%), Gaps = 19/128 (14%)
Query: 50 EAMQILHYEHGQKYEPHFDFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQ 108
E +Q+ +Y G YEPH+DFFRD + G+RIAT + YLS VE+GG T FP ++
Sbjct: 409 EQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFYLSEVEQGGATAFPFLDI-- 466
Query: 109 SRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH- 167
AVKP G+ L +++LH D + H CPV++G KW WIH
Sbjct: 467 --------------AVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKWIGNVWIHE 512
Query: 168 -VRNFDKP 174
+ F +P
Sbjct: 513 VTQTFARP 520
>gi|323452216|gb|EGB08091.1| hypothetical protein AURANDRAFT_26622 [Aureococcus anophagefferens]
Length = 190
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/165 (32%), Positives = 82/165 (49%), Gaps = 22/165 (13%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPE------NGEAMQILHYEHGQ 61
G S+ RTS +L ++ I+ +I R + + N E +Q++ Y+ Q
Sbjct: 34 GGGFTSKTRTSENGWLRRSASPILENIYKRFGDVLGIDHDLLRSGKNAEELQVVRYDRSQ 93
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
+Y PH DF D QQ R T+L+Y+ E+GG T FP ++ DG G
Sbjct: 94 EYAPHHDFGDDGTPQQ----RFLTLLLYIQLPEEGGATSFP-----KANDG-------MG 137
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
V P +GDA+LF+S+ PD + D +LH PV +G+KW W+
Sbjct: 138 VQVVPARGDAVLFYSMLPDGNADDLALHAGMPVRKGQKWVCNLWV 182
>gi|444731524|gb|ELW71877.1| Prolyl 4-hydroxylase subunit alpha-3 [Tupaia chinensis]
Length = 562
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/167 (34%), Positives = 86/167 (51%), Gaps = 24/167 (14%)
Query: 22 FLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYEHGQKYEPHFDFFRDK---MNQ 76
+L D ++ +++ RIAA T L P E +Q+++Y G YEPHFD + +
Sbjct: 412 WLKDTVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYR 471
Query: 77 QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFS 136
G+R+AT ++YLS VE GG T F + ++V +K AL +++
Sbjct: 472 MKSGNRVATFMIYLSSVEAGGATAFIYA----------------NFSVPVVKNAALFWWN 515
Query: 137 LHPDASTDSTSLHGSCPVIEGEKWSATKWIHV--RNFDKP-EKEPED 180
LH +S +LH CPV+ G+KW A KWIH + F +P PED
Sbjct: 516 LHRSGEGNSDTLHAGCPVLVGDKWVANKWIHEYGQEFRRPCTSSPED 562
>gi|443705944|gb|ELU02240.1| hypothetical protein CAPTEDRAFT_227850 [Capitella teleta]
Length = 475
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/169 (34%), Positives = 82/169 (48%), Gaps = 27/169 (15%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPE------NGEAMQIL 55
V D+ G+S R SS F++ + D +VAS+ R++ T L E E++Q+L
Sbjct: 214 VLDDTGGESFFDVSRLSSTAFVNDSND-LVASLNRRVSKLTGLQTEVLDSFSESESLQVL 272
Query: 56 HYEHGQKYEPHFDFFRDKMNQ----QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRD 111
Y G Y PH+D + + Q G RIAT ++YL GG TVFP +S
Sbjct: 273 RYGPGGLYTPHYDTLGSEADLPPYIQHTGDRIATFILYLDIATAGGATVFPLLPMS---- 328
Query: 112 GNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKW 160
+ KG A +F+LHPD S D +LH +CPVI G KW
Sbjct: 329 ------------IPIQKGAAAFWFNLHPDGSLDRRTLHAACPVIRGTKW 365
>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
Length = 534
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 79/171 (46%), Gaps = 27/171 (15%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
E G + RT+ G + K +E+ I RI T + E Q+++Y G Y
Sbjct: 359 EQGVPKKNRGRTAKGFWFKKESNELTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLL 418
Query: 66 HFDFF----------RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
H D+F R + LG RIATVL YL+ VE+GG TVF
Sbjct: 419 HMDYFDFASSNHTDTRSSYSMDLGD-RIATVLFYLTDVEQGGATVF-------------- 463
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
A GY+V P G A+ +++L + D + H +CPVI G KW T+WI
Sbjct: 464 --ADVGYSVYPQAGTAIFWYNLDTNGKGDPRTKHAACPVIVGSKWVMTEWI 512
>gi|24651418|ref|NP_524594.2| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|7301951|gb|AAF57057.1| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|359807686|gb|AEV66559.1| FI17802p1 [Drosophila melanogaster]
Length = 535
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/170 (31%), Positives = 83/170 (48%), Gaps = 19/170 (11%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+G S A+ RTS G + +++ + + ++ L + E +Q+ +Y G YEPH
Sbjct: 362 NGGSTAAAFRTSQGASFNYSRNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPH 421
Query: 67 FDFFRDKMNQQ---LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
+D F + Q L G+R+AT + YL+ VE GG T FP +
Sbjct: 422 WDSFPENHIYQEGDLHGNRMATGIYYLADVEAGGGTAFPFLPL----------------L 465
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
V P +G L +++LHP D + H +CPV++G KW A WI RN D
Sbjct: 466 VTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDN 515
>gi|452752943|ref|ZP_21952682.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
proteobacterium JLT2015]
gi|451959765|gb|EMD82182.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
proteobacterium JLT2015]
Length = 314
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 58/171 (33%), Positives = 78/171 (45%), Gaps = 22/171 (12%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQ-DEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+ D ++G VRTS G LS + D +V + RIAA T GE + IL Y
Sbjct: 159 ILDPQTGARRPDPVRTSVGAALSPVEEDLVVGMLNRRIAAATGTDRMQGEPLHILRYSGA 218
Query: 61 QKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARR 120
Q+Y PH D NQ R T+++YL+ +GGET FP
Sbjct: 219 QEYRPHHDAVAGLENQ-----RSHTLIVYLTADYEGGETAFPEL---------------- 257
Query: 121 GYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
G+ ++ +GDALLF +L D D H P G KW AT+WI R +
Sbjct: 258 GFRLRGRQGDALLFANLREDGRPDLRMRHAGLPATSGAKWIATRWIRTRPY 308
>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
Length = 541
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 57/173 (32%), Positives = 88/173 (50%), Gaps = 24/173 (13%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF- 70
S+VRTS F+ K + +++ +I+ R+A + L + E Q +Y G Y H D+F
Sbjct: 368 VSKVRTSQFTFIPKTRHKVLQTIDQRVADMSNLNMDYAELHQFANYGIGGHYAQHNDWFG 427
Query: 71 RDKMNQQL-----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
+D + +L G+RIATVL YLS V +GG T FP+ + ++
Sbjct: 428 QDAFDNELVSSPEMGNRIATVLFYLSDVAQGGGTAFPHLK----------------QLLQ 471
Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
P K A + +LH D +LHG+CP+I G KW +WI R F + ++ P
Sbjct: 472 PKKYAAAFWHNLHASGVGDLRTLHGACPIIAGSKWVQNRWI--REFIQADRRP 522
>gi|195391758|ref|XP_002054527.1| GJ22759 [Drosophila virilis]
gi|194152613|gb|EDW68047.1| GJ22759 [Drosophila virilis]
Length = 539
Score = 91.7 bits (226), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/165 (32%), Positives = 78/165 (47%), Gaps = 19/165 (11%)
Query: 11 IASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF 70
I++ RTS G I+ + +A + L + E +QI +Y G YEPH D F
Sbjct: 370 ISANFRTSQGTTFEYTDHPIMQKMSHHVAEISGLDMRSAEPLQIANYGIGGHYEPHMDSF 429
Query: 71 RDKMNQQLGGH---RIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPM 127
D + L + R+AT + YLS+VE GG T FP + V P
Sbjct: 430 PDSYDYSLNMYKTNRLATGIYYLSNVEAGGGTAFPFLPL----------------LVTPE 473
Query: 128 KGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFD 172
+G L +++LHP D + H +CPV++G KW A WI + N D
Sbjct: 474 RGSLLFWYNLHPSGDADYRTKHAACPVLQGSKWIANVWIRLSNQD 518
>gi|427410797|ref|ZP_18900999.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
51230]
gi|425710785|gb|EKU73805.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
51230]
Length = 322
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 57/167 (34%), Positives = 81/167 (48%), Gaps = 22/167 (13%)
Query: 1 MVADNESGKSIASEVRTS-SGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEH 59
MV D SG+ + +RTS G+F +D ++ +I RIAA + GE + +L Y
Sbjct: 167 MVIDPRSGRPMPHPIRTSDGGIFGPAREDLVIQAINRRIAAASGTMLSGGEPLTLLRYAV 226
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
GQ+Y H D NQ R T+L+YL+ GGET+FP R
Sbjct: 227 GQQYRQHHDCLPHVRNQ-----RAWTMLIYLNEGYAGGETIFP----------------R 265
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
G +VK KG+ALLF + ++H PV+ G+KW T+WI
Sbjct: 266 LGLSVKGRKGNALLFRNTDAQGQAAEAAVHLGAPVMAGQKWLCTRWI 312
>gi|390352104|ref|XP_003727818.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
[Strongylocentrotus purpuratus]
Length = 121
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 48/118 (40%), Positives = 65/118 (55%), Gaps = 17/118 (14%)
Query: 50 EAMQILHYEHGQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQS 109
E +QI +Y G Y PHFDF RD + G+RIA++L YLS V KGG+TVF ++
Sbjct: 5 EFLQIANYGLGGHYLPHFDFTRDVATHK-NGNRIASMLFYLSDVAKGGDTVFIDA----- 58
Query: 110 RDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
G +KP KG A+ +++L + D + H SCPVI G KW A W+H
Sbjct: 59 -----------GAKIKPEKGSAIFWYNLFKNGKVDERTKHASCPVISGSKWVANMWMH 105
>gi|410632646|ref|ZP_11343301.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
gi|410147883|dbj|GAC20168.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
Length = 480
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 84/164 (51%), Gaps = 19/164 (11%)
Query: 13 SEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRD 72
+ RTSS L QD ++ I+ +I + + P E +Q HY+ GQ+++PH D+F
Sbjct: 133 QQFRTSSTCHLGNMQDPVIRKIDLQICQYLGIDPSYSEVIQGQHYQLGQQFKPHTDYFEP 192
Query: 73 KMNQQLG---GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKG 129
G G R T ++YL+ VE+GG+TVFP + G+ K KG
Sbjct: 193 YELAHYGGIQGQRTYTFMIYLNEVEQGGDTVFPELAI--------------GFKAK--KG 236
Query: 130 DALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
A+++ +++PD S + +LH PV +GEK TKW + ++
Sbjct: 237 MAVIWNNINPDGSVNYQTLHQGMPVQKGEKLIITKWFRQHSLEQ 280
>gi|194871359|ref|XP_001972833.1| GG13662 [Drosophila erecta]
gi|190654616|gb|EDV51859.1| GG13662 [Drosophila erecta]
Length = 515
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 52/148 (35%), Positives = 82/148 (55%), Gaps = 24/148 (16%)
Query: 34 IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM-----NQQLGGHRIATVLM 88
I RI+ T E A+Q+ ++ G ++PH+D++ +++ N LG R+A++++
Sbjct: 377 INDRISDMTGFKVEEFPAIQLANFGVGGYFKPHYDYYTERLKELDANNTLGD-RLASIII 435
Query: 89 YLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSL 148
Y V +GG+TVFP+ +V AV+P KG AL +F+ D+S D SL
Sbjct: 436 YAGEVSQGGQTVFPDIKV----------------AVEPKKGKALFWFNDFDDSSPDPRSL 479
Query: 149 HGSCPVIEGEKWSATKWIHV--RNFDKP 174
H CPVI G +W+ TKW+H + F KP
Sbjct: 480 HSVCPVIVGSRWTITKWLHYAPQMFVKP 507
>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
rubripes]
Length = 540
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 55/169 (32%), Positives = 86/169 (50%), Gaps = 31/169 (18%)
Query: 9 KSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPEN--GEAMQILHYEHGQKYEPH 66
K +E R S +L + V+ ++ +I+ T L ++ GE +Q+++Y G YEPH
Sbjct: 377 KQATAEYRISKSAWLKGSAHSTVSRLDQKISMLTGLNVQHPHGEYLQVVNYGIGGHYEPH 436
Query: 67 FD--------FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECA 118
FD F+ K G+R+AT ++YLS VE GG T F +
Sbjct: 437 FDHATSPSSPVFKLKT-----GNRVATFMIYLSSVEAGGSTAFIYA-------------- 477
Query: 119 RRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
++V MK A+ +++LH + D+ +LH CPV+ G+KW A KWIH
Sbjct: 478 --NFSVPVMKNAAIFWWNLHRNGEGDADTLHAGCPVLIGDKWVANKWIH 524
>gi|297301157|ref|XP_001103971.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Macaca
mulatta]
Length = 512
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 56/175 (32%), Positives = 81/175 (46%), Gaps = 36/175 (20%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D E+GK ++ R S +LS ++ +V+ I RI T L E +Q+ +Y G
Sbjct: 365 VHDPETGKLTTAQYRVSKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGG 424
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
+YEPHFDF R +S V GG TVFP G
Sbjct: 425 QYEPHFDFAR------------------MSDVSAGGATVFPEV----------------G 450
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
+V P KG A+ +++L D ++ H +CPV+ G KW + KW+H R F +P
Sbjct: 451 ASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWLHERGQEFRRP 505
>gi|323453493|gb|EGB09364.1| hypothetical protein AURANDRAFT_15704, partial [Aureococcus
anophagefferens]
Length = 148
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 59/159 (37%), Positives = 79/159 (49%), Gaps = 15/159 (9%)
Query: 13 SEVRTSSGMFLSKA--QDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF 70
S RTS + + A + ++ ARI T +P EN E+ Q+L Y HGQ+Y H D
Sbjct: 1 STSRTSENAWCTGACESNRATRAVMARIEEVTGVPKENYESFQVLRYTHGQQYRAHHDMS 60
Query: 71 RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
R N G RI T MY S VEKGGET FP + + + P +G
Sbjct: 61 RGD-NALACGPRIYTFFMYFSDVEKGGETEFPMVKRPSGKT----------VKIAPKRGS 109
Query: 131 ALLFFSLHPDAST--DSTSLHGSCPVIEGEKWSATKWIH 167
ALL+ S+ D T D + H + PV+EG K++A WIH
Sbjct: 110 ALLWPSVTSDDPTAQDPRTRHAALPVVEGTKFAANAWIH 148
>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
Length = 534
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/171 (33%), Positives = 79/171 (46%), Gaps = 27/171 (15%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
E G + RT+ G + K +E+ I RI T + E Q+++Y G Y
Sbjct: 359 EQGVPKKNRGRTAKGFWFKKESNELTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLL 418
Query: 66 HFDFF----------RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWS 115
H D+F R + LG RIATVL YL+ VE+GG TVF
Sbjct: 419 HMDYFDFASSNHTDTRSGYSMDLGD-RIATVLFYLTDVEQGGATVF-------------- 463
Query: 116 ECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
A GY+V P G A+ +++L + D + H +CPVI G KW T+WI
Sbjct: 464 --ADVGYSVYPQAGTAIFWYNLDTNGKGDPRTRHAACPVIVGSKWVMTEWI 512
>gi|334343683|ref|YP_004552235.1| procollagen-proline dioxygenase [Sphingobium chlorophenolicum L-1]
gi|334100305|gb|AEG47729.1| Procollagen-proline dioxygenase [Sphingobium chlorophenolicum L-1]
Length = 225
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/165 (34%), Positives = 88/165 (53%), Gaps = 22/165 (13%)
Query: 10 SIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF 69
S ++ RTS+ LS D +V+++ RI A T + ++GE +Q Y GQ+Y+PH+D+
Sbjct: 74 SANADYRTSASCNLSP-WDPLVSAVSDRICALTGIAADHGETLQGQRYHPGQEYKPHWDY 132
Query: 70 FRDKMNQ-----QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAV 124
F N + GG R T ++YLS VE GGET FP+ E + V
Sbjct: 133 FPVTANYWPAMLKTGGQRSWTAMIYLSPVEAGGETHFPHCE----------------FMV 176
Query: 125 KPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
P++G L++ ++ D S + +SLH + PV +G K+ TKW R
Sbjct: 177 PPVEGMLLIWNNMDRDGSPNGSSLHAARPVEQGTKYVVTKWFRER 221
>gi|170064956|ref|XP_001867741.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882144|gb|EDS45527.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 520
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 57/168 (33%), Positives = 87/168 (51%), Gaps = 23/168 (13%)
Query: 13 SEVRTSSGMFLSKAQDEIVASI--EARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF 70
S++R S + D IV ++ AR A P + E +Q+ +Y G Y H+D+
Sbjct: 363 SKIRISQNAWFENEHDPIVETLNQRARDMAGGLNEP-SYELLQVNNYGLGGFYSIHYDWS 421
Query: 71 R--DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMK 128
+ + G+RIAT++ YLS V++GG TVFP R AV+P K
Sbjct: 422 TSANPFPNKGMGNRIATLMFYLSDVQEGGSTVFP----------------RLNLAVRPRK 465
Query: 129 GDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN--FDKP 174
G A+ +++LH + + +LH +CPV+ G KW A KWIH R+ F +P
Sbjct: 466 GTAIFWYNLHRNGKGNKKTLHAACPVLIGSKWVANKWIHERHQEFVRP 513
>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
Length = 536
Score = 90.5 bits (223), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 54/170 (31%), Positives = 83/170 (48%), Gaps = 19/170 (11%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G++ S R S +L+ + + ++ T L E +Q+ +Y G YEPH+
Sbjct: 366 GQNKKSSFRVSKNAWLAYETHPTMGKMLRDLSDTTGLDMTYCEQLQVANYGVGGHYEPHW 425
Query: 68 DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
DFFR+ + G+RIAT + YLS VE+GG T FP +AV+P
Sbjct: 426 DFFRNPDHYPAEEGNRIATAIYYLSEVEQGGATAFP----------------FLNFAVRP 469
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
G+ L +++LH + D + H CPV++G KW WIH + F +P
Sbjct: 470 QLGNVLFWYNLHRSSDMDYRTKHAGCPVLKGSKWIGNVWIHEVTQTFARP 519
>gi|224107311|ref|XP_002314441.1| predicted protein [Populus trichocarpa]
gi|222863481|gb|EEF00612.1| predicted protein [Populus trichocarpa]
Length = 84
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 49/87 (56%), Positives = 56/87 (64%), Gaps = 3/87 (3%)
Query: 137 LHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPEDDDCVDEDLNCVVWAKA 196
LHP A D +SLH CPVIEGEKWSATKWIHV +FDK + +C D++ +C WA
Sbjct: 1 LHPTAVPDISSLHAGCPVIEGEKWSATKWIHVDSFDKNVE--AGGNCTDQNESCERWAAL 58
Query: 197 GECKKNPLYMVGSKSSRGYCRKSCKVC 223
GE KN Y VGS GYCR S KVC
Sbjct: 59 GERTKNTEYTVGSPDLPGYCRSS-KVC 84
>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
Length = 534
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 54/170 (31%), Positives = 83/170 (48%), Gaps = 19/170 (11%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G++ S R S +L+ + + + ++ T L E +Q+ +Y G YEPH+
Sbjct: 364 GQNKKSSFRVSKNAWLAYDSHPTMGGMLSDLSDATGLDMTFCEQLQVANYGVGGHYEPHW 423
Query: 68 DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
DFFRD + G+R+AT + YLS VE+GG T FP +AVKP
Sbjct: 424 DFFRDPDHYPAEEGNRMATAIFYLSDVEQGGATAFPF----------------LNFAVKP 467
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
G+ L ++++H D + H CPV++G KW WIH + F +P
Sbjct: 468 QLGNVLFWYNVHRSLDVDYRTKHAGCPVLKGSKWIGNVWIHEATQTFARP 517
>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
Length = 534
Score = 90.5 bits (223), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 54/170 (31%), Positives = 83/170 (48%), Gaps = 19/170 (11%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G++ S R S +L+ + + + ++ T L E +Q+ +Y G YEPH+
Sbjct: 364 GQNKKSSFRVSKNAWLAYDSHPTMGGMLSDLSDATGLDMTFCEQLQVANYGVGGHYEPHW 423
Query: 68 DFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
DFFRD + G+R+AT + YLS VE+GG T FP +AVKP
Sbjct: 424 DFFRDPDHYPAEEGNRMATAIFYLSDVEQGGATAFPF----------------LNFAVKP 467
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
G+ L ++++H D + H CPV++G KW WIH + F +P
Sbjct: 468 QLGNVLFWYNVHRSLDVDYRTKHAGCPVLKGSKWIGNVWIHEATQTFARP 517
>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
latipes]
Length = 517
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 90/177 (50%), Gaps = 33/177 (18%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLP--PENGEAMQILHYE 58
+VA E+ ++ E R S +L ++ IV ++ RI+ T L P E +Q+++Y
Sbjct: 348 VVASGENQATV--EYRISKSAWLKGSESCIVGKLDQRISMLTGLNVRPPYAEYLQVVNYG 405
Query: 59 HGQKYEPHFD--------FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSR 110
G YEPHFD F+ K G+R+AT ++YLS VE GG T F +
Sbjct: 406 IGGHYEPHFDHATSPSSPVFKLKT-----GNRVATFMIYLSSVEAGGSTAFIYA------ 454
Query: 111 DGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
++V +K A+ +++LH + D+ +LH CPV+ G+KW A KW+H
Sbjct: 455 ----------NFSVPVLKKAAIFWWNLHRNGRGDAETLHAGCPVLIGDKWVANKWVH 501
>gi|66770649|gb|AAY54636.1| IP12415p [Drosophila melanogaster]
gi|66772017|gb|AAY55320.1| IP12615p [Drosophila melanogaster]
Length = 512
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 67/136 (49%), Gaps = 19/136 (13%)
Query: 34 IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM---NQQLGGHRIATVLMYL 90
I RI T E +QI +Y G ++PHFD+ D N G R+A++L Y
Sbjct: 380 INQRIIDMTGFNFLKDEKLQIANYGLGTYFQPHFDYSSDGFETPNITTLGDRLASILFYA 439
Query: 91 SHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHG 150
S V +GG TVFP V+ V P KG L +F+LH D D SLH
Sbjct: 440 SEVPQGGATVFPEINVT----------------VFPQKGSMLYWFNLHDDGKPDIRSLHS 483
Query: 151 SCPVIEGEKWSATKWI 166
CPV+ G++W+ TKW+
Sbjct: 484 VCPVLNGDRWTLTKWV 499
>gi|339236271|ref|XP_003379690.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
gi|316977627|gb|EFV60702.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
Length = 558
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 62/207 (29%), Positives = 92/207 (44%), Gaps = 43/207 (20%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V + ++G+ + R S +L + E+V I RI T L E E +QI +Y G
Sbjct: 366 VHNADTGQLETASYRISKSAWLKDTEHEVVKRISDRIDMMTDLTMETAELLQIANYGIGG 425
Query: 62 KYEPHFDF--------FRDKMNQQL-----------------GGHRIATVLMYLSHVEKG 96
Y+PHFD + + ++ G+RIATVL Y+S E G
Sbjct: 426 HYDPHFDMSTRGESDPYEEGTGNRIATVLFYTNDPYSFESLNAGNRIATVLFYISQPEAG 485
Query: 97 GETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIE 156
G TVF + +++ V+P K DA +F++ D ++ H +CPV+
Sbjct: 486 GGTVFTSHKIT----------------VEPSKYDAAFWFNVLQGGEPDMSTRHAACPVLA 529
Query: 157 GEKWSATKWIHVR--NFDKPEKEPEDD 181
G KW A KWIH R F +P E D
Sbjct: 530 GTKWVANKWIHERGQEFRRPCSTKETD 556
>gi|15808767|gb|AAL08490.1|AF369789_1 prolyl-4-hydroxylase alpha subunit-like protein [Onchocerca
volvulus]
Length = 571
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 56/151 (37%), Positives = 80/151 (52%), Gaps = 20/151 (13%)
Query: 23 LSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD--FFRDKMNQQLG- 79
L + E V I+ R+ T L E E + +L+Y G ++EPHFD D+ ++LG
Sbjct: 389 LRSTEYETVKRIDKRLELATNLEIETAEDLAVLNYGIGGQFEPHFDCALKGDQCFEKLGT 448
Query: 80 GHRIATVLMYLSHVEKGGETVFP-NSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLH 138
G+RIAT L+YL+ E GG TVF N ++S V +K AL +++L
Sbjct: 449 GNRIATFLIYLTEPEIGGRTVFTSNLKIS----------------VPCVKNAALFWYNLM 492
Query: 139 PDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
+ D+ SLH +CPV G KW+A KW H R
Sbjct: 493 RNGEVDTRSLHAACPVATGIKWTANKWFHER 523
>gi|66771935|gb|AAY55279.1| IP12715p [Drosophila melanogaster]
Length = 451
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 67/136 (49%), Gaps = 19/136 (13%)
Query: 34 IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM---NQQLGGHRIATVLMYL 90
I RI T E +QI +Y G ++PHFD+ D N G R+A++L Y
Sbjct: 319 INQRIIDMTGFNFLKDEKLQIANYGLGTYFQPHFDYSSDGFETPNITTLGDRLASILFYA 378
Query: 91 SHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHG 150
S V +GG TVFP V+ V P KG L +F+LH D D SLH
Sbjct: 379 SEVPQGGATVFPEINVT----------------VFPQKGSMLYWFNLHDDGKPDIRSLHS 422
Query: 151 SCPVIEGEKWSATKWI 166
CPV+ G++W+ TKW+
Sbjct: 423 VCPVLNGDRWTLTKWV 438
>gi|281350467|gb|EFB26051.1| hypothetical protein PANDA_009188 [Ailuropoda melanoleuca]
Length = 511
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 53/149 (35%), Positives = 74/149 (49%), Gaps = 20/149 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFR---- 71
R S +LS ++ +V+ I RI T L E +Q+ +Y G +YEPHFDF R
Sbjct: 379 RISKSAWLSGYENPVVSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEP 438
Query: 72 DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDA 131
D + G+RIAT L Y+S V GG TVFP G +V P KG A
Sbjct: 439 DAFKELGTGNRIATWLFYMSDVSAGGATVFPEV----------------GASVWPKKGTA 482
Query: 132 LLFFSLHPDASTDSTSLHGSCPVIEGEKW 160
+ +++L D ++ H +CPV+ G KW
Sbjct: 483 VFWYNLFASGEGDYSTRHAACPVLVGNKW 511
>gi|221512818|ref|NP_730346.2| CG32201 [Drosophila melanogaster]
gi|220902638|gb|AAN11679.2| CG32201 [Drosophila melanogaster]
Length = 520
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 67/136 (49%), Gaps = 19/136 (13%)
Query: 34 IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM---NQQLGGHRIATVLMYL 90
I RI T E +QI +Y G ++PHFD+ D N G R+A++L Y
Sbjct: 388 INQRIIDMTGFNFLKDEKLQIANYGLGTYFQPHFDYSSDGFETPNITTLGDRLASILFYA 447
Query: 91 SHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHG 150
S V +GG TVFP V+ V P KG L +F+LH D D SLH
Sbjct: 448 SEVPQGGATVFPEINVT----------------VFPQKGSMLYWFNLHDDGKPDIRSLHS 491
Query: 151 SCPVIEGEKWSATKWI 166
CPV+ G++W+ TKW+
Sbjct: 492 VCPVLNGDRWTLTKWV 507
>gi|15808763|gb|AAL08488.1| prolyl-4-hydroxylase alpha subunit-like protein [Onchocerca
volvulus]
Length = 571
Score = 90.1 bits (222), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 56/151 (37%), Positives = 80/151 (52%), Gaps = 20/151 (13%)
Query: 23 LSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD--FFRDKMNQQLG- 79
L + E V I+ R+ T L E E + +L+Y G ++EPHFD D+ ++LG
Sbjct: 389 LRSTEYETVKRIDKRLELATNLEIETAEDLAVLNYGIGGQFEPHFDCALKGDQCFEKLGT 448
Query: 80 GHRIATVLMYLSHVEKGGETVFP-NSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLH 138
G+RIAT L+YL+ E GG TVF N ++S V +K AL +++L
Sbjct: 449 GNRIATFLIYLTEPEIGGRTVFTSNLKIS----------------VPCVKNAALFWYNLM 492
Query: 139 PDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
+ D+ SLH +CPV G KW+A KW H R
Sbjct: 493 RNGEVDTRSLHAACPVATGIKWTANKWFHER 523
>gi|219126272|ref|XP_002183385.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405141|gb|EEC45085.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 474
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 59/164 (35%), Positives = 85/164 (51%), Gaps = 23/164 (14%)
Query: 13 SEVRTSSGMFLSKAQD--EIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF 70
SE RTS+ + D E+ I R+ T +PPEN E++Q+L YE GQ Y H D+
Sbjct: 321 SETRTSTNAWCYNECDDHEVTQIIWERMTFLTQIPPENSESLQMLRYEPGQFYAVHHDYI 380
Query: 71 RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
+ N+ +G RI TV +YL+ VE+GG T FP E+ AV+P +G
Sbjct: 381 ENDWNRAVGS-RILTVFLYLNDVEEGGATNFPELEL----------------AVQPKRGR 423
Query: 131 ALLFFSL---HPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
ALL+ S+ +P D T H + V +G K+ A W H R++
Sbjct: 424 ALLWPSVLDQYPHKKDDRTE-HEAQVVTKGIKYGANAWFHQRDY 466
>gi|194765184|ref|XP_001964707.1| GF22906 [Drosophila ananassae]
gi|190614979|gb|EDV30503.1| GF22906 [Drosophila ananassae]
Length = 708
Score = 90.1 bits (222), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 57/175 (32%), Positives = 87/175 (49%), Gaps = 20/175 (11%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
VA N +GKS +++R S +L+ I+ SI I + E MQ+ +Y G
Sbjct: 537 VAGN-AGKSTVADLRVSQQTWLNYT-SPIMKSISRIIQFVSGFDIAGAEFMQVANYGVGG 594
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
+YEPH D+F + QQ G RI+T + YLS+VE+GG TVF V
Sbjct: 595 QYEPHPDYFEFNLPQQFQGDRISTSMFYLSNVEQGGYTVFTKLNV--------------- 639
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH--VRNFDKP 174
+ P++G +++ +LH D+ +LH CPV+ G K W+H + F +P
Sbjct: 640 -FLPPIQGAMVMWHNLHRSLDVDARTLHAGCPVLVGSKRIGNIWMHSGFQEFRRP 693
>gi|195505244|ref|XP_002099420.1| GE10895 [Drosophila yakuba]
gi|194185521|gb|EDW99132.1| GE10895 [Drosophila yakuba]
Length = 533
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 53/141 (37%), Positives = 74/141 (52%), Gaps = 18/141 (12%)
Query: 29 EIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK-MNQQLGGHRIATVL 87
E++ I RI T L +G MQ+L Y G + PHFD+F K + + G RIATVL
Sbjct: 382 EVLNRIGRRIGDITGLSTRSGRQMQLLKYGFGGHFTPHFDYFDSKTLYLEKVGDRIATVL 441
Query: 88 MYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDA-STDST 146
YL++VE GG TVFP+ + AV KG AL + +L + D+
Sbjct: 442 FYLNNVEHGGATVFPSINL----------------AVPTQKGSALFWHNLDGQSYDYDTR 485
Query: 147 SLHGSCPVIEGEKWSATKWIH 167
+ HG+CP+I G K T+WI+
Sbjct: 486 TFHGACPLISGTKLVMTRWIY 506
>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
Length = 205
Score = 89.7 bits (221), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 51/158 (32%), Positives = 78/158 (49%), Gaps = 24/158 (15%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMN 75
RT+ +L + +++ + R + +P N E Q+++Y G +Y+PHFD F DK
Sbjct: 59 RTNDFCWLEHSASDVIHEVSKRFSVLVKMPINNAEQFQLVYYGPGNEYKPHFDAF-DKTT 117
Query: 76 QQ------LGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKG 129
++ GG R+ T L YL+ VE+GG T FP VS VKP KG
Sbjct: 118 KEGQNNWFPGGQRMVTALAYLNDVEEGGATDFPKINVS----------------VKPNKG 161
Query: 130 DALLFFS-LHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
D ++F + + + +LHG PV+ GEKW+ W
Sbjct: 162 DVVVFHNCIEGTTEINPQALHGGSPVVAGEKWAVNLWF 199
>gi|301613006|ref|XP_002936013.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
(Silurana) tropicalis]
Length = 504
Score = 89.7 bits (221), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 49/129 (37%), Positives = 68/129 (52%), Gaps = 22/129 (17%)
Query: 52 MQILHYEHGQKYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVS 107
+++ +Y G +YEPHFDF R D + G+R+AT L Y+S VE GG TVFP
Sbjct: 385 LEVANYGMGGQYEPHFDFARKDEPDAFKELGTGNRVATWLFYMSDVEAGGATVFPEV--- 441
Query: 108 QSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
G AV P KG A+ +++L D ++ H +CPV+ G KW + KWIH
Sbjct: 442 -------------GAAVYPKKGTAVFWYNLFESGEGDYSTRHAACPVLVGNKWVSNKWIH 488
Query: 168 VRN--FDKP 174
R F +P
Sbjct: 489 ERGQEFRRP 497
>gi|292621357|ref|XP_691737.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Danio rerio]
Length = 538
Score = 89.7 bits (221), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 83/166 (50%), Gaps = 31/166 (18%)
Query: 12 ASEVRTSSGMFLSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYEHGQKYEPHFD- 68
+E R S +L ++ E+V ++ RI T L P E +Q+++Y G YEPHFD
Sbjct: 378 TAEYRISKSAWLKESAHEVVGKLDQRITLVTGLNVQPPYAEYLQVVNYGIGGHYEPHFDH 437
Query: 69 -------FFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
+R K G+R+AT+++YLS V+ GG T F +
Sbjct: 438 ATSDSSPLYRLKT-----GNRVATIMIYLSPVQAGGSTAFIYA----------------N 476
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
++V ++ AL +++LH + + +LH CPVI G KW A KW+H
Sbjct: 477 FSVPVVQNAALFWWNLHKNGQGNVDTLHAGCPVIVGNKWVANKWVH 522
>gi|195505218|ref|XP_002099409.1| GE10887 [Drosophila yakuba]
gi|194185510|gb|EDW99121.1| GE10887 [Drosophila yakuba]
Length = 521
Score = 89.4 bits (220), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 57/177 (32%), Positives = 86/177 (48%), Gaps = 24/177 (13%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
+S+ S VRTS F+ + +++++I+ R+A T L + E Q +Y G Y H
Sbjct: 336 NESVVSNVRTSQFTFIPVSAHKVLSTIDQRVADMTNLNMKYAEDHQFANYGIGGHYGQHM 395
Query: 68 D-FFRDKMNQQL-----GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
D F++ ++ L G+RIATVL YLS V +GG T FP
Sbjct: 396 DWFYQTTIDAGLISSPEMGNRIATVLFYLSDVSQGGGTAFPQLRT--------------- 440
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEP 178
+KP K A + +LH D + HG+CP+I G KW +WI R D+ ++ P
Sbjct: 441 -LLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKWVQNRWI--REVDQSDRRP 494
>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
Length = 538
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 48/128 (37%), Positives = 67/128 (52%), Gaps = 19/128 (14%)
Query: 50 EAMQILHYEHGQKYEPHFDFFRDKMNQQLG-GHRIATVLMYLSHVEKGGETVFPNSEVSQ 108
E +Q+ +Y G YEPH+DFF D + G+RIAT + YLS VE+GG T FP
Sbjct: 410 EQLQVANYGVGGHYEPHWDFFVDSQHYPAEEGNRIATAIFYLSDVEQGGATAFPF----- 464
Query: 109 SRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH- 167
+AV+P G+ L +++LH D + H CPV++G KW A WIH
Sbjct: 465 -----------LNFAVRPQLGNILFWYNLHRSLDMDYRTKHAGCPVLKGSKWIANIWIHE 513
Query: 168 -VRNFDKP 174
+ F +P
Sbjct: 514 ATQTFARP 521
>gi|194764881|ref|XP_001964556.1| GF23245 [Drosophila ananassae]
gi|190614828|gb|EDV30352.1| GF23245 [Drosophila ananassae]
Length = 460
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/161 (33%), Positives = 80/161 (49%), Gaps = 17/161 (10%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G+S S +RTS M E++ +IE RI T L + E +++Y G Y+ H+
Sbjct: 298 GESQISTLRTSQDMPFGANSGEVMRNIEKRIKDMTGLSMDLSEDFMLINYGIGGTYKMHY 357
Query: 68 DFF-RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKP 126
DF+ + + L G RI TVL YL VE G TVFP +S + P
Sbjct: 358 DFYVYSEPLRFLRGERIVTVLFYLGDVELSGSTVFPFLNIS----------------ITP 401
Query: 127 MKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
KG A+++++LH + H +CPV+ G K+ TKWI+
Sbjct: 402 KKGSAVMWYNLHNSGDVHQKTQHCACPVVVGSKYVLTKWIN 442
>gi|268562483|ref|XP_002638619.1| Hypothetical protein CBG05671 [Caenorhabditis briggsae]
Length = 520
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 82/162 (50%), Gaps = 26/162 (16%)
Query: 13 SEVRTSSGMFLSKAQD----EIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFD 68
S+VR ++G +L + +I +++ I A L E QIL Y Y PH+D
Sbjct: 99 SQVRAANGTWLIHTKRPNFAKIFWNLQVNIRA---LDLSTAEPWQILSYNSEGYYAPHYD 155
Query: 69 FFRDKMNQQL---GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVK 125
F + N+ L G+RIATVL+ L +KGG TVFP ++ ++
Sbjct: 156 FLNPETNKVLVESRGNRIATVLVILQIAKKGGTTVFPKININ----------------IR 199
Query: 126 PMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIH 167
P GD +++ + PD +DS +LH +CP+ EG K AT W+H
Sbjct: 200 PKIGDVVVWLNTVPDGESDSQTLHAACPIKEGTKIGATLWVH 241
Score = 60.1 bits (144), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 45/179 (25%), Positives = 73/179 (40%), Gaps = 35/179 (19%)
Query: 5 NESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAW----TFLPP---ENGEAMQILHY 57
N+ G S+ R ++G + I A ++ W +P E+ E + L Y
Sbjct: 341 NDDGTEYYSKYRKANGTQI------IAPDFPAALSIWKTVKILIPTLNIESSEDIVALSY 394
Query: 58 EHGQKYEPHFDFFRDKMNQQLGG------HRIATVLMYLSHVEKGGETVFPNSEVSQSRD 111
G Y H DF ++ G +R T++M E GG T+FP+
Sbjct: 395 IRGGHYAAHHDFLEYPSEKEWDGWMKDYGNRFGTLIMAFETAELGGATIFPSLNA----- 449
Query: 112 GNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRN 170
A++P GDA +F+ + + S HG CP+ EG+K +T W ++N
Sbjct: 450 -----------AIRPNTGDAFFWFNAMGNTKQEDLSDHGGCPIYEGKKSISTIWFRMKN 497
>gi|198466401|ref|XP_002135182.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
gi|198150583|gb|EDY73809.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 48/140 (34%), Positives = 75/140 (53%), Gaps = 21/140 (15%)
Query: 32 ASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLG---GHRIATVLM 88
A I RI T E + + +Y G + PH+D+ + N +G G + T+L
Sbjct: 391 ARIYQRITDITGFQLFVQEELNVANYGLGTIFGPHYDYTPE--NYDIGWFMGGPLGTILF 448
Query: 89 YLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSL 148
Y+S +++GG T+FP+ ++ V P KG ALL+F+L+ D D +L
Sbjct: 449 YVSDLQQGGATIFPSINIT----------------VSPRKGSALLWFNLYDDGEPDPRTL 492
Query: 149 HGSCPVIEGEKWSATKWIHV 168
H SCPVIEG++W+ TKW+H+
Sbjct: 493 HSSCPVIEGDRWTLTKWVHL 512
>gi|402584932|gb|EJW78873.1| hypothetical protein WUBG_10221 [Wuchereria bancrofti]
Length = 187
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/150 (34%), Positives = 76/150 (50%), Gaps = 18/150 (12%)
Query: 22 FLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDK--MNQQLG 79
+L + E+V I R+ T L E E +Q+ +Y G YEPH+D R + +
Sbjct: 9 WLGSTEHEVVNRINKRLDLATNLETETAEELQVQNYGIGGHYEPHYDCSRRESVFEKTKN 68
Query: 80 GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHP 139
G+RIAT+L+Y++ E GG TVF + + S S C K AL +++L
Sbjct: 69 GNRIATILIYMTKPEIGGGTVFIDLKTSIS-------CT---------KNAALFWYNLMR 112
Query: 140 DASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
+ D S H +CPV+ G KW+A KW H R
Sbjct: 113 SGAVDIRSYHAACPVLTGTKWTANKWFHER 142
>gi|357459545|ref|XP_003600053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355489101|gb|AES70304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 156
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 46/106 (43%), Positives = 69/106 (65%), Gaps = 8/106 (7%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
++D +GK I + + G F+ +D+I+ +IE RI +P ENGE +Q++HY GQ
Sbjct: 47 ISDKRTGKGIENRFAYACGGFV---KDKIIKNIEQRIPDIISIPVENGEGLQVIHYGVGQ 103
Query: 62 KYEPHFDFFRDKMNQQL--GGHRIATVLMYLSHVEKGGETVFPNSE 105
K+ PH+D + N+ GG R+AT LMYLS VE+GGETVFP+++
Sbjct: 104 KFVPHYD---SRSNESFWNGGPRVATFLMYLSDVEEGGETVFPSAK 146
>gi|260787668|ref|XP_002588874.1| hypothetical protein BRAFLDRAFT_235878 [Branchiostoma floridae]
gi|229274045|gb|EEN44885.1| hypothetical protein BRAFLDRAFT_235878 [Branchiostoma floridae]
Length = 151
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/149 (37%), Positives = 70/149 (46%), Gaps = 19/149 (12%)
Query: 22 FLSKAQDEIVASIEARIAAWTFLPPE--NGEAMQILHYEHGQKYEPHFDFFRDKMNQQL- 78
+L + ++A + R+ T L GEA Q+L+Y G YEPH D+FRD+ L
Sbjct: 3 WLFDTEHTVIAKLSRRVEYITGLDVNWPYGEAFQVLNYGLGGFYEPHVDYFRDEQPALLT 62
Query: 79 GGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLH 138
G RI T L YLS VE GG TVF R V +K A+LF L
Sbjct: 63 NGQRIVTFLFYLSDVEAGGATVF----------------TRLNLTVPAVKNSAVLFHDLK 106
Query: 139 PDASTDSTSLHGSCPVIEGEKWSATKWIH 167
+ S H CPV+ G KW A KWIH
Sbjct: 107 RSLEFEKDSEHAGCPVLMGSKWIANKWIH 135
>gi|196011908|ref|XP_002115817.1| hypothetical protein TRIADDRAFT_30052 [Trichoplax adhaerens]
gi|190581593|gb|EDV21669.1| hypothetical protein TRIADDRAFT_30052, partial [Trichoplax
adhaerens]
Length = 495
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 71/145 (48%), Gaps = 18/145 (12%)
Query: 22 FLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGGH 81
+L A D +V I T L E +Q+ +Y G Y PH+D L
Sbjct: 352 WLEDAYDPVVEKISRLTQELTHLDVNYAEDLQVANYGIGGHYVPHYDSTIIAPEDPL--Q 409
Query: 82 RIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDA 141
R+AT++ YLS+VE GG T+FP R G AV+P KG AL + +L +
Sbjct: 410 RLATMMFYLSNVEIGGATIFP----------------RLGVAVRPQKGSALFWINLKRNG 453
Query: 142 STDSTSLHGSCPVIEGEKWSATKWI 166
T+ +LH +CPV+ G KW A KWI
Sbjct: 454 LTNRQTLHAACPVVIGSKWIANKWI 478
>gi|195113245|ref|XP_002001178.1| GI22115 [Drosophila mojavensis]
gi|193917772|gb|EDW16639.1| GI22115 [Drosophila mojavensis]
Length = 498
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 61/181 (33%), Positives = 92/181 (50%), Gaps = 30/181 (16%)
Query: 7 SGKSIASEVRTSSGMF-----LSKAQDEIVASIEARIAAWTFL--PPENGEAMQILHYEH 59
+ +S+ S+VRT+ G F LS ++V ++ R+ + L E MQ L+Y+
Sbjct: 325 NNESVVSKVRTAKGAFMHADRLSPESAQVVQRLKQRMGDLSDLNIKREGYNEMQYLNYDF 384
Query: 60 GQKYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
G Y H D+F MN RIAT L+YL+ V +GG T+FP +V Q
Sbjct: 385 GDHYLLHMDYFNISMND-----RIATFLIYLNDVTRGGGTIFP--QVKQ----------- 426
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI--HVRNFDKPEKE 177
AV P KG +L+++++ + + SLHG+CPV+ G K + WI H + F KP
Sbjct: 427 ---AVHPEKGKLILWYNMNSNLDYELASLHGACPVLIGRKIAIVYWIREHDQMFVKPCLN 483
Query: 178 P 178
P
Sbjct: 484 P 484
>gi|167524906|ref|XP_001746788.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774568|gb|EDQ88195.1| predicted protein [Monosiga brevicollis MX1]
Length = 321
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 53/151 (35%), Positives = 75/151 (49%), Gaps = 17/151 (11%)
Query: 21 MFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKMNQQLGG 80
M ++ IV +E RI LP N E Q+L Y + Q Y H D ++ + G
Sbjct: 178 MAVNATAATIVRQLEERIGKLVGLPVVNQEHFQVLRYNNNQYYRVHNDLIDEQYDMPCGP 237
Query: 81 HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPD 140
R+ T+ +YL+ V GGET F R G AVKP KG A+L++S+ D
Sbjct: 238 -RVLTLFIYLNDVPAGGETSF----------------TRLGLAVKPKKGKAVLWYSVTND 280
Query: 141 ASTDSTSLHGSCPVIEGEKWSATKWIHVRNF 171
+ + H + PV +G K++A KWIHV NF
Sbjct: 281 LEPEERTDHEARPVKQGTKYAANKWIHVGNF 311
>gi|194905376|ref|XP_001981185.1| GG11927 [Drosophila erecta]
gi|190655823|gb|EDV53055.1| GG11927 [Drosophila erecta]
Length = 539
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/164 (31%), Positives = 80/164 (48%), Gaps = 18/164 (10%)
Query: 5 NESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYE 64
N + + S+ RTS ++L + +E + R+A T L ++ E Q+++Y G +E
Sbjct: 363 NAANDFVVSKFRTSKSVWLDRDANEATVKLTQRLADATGLDVKHSEHFQVINYGIGGVFE 422
Query: 65 PHFDFFRDKMNQQLGGH--RIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
HFD + N+ +GG RIAT L YL+ V +GG T FP ++
Sbjct: 423 SHFDTTLEDTNRFVGGFIDRIATTLFYLNDVPQGGATHFPGLNIT--------------- 467
Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
V P G AL +++L ++H CPVI G KW +KWI
Sbjct: 468 -VFPRLGAALFWYNLDTQGMLQVRTMHTGCPVIVGSKWVVSKWI 510
>gi|194905397|ref|XP_001981189.1| GG11929 [Drosophila erecta]
gi|190655827|gb|EDV53059.1| GG11929 [Drosophila erecta]
Length = 538
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 83/166 (50%), Gaps = 21/166 (12%)
Query: 6 ESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEP 65
+S S ++ RTS +L + ++ I+ R+ T L E+ E +Q+L+Y G +YEP
Sbjct: 367 QSENSKIADRRTSQNTWLWYDVNPWLSRIKQRLEDVTGLSTESAEPLQLLNYGIGGQYEP 426
Query: 66 HFDFFRDKMNQQLGG---HRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
HFDF D +++ G R+ T + Y++ V GG T FP +
Sbjct: 427 HFDFVEDA--EKIFGWQDDRLMTAIFYINDVALGGATAFPFLRL---------------- 468
Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHV 168
AV P KG L++ +LH D S H CP+++G KW T+W HV
Sbjct: 469 AVPPEKGSLLMWNNLHSSLHKDYRSKHAGCPILQGSKWICTEWFHV 514
>gi|372272594|ref|ZP_09508642.1| Procollagen-proline dioxygenase [Marinobacterium stanieri S30]
Length = 217
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 55/160 (34%), Positives = 75/160 (46%), Gaps = 22/160 (13%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF----- 70
R+ +L A + + RIA +P EN E++Q+LHY Q+Y H+D +
Sbjct: 50 RSGQNCWLRYADYPLAKQVGDRIAKLAGIPLENAESLQVLHYGPEQEYRAHYDAYDLSTA 109
Query: 71 RDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGD 130
R + + GG R+ T L+YL+ VE GG T FP R G V P G
Sbjct: 110 RGQRCCRYGGQRLVTALVYLNAVEAGGGTAFP----------------RLGLEVSPALGR 153
Query: 131 ALLFFSLHPDAST-DSTSLHGSCPVIEGEKWSATKWIHVR 169
+LF + D S SLH PV +GEKW+ W HVR
Sbjct: 154 MVLFQNTDEDVSKPHRDSLHAGMPVTQGEKWAFNIWFHVR 193
>gi|198284815|ref|YP_002221136.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
gi|218668131|ref|YP_002427500.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
gi|198249336|gb|ACH84929.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
gi|218520344|gb|ACK80930.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
ferrooxidans ATCC 23270]
Length = 213
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 84/181 (46%), Gaps = 17/181 (9%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+V D S + + R S+ + S I+ I RI ++ + EN E +QILHY G
Sbjct: 42 VVVDGASDAAYETPGRCSTVVAPSVDAYPIILEIRRRIELFSGISQENQEPLQILHYTRG 101
Query: 61 QKYEPHFDFFRDKMNQ-QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
KY+ H+D F D Q + GG+R+ TVL+YL+ VE GG T FP+ +
Sbjct: 102 GKYDIHYDAFSDGSPQLRNGGNRLLTVLLYLNDVEYGGWTQFPHIMAN------------ 149
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
+ P G +LF + SLH PV GEKW A+ WI + P +
Sbjct: 150 ----IVPNAGSGILFRNTDAQNRQLRESLHAGLPVTHGEKWIASIWIRENPYITPSVDRV 205
Query: 180 D 180
D
Sbjct: 206 D 206
>gi|323454062|gb|EGB09933.1| hypothetical protein AURANDRAFT_14928, partial [Aureococcus
anophagefferens]
Length = 182
Score = 89.0 bits (219), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 58/167 (34%), Positives = 84/167 (50%), Gaps = 25/167 (14%)
Query: 8 GKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHF 67
G S RTSS +L++ E + S+ ++ A T P E+ E Q+ Y G+ Y+PH+
Sbjct: 35 GNGEVSVSRTSSTCYLAR---EDLPSVCTKVCALTGKPLEHLELPQVGRYRGGEFYKPHY 91
Query: 68 DFFRD-----KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGY 122
D F + Q GG R+ATVL+YL+ VE+GGET F ++ G
Sbjct: 92 DAFDTSSADGRRFAQNGGQRVATVLVYLNDVERGGETSF----------------SKLGV 135
Query: 123 AVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
+KP KG+AL+FF D D LH + P ++ KW + WI R
Sbjct: 136 RIKPRKGNALIFFPATLDGVLDQNYLHAAEPAVD-PKWVSQIWIRQR 181
>gi|415977972|ref|ZP_11559036.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
gi|339834153|gb|EGQ61937.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
Length = 215
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/181 (32%), Positives = 84/181 (46%), Gaps = 17/181 (9%)
Query: 1 MVADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHG 60
+V D S + + R S+ + S I+ I RI ++ + EN E +QILHY G
Sbjct: 44 VVVDGASDAAYETPGRCSTVVAPSVDAYPIILEIRRRIELFSGISQENQEPLQILHYTRG 103
Query: 61 QKYEPHFDFFRDKMNQ-QLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECAR 119
KY+ H+D F D Q + GG+R+ TVL+YL+ VE GG T FP+ +
Sbjct: 104 GKYDIHYDAFSDGSPQLRNGGNRLLTVLLYLNDVEYGGWTQFPHIMAN------------ 151
Query: 120 RGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDKPEKEPE 179
+ P G +LF + SLH PV GEKW A+ WI + P +
Sbjct: 152 ----IVPNAGSGILFRNTDAQNRQLRESLHAGLPVTHGEKWIASIWIRENPYITPSVDRV 207
Query: 180 D 180
D
Sbjct: 208 D 208
>gi|55925444|ref|NP_001007286.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Danio rerio]
gi|49900294|gb|AAH76508.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide 2 [Danio rerio]
gi|182891794|gb|AAI65288.1| P4ha2 protein [Danio rerio]
Length = 514
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/175 (31%), Positives = 79/175 (45%), Gaps = 36/175 (20%)
Query: 2 VADNESGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQ 61
V D ++G + R S +L D ++A + RI T L + E +Q+ +Y G
Sbjct: 367 VRDPKTGVLTVAHYRVSKSAWLEGEDDPVIARVNQRIEDITGLTVDTAELLQVANYGVGG 426
Query: 62 KYEPHFDFFRDKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRG 121
+YEPHFDF R +S VE GG TVFP+ G
Sbjct: 427 QYEPHFDFSR------------------MSDVEAGGATVFPDF----------------G 452
Query: 122 YAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR--NFDKP 174
+V P KG A+ +++L D + H +CPV+ G KW + KWIH R F +P
Sbjct: 453 ASVWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWVSNKWIHERGQEFRRP 507
>gi|242003035|ref|XP_002436120.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215499456|gb|EEC08950.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 173
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 59/165 (35%), Positives = 80/165 (48%), Gaps = 32/165 (19%)
Query: 19 SGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDF-FRDKMN-- 75
S +LS +V + RIAA T L + E +Q+++Y G Y PHFDF +DK
Sbjct: 2 SAAWLSDHHHPVVKKLSRRIAAATGLSTSSAEHLQVVNYGVGGHYSPHFDFSTKDKPLRG 61
Query: 76 -QQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLF 134
+ G R AT L+YLS VE+GG T+F V V+P G AL +
Sbjct: 62 WETFAGQRQATWLVYLSSVERGGATLFKRLRVR----------------VQPEAGMALFW 105
Query: 135 FSLHPDAST------------DSTSLHGSCPVIEGEKWSATKWIH 167
+L P ++ D + HG+CPV+ G KW ATKWIH
Sbjct: 106 HNLPPGSTNSLPSCCVHRSVGDERTEHGACPVLVGSKWIATKWIH 150
>gi|403274090|ref|XP_003928822.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saimiri
boliviensis boliviensis]
Length = 149
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/130 (38%), Positives = 69/130 (53%), Gaps = 22/130 (16%)
Query: 51 AMQILHYEHGQKYEPHFDFFR----DKMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEV 106
+Q+ +Y G +YEPHFDF R D + G+RIAT L Y+S V GG TVFP EV
Sbjct: 29 GLQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFP--EV 86
Query: 107 SQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWI 166
G +V P KG A+ +++L D ++ H +CPV+ G KW + KW+
Sbjct: 87 --------------GASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKWVSNKWL 132
Query: 167 HVRN--FDKP 174
H R F +P
Sbjct: 133 HERGQEFRRP 142
>gi|195591302|ref|XP_002085381.1| GD14757 [Drosophila simulans]
gi|194197390|gb|EDX10966.1| GD14757 [Drosophila simulans]
Length = 525
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 66/136 (48%), Gaps = 19/136 (13%)
Query: 34 IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM---NQQLGGHRIATVLMYL 90
I RI T E +QI +Y G ++PHFD+ D N G R+A++L Y
Sbjct: 388 INQRIIDMTEFNFSKDEKLQITNYGVGTYFQPHFDYSSDGFETPNITTLGDRLASILFYA 447
Query: 91 SHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHG 150
S V +GG TVFP V+ V P KG L +F+LH D D S H
Sbjct: 448 SEVPQGGATVFPEINVT----------------VFPQKGSMLYWFNLHDDGRPDIRSKHS 491
Query: 151 SCPVIEGEKWSATKWI 166
CPVI G++W+ TKW+
Sbjct: 492 VCPVINGDRWTLTKWV 507
>gi|195159319|ref|XP_002020529.1| GL14044 [Drosophila persimilis]
gi|194117298|gb|EDW39341.1| GL14044 [Drosophila persimilis]
Length = 536
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/161 (32%), Positives = 76/161 (47%), Gaps = 19/161 (11%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF---RD 72
RTS G + +Q + +A + L + E +QI +Y G YEPH+D F +
Sbjct: 372 RTSQGASFNYSQYATTQRLSQHVADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPEHHE 431
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
L G+R+AT + YLS V GG T FP + V P +G L
Sbjct: 432 YPEDDLYGNRLATAIYYLSDVVAGGGTAFPFLPL----------------LVTPERGSLL 475
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
+++LHP D + H +CPV++G KW A WI RN D+
Sbjct: 476 FWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDR 516
>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
Length = 519
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 76/166 (45%), Gaps = 19/166 (11%)
Query: 7 SGKSIASEVRTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPH 66
+G ++ S R S +L + ++ ++ R+ T L E E +Q+++Y G YEPH
Sbjct: 356 TGGAVLSSYRISKNAWLYYWEHRLINRVKQRVEDATGLTMETAEPLQVINYGIGGHYEPH 415
Query: 67 FDFFRDKMNQQLG---GHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYA 123
FD L G RIAT+L Y+S VE GG TVFP G
Sbjct: 416 FDCATKDEEFALDPNEGDRIATMLFYMSDVEAGGATVFPQV----------------GAR 459
Query: 124 VKPMKGDALLFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVR 169
V P KG +++L D + H CPV+ G KW + WIH R
Sbjct: 460 VVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGSKWVSNMWIHER 505
>gi|198449648|ref|XP_001357666.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
gi|198130700|gb|EAL26801.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
Length = 536
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/161 (32%), Positives = 76/161 (47%), Gaps = 19/161 (11%)
Query: 16 RTSSGMFLSKAQDEIVASIEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFF---RD 72
RTS G + +Q + +A + L + E +QI +Y G YEPH+D F +
Sbjct: 372 RTSQGASFNYSQYATTQRLSQHVADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPEHHE 431
Query: 73 KMNQQLGGHRIATVLMYLSHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDAL 132
L G+R+AT + YLS V GG T FP + V P +G L
Sbjct: 432 YPEDDLYGNRLATAIYYLSDVVAGGGTAFPFLPL----------------LVTPERGSLL 475
Query: 133 LFFSLHPDASTDSTSLHGSCPVIEGEKWSATKWIHVRNFDK 173
+++LHP D + H +CPV++G KW A WI RN D+
Sbjct: 476 FWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWIRERNQDR 516
>gi|195352182|ref|XP_002042593.1| GM14980 [Drosophila sechellia]
gi|194124477|gb|EDW46520.1| GM14980 [Drosophila sechellia]
Length = 520
Score = 88.2 bits (217), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 50/136 (36%), Positives = 66/136 (48%), Gaps = 19/136 (13%)
Query: 34 IEARIAAWTFLPPENGEAMQILHYEHGQKYEPHFDFFRDKM---NQQLGGHRIATVLMYL 90
I RI T E +QI +Y G ++PHFD+ D N G R+A++L Y
Sbjct: 388 INQRIIDMTEFNFSKDEKLQIANYGVGTYFQPHFDYSSDGFETPNITTLGDRLASILFYA 447
Query: 91 SHVEKGGETVFPNSEVSQSRDGNWSECARRGYAVKPMKGDALLFFSLHPDASTDSTSLHG 150
S V +GG TVFP V+ V P KG L +F+LH D D S H
Sbjct: 448 SEVPQGGATVFPEINVT----------------VFPQKGSMLYWFNLHDDGRPDIRSKHS 491
Query: 151 SCPVIEGEKWSATKWI 166
CPVI G++W+ TKW+
Sbjct: 492 VCPVINGDRWTLTKWL 507
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.130 0.404
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,918,866,119
Number of Sequences: 23463169
Number of extensions: 163928495
Number of successful extensions: 335651
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1467
Number of HSP's successfully gapped in prelim test: 588
Number of HSP's that attempted gapping in prelim test: 330647
Number of HSP's gapped (non-prelim): 2346
length of query: 230
length of database: 8,064,228,071
effective HSP length: 138
effective length of query: 92
effective length of database: 9,121,278,045
effective search space: 839157580140
effective search space used: 839157580140
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 74 (33.1 bits)