BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 023052
(288 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 491 bits (1265), Expect = e-136, Method: Compositional matrix adjust.
Identities = 232/288 (80%), Positives = 268/288 (93%)
Query: 1 MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
MAKPRY RFP RKSSSST++L++L+MF+F +L+LLA G+LS+PS SGDS +ANDLS+IV
Sbjct: 1 MAKPRYPRFPPRKSSSSTVVLSMLLMFSFVVLVLLALGLLSIPSHSGDSPRANDLSTIVH 60
Query: 61 KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
+++ERS+G++G+ E W EVISWEPRAFVYHNFLSK+ECEYLI LA PHM+KSTVVDS TG
Sbjct: 61 RTVERSDGNDGKGEPWSEVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSSTG 120
Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFD 180
KSKDSRVRTSSGTFL RG+DKIIR IEKR++DFTF P+E+GEGLQ+LHYE GQKYEPH+D
Sbjct: 121 KSKDSRVRTSSGTFLTRGQDKIIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYD 180
Query: 181 YFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIK 240
YF+D++NTKNGGQRMATVLMYLSDVEEGGETVFP A+GN S+VPWWNELS+CGK GLS+K
Sbjct: 181 YFLDDYNTKNGGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKEGLSVK 240
Query: 241 PKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
PKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK+
Sbjct: 241 PKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKI 288
>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
Length = 288
Score = 491 bits (1263), Expect = e-136, Method: Compositional matrix adjust.
Identities = 232/288 (80%), Positives = 267/288 (92%)
Query: 1 MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
MAKPRY RFP RKSSSST++L++L+MF+F +L+LLA G+LS+PS SGDS +ANDLS+IV
Sbjct: 1 MAKPRYPRFPPRKSSSSTVVLSMLLMFSFVVLVLLALGLLSIPSHSGDSPRANDLSTIVH 60
Query: 61 KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
+++ERS+G++G+ E W EVISWEPRAFVYHNFLSK+ECEYLI LA PHM+KSTVVDS TG
Sbjct: 61 RTVERSDGNDGKGEPWSEVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSSTG 120
Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFD 180
KSKDSRVRTSSGTFL RG+DKIIR IEKR++DFTF P+E+GEGLQ+LHYE GQKYEPH+D
Sbjct: 121 KSKDSRVRTSSGTFLTRGQDKIIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHYD 180
Query: 181 YFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIK 240
YF+D++NTKNGGQRMATVLMYLSDVEEGGETVFP A+GN S+VPWWNELS CGK GLS+K
Sbjct: 181 YFLDDYNTKNGGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSXCGKEGLSVK 240
Query: 241 PKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
PKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK+
Sbjct: 241 PKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKI 288
>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 289
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 229/288 (79%), Positives = 260/288 (90%), Gaps = 1/288 (0%)
Query: 1 MAKPRYSRFPTRKSSSS-TLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIV 59
MAK RYSR P+RKSSS TLI +L I FTF ILILL FGILS+PSS+ + K NDL+SIV
Sbjct: 1 MAKSRYSRLPSRKSSSPYTLIFSLFIAFTFLILILLVFGILSIPSSNQNLPKPNDLTSIV 60
Query: 60 RKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT 119
+++R++ +EG+ EQWVEV+SWEPRAFVYHNFL+KEECEYLI++A P M KSTVVDS+T
Sbjct: 61 HNTVDRNDDEEGKGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSET 120
Query: 120 GKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHF 179
GKSKDSRVRTSSGTFLARGRDKI+R+IEK+IADFTF P+E+GEGLQVLHYE GQKYEPH+
Sbjct: 121 GKSKDSRVRTSSGTFLARGRDKIVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHY 180
Query: 180 DYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSI 239
DYF+DEFNTKNGGQR+ATVLMYL+DVEEGGETVFP A+GN S VPW+NELS+CGK GLSI
Sbjct: 181 DYFLDEFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSNVPWYNELSDCGKKGLSI 240
Query: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
KPK GDALLFWSMKPDA+LD SSLHGGCPVIKGNKWSSTKWIRVNEYK
Sbjct: 241 KPKRGDALLFWSMKPDATLDASSLHGGCPVIKGNKWSSTKWIRVNEYK 288
>gi|240256489|ref|NP_201407.4| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010770|gb|AED98153.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 289
Score = 486 bits (1251), Expect = e-135, Method: Compositional matrix adjust.
Identities = 227/288 (78%), Positives = 256/288 (88%)
Query: 1 MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
MA+PR R RKSS STL+ +LIM TF ILILLAFGILS+PS++ S KANDL+SIVR
Sbjct: 2 MARPRNHRPSARKSSHSTLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIVR 61
Query: 61 KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
K+++RS D+ + E+WVE+ISWEPRA VYHNFL+KEEC+YLI LA PHM KSTVVD TG
Sbjct: 62 KTLQRSGEDDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTG 121
Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFD 180
KS DSRVRTSSGTFLARGRDK IR+IEKRI+DFTF P+E+GEGLQVLHYE GQKYEPH+D
Sbjct: 122 KSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYD 181
Query: 181 YFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIK 240
YFMDE+NT+NGGQR+ATVLMYLSDVEEGGETVFP A+GN SAVPWWNELSECGK GLS+K
Sbjct: 182 YFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVK 241
Query: 241 PKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
PKMGDALLFWSM PDA+LDPSSLHGGC VIKGNKWSSTKW+RV+EYKV
Sbjct: 242 PKMGDALLFWSMTPDATLDPSSLHGGCAVIKGNKWSSTKWLRVHEYKV 289
>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
Length = 290
Score = 482 bits (1241), Expect = e-134, Method: Compositional matrix adjust.
Identities = 231/289 (79%), Positives = 263/289 (91%), Gaps = 2/289 (0%)
Query: 1 MAKPRYSRFPTRKSSSS-TLILTLLIMFTFAILILLAFGILSMPSSS-GDSRKANDLSSI 58
MAKPRYSR P RKSSSS TLILTL ++FTF +LILLA GILS+PSSS G+ K NDL+SI
Sbjct: 1 MAKPRYSRLPPRKSSSSSTLILTLFLVFTFLVLILLALGILSIPSSSRGNLPKPNDLASI 60
Query: 59 VRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD 118
R ++ S+ D+ R EQWVEV+SWEPRAFVYHNFL+KEECEYLI++A P+M KS+VVDS+
Sbjct: 61 ARNTIHTSDDDDVRGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVVDSE 120
Query: 119 TGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPH 178
TGKSKDSRVRTSSGTFLARGRDKI+RDIEKRIA ++F P+E+GEGLQVLHYE GQKYEPH
Sbjct: 121 TGKSKDSRVRTSSGTFLARGRDKIVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPH 180
Query: 179 FDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLS 238
+DYF+D+FNTKNGGQR+ATVLMYL+DVEEGGETVFP A+GN S+VPWWNELSECGK GLS
Sbjct: 181 YDYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSSVPWWNELSECGKKGLS 240
Query: 239 IKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
IKPK GDALLFWSMKPDA+LDPSSLHGGCPVIKGNKWSSTKW+RV+EYK
Sbjct: 241 IKPKRGDALLFWSMKPDATLDPSSLHGGCPVIKGNKWSSTKWMRVSEYK 289
>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
gi|255647110|gb|ACU24023.1| unknown [Glycine max]
Length = 289
Score = 479 bits (1233), Expect = e-133, Method: Compositional matrix adjust.
Identities = 233/288 (80%), Positives = 261/288 (90%), Gaps = 3/288 (1%)
Query: 1 MAKPRYSRFPTRKSSSS-TLILTLLIMFTFAILILLAFGILSMPSSS-GDSRKANDLSSI 58
MAKPRYSR P RKSSSS TLILTL ++FTF +LILLA GILS+PSSS G+ K NDL+SI
Sbjct: 1 MAKPRYSRLPPRKSSSSSTLILTLFLVFTFLVLILLALGILSIPSSSRGNLPKPNDLASI 60
Query: 59 VRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD 118
R ++E S+ DE R EQWVEV+SWEPRAFVYHNFL+KEECEYLI++A P M KSTVVDS+
Sbjct: 61 ARNTIETSDSDE-RGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSE 119
Query: 119 TGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPH 178
TGKSKDSRVRTSSGTFLARGRDKI+R+IEK+I+DFTF P+E+GEGLQVLHYE GQKYEPH
Sbjct: 120 TGKSKDSRVRTSSGTFLARGRDKIVRNIEKKISDFTFIPVEHGEGLQVLHYEVGQKYEPH 179
Query: 179 FDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLS 238
+DYF+D+FNTKNGGQR+ATVLMYL+DVEEGGETVFP A+GN S VPWWNEL ECGK GLS
Sbjct: 180 YDYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSFVPWWNELFECGKKGLS 239
Query: 239 IKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
IKPK GDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKW+RV+EY
Sbjct: 240 IKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWMRVSEY 287
>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 287
Score = 473 bits (1217), Expect = e-131, Method: Compositional matrix adjust.
Identities = 232/289 (80%), Positives = 263/289 (91%), Gaps = 3/289 (1%)
Query: 1 MAKPRYSRFPTRKSSS-STLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIV 59
MAK RYSR P RKSSS +T+ILT+L+MFTF ILILLA GILS+PS+SGD KA+DL++IV
Sbjct: 1 MAKARYSRLPARKSSSPTTMILTMLLMFTFVILILLALGILSVPSNSGD--KAHDLTTIV 58
Query: 60 RKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT 119
+ +GD+G+ E+W EVISWEPRAFVYHNFL+KEECEYLINLA P+M+KSTVVDS+T
Sbjct: 59 HNKEQSFDGDDGKGERWAEVISWEPRAFVYHNFLTKEECEYLINLAKPNMQKSTVVDSET 118
Query: 120 GKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHF 179
G+SKDSRVRTSSGTFL+RGRDK IRDIEKRIADF+F P+E+GEGLQVLHYE GQKYEPHF
Sbjct: 119 GRSKDSRVRTSSGTFLSRGRDKKIRDIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEPHF 178
Query: 180 DYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSI 239
DYF DEFNTKNGGQR+AT+LMYLSDVEEGGETVFP A+GN SAVPWWNELSECGK GLS+
Sbjct: 179 DYFNDEFNTKNGGQRVATLLMYLSDVEEGGETVFPAAKGNFSAVPWWNELSECGKKGLSV 238
Query: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
KP MGDALLFWSMKPDA+LDPSSLHGGCPVI GNKWS+TKW+RVNEY+V
Sbjct: 239 KPNMGDALLFWSMKPDATLDPSSLHGGCPVINGNKWSATKWMRVNEYRV 287
>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 233/289 (80%), Positives = 261/289 (90%), Gaps = 3/289 (1%)
Query: 1 MAKPRYSRFPTRKSSSS-TLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIV 59
MAK R SRFPTRKSSSS TL+ TLLIMFTF ILILLA GILS+P +SG S K +DLSSIV
Sbjct: 1 MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60
Query: 60 RKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT 119
RK+ + + DE + EQWVEVISWEPRAFVYHNFL+KEECEYLI+LA PHM+KSTVVDS+T
Sbjct: 61 RKTSD--DVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSET 118
Query: 120 GKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHF 179
G+SKDSRVRTSSGTFL RGRDK +R IEKR++DF+F P+E+GEGLQVLHYE GQKYEPHF
Sbjct: 119 GQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHF 178
Query: 180 DYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSI 239
DYF+DE+NTKNGGQR+ATVLMYLSDVEEGGETVFP A+GN S+VPWWNELS+CGK GLS+
Sbjct: 179 DYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSV 238
Query: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
KPK GDALLFWSMKPDASLDPSSLHGGCPVIKGNKWS+TKW+RV EYK
Sbjct: 239 KPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWVRVEEYKA 287
>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 287
Score = 467 bits (1201), Expect = e-129, Method: Compositional matrix adjust.
Identities = 233/289 (80%), Positives = 261/289 (90%), Gaps = 3/289 (1%)
Query: 1 MAKPRYSRFPTRKSSSS-TLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIV 59
MAK R SRFPTRKSSSS TL+ TLLIMFTF ILILLA GILS+P +SG S K +DLSSIV
Sbjct: 1 MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60
Query: 60 RKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT 119
RK+ + + DE + EQWVEVISWEPRAFVYHNFL+KEECEYLI+LA PHM+KSTVVDS+T
Sbjct: 61 RKTSD--DVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSET 118
Query: 120 GKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHF 179
G+SKDSRVRTSSGTFL RGRDK +R IEKR++DF+F P+E+GEGLQVLHYE GQKYEPHF
Sbjct: 119 GQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPHF 178
Query: 180 DYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSI 239
DYF+DE+NTKNGGQR+ATVLMYLSDVEEGGETVFP A+GN S+VPWWNELS+CGK GLS+
Sbjct: 179 DYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSV 238
Query: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
KPK GDALLFWSMKPDASLDPSSLHGGCPVIKGNKWS+TKW+RV EYK
Sbjct: 239 KPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287
>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 287
Score = 456 bits (1172), Expect = e-126, Method: Compositional matrix adjust.
Identities = 215/288 (74%), Positives = 247/288 (85%), Gaps = 1/288 (0%)
Query: 1 MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
MAK R+SRF RK S+ L+L +L M T +L+LLAFG+ S+P ++ +S DLS R
Sbjct: 1 MAKLRHSRFQARKWSTLMLVLFMLFMLTIVLLMLLAFGVFSLPINNDESSPI-DLSYFRR 59
Query: 61 KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
+ ERSEG R +QW EV+SWEPRAFVYHNFLSKEECEYLI+LA PHM KSTVVDS+TG
Sbjct: 60 AATERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETG 119
Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFD 180
KSKDSRVRTSSGTFL RGRDKII+ IEKRIAD+TF P ++GEGLQVLHYEAGQKYEPH+D
Sbjct: 120 KSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYD 179
Query: 181 YFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIK 240
YF+DEFNTKNGGQRMAT+LMYLSDVEEGGETVFP A N S+VPW+NELSECGK GLS+K
Sbjct: 180 YFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVK 239
Query: 241 PKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
P+MGDALLFWSM+PDA+LDP+SLHGGCPVI+GNKWSSTKWI V EYK+
Sbjct: 240 PRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWIHVGEYKI 287
>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
from Gallus gallus gi|212530 [Arabidopsis thaliana]
gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 287
Score = 454 bits (1169), Expect = e-125, Method: Compositional matrix adjust.
Identities = 214/288 (74%), Positives = 247/288 (85%), Gaps = 1/288 (0%)
Query: 1 MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
MAK R+SRF RK S+ L+L +L M T +L+LLAFG+ S+P ++ +S DLS R
Sbjct: 1 MAKLRHSRFQARKWSTLMLVLFMLFMLTIVLLMLLAFGVFSLPINNDESSPI-DLSYFRR 59
Query: 61 KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
+ ERSEG R +QW EV+SWEPRAFVYHNFLSKEECEYLI+LA PHM KSTVVDS+TG
Sbjct: 60 AATERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETG 119
Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFD 180
KSKDSRVRTSSGTFL RGRDKII+ IEKRIAD+TF P ++GEGLQVLHYEAGQKYEPH+D
Sbjct: 120 KSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYD 179
Query: 181 YFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIK 240
YF+DEFNTKNGGQRMAT+LMYLSDVEEGGETVFP A N S+VPW+NELSECGK GLS+K
Sbjct: 180 YFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVK 239
Query: 241 PKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
P+MGDALLFWSM+PDA+LDP+SLHGGCPVI+GNKWSSTKW+ V EYK+
Sbjct: 240 PRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGEYKI 287
>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 287
Score = 454 bits (1167), Expect = e-125, Method: Compositional matrix adjust.
Identities = 213/288 (73%), Positives = 247/288 (85%), Gaps = 1/288 (0%)
Query: 1 MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
MAK R+SRF RK S+ L+L +L M T +L+LLAFG+ S+P ++ +S DLS R
Sbjct: 1 MAKLRHSRFQARKWSTLMLVLFMLFMLTIVLLMLLAFGVFSLPINNDESSPI-DLSYFRR 59
Query: 61 KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
+ ERSEG R +QW EV+SWEPRAFVYHNFLSKEECEYLI+LA PHM KSTVVDS+TG
Sbjct: 60 AATERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETG 119
Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFD 180
KSKDSRVRTSSGTFL RGRDKII+ IEKRIAD+TF P ++GEGLQ+LHYEAGQKYEPH+D
Sbjct: 120 KSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQILHYEAGQKYEPHYD 179
Query: 181 YFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIK 240
YF+DEFNTKNGGQRMAT+LMYLSDVEEGGETVFP A N S+VPW+NELSECGK GLS+K
Sbjct: 180 YFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVK 239
Query: 241 PKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
P+MGDALLFWSM+PDA+LDP+SLHGGCPVI+GNKWSSTKW+ V EYK+
Sbjct: 240 PRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGEYKI 287
>gi|224133600|ref|XP_002327635.1| predicted protein [Populus trichocarpa]
gi|222836720|gb|EEE75113.1| predicted protein [Populus trichocarpa]
Length = 291
Score = 446 bits (1148), Expect = e-123, Method: Compositional matrix adjust.
Identities = 236/291 (81%), Positives = 261/291 (89%), Gaps = 3/291 (1%)
Query: 1 MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDS--RKANDLSSI 58
MAK RYSR P+RKS SSTLILTLL+MFTF ILILL GILS+PS+S R+ANDLSSI
Sbjct: 1 MAKARYSRIPSRKSPSSTLILTLLLMFTFVILILLGLGILSIPSTSSSDSSRQANDLSSI 60
Query: 59 VRKS-MERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
S ++ S DEG+AEQW EVISW+PRAFVYHNFL+K ECEYLINLA P M+KSTVVDS
Sbjct: 61 AHHSRIDGSGDDEGKAEQWAEVISWKPRAFVYHNFLTKAECEYLINLAKPRMQKSTVVDS 120
Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEP 177
TGKSKDS+VRTSSGTFL RGRDKI+RDIEKRIADF+F P+E+GEGLQ+LHYE GQ+YEP
Sbjct: 121 STGKSKDSKVRTSSGTFLPRGRDKIVRDIEKRIADFSFIPVEHGEGLQILHYEVGQRYEP 180
Query: 178 HFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGL 237
HFDYFMDE+NTKNGGQR+ATVLMYLSDVEEGGETVFP+A+GNISAVPWWNELSECGK GL
Sbjct: 181 HFDYFMDEYNTKNGGQRIATVLMYLSDVEEGGETVFPSAEGNISAVPWWNELSECGKGGL 240
Query: 238 SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
S+KPKMGDALLFWSM PD S DPSSLHGGCPVI+GNKWSSTKW+RVNEYKV
Sbjct: 241 SVKPKMGDALLFWSMNPDGSPDPSSLHGGCPVIRGNKWSSTKWMRVNEYKV 291
>gi|10177121|dbj|BAB10411.1| prolyl 4-hydroxylase, alpha subunit-like protein [Arabidopsis
thaliana]
Length = 267
Score = 442 bits (1136), Expect = e-121, Method: Compositional matrix adjust.
Identities = 207/265 (78%), Positives = 234/265 (88%)
Query: 1 MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
MA+PR R RKSS STL+ +LIM TF ILILLAFGILS+PS++ S KANDL+SIVR
Sbjct: 2 MARPRNHRPSARKSSHSTLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIVR 61
Query: 61 KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
K+++RS D+ + E+WVE+ISWEPRA VYHNFL+KEEC+YLI LA PHM KSTVVD TG
Sbjct: 62 KTLQRSGEDDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTG 121
Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFD 180
KS DSRVRTSSGTFLARGRDK IR+IEKRI+DFTF P+E+GEGLQVLHYE GQKYEPH+D
Sbjct: 122 KSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYD 181
Query: 181 YFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIK 240
YFMDE+NT+NGGQR+ATVLMYLSDVEEGGETVFP A+GN SAVPWWNELSECGK GLS+K
Sbjct: 182 YFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVK 241
Query: 241 PKMGDALLFWSMKPDASLDPSSLHG 265
PKMGDALLFWSM PDA+LDPSSLHG
Sbjct: 242 PKMGDALLFWSMTPDATLDPSSLHG 266
>gi|297797785|ref|XP_002866777.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312612|gb|EFH43036.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 266
Score = 440 bits (1132), Expect = e-121, Method: Compositional matrix adjust.
Identities = 206/265 (77%), Positives = 233/265 (87%)
Query: 1 MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
MA+PR R RKSS STL+ +LIM TF ILILLAFGILS+PS++ S KANDL+SIVR
Sbjct: 1 MARPRSHRPSARKSSRSTLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIVR 60
Query: 61 KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
K+++R D+ + E+WVE+ISWEPRA VYHNFL+KEEC+YLI LA PHM KSTVVD TG
Sbjct: 61 KTLQRGVEDDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTG 120
Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFD 180
KS DSRVRTSSGTFLARGRDK IR+IEKRI+DFTF P+E+GEGLQVLHYE GQKYEPH+D
Sbjct: 121 KSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYD 180
Query: 181 YFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIK 240
YFMDE+NT+NGGQR+ATVLMYLSDVEEGGETVFP A+GN SAVPWWNELSECGK GLS+K
Sbjct: 181 YFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVK 240
Query: 241 PKMGDALLFWSMKPDASLDPSSLHG 265
PKMGDALLFWSM PDA+LDPSSLHG
Sbjct: 241 PKMGDALLFWSMTPDATLDPSSLHG 265
>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
Length = 288
Score = 437 bits (1125), Expect = e-120, Method: Compositional matrix adjust.
Identities = 209/290 (72%), Positives = 241/290 (83%), Gaps = 4/290 (1%)
Query: 1 MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
M K R+SR T+K S+ TL+L++L M T + ILL G S+P SS DS NDL+S R
Sbjct: 1 MVKVRHSRLHTKKWSTFTLVLSMLFMLTVVLFILLGLGAFSLPVSSEDS-SPNDLNSYRR 59
Query: 61 KSMERSEGDE--GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD 118
+ E S+GD R EQW E++SWEPRAF+YHNFLSKEECEYLINLA PHM KSTVVDS
Sbjct: 60 IASE-SDGDGMGKREEQWTEILSWEPRAFLYHNFLSKEECEYLINLAKPHMMKSTVVDSK 118
Query: 119 TGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPH 178
TG+SKDSRVRTSSG FL RGRD++IR+IEKRIADF+F P+E+GEGLQVLHYE GQKYE H
Sbjct: 119 TGRSKDSRVRTSSGMFLRRGRDRVIREIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEAH 178
Query: 179 FDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLS 238
FDYF+DEFNTKNGGQR AT+LMYLSDVEEGGETVFP A NISAVPWWNELSEC K GLS
Sbjct: 179 FDYFLDEFNTKNGGQRTATLLMYLSDVEEGGETVFPAANMNISAVPWWNELSECAKQGLS 238
Query: 239 IKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
+KPKMG+ALLFWS +PDA+LDPSSLHG CPVI+GNKWS+TKW+ + EYK+
Sbjct: 239 LKPKMGNALLFWSTRPDATLDPSSLHGSCPVIRGNKWSATKWMHLGEYKI 288
>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
Length = 288
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 213/288 (73%), Positives = 248/288 (86%)
Query: 1 MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
MAK RYSR ++ S+ L+L+LL+M T +L+LLA GI+S+P + DS ANDLSS R
Sbjct: 1 MAKGRYSRGHGKRWSTLALVLSLLLMLTVVLLMLLALGIVSLPIGTVDSDAANDLSSFRR 60
Query: 61 KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
K+ + EG R EQW E++SWEPRAF+YHNFLSKEECEY+I+LA P+M+KSTVVDS+TG
Sbjct: 61 KTFDGGEGLGKRGEQWTEIVSWEPRAFIYHNFLSKEECEYMISLAKPYMKKSTVVDSETG 120
Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFD 180
+SKDSRVRTSSG FL RGRDKIIRDIEKRIADFTF P+E+GEGLQVLHYE GQKY+ H+D
Sbjct: 121 RSKDSRVRTSSGMFLRRGRDKIIRDIEKRIADFTFIPVEHGEGLQVLHYEVGQKYDAHYD 180
Query: 181 YFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIK 240
YF+DEFNTKNGGQR+AT+LMYLSDVEEGGETVFP + N S+VPWWNELSECGK GLS+K
Sbjct: 181 YFLDEFNTKNGGQRIATLLMYLSDVEEGGETVFPATKANFSSVPWWNELSECGKKGLSVK 240
Query: 241 PKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
PKMGDALLFWSM+PDA+LDPSSLHGGCPVIKGNKWSSTKW+ V EYK
Sbjct: 241 PKMGDALLFWSMRPDATLDPSSLHGGCPVIKGNKWSSTKWMHVEEYKA 288
>gi|224117220|ref|XP_002331751.1| predicted protein [Populus trichocarpa]
gi|222874448|gb|EEF11579.1| predicted protein [Populus trichocarpa]
Length = 266
Score = 424 bits (1089), Expect = e-116, Method: Compositional matrix adjust.
Identities = 216/290 (74%), Positives = 238/290 (82%), Gaps = 26/290 (8%)
Query: 1 MAKPRYSRFPTRKSSSSTLIL-TLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIV 59
MAK RYSR TRKS SSTLI +L + F NDLSSI
Sbjct: 1 MAKARYSRISTRKSPSSTLIRKSLNVHF------------------------PNDLSSIA 36
Query: 60 RKS-MERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD 118
S + S DEG+AEQWVE ISWEPRAF+YHNFL+K EC+YLINLA PHM+KS VVDS
Sbjct: 37 HNSKIHESGDDEGKAEQWVEAISWEPRAFIYHNFLTKAECDYLINLAKPHMQKSMVVDSS 96
Query: 119 TGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPH 178
+GKSKDSRVRTSSGTFL RGRDKIIRDIEKRIADF+F P E+GEGLQ+LHYE GQKYEPH
Sbjct: 97 SGKSKDSRVRTSSGTFLPRGRDKIIRDIEKRIADFSFIPSEHGEGLQILHYEVGQKYEPH 156
Query: 179 FDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLS 238
FDYFMD++NT+NGGQR+ATVLMYLSDVEEGGETVFP+A+GNIS+VPWWNELSECGK GLS
Sbjct: 157 FDYFMDDYNTENGGQRIATVLMYLSDVEEGGETVFPSAKGNISSVPWWNELSECGKGGLS 216
Query: 239 IKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
+KPKMGDALLFWSMKPDASLDPSSLHGGCPVI+GNKWSSTKW+RVNEYK
Sbjct: 217 VKPKMGDALLFWSMKPDASLDPSSLHGGCPVIRGNKWSSTKWMRVNEYKA 266
>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 289
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 206/290 (71%), Positives = 240/290 (82%), Gaps = 3/290 (1%)
Query: 1 MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
MAK R+SR RK S+ TL+ ++L M T +L+LLA GI S+P S+ DS NDL++ R
Sbjct: 1 MAKMRHSRLQARKMSTLTLVFSMLFMLTVVLLMLLALGIFSLPMSTDDS-PPNDLAASYR 59
Query: 61 K-SMERS-EGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD 118
+ + ER +G R EQW E+ISWEPRAFVYHNFLSKEECEYLI LA PHM KSTVVDS
Sbjct: 60 RMAAERDYDGLGKRVEQWTEIISWEPRAFVYHNFLSKEECEYLIALAKPHMVKSTVVDSK 119
Query: 119 TGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPH 178
TG+SKDSRVRTSSG FL RGRDKIIR+IEKRIADF+F P+E+GEGLQVLHYE GQKYE H
Sbjct: 120 TGRSKDSRVRTSSGMFLRRGRDKIIRNIEKRIADFSFIPIEHGEGLQVLHYEVGQKYEAH 179
Query: 179 FDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLS 238
+DYF+DEFNTKNGGQR AT+LMYLSDVEEGGETVFP A+ NIS VP WNELSEC + GLS
Sbjct: 180 YDYFLDEFNTKNGGQRTATLLMYLSDVEEGGETVFPAAKANISNVPSWNELSECARQGLS 239
Query: 239 IKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
+KPKMG+ALLFWS +PDA+LDP+SLHG CPVI+GNKWS+TKW+ + EY V
Sbjct: 240 VKPKMGNALLFWSTRPDATLDPASLHGSCPVIRGNKWSATKWMHLGEYSV 289
>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 280
Score = 413 bits (1062), Expect = e-113, Method: Compositional matrix adjust.
Identities = 194/288 (67%), Positives = 234/288 (81%), Gaps = 8/288 (2%)
Query: 1 MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
MAK +++ ++ STLIL L M T I++LLA GIL +P+++ DS + R
Sbjct: 1 MAKGKHTHPRSQVKKLSTLILLTLFMLTLVIIVLLALGILYLPNTTDDSLITD------R 54
Query: 61 KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
+ + S + + EQW E++SWEPRAFVYHNFLSKEECE+LINLA P + KS+VVDS TG
Sbjct: 55 RKIYESLAE--KKEQWTEILSWEPRAFVYHNFLSKEECEHLINLAKPFLAKSSVVDSKTG 112
Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFD 180
KS +SRVRTSSG FL RG+DKII++IE+RIADFTF P+ENGEGLQVLHY G+KYEPH+D
Sbjct: 113 KSTESRVRTSSGMFLKRGKDKIIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYD 172
Query: 181 YFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIK 240
YF+DEFNTKNGGQR+ATVLMYLSDVEEGGETVFP A+ N S+VPWWN+LSEC + GLS+K
Sbjct: 173 YFLDEFNTKNGGQRVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLK 232
Query: 241 PKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
PKMGDALLFWSM+PDA+LD SSLHGGCPVI GNKWSSTKW+ + EYKV
Sbjct: 233 PKMGDALLFWSMRPDATLDASSLHGGCPVIVGNKWSSTKWMHLEEYKV 280
>gi|297802350|ref|XP_002869059.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297314895|gb|EFH45318.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 290
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 201/287 (70%), Positives = 233/287 (81%), Gaps = 4/287 (1%)
Query: 3 KPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKS 62
KP++ R RKS S T T+LI+ F ILIL+ GILS+P+++ S + DL++IV+
Sbjct: 4 KPKHLRNQPRKSFS-TQAFTVLILGLFVILILVGLGILSLPNTNKSSSRPMDLTTIVQTI 62
Query: 63 MER-SEGDE--GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT 119
ER S GDE G ++W+EVISWEPRAFVYHNFL+ EECE+LI+LA P M KS VVD T
Sbjct: 63 EERESYGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMVKSKVVDVKT 122
Query: 120 GKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHF 179
GKS DSRVRTSSGTFL RG D+I+ +IE RI+DFTF P+ENGEGLQVLHYE GQKYEPH
Sbjct: 123 GKSIDSRVRTSSGTFLKRGHDEIVEEIENRISDFTFIPIENGEGLQVLHYEVGQKYEPHH 182
Query: 180 DYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSI 239
DYF DEFN + GGQR+ATVLMYLSDV+EGGETVFP A+GNIS VPWW+ELS+CGK GLS+
Sbjct: 183 DYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNISDVPWWDELSQCGKEGLSV 242
Query: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
PK DALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKW V+EY
Sbjct: 243 LPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHEY 289
>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 290
Score = 409 bits (1052), Expect = e-112, Method: Compositional matrix adjust.
Identities = 194/288 (67%), Positives = 233/288 (80%), Gaps = 2/288 (0%)
Query: 2 AKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRK 61
K ++R +K S+ +L+L L T +++LLA GI+ + ++ D DLS+ RK
Sbjct: 4 GKHTHTRAQGKKWSTFSLVLWALFFLTLILVVLLALGIVYL-PTTDDDFPTTDLSAFRRK 62
Query: 62 SMERSEG-DEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
+ + +E E EQW E++SWEPRAF+YHNFLSKEECEYLI LA P M KS+VVDS TG
Sbjct: 63 TSQSAESLVENPPEQWTEILSWEPRAFIYHNFLSKEECEYLIELAKPQMVKSSVVDSKTG 122
Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFD 180
KS +SRVRTSSG FL RG+DKI+++IEKRIADFTF P ENGEGLQ+LHYE GQKYEPH+D
Sbjct: 123 KSTESRVRTSSGMFLKRGKDKIVQNIEKRIADFTFIPEENGEGLQILHYEVGQKYEPHYD 182
Query: 181 YFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIK 240
YF+DEFNTKNGGQR+ATVLMYLSDVEEGGETVFP A N S+VPWWN+LS+C + GLS+K
Sbjct: 183 YFLDEFNTKNGGQRIATVLMYLSDVEEGGETVFPAANANFSSVPWWNDLSQCARKGLSVK 242
Query: 241 PKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
PKMGDALLFWSM+PDA+LDPSSLHGGCPVIKGNKWSSTKW+ + EYKV
Sbjct: 243 PKMGDALLFWSMRPDATLDPSSLHGGCPVIKGNKWSSTKWMHLREYKV 290
>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 291
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 209/286 (73%), Positives = 242/286 (84%), Gaps = 3/286 (1%)
Query: 5 RYSRFPTRKS-SSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSM 63
++ R+ RKS S ST T+LI+ ILILL GILS+P+++ +S K NDL++IVRKS
Sbjct: 7 QHLRYQPRKSVSRSTQAFTVLILLLVVILILLGLGILSLPNANRNSSKTNDLTNIVRKS- 65
Query: 64 ERSEGDE-GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKS 122
E S GDE G E+WVEVISWEPRA VYHNFLS EECE+LINLA P M KSTVVD TG S
Sbjct: 66 ETSYGDEDGNGERWVEVISWEPRAVVYHNFLSNEECEHLINLAKPSMVKSTVVDEKTGGS 125
Query: 123 KDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF 182
KDSRVRTSSGTFL RG D+++ IEKRI+DFTF P+ENGEGLQVLHY+ GQKYEPH+DYF
Sbjct: 126 KDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYF 185
Query: 183 MDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPK 242
+DEFNTKNGGQR+ATVLMYLSDV++GGETVFP A+GNISAVPWWNELS+CGK GLS+ PK
Sbjct: 186 LDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPK 245
Query: 243 MGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
DALLFW+M+PDASLDPSSLHGGCPV+KGNKWSSTKW V+E+KV
Sbjct: 246 KRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEFKV 291
>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
Arabidopsis thaliana
gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
thaliana]
gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 291
Score = 406 bits (1044), Expect = e-111, Method: Compositional matrix adjust.
Identities = 204/285 (71%), Positives = 240/285 (84%), Gaps = 1/285 (0%)
Query: 5 RYSRFPTRKS-SSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSM 63
++ R+ RKS S ST T+LI+ ILILL GILS+P+++ +S K NDL++IVRKS
Sbjct: 7 QHLRYQPRKSVSRSTQAFTVLILLLVVILILLGLGILSLPNANRNSSKTNDLTNIVRKSE 66
Query: 64 ERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSK 123
S +EG E+WVEVISWEPRA VYHNFL+ EECE+LI+LA P M KSTVVD TG SK
Sbjct: 67 TSSGDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSK 126
Query: 124 DSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM 183
DSRVRTSSGTFL RG D+++ IEKRI+DFTF P+ENGEGLQVLHY+ GQKYEPH+DYF+
Sbjct: 127 DSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFL 186
Query: 184 DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKM 243
DEFNTKNGGQR+ATVLMYLSDV++GGETVFP A+GNISAVPWWNELS+CGK GLS+ PK
Sbjct: 187 DEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKK 246
Query: 244 GDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
DALLFW+M+PDASLDPSSLHGGCPV+KGNKWSSTKW V+E+KV
Sbjct: 247 RDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEFKV 291
>gi|21593091|gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
Length = 291
Score = 406 bits (1044), Expect = e-111, Method: Compositional matrix adjust.
Identities = 204/285 (71%), Positives = 240/285 (84%), Gaps = 1/285 (0%)
Query: 5 RYSRFPTRKS-SSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSM 63
++ R+ RKS S ST T+LI+ ILILL GILS+P+++ +S K NDL++IVRKS
Sbjct: 7 QHLRYQPRKSVSRSTQAFTVLILLLVVILILLGLGILSLPNANRNSSKTNDLTNIVRKSE 66
Query: 64 ERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSK 123
S +EG E+WVEVISWEPRA VYHNFL+ EECE+LI+LA P M KSTVVD TG SK
Sbjct: 67 TSSGDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSK 126
Query: 124 DSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM 183
DSRVRTSSGTFL RG D+++ IEKRI+DFTF P+ENGEGLQVLHY+ GQKYEPH+DYF+
Sbjct: 127 DSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQKYEPHYDYFL 186
Query: 184 DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKM 243
DEFNTKNGGQR+ATVLMYLSDV++GGETVFP A+GNISAVPWWNELS+CGK GLS+ PK
Sbjct: 187 DEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKX 246
Query: 244 GDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
DALLFW+M+PDASLDPSSLHGGCPV+KGNKWSSTKW V+E+KV
Sbjct: 247 RDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEFKV 291
>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
Length = 310
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 201/294 (68%), Positives = 232/294 (78%), Gaps = 14/294 (4%)
Query: 9 FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSS------------GDSRKAND 54
FPTR ++S L L L++ + +L L+AFG+ S+P S+ GD+ A+
Sbjct: 17 FPTRGGRTSPLALALAALLLASALLLALIAFGVFSLPVSAPNAATTDSAAAGGDAEPADP 76
Query: 55 LSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV 114
R + SEG R QW EVISWEPRAFVYHNFLSKEEC+YLI LA PHM KSTV
Sbjct: 77 RPPRTRARRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV 136
Query: 115 VDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQK 174
VDS TGKSKDSRVRTSSG FL RGRDK+IR IEKRIAD+TF P+E+GEGLQVLHYE GQK
Sbjct: 137 VDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQK 196
Query: 175 YEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGK 234
YEPHFDYF+DE+NTKNGGQRMAT+LMYLSDVEEGGET+FP+A N S++PW+NELSEC +
Sbjct: 197 YEPHFDYFLDEYNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECAR 256
Query: 235 TGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
GL++KPKMGDALLFWSMKPDA+LDP SLHGGCPVIKGNKWSSTKW+ V EYK
Sbjct: 257 KGLAVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHVREYKA 310
>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 318
Score = 402 bits (1034), Expect = e-110, Method: Compositional matrix adjust.
Identities = 197/294 (67%), Positives = 224/294 (76%), Gaps = 18/294 (6%)
Query: 12 RKSSSSTLILTLLIMFTFAILILLAFGILSMPSS--------------SGDSRKANDLSS 57
R S + + LLI F +L L+AFG+ S+P S SG + ++ S
Sbjct: 26 RVSPYAVALGALLIASAF-LLALIAFGVFSLPVSAPNLATTAGGGETESGSTEESGGSES 84
Query: 58 IVRKSMERSEGDEG---RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV 114
+S R + EG R QW EVISWEPRAFVYHNFLSKEECEYLI LA P M KSTV
Sbjct: 85 HSARSRARRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEECEYLIGLAKPRMEKSTV 144
Query: 115 VDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQK 174
VDS TGKSKDSRVRTSSG FL RGRDK+IR IE+RIAD+TF P E+GEGLQVLHYE GQK
Sbjct: 145 VDSTTGKSKDSRVRTSSGMFLRRGRDKVIRAIERRIADYTFIPAEHGEGLQVLHYEVGQK 204
Query: 175 YEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGK 234
YEPHFDYF+DEFNTKNGGQRMAT+LMYLSDVEEGGET+FP+A N S++PW NELSEC +
Sbjct: 205 YEPHFDYFLDEFNTKNGGQRMATILMYLSDVEEGGETIFPDANVNSSSLPWHNELSECAR 264
Query: 235 TGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
GL++KPKMGDALLFWSM PDA+LDP SLHGGCPVI+GNKWSSTKW+ V EYK
Sbjct: 265 KGLAVKPKMGDALLFWSMNPDATLDPLSLHGGCPVIRGNKWSSTKWMHVGEYKT 318
>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 399 bits (1026), Expect = e-109, Method: Compositional matrix adjust.
Identities = 194/289 (67%), Positives = 230/289 (79%), Gaps = 11/289 (3%)
Query: 1 MAKPRYSRFPTRKSSS---STLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSS 57
++K +Y + RK S+ S +I+ L++ F +LI L F S P +S + S
Sbjct: 3 ISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRF--FSPPETS------HHRFS 54
Query: 58 IVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
VR + S+G R +QWVE ISWEPRAFVYHNFLSKEEC YLI+LA PHM KSTVVDS
Sbjct: 55 SVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDS 114
Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEP 177
TG+S DSRVRTSSG FL RG+DKIIR+IEKRIADFTF P+E+GEGLQ+LHYE GQKY+
Sbjct: 115 KTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDA 174
Query: 178 HFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGL 237
H+DYF+DE+N K GGQRMAT+LMYLSDVEEGGETVFP A+GN S+VPWWNELSECGK GL
Sbjct: 175 HYDYFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGL 234
Query: 238 SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
S+KPKMGDALLFWSMKPDA+LDP+SLHG CPVI+GNKWS TKW+ V++Y
Sbjct: 235 SVKPKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 283
>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 315
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 191/268 (71%), Positives = 213/268 (79%), Gaps = 16/268 (5%)
Query: 37 FGILSMP--------------SSSGDSRKANDLSSIVRKSMER--SEGDEGRAEQWVEVI 80
FG+ S+P SS G A++ S R R SEG R QW EVI
Sbjct: 48 FGVFSLPVSSPTVPTTGAETESSGGGGEAASESSRPARNRGRRDLSEGLGERGAQWTEVI 107
Query: 81 SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
SWEPRAFVYHNFLSKEECEYLI LA P M KSTVVDS+TGKSKDSRVRTSSG FL RGRD
Sbjct: 108 SWEPRAFVYHNFLSKEECEYLIELAKPRMVKSTVVDSETGKSKDSRVRTSSGMFLQRGRD 167
Query: 141 KIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLM 200
K+IR IE+RIAD+TF P E+GEGLQVLHYE GQKYEPHFDYF+DEFNTKNGGQRMAT+LM
Sbjct: 168 KVIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQRMATILM 227
Query: 201 YLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDP 260
YLSD+EEGGET+FP+A N S++PW+NELSEC + GL++KPKMGDALLFWSMKPDA+LDP
Sbjct: 228 YLSDIEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATLDP 287
Query: 261 SSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
SLHGGCPVIKGNKWSSTKW+ V EYK
Sbjct: 288 LSLHGGCPVIKGNKWSSTKWLHVGEYKA 315
>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
Length = 308
Score = 398 bits (1022), Expect = e-108, Method: Compositional matrix adjust.
Identities = 200/292 (68%), Positives = 235/292 (80%), Gaps = 12/292 (4%)
Query: 9 FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSR----------KANDLS 56
FPTR ++S + LT L++ + A+L L+AFG+ S+P S+ ++ A+
Sbjct: 17 FPTRGGRASPYAVALTALLLVSAALLALIAFGVFSLPVSAPNAAATTTAGGETESADTRP 76
Query: 57 SIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVD 116
+ R + EG R QW EVISWEPRAFVYHNFLSKEECEYLI LA PHM KSTVVD
Sbjct: 77 ARPRARRDLGEGLGERGAQWTEVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVD 136
Query: 117 SDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYE 176
S TGKSKDSRVRTSSG FL RGRDK+IR IEKRIAD+TF P+++GEGLQVLHYE GQKYE
Sbjct: 137 STTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYE 196
Query: 177 PHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTG 236
PHFDYF+DEFNTKNGGQRMAT+LMYLSDVEEGGET+FP+A N+S++PW+NELSEC K G
Sbjct: 197 PHFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNELSECAKRG 256
Query: 237 LSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
LS+KPKMGDALLFWSMKPDA+LDP SLHGGCPVI+GNKWSSTKW+ ++EYK
Sbjct: 257 LSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEYKA 308
>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
Length = 307
Score = 397 bits (1019), Expect = e-108, Method: Compositional matrix adjust.
Identities = 201/291 (69%), Positives = 234/291 (80%), Gaps = 11/291 (3%)
Query: 9 FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSS---------GDSRKANDLSS 57
FPTR ++S + LT L++ + A+L L+AFG+ S+P S+ G++ A+ +
Sbjct: 17 FPTRGGRASPYAVALTALLLVSAALLALIAFGVFSLPVSAPNAATTGAGGETESADARPA 76
Query: 58 IVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
R + EG R QW EVISWEPRAFVYHNFLSKEECEYLI LA PHM KSTVVDS
Sbjct: 77 RPRARRDLGEGLGERGAQWTEVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVDS 136
Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEP 177
TGKSKDSRVRTSSG FL RGRDK+IR IEKRIAD+TF P ++GEGLQVLHYE GQKYEP
Sbjct: 137 TTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPADHGEGLQVLHYEVGQKYEP 196
Query: 178 HFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGL 237
HFDYF+DEFNTKNGGQRMAT+LMYLSDVEEGGET+FP+A N S++PW+NELSEC K GL
Sbjct: 197 HFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSECAKRGL 256
Query: 238 SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
S+KPKMGDALLFWSMKPDA+LDP SLHGGCPVI+GNKWSSTKW+ ++EYK
Sbjct: 257 SVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEYKA 307
>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
Length = 307
Score = 396 bits (1018), Expect = e-108, Method: Compositional matrix adjust.
Identities = 201/291 (69%), Positives = 235/291 (80%), Gaps = 11/291 (3%)
Query: 9 FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDL-------SSIV 59
FPTR ++S T+ LT L++ + A+L L+AFG+ S+P S+ ++ S+ V
Sbjct: 17 FPTRGGRASPYTVALTALLLVSAALLALIAFGVFSLPVSAPNAAATTGTAAGGETESADV 76
Query: 60 RKSMER--SEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
R R EG R QW EVISWEPRAFVYHNFLSK+ECEYLI LA PHM KSTVVDS
Sbjct: 77 RPRARRDLGEGLGERGAQWTEVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDS 136
Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEP 177
TGKSKDSRVRTSSG FL RGRDK+IR IEKRIAD+TF P+++GEGLQVLHYE GQKYEP
Sbjct: 137 TTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEP 196
Query: 178 HFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGL 237
HFDYF+DEFNTKNGGQR+AT+LMYLSDVEEGGET+FP+A N S++PW+NELS+C K GL
Sbjct: 197 HFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGL 256
Query: 238 SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
S+KPKMGDALLFWSMKPDA+LDP SLHGGCPVIKGNKWSSTKW+ ++EYK
Sbjct: 257 SVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHEYKA 307
>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
Length = 307
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 200/291 (68%), Positives = 235/291 (80%), Gaps = 11/291 (3%)
Query: 9 FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDL-------SSIV 59
FPTR ++S T+ LT L++ + A+L L+AFG+ S+P S+ ++ S+ V
Sbjct: 17 FPTRGGRASPYTVALTALLLVSAALLALIAFGVFSLPVSAPNAAATTGTAAGGETESADV 76
Query: 60 RKSMER--SEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
R R EG R QW EVISWEPRAFVYHNFLSK+ECEYLI LA PHM KSTVVDS
Sbjct: 77 RPRARRDLGEGLGERGAQWTEVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDS 136
Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEP 177
TGKSKDSRVRTSSG FL RGR+K+IR IEKRIAD+TF P+++GEGLQVLHYE GQKYEP
Sbjct: 137 TTGKSKDSRVRTSSGMFLQRGRNKVIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEP 196
Query: 178 HFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGL 237
HFDYF+DEFNTKNGGQR+AT+LMYLSDVEEGGET+FP+A N S++PW+NELS+C K GL
Sbjct: 197 HFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGL 256
Query: 238 SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
S+KPKMGDALLFWSMKPDA+LDP SLHGGCPVIKGNKWSSTKW+ ++EYK
Sbjct: 257 SVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHEYKA 307
>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
Length = 307
Score = 394 bits (1011), Expect = e-107, Method: Compositional matrix adjust.
Identities = 200/291 (68%), Positives = 234/291 (80%), Gaps = 11/291 (3%)
Query: 9 FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDL-------SSIV 59
FPTR ++S T+ LT L++ + A+L L+AFG+ S+P S+ ++ S+ V
Sbjct: 17 FPTRGGRASPYTVALTALLLVSAALLALIAFGVFSLPVSAPNAAATTGTAAGGETESADV 76
Query: 60 RKSMER--SEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
R R EG R QW EVISWEPRAFVYHNFLSK+ECEYLI LA PHM KSTVVDS
Sbjct: 77 RPRARRDLGEGLGERGAQWTEVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDS 136
Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEP 177
TGKSKDSRVRTSSG FL RGRDK+IR IEKRIAD+TF P+++GEGLQVLHYE GQKYEP
Sbjct: 137 TTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEP 196
Query: 178 HFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGL 237
HFDYF+DEFNTKNGGQR+AT+LMYLSDVEEGGET+FP+A N S++PW+NELS+C K GL
Sbjct: 197 HFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGL 256
Query: 238 SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
S+KPKMGDALLFWSMKP A+LDP SLHGGCPVIKGNKWSSTKW+ ++EYK
Sbjct: 257 SVKPKMGDALLFWSMKPGATLDPLSLHGGCPVIKGNKWSSTKWMHIHEYKA 307
>gi|326495334|dbj|BAJ85763.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 300
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 177/217 (81%), Positives = 201/217 (92%)
Query: 72 RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
+ E W EV+SWEPRAF+YHNFLSKEECEYLI+LA PHM+KSTVVDS TG SKDSRVRTSS
Sbjct: 84 KGEPWTEVLSWEPRAFIYHNFLSKEECEYLISLAKPHMKKSTVVDSATGGSKDSRVRTSS 143
Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG 191
GTFL RG+DKI+R IEKRI+DFTF P+ENGEGLQVLHYE GQKYEPHFDYF D+FNTKNG
Sbjct: 144 GTFLRRGQDKIVRTIEKRISDFTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDDFNTKNG 203
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
GQR+ATVLMYLSDVEEGGETVFP+A+ N S++P++NELSEC K G+S+KPKMGDALLFWS
Sbjct: 204 GQRIATVLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALLFWS 263
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
M+PD +LDP+SLHGGCPVIKG+KWSSTKWIRV+EYKV
Sbjct: 264 MRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHEYKV 300
>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 306
Score = 389 bits (1000), Expect = e-106, Method: Compositional matrix adjust.
Identities = 176/217 (81%), Positives = 200/217 (92%)
Query: 72 RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
+ E W EV+SWEPRAF+YHNFLSKEECEYLI+LA PHM+KSTVVDS TG SKDSRVRTSS
Sbjct: 90 KGEPWTEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDSATGGSKDSRVRTSS 149
Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG 191
GTFL RG+DK+IR IEKRI+DFTF P ENGEGLQVLHYE GQKYEPHFDYF D+FNTKNG
Sbjct: 150 GTFLRRGQDKVIRTIEKRISDFTFIPAENGEGLQVLHYEVGQKYEPHFDYFHDDFNTKNG 209
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
GQR+AT+LMYLSDVEEGGETVFP+A+ N S++P++NELSEC K G+S+KPKMGDALLFWS
Sbjct: 210 GQRIATLLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALLFWS 269
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
M+PD +LDP+SLHGGCPVIKG+KWSSTKWIRV+EYKV
Sbjct: 270 MRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHEYKV 306
>gi|449443243|ref|XP_004139389.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 284
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 186/285 (65%), Positives = 226/285 (79%), Gaps = 11/285 (3%)
Query: 5 RYSRFPTRKSSS---STLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRK 61
+Y + +K S+ S +I+ L++ F +LI L F LS P +S + S VR
Sbjct: 7 KYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRF--LSPPETS------HHRFSSVRH 58
Query: 62 SMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGK 121
+ S+G R +QWVE ISWEPRAFVYHNFLSKEEC YLI+LA PHM KSTVVD++TGK
Sbjct: 59 TAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGK 118
Query: 122 SKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY 181
+ + VRTSSG FL RG+DKI+ +IEKRIADFTF P+E+GEGLQ+LHYE GQKY+ H+DY
Sbjct: 119 NVEDSVRTSSGMFLNRGQDKIVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDY 178
Query: 182 FMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKP 241
F+DE+N K GGQRMAT+LMYLSDVEEGGETVFP A+GN S+VPWWNELS+CGK GLS+KP
Sbjct: 179 FVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVKP 238
Query: 242 KMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
KMGDALLFWSMKPDA+LDP+SLHG CPVI+GNKWS TKW+ V++Y
Sbjct: 239 KMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 283
>gi|116788056|gb|ABK24739.1| unknown [Picea sitchensis]
Length = 303
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 187/280 (66%), Positives = 218/280 (77%), Gaps = 22/280 (7%)
Query: 28 TFAILILLAFGILSMPSSSGDSRKANDLS--------SIVRKSMERSEGDEGRAEQWVEV 79
+ +L+LLA GI+S+P +SR +++S S +K E +QW EV
Sbjct: 27 SIVLLMLLALGIVSLPV---NSRAPDEISNGGVYSEHSGGKKLQETYSNGMDEPKQWAEV 83
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR-----------VR 128
+SWEPRA +YHNFL+KEECEYLINLA PHM KSTVVDS TGKSKDSR VR
Sbjct: 84 LSWEPRAILYHNFLNKEECEYLINLAKPHMAKSTVVDSATGKSKDSRFVHRWKSNDSRVR 143
Query: 129 TSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT 188
TSSG FL RG+DK IR IEKRIADFTF P E+GEGLQVLHYE GQKYEPHFDYF+DEFNT
Sbjct: 144 TSSGMFLNRGQDKTIRSIEKRIADFTFIPAEHGEGLQVLHYEVGQKYEPHFDYFLDEFNT 203
Query: 189 KNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALL 248
KNGGQR+ATVLMYLSDVE+GGETVFP ++ N S+VPWW+ELSEC K G+S++P+MGDALL
Sbjct: 204 KNGGQRIATVLMYLSDVEKGGETVFPASKVNSSSVPWWDELSECAKAGISVRPRMGDALL 263
Query: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
FWSM+PDA LDPSSLH GCPVI+G+KWS+TKWI V EYKV
Sbjct: 264 FWSMRPDAELDPSSLHAGCPVIQGDKWSATKWIHVGEYKV 303
>gi|357517881|ref|XP_003629229.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523251|gb|AET03705.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 278
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 191/291 (65%), Positives = 227/291 (78%), Gaps = 16/291 (5%)
Query: 1 MAKPRYSRFPTRKSS---SSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSS 57
M K R+SR RK S S TLI TL + FT ILILL I K N ++S
Sbjct: 1 MVKFRHSRLGPRKPSLTASQTLIFTLFVTFTLLILILLTLRI----------PKLNHINS 50
Query: 58 IVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
I ++ + D R WV+++SWEPRAF+YHNFL+K+ECE+LIN A P M+KS+VVD+
Sbjct: 51 ISHNALRSEDNDNKR---WVQIVSWEPRAFLYHNFLTKKECEHLINTAKPSMQKSSVVDN 107
Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEP 177
+TGKSKDS VRTSSGTFL RG D+I+R+IEKRIADFTF P+ENGE VL YE GQKY+P
Sbjct: 108 ETGKSKDSSVRTSSGTFLDRGGDEIVRNIEKRIADFTFIPVENGESFNVLRYEVGQKYDP 167
Query: 178 HFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGL 237
H DYF D++NT NGGQR+AT+LMYLSDVEEGGETVFP A+GNIS+VPWWNELS+CGK GL
Sbjct: 168 HLDYFADDYNTVNGGQRIATMLMYLSDVEEGGETVFPAAKGNISSVPWWNELSDCGKKGL 227
Query: 238 SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
SIKPKMGDALLFWSMKPD +LDPSSLHG CPVIKG+KWS TKW+R+NE++
Sbjct: 228 SIKPKMGDALLFWSMKPDGTLDPSSLHGACPVIKGDKWSCTKWMRINEFRA 278
>gi|42567428|ref|NP_195306.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|332661174|gb|AEE86574.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 290
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 197/287 (68%), Positives = 229/287 (79%), Gaps = 4/287 (1%)
Query: 3 KPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKS 62
KP+ R RKS S T T++++ F ILIL+ GI S+PS++ S DL++IV+
Sbjct: 4 KPKQLRNKPRKSFS-TQTFTVVVLVLFVILILVGLGIFSLPSTNKTSSMPMDLTTIVQTI 62
Query: 63 MER-SEGDE--GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT 119
ER S GDE G ++W+EVISWEPRAFVYHNFL+ EECE+LI+LA P M KS VVD T
Sbjct: 63 QERESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKT 122
Query: 120 GKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHF 179
GKS DSRVRTSSGTFL RG D+I+ +IE RI+DFTF P ENGEGLQVLHYE GQ+YEPH
Sbjct: 123 GKSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHH 182
Query: 180 DYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSI 239
DYF DEFN + GGQR+ATVLMYLSDV+EGGETVFP A+GN+S VPWW+ELS+CGK GLS+
Sbjct: 183 DYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLSV 242
Query: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
PK DALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKW V+EY
Sbjct: 243 LPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHEY 289
>gi|115482738|ref|NP_001064962.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|78708853|gb|ABB47828.1| prolyl 4-hydroxylase alpha subunit, putative, expressed [Oryza
sativa Japonica Group]
gi|113639571|dbj|BAF26876.1| Os10g0497800 [Oryza sativa Japonica Group]
gi|215767852|dbj|BAH00081.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218184821|gb|EEC67248.1| hypothetical protein OsI_34188 [Oryza sativa Indica Group]
Length = 321
Score = 379 bits (973), Expect = e-103, Method: Compositional matrix adjust.
Identities = 176/233 (75%), Positives = 205/233 (87%), Gaps = 2/233 (0%)
Query: 56 SSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV 115
S+ +E G++G E W EV+SWEPRAF+YHNFLSKEECEYLI+LA PHM+KSTVV
Sbjct: 91 SAAFESGLEMRGGEKG--EPWTEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVV 148
Query: 116 DSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKY 175
D+ TG SKDSRVRTSSG FL RG+DKIIR IEKRI+D+TF P+ENGEGLQVLHYE GQKY
Sbjct: 149 DASTGGSKDSRVRTSSGMFLGRGQDKIIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKY 208
Query: 176 EPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKT 235
EPHFDYF DEFNTKNGGQR+AT+LMYLSDVEEGGET+FP+++ N S+ P++NELSEC K
Sbjct: 209 EPHFDYFHDEFNTKNGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKK 268
Query: 236 GLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
GL++KPKMGDALLFWSM+PD SLD +SLHGGCPVIKGNKWSSTKW+RV+EYK+
Sbjct: 269 GLAVKPKMGDALLFWSMRPDGSLDATSLHGGCPVIKGNKWSSTKWMRVHEYKI 321
>gi|334188665|ref|NP_001190630.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
gi|332010771|gb|AED98154.1| iron ion binding / oxidoreductase/ oxidoreductase protein
[Arabidopsis thaliana]
Length = 243
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 180/236 (76%), Positives = 204/236 (86%), Gaps = 2/236 (0%)
Query: 1 MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
MA+PR R RKSS STL+ +LIM TF ILILLAFGILS+PS++ S KANDL+SIVR
Sbjct: 2 MARPRNHRPSARKSSHSTLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIVR 61
Query: 61 KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
K+++RS D+ + E+WVE+ISWEPRA VYHNFL EEC+YLI LA PHM KSTVVD TG
Sbjct: 62 KTLQRSGEDDSKNERWVEIISWEPRASVYHNFL--EECKYLIELAKPHMEKSTVVDEKTG 119
Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFD 180
KS DSRVRTSSGTFLARGRDK IR+IEKRI+DFTF P+E+GEGLQVLHYE GQKYEPH+D
Sbjct: 120 KSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHYD 179
Query: 181 YFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTG 236
YFMDE+NT+NGGQR+ATVLMYLSDVEEGGETVFP A+GN SAVPWWNELSECGK G
Sbjct: 180 YFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGG 235
>gi|414870899|tpg|DAA49456.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 364
Score = 376 bits (966), Expect = e-102, Method: Compositional matrix adjust.
Identities = 174/217 (80%), Positives = 197/217 (90%)
Query: 72 RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
+ E W EV+SWEPRAFVYHNFLSKEEC++LI+LA PHM+KSTVVDS TG SKDSRVRTSS
Sbjct: 148 KGEPWTEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSRVRTSS 207
Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG 191
G FL RG+DKIIR IEKRIAD+TF P+E GEGLQVLHYE GQKYEPHFDYF D++NTKNG
Sbjct: 208 GMFLRRGQDKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNTKNG 267
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
GQR+AT+LMYLSDVE+GGETVFP++ N S+ P++NELSEC K GLS+KPKMGDALLFWS
Sbjct: 268 GQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWS 327
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
MKPD SLDP+SLHGGCPVIKGNKWSSTKW+RV+EYKV
Sbjct: 328 MKPDGSLDPTSLHGGCPVIKGNKWSSTKWMRVHEYKV 364
>gi|356502598|ref|XP_003520105.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 296
Score = 376 bits (965), Expect = e-102, Method: Compositional matrix adjust.
Identities = 194/297 (65%), Positives = 236/297 (79%), Gaps = 10/297 (3%)
Query: 1 MAKPRYSRFPTRKSSSST---LILTLLIMFTFAILILLAFGILSMPSSSGDSR---KAND 54
M K R SR RK S + ++LTLL +F ILILLA ILS +++ R K ND
Sbjct: 1 MVKGRQSRLGHRKPSRGSSWPVMLTLLATCSFLILILLALPILSNSNANSSGRLIIKPND 60
Query: 55 LSSIVRKS---MERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRK 111
L+SI + + +E D+ E+WVE+ISWEPR F+YHNFL+KEECE+LIN+A P+MRK
Sbjct: 61 LNSIALNTTTHISEAEYDQ-LGERWVEIISWEPRIFLYHNFLTKEECEHLINIAKPNMRK 119
Query: 112 STVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEA 171
STV++S+TG S +SRVRTSSGTFLARGRDKI+R+IE RIADFTF P++NGE LQVLHY+
Sbjct: 120 STVIESETGMSIESRVRTSSGTFLARGRDKIVRNIENRIADFTFIPVDNGEELQVLHYQV 179
Query: 172 GQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSE 231
G+KY PH DYFMD+ NT NGG R+AT+LMYLSDVEEGGETVFP+A+GN S++P WNELS
Sbjct: 180 GEKYVPHHDYFMDDINTANGGDRIATMLMYLSDVEEGGETVFPDAKGNFSSMPGWNELSV 239
Query: 232 CGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
CGK GLSIKPKM +ALLFWS+KPDA+ DP SLHG CPVIKGNKWSSTKWIR+ E+K+
Sbjct: 240 CGKKGLSIKPKMRNALLFWSIKPDATYDPLSLHGSCPVIKGNKWSSTKWIRIGEHKL 296
>gi|212720775|ref|NP_001131953.1| uncharacterized protein LOC100193348 [Zea mays]
gi|194693016|gb|ACF80592.1| unknown [Zea mays]
gi|347978798|gb|AEP37741.1| prolyl 4-hydroxylase 1 [Zea mays]
gi|414870898|tpg|DAA49455.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 307
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 174/217 (80%), Positives = 197/217 (90%)
Query: 72 RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
+ E W EV+SWEPRAFVYHNFLSKEEC++LI+LA PHM+KSTVVDS TG SKDSRVRTSS
Sbjct: 91 KGEPWTEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSRVRTSS 150
Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG 191
G FL RG+DKIIR IEKRIAD+TF P+E GEGLQVLHYE GQKYEPHFDYF D++NTKNG
Sbjct: 151 GMFLRRGQDKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNTKNG 210
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
GQR+AT+LMYLSDVE+GGETVFP++ N S+ P++NELSEC K GLS+KPKMGDALLFWS
Sbjct: 211 GQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWS 270
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
MKPD SLDP+SLHGGCPVIKGNKWSSTKW+RV+EYKV
Sbjct: 271 MKPDGSLDPTSLHGGCPVIKGNKWSSTKWMRVHEYKV 307
>gi|222613083|gb|EEE51215.1| hypothetical protein OsJ_32038 [Oryza sativa Japonica Group]
Length = 222
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 172/217 (79%), Positives = 198/217 (91%)
Query: 72 RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
+ E W EV+SWEPRAF+YHNFLSKEECEYLI+LA PHM+KSTVVD+ TG SKDSRVRTSS
Sbjct: 6 KGEPWTEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSRVRTSS 65
Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG 191
G FL RG+DKIIR IEKRI+D+TF P+ENGEGLQVLHYE GQKYEPHFDYF DEFNTKNG
Sbjct: 66 GMFLGRGQDKIIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKNG 125
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
GQR+AT+LMYLSDVEEGGET+FP+++ N S+ P++NELSEC K GL++KPKMGDALLFWS
Sbjct: 126 GQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLFWS 185
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
M+PD SLD +SLHGGCPVIKGNKWSSTKW+RV+EYK+
Sbjct: 186 MRPDGSLDATSLHGGCPVIKGNKWSSTKWMRVHEYKI 222
>gi|449520146|ref|XP_004167095.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 249
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 176/253 (69%), Positives = 206/253 (81%), Gaps = 6/253 (2%)
Query: 33 ILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNF 92
+L+A LS P +S + S VR + S+G R +QWVE ISWEPRAFVYHNF
Sbjct: 1 MLIALRFLSPPETS------HHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNF 54
Query: 93 LSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIAD 152
LSKEEC YLI+LA PHM KSTVVD++TGK+ + VRTSSG FL RG+DKI+ +IEKRIAD
Sbjct: 55 LSKEECLYLISLAKPHMEKSTVVDNETGKNVEDSVRTSSGMFLNRGQDKIVSNIEKRIAD 114
Query: 153 FTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETV 212
FTF P+E+GEGLQ+LHYE GQKY+ H+D+F DEFN K GQRMAT+LMYLSDVEEGGETV
Sbjct: 115 FTFIPIEHGEGLQILHYEVGQKYDAHYDFFDDEFNLKEIGQRMATLLMYLSDVEEGGETV 174
Query: 213 FPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKG 272
FP A+GN S+VPWWNELS+CGK GLS+KPKMGDALLFWSMKPD +LDP+SLHG CPVI+G
Sbjct: 175 FPAAKGNFSSVPWWNELSKCGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRG 234
Query: 273 NKWSSTKWIRVNE 285
NKWS TKWI VN+
Sbjct: 235 NKWSCTKWIHVNQ 247
>gi|242039227|ref|XP_002467008.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
gi|241920862|gb|EER94006.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
Length = 307
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 176/235 (74%), Positives = 201/235 (85%), Gaps = 6/235 (2%)
Query: 60 RKSMERSEGDEG------RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKST 113
R RS D G + E W EV+SWEPRAFVYHNFLSKEEC++LI+LA PHM+KST
Sbjct: 73 RSHFRRSTFDSGLEMRGEKGEPWTEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKST 132
Query: 114 VVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQ 173
VVDS TG SKDSRVRTSSG FL RG+DKII+ IEKRIADFTF P+E+GEGLQVLHYE GQ
Sbjct: 133 VVDSATGASKDSRVRTSSGMFLRRGQDKIIQTIEKRIADFTFIPVEHGEGLQVLHYEVGQ 192
Query: 174 KYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECG 233
KYEPHFDYF D++NTKNGGQR+AT+LMYLSDVE+GGETVFP++ N S+ P++NELSEC
Sbjct: 193 KYEPHFDYFHDDYNTKNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECA 252
Query: 234 KTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
K GLS+KPKMGDALLFWSMKPD S+D +SLHGGCPVIKGNKWSSTKW+RV+EYK
Sbjct: 253 KGGLSVKPKMGDALLFWSMKPDGSMDSTSLHGGCPVIKGNKWSSTKWMRVHEYKA 307
>gi|357517895|ref|XP_003629236.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523258|gb|AET03712.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 326
Score = 363 bits (933), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 168/267 (62%), Positives = 211/267 (79%), Gaps = 2/267 (0%)
Query: 22 TLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVIS 81
+L+I +T + + + F L + + K N L+SI + R+E D+ + +WV++IS
Sbjct: 62 SLIICWTLFLTLFVTFTFLILIILTLRIPKPNHLNSITHSNTLRNEDDDNK--RWVQIIS 119
Query: 82 WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDK 141
WEPRAF+YHNFL+KEECE+LIN+A P M KS V+D +TG DSR RTSSG FL RG D+
Sbjct: 120 WEPRAFLYHNFLTKEECEHLINIAKPSMHKSAVIDEETGNGVDSRERTSSGAFLKRGSDR 179
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
I+++IE+RIADFTF P+E+GE VLHYE GQKYEPH+DYFMD F+T GQR+AT+LMY
Sbjct: 180 IVKNIERRIADFTFIPVEHGENFNVLHYEVGQKYEPHYDYFMDTFSTTYAGQRIATMLMY 239
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
LSDVEEGGETVFPNA+GN S+VPWWNELS+CGK GLSIKPKMG+A+LFWSMKPDA+LDPS
Sbjct: 240 LSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIKPKMGNAILFWSMKPDATLDPS 299
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYKV 288
SLHG CPVIKG+KW KW+ V E+K+
Sbjct: 300 SLHGACPVIKGDKWLCAKWMHVGEFKI 326
>gi|90704797|dbj|BAE92293.1| putative prolyl 4-hydroxylase, alpha subunit [Cryptomeria japonica]
Length = 302
Score = 363 bits (933), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 176/269 (65%), Positives = 214/269 (79%), Gaps = 5/269 (1%)
Query: 21 LTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLS-SIVRKSMERSEGDEGRAEQWVEV 79
TLL +F +L +LA + S+P + + LS SI +RS+G + E+ VEV
Sbjct: 38 FTLLSVF---VLFVLAIWVFSVPVKRTPYQISRQLSESIAADYAKRSDGKDEPKER-VEV 93
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+SWEPRAF+YHNFL+K+ECEYLIN+A PHM KS VVDS TG S DS VRTSSG FL RG+
Sbjct: 94 LSWEPRAFLYHNFLAKDECEYLINIAKPHMVKSMVVDSKTGGSMDSNVRTSSGWFLNRGQ 153
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
DKIIR IEKRIADF+ P+E+GEGL VLHYE QKY+ H+DYF D N KNGGQR AT+L
Sbjct: 154 DKIIRRIEKRIADFSHIPVEHGEGLHVLHYEVEQKYDAHYDYFSDTINVKNGGQRGATML 213
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
MYLSDVE+GGETVFP ++ N S+VPWW+ELSECG++GLS++PKMGDALLFWS+KPDASLD
Sbjct: 214 MYLSDVEKGGETVFPQSKVNSSSVPWWDELSECGRSGLSVRPKMGDALLFWSVKPDASLD 273
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
PSSLHG CPVI+GNKWS+TKW+R+N+Y V
Sbjct: 274 PSSLHGSCPVIQGNKWSATKWMRLNKYSV 302
>gi|222623961|gb|EEE58093.1| hypothetical protein OsJ_08962 [Oryza sativa Japonica Group]
Length = 387
Score = 360 bits (925), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 181/270 (67%), Positives = 211/270 (78%), Gaps = 14/270 (5%)
Query: 9 FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSS------------GDSRKAND 54
FPTR ++S L L L++ + +L L+AFG+ S+P S+ GD+ A+
Sbjct: 17 FPTRGGRTSPLALALAALLLASALLLALIAFGVFSLPVSAPNAATTDSAAAGGDAEPADP 76
Query: 55 LSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV 114
R + SEG R QW EVISWEPRAFVYHNFLSKEEC+YLI LA PHM KSTV
Sbjct: 77 RPPRTRARRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV 136
Query: 115 VDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQK 174
VDS TGKSKDSRVRTSSG FL RGRDK+IR IEKRIAD+TF P+E+GEGLQVLHYE GQK
Sbjct: 137 VDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQK 196
Query: 175 YEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGK 234
YEPHFDYF+DE+NTKNGGQRMAT+LMYLSDVEEGGET+FP+A N S++PW+NELSEC +
Sbjct: 197 YEPHFDYFLDEYNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNSSSLPWYNELSECAR 256
Query: 235 TGLSIKPKMGDALLFWSMKPDASLDPSSLH 264
GL++KPKMGDALLFWSMKPDA+LDP SLH
Sbjct: 257 KGLAVKPKMGDALLFWSMKPDATLDPLSLH 286
>gi|357517897|ref|XP_003629237.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523259|gb|AET03713.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|388513409|gb|AFK44766.1| unknown [Medicago truncatula]
gi|388516345|gb|AFK46234.1| unknown [Medicago truncatula]
Length = 275
Score = 360 bits (924), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 173/286 (60%), Positives = 216/286 (75%), Gaps = 11/286 (3%)
Query: 1 MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
M K ++S RK S T L +F ++L L +P K N L+SI
Sbjct: 1 MVKFKHSNVGLRKPSLITCWTLFLTLFVTFTFLILIILTLRIP-------KLNHLNSITH 53
Query: 61 KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
+ R++ ++ +WV++ISWEPRAF+YHNFL+KEECE+LIN+A P M KS V+D TG
Sbjct: 54 SNTLRNDDNK----RWVQIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSEVIDEKTG 109
Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFD 180
KS +S +RTSSGTFL R D+I+ +IEKRIADFTF P+E+GE VLHYE GQKYEPH+D
Sbjct: 110 KSLNSSIRTSSGTFLDREGDEIVSNIEKRIADFTFIPVEHGESFNVLHYEVGQKYEPHYD 169
Query: 181 YFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIK 240
YF+D F+T++ GQR+AT+LMYLSDVEEGGETVFPNA+GN S+VPWWNELS+CGK GLSIK
Sbjct: 170 YFLDTFSTRHAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIK 229
Query: 241 PKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
PKMG+A+LFWSMKPDA+LDPSSLHG CPVIKG+KWS KW+ +EY
Sbjct: 230 PKMGNAILFWSMKPDATLDPSSLHGACPVIKGDKWSCAKWMHADEY 275
>gi|218191856|gb|EEC74283.1| hypothetical protein OsI_09531 [Oryza sativa Indica Group]
Length = 376
Score = 357 bits (917), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 170/240 (70%), Positives = 193/240 (80%), Gaps = 12/240 (5%)
Query: 37 FGILSMPSSS------------GDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEP 84
FG+ S+P S+ GD+ A+ R + SEG R QW EVISWEP
Sbjct: 47 FGVFSLPVSAPNAATTDSAAAGGDAEPADPRPPRTRARRDLSEGLGERGAQWTEVISWEP 106
Query: 85 RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIR 144
RAFVYHNFLSKEEC+YLI LA PHM KSTVVDS TGKSKDSRVRTSSG FL RGRDK+IR
Sbjct: 107 RAFVYHNFLSKEECDYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIR 166
Query: 145 DIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSD 204
IEKRIAD+TF P+E+GEGLQVLHYE GQKYEPHFDYF+DE+NTKNGGQRMAT+LMYLSD
Sbjct: 167 AIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRMATLLMYLSD 226
Query: 205 VEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLH 264
VEEGGET+FP+A N S++PW+NELSEC + GL++KPKMGDALLFWSMKPDA+LDP SLH
Sbjct: 227 VEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATLDPLSLH 286
>gi|356502610|ref|XP_003520111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 353 bits (907), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 165/256 (64%), Positives = 204/256 (79%), Gaps = 8/256 (3%)
Query: 33 ILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNF 92
ILLA ILS P AN SS+ R + +E D+ A + +EVISW+PRAF+YHNF
Sbjct: 38 ILLALHILSTP-------HANANSSVSRNTHIEAEEDDQVALR-MEVISWQPRAFLYHNF 89
Query: 93 LSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIAD 152
L+KEECEYLIN+ATPHM+KSTV D+ +G+S VR S+G FL RG+D+I+R+IEKRIAD
Sbjct: 90 LTKEECEYLINIATPHMQKSTVADNQSGQSVVHDVRKSTGAFLDRGQDEIVRNIEKRIAD 149
Query: 153 FTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETV 212
TF P+ENGE + V+HYE GQ Y+PH+DYF+D+FN +NGGQR+AT+LMYLS+VEEGGET+
Sbjct: 150 VTFIPIENGEPIYVIHYEVGQYYDPHYDYFIDDFNIENGGQRIATMLMYLSNVEEGGETM 209
Query: 213 FPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKG 272
FP A+ N S+VPWWNELS CGK GLSIKPKMGDALLFWSMKP+A+LD +LH CPVIKG
Sbjct: 210 FPRAKANFSSVPWWNELSNCGKMGLSIKPKMGDALLFWSMKPNATLDALTLHSACPVIKG 269
Query: 273 NKWSSTKWIRVNEYKV 288
NKWS TKW+ E+K+
Sbjct: 270 NKWSCTKWMHPTEFKM 285
>gi|357517885|ref|XP_003629231.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523253|gb|AET03707.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 279
Score = 352 bits (902), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 176/289 (60%), Positives = 222/289 (76%), Gaps = 13/289 (4%)
Query: 1 MAKPRYSRFPTRKS---SSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSS 57
M K ++SR RK +S TLI TL + F F I+ L+ L +P K L+S
Sbjct: 1 MGKLKHSRVGPRKPLLPTSRTLIFTLFVTFIFLIIFLILLS-LRIP-------KPKHLNS 52
Query: 58 IVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
I + R + D+ + +WVE++SWEPR F+YHNFL+KEECE+LIN+A P ++KSTVVD
Sbjct: 53 ITHINNLRRDDDDNK--RWVEIVSWEPRVFLYHNFLAKEECEHLINIAKPDVQKSTVVDD 110
Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEP 177
TGKS +S RTSSGTF+ RG DKI+ DIEKRIADFTF P+E+GE + +LHYE GQKY+
Sbjct: 111 TTGKSVNSSARTSSGTFIDRGYDKILSDIEKRIADFTFIPVEHGEDVNILHYEVGQKYDF 170
Query: 178 HFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGL 237
H DYF DE NTK+GG+R+AT+LMYLSDVEEGGETVFP+A+GN S+VPWWNELS+CGK GL
Sbjct: 171 HTDYFEDEVNTKHGGERIATMLMYLSDVEEGGETVFPSAKGNFSSVPWWNELSDCGKKGL 230
Query: 238 SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
SIKPKMG+A+LFW MKPDA++DP S+HG CPVIKG+KWS TKW+RV ++
Sbjct: 231 SIKPKMGNAILFWGMKPDATVDPLSVHGACPVIKGDKWSCTKWMRVGKW 279
>gi|363807814|ref|NP_001242181.1| uncharacterized protein LOC100782154 [Glycine max]
gi|255644463|gb|ACU22735.1| unknown [Glycine max]
Length = 285
Score = 348 bits (893), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 166/254 (65%), Positives = 202/254 (79%), Gaps = 3/254 (1%)
Query: 28 TFAILILLAFGILSMP--SSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPR 85
+F ILI LA ILS P +SS K NDL+S+ R + SEG+ R ++WVEV+SWEPR
Sbjct: 33 SFLILIPLALRILSTPHVNSSSALSKPNDLNSVPRNT-HVSEGENNRVKRWVEVMSWEPR 91
Query: 86 AFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRD 145
AF+YHNFL+KEECEYLIN ATP+M KS V+D+++G+ ++ RTS+ + RG+DKI+R+
Sbjct: 92 AFLYHNFLTKEECEYLINTATPNMLKSLVIDNESGEGIETSYRTSTEYVVERGKDKIVRN 151
Query: 146 IEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDV 205
IEKRIAD TF P+E+GE L V+ Y GQ YEPH DYF +EF+ NGGQR+AT+LMYLS+V
Sbjct: 152 IEKRIADVTFIPIEHGEPLHVIRYAVGQYYEPHVDYFEEEFSLVNGGQRIATMLMYLSNV 211
Query: 206 EEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHG 265
E GGETVFP A N S+VPWWNELSECG+TGLSIKPKMGDALLFWSMKPDA+LDP +LH
Sbjct: 212 EGGGETVFPIANANFSSVPWWNELSECGQTGLSIKPKMGDALLFWSMKPDATLDPLTLHR 271
Query: 266 GCPVIKGNKWSSTK 279
CPVIKGNKWS TK
Sbjct: 272 ACPVIKGNKWSCTK 285
>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
Length = 256
Score = 346 bits (887), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 158/211 (74%), Positives = 183/211 (86%)
Query: 76 WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
W E ISW+PRA V+HNFLS EEC++LI LA P+M++S VVD+ TGKSKDSRVRTSSGTFL
Sbjct: 45 WTETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSRVRTSSGTFL 104
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
RG+D+II IE+RIA FTF P E+GEGLQVLHYE GQKY+ H DYF D+ NTKNGGQR+
Sbjct: 105 RRGQDEIISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNTKNGGQRV 164
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
ATVLMYLSDVEEGGETVFP+A+ N S+VPWW+ELSECGK G+S+KP+ GDALLFWSM PD
Sbjct: 165 ATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECGKKGVSVKPRKGDALLFWSMSPD 224
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
A LDP SLHGGCPVIKGNKWS+TKW+ + EY
Sbjct: 225 AELDPFSLHGGCPVIKGNKWSATKWMHLREY 255
>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
Length = 256
Score = 344 bits (883), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 157/211 (74%), Positives = 182/211 (86%)
Query: 76 WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
W E ISW+PRA V+HNFLS EEC++LI LA P+M++S VVD+ TGKSKDSRVRTSSGTFL
Sbjct: 45 WTETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSRVRTSSGTFL 104
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
RG+D+II IE+RIA FTF P E+GEGLQVLHYE GQKY+ H DYF D+ NTKNGGQR+
Sbjct: 105 RRGQDEIISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNTKNGGQRV 164
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
ATVLMYLSDVEEGGETVFP+A+ N S+VPWW+ELSEC K G+S+KP+ GDALLFWSM PD
Sbjct: 165 ATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECAKKGVSVKPRKGDALLFWSMSPD 224
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
A LDP SLHGGCPVIKGNKWS+TKW+ + EY
Sbjct: 225 AELDPFSLHGGCPVIKGNKWSATKWMHLREY 255
>gi|168002780|ref|XP_001754091.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162694645|gb|EDQ80992.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 214
Score = 342 bits (878), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 154/212 (72%), Positives = 184/212 (86%)
Query: 75 QWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTF 134
+WVEV+SWEPRAF+YH+FL++EEC +LI +A P + KSTVVDSDTGKSKDSR+RTSSGTF
Sbjct: 1 RWVEVLSWEPRAFLYHHFLTEEECNHLIEVARPSLVKSTVVDSDTGKSKDSRLRTSSGTF 60
Query: 135 LARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQR 194
L RG+D +I+ IEKRIADFTF P E GEGLQVL Y+ +KYEPH+DYF D +NTKNGGQR
Sbjct: 61 LMRGQDPVIKRIEKRIADFTFIPAEQGEGLQVLQYKESEKYEPHYDYFHDAYNTKNGGQR 120
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ATVLMYLS+VEEGGETVFP AQ N + VP W++LSEC + GLS++P+MGDALLFWSMKP
Sbjct: 121 IATVLMYLSNVEEGGETVFPAAQVNKTEVPDWDKLSECAQKGLSVRPRMGDALLFWSMKP 180
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DA+LD +SLHGGCPVIKG KWS+TKW+ V Y
Sbjct: 181 DATLDSTSLHGGCPVIKGTKWSATKWLHVENY 212
>gi|168060785|ref|XP_001782374.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666166|gb|EDQ52828.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 211
Score = 338 bits (868), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 152/211 (72%), Positives = 180/211 (85%)
Query: 76 WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
WVEV+SWEPRAF+YH+FL++ EC +LI +A P + KSTV+DS TGKSKDSRVRTSSGTFL
Sbjct: 1 WVEVLSWEPRAFLYHHFLTQVECNHLIEVAKPSLVKSTVIDSATGKSKDSRVRTSSGTFL 60
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
RG+D II+ IEKRIADFTF P+E GEGLQVL Y +KYEPH+DYF D FNTKNGGQR+
Sbjct: 61 VRGQDHIIKRIEKRIADFTFIPVEQGEGLQVLQYRESEKYEPHYDYFHDAFNTKNGGQRI 120
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
ATVLMYLSDVE+GGETVFP ++ N S VP W++ SEC K GLS++P+MGDALLFWSMKPD
Sbjct: 121 ATVLMYLSDVEKGGETVFPASKVNASEVPDWDQRSECAKRGLSVRPRMGDALLFWSMKPD 180
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
A LDP+SLHG CPVI+G KWS+TKW+ V +Y
Sbjct: 181 AKLDPTSLHGACPVIQGTKWSATKWLHVEKY 211
>gi|18071415|gb|AAL58274.1|AC068923_16 putative prolyl 4-hydroxylase, alpha subunit [Oryza sativa Japonica
Group]
Length = 343
Score = 337 bits (864), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 157/215 (73%), Positives = 185/215 (86%), Gaps = 2/215 (0%)
Query: 56 SSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV 115
S+ +E G++G E W EV+SWEPRAF+YHNFLSKEECEYLI+LA PHM+KSTVV
Sbjct: 91 SAAFESGLEMRGGEKG--EPWTEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVV 148
Query: 116 DSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKY 175
D+ TG SKDSRVRTSSG FL RG+DKIIR IEKRI+D+TF P+ENGEGLQVLHYE GQKY
Sbjct: 149 DASTGGSKDSRVRTSSGMFLGRGQDKIIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKY 208
Query: 176 EPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKT 235
EPHFDYF DEFNTKNGGQR+AT+LMYLSDVEEGGET+FP+++ N S+ P++NELSEC K
Sbjct: 209 EPHFDYFHDEFNTKNGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKK 268
Query: 236 GLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVI 270
GL++KPKMGDALLFWSM+PD SLD +SLHG P++
Sbjct: 269 GLAVKPKMGDALLFWSMRPDGSLDATSLHGEIPIL 303
>gi|357483927|ref|XP_003612250.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355513585|gb|AES95208.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 204
Score = 330 bits (845), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 155/203 (76%), Positives = 181/203 (89%), Gaps = 1/203 (0%)
Query: 1 MAKPRYSRFPTRKSSSS-TLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIV 59
MAK RYSR P+RKSSS TLI +L I FTF ILILL FGILS+PSS+ + K NDL+SIV
Sbjct: 1 MAKSRYSRLPSRKSSSPYTLIFSLFIAFTFLILILLVFGILSIPSSNQNLPKPNDLTSIV 60
Query: 60 RKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT 119
+++R++ +EG+ EQWVEV+SWEPRAFVYHNFL+KEECEYLI++A P M KSTVVDS+T
Sbjct: 61 HNTVDRNDDEEGKGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSET 120
Query: 120 GKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHF 179
GKSKDSRVRTSSGTFLARGRDKI+R+IEK+IADFTF P+E+GEGLQVLHYE GQKYEPH+
Sbjct: 121 GKSKDSRVRTSSGTFLARGRDKIVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPHY 180
Query: 180 DYFMDEFNTKNGGQRMATVLMYL 202
DYF+DEFNTKNGGQR+ATVLMYL
Sbjct: 181 DYFLDEFNTKNGGQRIATVLMYL 203
>gi|302762452|ref|XP_002964648.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
gi|300168377|gb|EFJ34981.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
Length = 225
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 148/224 (66%), Positives = 185/224 (82%)
Query: 65 RSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD 124
R E EG+ E W E+ISW PRA + HNFL+ +EC++LI +A P M+KSTVVDS TG S+D
Sbjct: 2 REEVGEGKHEPWSEIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRD 61
Query: 125 SRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD 184
SRVRTSSG FL RG+D++I +IE +IA TF P ++GEG+QVLHYE GQKY+ H D+F D
Sbjct: 62 SRVRTSSGMFLNRGQDRVISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYD 121
Query: 185 EFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMG 244
NT+NGGQR+AT+LMYL+DVEEGGETVFP + N S++PW N+LSECG+ G+S++PK G
Sbjct: 122 TVNTRNGGQRIATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRG 181
Query: 245 DALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
DALLFWSM PDA LD SSLHGGCPVIKG+KWS+TKW+RV+EYK+
Sbjct: 182 DALLFWSMSPDAQLDHSSLHGGCPVIKGDKWSATKWMRVSEYKL 225
>gi|3805847|emb|CAA21467.1| putative protein [Arabidopsis thaliana]
gi|7270533|emb|CAB81490.1| putative protein [Arabidopsis thaliana]
Length = 307
Score = 328 bits (840), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 165/264 (62%), Positives = 190/264 (71%), Gaps = 34/264 (12%)
Query: 36 AFGILSMPSSSGDSRKANDLSSIVRKSMER-SEGDE--GRAEQWVEVISWEPRAFVYHNF 92
GI S+PS++ S DL++IV+ ER S GDE G ++W+EVISWEPRAFVYHNF
Sbjct: 36 GLGIFSLPSTNKTSSMPMDLTTIVQTIQERESFGDEEDGNGDRWLEVISWEPRAFVYHNF 95
Query: 93 LSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR-------------------------- 126
L+ EECE+LI+LA P M KS VVD TGKS DSR
Sbjct: 96 LTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRFCTLTSVVVFTFQLNLERFENSKFAN 155
Query: 127 -----VRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY 181
VRTSSGTFL RG D+I+ +IE RI+DFTF P ENGEGLQVLHYE GQ+YEPH DY
Sbjct: 156 PSLCRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDY 215
Query: 182 FMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKP 241
F DEFN + GGQR+ATVLMYLSDV+EGGETVFP A+GN+S VPWW+ELS+CGK GLS+ P
Sbjct: 216 FFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLSVLP 275
Query: 242 KMGDALLFWSMKPDASLDPSSLHG 265
K DALLFWSMKPDASLDPSSLHG
Sbjct: 276 KKRDALLFWSMKPDASLDPSSLHG 299
>gi|388520325|gb|AFK48224.1| unknown [Lotus japonicus]
Length = 188
Score = 326 bits (835), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 147/188 (78%), Positives = 168/188 (89%)
Query: 101 LINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLEN 160
+INLA PHM KS+VVDS TGKS SRVRTSSG FL RG+DK+I+ IEKRIADF F P+EN
Sbjct: 1 MINLAKPHMAKSSVVDSQTGKSVGSRVRTSSGMFLKRGKDKVIQTIEKRIADFAFIPVEN 60
Query: 161 GEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNI 220
GEGLQVLHYE GQKYEPH+DYF+DEFNTKNGGQR+ATVLMYLSDVEEGGET+FP A+ N
Sbjct: 61 GEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIFPAAKANF 120
Query: 221 SAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKW 280
S+VPW+N+LS C K GLS+KPK GDALLFWS++PDA+LDPSSLHGGCPVI+GNKWSSTKW
Sbjct: 121 SSVPWYNDLSVCAKKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGNKWSSTKW 180
Query: 281 IRVNEYKV 288
+ + EYKV
Sbjct: 181 MHLEEYKV 188
>gi|255633460|gb|ACU17088.1| unknown [Glycine max]
Length = 207
Score = 325 bits (834), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 158/207 (76%), Positives = 185/207 (89%), Gaps = 2/207 (0%)
Query: 1 MAKPRYSRFPTRKSSSS-TLILTLLIMFTFAILILLAFGILSMPSSS-GDSRKANDLSSI 58
MAKPRYSR P RKSSSS TLILTL ++FTF +LILLA GILS+PSSS G+ K NDL+SI
Sbjct: 1 MAKPRYSRLPPRKSSSSSTLILTLFLVFTFLVLILLALGILSIPSSSRGNLPKPNDLASI 60
Query: 59 VRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD 118
R ++ S+ D+ R EQWVEV+SWEPRAFVYHNFL+KEECEYLI++A P+M KS+VVDS+
Sbjct: 61 ARNTIHTSDDDDVRGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVVDSE 120
Query: 119 TGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPH 178
TGKSKDSRVRTSSGTFLARGRDKI+RDIEKRIA ++F P+E+GEGLQVLHYE GQKYEPH
Sbjct: 121 TGKSKDSRVRTSSGTFLARGRDKIVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEPH 180
Query: 179 FDYFMDEFNTKNGGQRMATVLMYLSDV 205
+DYF+D+FNTKNGGQR+ATVLMYL+DV
Sbjct: 181 YDYFLDDFNTKNGGQRIATVLMYLTDV 207
>gi|238007346|gb|ACR34708.1| unknown [Zea mays]
Length = 180
Score = 324 bits (830), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 147/179 (82%), Positives = 166/179 (92%)
Query: 109 MRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLH 168
M KSTVVDS TGKSKDSRVRTSSG FL RGRDK+IR IEKRIAD+TF P+++GEGLQVLH
Sbjct: 1 MVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRIADYTFIPVDHGEGLQVLH 60
Query: 169 YEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNE 228
YE GQKYEPHFDYF+DEFNTKNGGQRMAT+LMYLSDVEEGGET+FP+A N+S++PW+NE
Sbjct: 61 YEVGQKYEPHFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNE 120
Query: 229 LSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
LSEC K GLS+KPKMGDALLFWSMKPDA+LDP SLHGGCPVI+GNKWSSTKW+ ++EYK
Sbjct: 121 LSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEYK 179
>gi|302815629|ref|XP_002989495.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
gi|300142673|gb|EFJ09371.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
Length = 213
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 143/213 (67%), Positives = 179/213 (84%)
Query: 76 WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
W E+ISW PRA + HNFL+ +EC++LI +A P M+KSTVVDS TG S+DSRVRTSSG FL
Sbjct: 1 WSEIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSRVRTSSGMFL 60
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
RG+D++I +IE +IA TF P ++GEG+QVLHYE GQKY+ H D+F D NT+NGGQR+
Sbjct: 61 NRGQDRVISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQRI 120
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
AT+LMYL+DVEEGGETVFP + N S++PW N+LSECG+ G+S++PK GDALLFWSM PD
Sbjct: 121 ATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSPD 180
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
A LD SSLHGGCPVIKG+KWS+TKW+RV+EYK+
Sbjct: 181 AQLDHSSLHGGCPVIKGDKWSATKWMRVSEYKL 213
>gi|224033439|gb|ACN35795.1| unknown [Zea mays]
Length = 180
Score = 321 bits (823), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 146/180 (81%), Positives = 165/180 (91%)
Query: 109 MRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLH 168
M KSTVVDS TGKSKDSRVRTSSG FL RGRDK+IR IEKRIAD+TF P+++GEGLQVLH
Sbjct: 1 MVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPVDHGEGLQVLH 60
Query: 169 YEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNE 228
YE GQKYEPHFDYF+DEFNTKNGGQR+AT+LMYLSDVEEGGET+FP+A N S++PW+NE
Sbjct: 61 YEVGQKYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNE 120
Query: 229 LSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
LS+C K GLS+KPKMGDALLFWSMKPDA+LDP SLHGGCPVIKGNKWSSTKW+ ++EYK
Sbjct: 121 LSDCAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHEYKA 180
>gi|363543299|ref|NP_001241865.1| prolyl 4-hydroxylase 5-1 [Zea mays]
gi|347978814|gb|AEP37749.1| prolyl 4-hydroxylase 5-1 [Zea mays]
Length = 180
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 145/180 (80%), Positives = 164/180 (91%)
Query: 109 MRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLH 168
M KSTVVDS TGKSKDSRVRTSSG FL RGRDK+IR IEKRI D+TF P+++GEGLQVLH
Sbjct: 1 MVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRITDYTFIPVDHGEGLQVLH 60
Query: 169 YEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNE 228
YE GQKYEPHFDYF+DEFNTKNGGQRMAT+LM+LSDVEEGGET+FP+A N S++PW+NE
Sbjct: 61 YEVGQKYEPHFDYFLDEFNTKNGGQRMATLLMHLSDVEEGGETIFPDANVNDSSLPWYNE 120
Query: 229 LSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
LSEC K GLS+KPKMGDALLFWSMKPDA+LDP SLHGGCPVI+GNKWSSTKW+ ++EYK
Sbjct: 121 LSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEYKA 180
>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 683
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 143/217 (65%), Positives = 171/217 (78%), Gaps = 6/217 (2%)
Query: 72 RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
R +QW E++S PRA +YHNFLSKEECE+LINLA P M +S VVD TG+ K+S RTSS
Sbjct: 107 RKDQWTEILSSVPRASMYHNFLSKEECEHLINLAKPFMARSLVVDGVTGEVKESSSRTSS 166
Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG 191
G FL RG+DKI+++IE+RIAD T P+ENGEGL V+HY GQK EPH+DY D TKNG
Sbjct: 167 GMFLDRGKDKIVQNIERRIADITSVPIENGEGLHVIHYGVGQKCEPHYDYTSDGVVTKNG 226
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
G R+ATVLMYLSDVEEGGETVFP+AQ N ++V S+C GLS+KPKMGDALLFWS
Sbjct: 227 GPRVATVLMYLSDVEEGGETVFPDAQPNFTSV------SKCSGDGLSVKPKMGDALLFWS 280
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
MKPD +LD SSLHGG PVI+GNKW+STKW+ + E K+
Sbjct: 281 MKPDGTLDTSSLHGGSPVIRGNKWASTKWLHLRECKL 317
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 115/197 (58%), Positives = 138/197 (70%), Gaps = 21/197 (10%)
Query: 92 FLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIA 151
F SKEECE+LINLA P M +S VVD TGK ++S RTSSG FL RG+DKI+++IE+RIA
Sbjct: 372 FGSKEECEHLINLAKPFMTRSLVVDGLTGKGRESSARTSSGRFLERGKDKIVQNIEQRIA 431
Query: 152 DFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGET 211
D T P + + AG TKNGG R+ATVLMYLSDVEEGGET
Sbjct: 432 DITSIPRM---ARDFMLFTAGGVV------------TKNGGPRVATVLMYLSDVEEGGET 476
Query: 212 VFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIK 271
VFPNA+ NI++V + E GLS+KPKMGDALLF SMKPD +LD SSLHGG PVI+
Sbjct: 477 VFPNAKPNINSVSKYPE------KGLSVKPKMGDALLFRSMKPDGTLDTSSLHGGSPVIR 530
Query: 272 GNKWSSTKWIRVNEYKV 288
GNKW+STKW+ + E+KV
Sbjct: 531 GNKWASTKWLHLTEFKV 547
>gi|357467077|ref|XP_003603823.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492871|gb|AES74074.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 291
Score = 291 bits (745), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 139/217 (64%), Positives = 174/217 (80%), Gaps = 5/217 (2%)
Query: 72 RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
R +QW EV+S EPRA +YHNFLSKEECE+LINLA P M++S VVD TG+ + VRTSS
Sbjct: 80 RKDQWTEVLSSEPRASMYHNFLSKEECEHLINLAKPFMQRSLVVDGVTGQGILNSVRTSS 139
Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG 191
GTFL RG+DKI++++E+RIAD T P+ENGEGLQ++HYE GQK+EPH+DY + T NG
Sbjct: 140 GTFLERGKDKIVQNVERRIADITSIPIENGEGLQIIHYEVGQKFEPHYDYNFNWRITNNG 199
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
G R+ATVLMYLSDVEEGGETVFPNA+ N ++V ++ GK GL +KPKMGDALLFWS
Sbjct: 200 GPRVATVLMYLSDVEEGGETVFPNAKPNFNSVSKYHP----GK-GLVVKPKMGDALLFWS 254
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
+KPD SLD +SLHGG PVI+G+KW+S K + + E+KV
Sbjct: 255 VKPDGSLDTASLHGGSPVIRGSKWASNKLLHLTEFKV 291
>gi|357517893|ref|XP_003629235.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355523257|gb|AET03711.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 196
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 132/195 (67%), Positives = 156/195 (80%), Gaps = 14/195 (7%)
Query: 94 SKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADF 153
+KEECE+LIN+A P M KSTV D +TGKS D+ RTSSGTF+ RG DKI+R+IE+RIADF
Sbjct: 14 TKEECEHLINIAKPSMHKSTV-DDETGKSVDNSARTSSGTFINRGHDKILRNIEQRIADF 72
Query: 154 TFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVF 213
TF P+ENGE + +LHYE GQKYEPH D+F DE NTKNGG E+GGETVF
Sbjct: 73 TFIPVENGESVNILHYEVGQKYEPHPDFFTDEINTKNGG-------------EQGGETVF 119
Query: 214 PNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGN 273
P A+GN S+VPWWNELS+CGK GLSIKPKMGDALLFWSMKPD +LDP S+HG CPVIKG+
Sbjct: 120 PFAEGNFSSVPWWNELSDCGKKGLSIKPKMGDALLFWSMKPDGTLDPLSMHGACPVIKGD 179
Query: 274 KWSSTKWIRVNEYKV 288
KWS TKW+RV ++ +
Sbjct: 180 KWSCTKWMRVGKWSI 194
>gi|50845214|gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 316
Score = 280 bits (715), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 140/265 (52%), Positives = 180/265 (67%), Gaps = 4/265 (1%)
Query: 24 LIMFTFAILILLAFGILS-MPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISW 82
++ F + I L+F +LS PSS G S++R E G V +SW
Sbjct: 1 MVKFDVFLTIFLSFFLLSPHPSSCGWLNNVKKGKSVLRLKSENVPSSVGVDPSHVTQLSW 60
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PRAF+Y FL+ EEC++LI++A + KS V D+++GKS S VRTSSG FL + +D +
Sbjct: 61 KPRAFLYEGFLTHEECDHLIDMAKDKLEKSMVADNESGKSIPSEVRTSSGMFLQKAQDDV 120
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ IE RIA +TF P+ENGE +Q+LHYE GQKYEPHFDYF D+ N + GG R+ATVLMYL
Sbjct: 121 VAAIEARIAAWTFLPIENGEAMQILHYERGQKYEPHFDYFHDKVNQQLGGHRIATVLMYL 180
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNE-LSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
S+VEEGGETVFPNA+ + NE LS+C K G S+KPK GDALLF+S+ PDAS D
Sbjct: 181 SNVEEGGETVFPNAEAKLQLAN--NESLSDCAKGGYSVKPKKGDALLFFSLHPDASTDSL 238
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEY 286
SLHG CPVI+G KWS+TKWI V +
Sbjct: 239 SLHGSCPVIEGEKWSATKWIHVRSF 263
>gi|357467087|ref|XP_003603828.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
gi|355492876|gb|AES74079.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
Length = 156
Score = 279 bits (713), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 124/155 (80%), Positives = 142/155 (91%)
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL RG+DKII++IE+RIADFTF P+ENGEGLQVLHY G+KYEPH+DYF+DEFNTKNGGQ
Sbjct: 2 FLKRGKDKIIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNGGQ 61
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+ATVLMYLSDVEEGGETVFP A+ N S+VPWWN+LSEC + GLS+KPKMGDALLFWSM+
Sbjct: 62 RVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSMR 121
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
PDA+LD SSLHGGCPVI GNKWSSTKW+ + EYKV
Sbjct: 122 PDATLDASSLHGGCPVIVGNKWSSTKWMHLEEYKV 156
>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
C-169]
Length = 222
Score = 279 bits (713), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 125/214 (58%), Positives = 163/214 (76%), Gaps = 6/214 (2%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+EV+SWEPRA++YHNFL++ E +YL+ PHM KS VVD++TGKS S+VRTSSG FL
Sbjct: 1 MEVLSWEPRAYLYHNFLTEAEADYLVQKGKPHMEKSEVVDNETGKSAPSKVRTSSGMFLN 60
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
RG D +I IE RIA +T P ENGEGLQ+LHY+A ++Y PHFDYF D FNT+NGGQR+A
Sbjct: 61 RGEDDVIERIEARIAKYTAIPKENGEGLQILHYQASEEYRPHFDYFHDNFNTQNGGQRIA 120
Query: 197 TVLMYLSDVEEGGETVFPNA--QGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
T+LMYLSDVE+GGETVFP + + N+ + S+C + G + KPK GDAL F+S+ P
Sbjct: 121 TMLMYLSDVEDGGETVFPESSDKPNVGNT----KFSQCAQAGAAAKPKKGDALFFYSLTP 176
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
D +D SLH GCPV+KG+KWS+TKW+RV+ ++
Sbjct: 177 DGRMDEKSLHAGCPVMKGDKWSATKWLRVDRFEA 210
>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 311
Score = 271 bits (693), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 126/211 (59%), Positives = 163/211 (77%), Gaps = 1/211 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PRAF+Y FLS EEC++LI+LA + KS V D+++GKS +S VRTSSG F+A
Sbjct: 49 VTQLSWHPRAFLYKGFLSYEECDHLIDLARDKLEKSMVADNESGKSIESEVRTSSGMFIA 108
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D+I+ DIE RIA +TF P ENGE +Q+LHYE GQKYEPHFDYF D+ N + GG R+A
Sbjct: 109 KAQDEIVADIEARIAAWTFLPEENGESMQILHYEHGQKYEPHFDYFHDKANQELGGHRVA 168
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVLMYLS+VE+GGETVFPNA+G +S P + S+C K G ++KP+ GDALLF+S+ PDA
Sbjct: 169 TVLMYLSNVEKGGETVFPNAEGKLSQ-PKEDSWSDCAKGGYAVKPEKGDALLFFSLHPDA 227
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+ D SLHG CPVI+G KWS+TKWI V ++
Sbjct: 228 TTDSDSLHGSCPVIEGEKWSATKWIHVRSFE 258
>gi|359477455|ref|XP_002278454.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Vitis
vinifera]
Length = 296
Score = 268 bits (686), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 130/260 (50%), Positives = 180/260 (69%), Gaps = 18/260 (6%)
Query: 27 FTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRA 86
F +L+ ++ IL SS D+ +N ++ VR+ ISW+PRA
Sbjct: 5 LQFLLLLWISSTILEFSSSYADAAGSNVSAAKVRQ------------------ISWKPRA 46
Query: 87 FVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDI 146
FVY FLS+EEC++LI+LA +++S V D+ +GKS+ S VRTSSG F+ +G+D I+ I
Sbjct: 47 FVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSRLSEVRTSSGMFIGKGKDPIVAGI 106
Query: 147 EKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVE 206
E +IA +TF P +NGE +QVL YE GQKY+ H+DYF+D+ N GG R+ATVLMYLSDV
Sbjct: 107 EDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHYDYFVDKVNIARGGHRIATVLMYLSDVV 166
Query: 207 EGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGG 266
+GGETVFP A+ + S +P ++LSEC + G+++KP+ GDALLF+S+ P A DP SLHGG
Sbjct: 167 KGGETVFPMAEVSSSTLPTNDDLSECARKGIAVKPRKGDALLFFSLHPTAIPDPMSLHGG 226
Query: 267 CPVIKGNKWSSTKWIRVNEY 286
CPVI+G KWS+TKWI V+ +
Sbjct: 227 CPVIEGEKWSATKWIHVDSF 246
>gi|302845234|ref|XP_002954156.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
gi|300260655|gb|EFJ44873.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
nagariensis]
Length = 309
Score = 268 bits (686), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 139/262 (53%), Positives = 175/262 (66%), Gaps = 12/262 (4%)
Query: 27 FTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVI--SWEP 84
F + +L ILS+ +S +A S ++ S E + E EVI SW P
Sbjct: 6 MAFQVSAVLLLTILSLAVAS----EAASTSHVITGSGHTVGFGELKEEWRGEVIHLSWSP 61
Query: 85 RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIR 144
RAF+ FLS EECE++I A P M KS+VVD+ +GKS DS +RTS+G +LA+G D+II
Sbjct: 62 RAFLLKGFLSDEECEHIIAKAKPRMVKSSVVDNASGKSVDSEIRTSTGAWLAKGEDEIIS 121
Query: 145 DIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMYL 202
IEKR+A T PLEN EGLQVLHY GQKYEPH+DYF D N ++GGQR+ TVLMYL
Sbjct: 122 RIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNASPEHGGQRVVTVLMYL 181
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
+ VEEGGETV P+A +S W SEC K GL++KP GDAL+F+S+KPD S DP+S
Sbjct: 182 TTVEEGGETVLPHADQKVSGEGW----SECAKRGLAVKPVKGDALMFYSLKPDGSNDPAS 237
Query: 263 LHGGCPVIKGNKWSSTKWIRVN 284
LHG CP +KG+KWS+TKWI V
Sbjct: 238 LHGSCPTLKGDKWSATKWIHVG 259
>gi|215490183|dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 318
Score = 268 bits (686), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 125/233 (53%), Positives = 167/233 (71%), Gaps = 1/233 (0%)
Query: 56 SSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV 115
SS+++ +RS V ISW PRAFVY NFL+ EEC++ I LA + KS V
Sbjct: 41 SSVLKLLTDRSSSSPTIDPTRVTQISWRPRAFVYRNFLTDEECDHFITLAKHKLEKSMVA 100
Query: 116 DSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKY 175
D+++GKS +S VRTSSG F + +D+++ ++E RIA +TF P ENGE +Q+LHYE GQKY
Sbjct: 101 DNESGKSVESEVRTSSGMFFRKAQDQVVANVEARIAAWTFLPEENGESIQILHYEHGQKY 160
Query: 176 EPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKT 235
EPHFDYF D+ N + GG R+ATVLMYLSDVE+GGETVFPN++ + ++ S+C K
Sbjct: 161 EPHFDYFHDKVNQELGGHRVATVLMYLSDVEKGGETVFPNSEAKKTQAK-GDDWSDCAKK 219
Query: 236 GLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
G ++KP+ GDALLF+S+ PDA+ DP SLHG CPVI+G KWS+TKWI V ++
Sbjct: 220 GYAVKPRKGDALLFFSLHPDATTDPLSLHGSCPVIEGEKWSATKWIHVRSFET 272
>gi|159794881|pdb|2JIJ|A Chain A, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794882|pdb|2JIJ|B Chain B, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
gi|159794883|pdb|2JIJ|C Chain C, Crystal Structure Of The Apo Form Of Chlamydomonas
Reinhardtii Prolyl-4 Hydroxylase Type I
Length = 233
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 126/215 (58%), Positives = 160/215 (74%), Gaps = 9/215 (4%)
Query: 74 EQW---VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
E+W V +SW PRAF+ NFLS EEC+Y++ A P M KS+VVD+++GKS DS +RTS
Sbjct: 16 EEWRGEVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTS 75
Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT-- 188
+GT+ A+G D +I IEKR+A T PLEN EGLQVLHY GQKYEPH+DYF D N
Sbjct: 76 TGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGP 135
Query: 189 KNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALL 248
++GGQR+ T+LMYL+ VEEGGETV PNA+ ++ W SEC K GL++KP GDAL+
Sbjct: 136 EHGGQRVVTMLMYLTTVEEGGETVLPNAEQKVTGDGW----SECAKRGLAVKPIKGDALM 191
Query: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRV 283
F+S+KPD S DP+SLHG CP +KG+KWS+TKWI V
Sbjct: 192 FYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 226
>gi|225452614|ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera]
Length = 316
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 126/210 (60%), Positives = 158/210 (75%), Gaps = 1/210 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PRAF+Y FLS+EEC++LI LA + KS V D+++GKS S VRTSSG FL
Sbjct: 54 VTQLSWRPRAFLYKGFLSEEECDHLITLAKDKLEKSMVADNESGKSIMSEVRTSSGMFLL 113
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D+I+ DIE RIA +TF P+ENGE +Q+LHYE G+KYEPHFDYF D+ N GG R+A
Sbjct: 114 KAQDEIVADIEARIAAWTFLPVENGESIQILHYENGEKYEPHFDYFHDKVNQLLGGHRIA 173
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVLMYL+ VEEGGETVFPN++G S P + S+C K G ++ PK GDALLF+S+ PDA
Sbjct: 174 TVLMYLATVEEGGETVFPNSEGRFSQ-PKDDSWSDCAKKGYAVNPKKGDALLFFSLHPDA 232
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+ DPSSLHG CPVI G KWS+TKWI V +
Sbjct: 233 TTDPSSLHGSCPVIAGEKWSATKWIHVRSF 262
>gi|241913390|pdb|3GZE|A Chain A, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913391|pdb|3GZE|B Chain B, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913392|pdb|3GZE|C Chain C, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
gi|241913393|pdb|3GZE|D Chain D, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
(Ser-Pro)5 Peptide Substrate
Length = 225
Score = 267 bits (682), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 126/215 (58%), Positives = 160/215 (74%), Gaps = 9/215 (4%)
Query: 74 EQW---VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
E+W V +SW PRAF+ NFLS EEC+Y++ A P M KS+VVD+++GKS DS +RTS
Sbjct: 8 EEWRGEVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTS 67
Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT-- 188
+GT+ A+G D +I IEKR+A T PLEN EGLQVLHY GQKYEPH+DYF D N
Sbjct: 68 TGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGP 127
Query: 189 KNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALL 248
++GGQR+ T+LMYL+ VEEGGETV PNA+ ++ W SEC K GL++KP GDAL+
Sbjct: 128 EHGGQRVVTMLMYLTTVEEGGETVLPNAEQKVTGDGW----SECAKRGLAVKPIKGDALM 183
Query: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRV 283
F+S+KPD S DP+SLHG CP +KG+KWS+TKWI V
Sbjct: 184 FYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 218
>gi|159794879|pdb|2JIG|A Chain A, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
gi|159794880|pdb|2JIG|B Chain B, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
Dicarboxylate
Length = 224
Score = 267 bits (682), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 126/215 (58%), Positives = 160/215 (74%), Gaps = 9/215 (4%)
Query: 74 EQW---VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
E+W V +SW PRAF+ NFLS EEC+Y++ A P M KS+VVD+++GKS DS +RTS
Sbjct: 7 EEWRGEVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTS 66
Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT-- 188
+GT+ A+G D +I IEKR+A T PLEN EGLQVLHY GQKYEPH+DYF D N
Sbjct: 67 TGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGP 126
Query: 189 KNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALL 248
++GGQR+ T+LMYL+ VEEGGETV PNA+ ++ W SEC K GL++KP GDAL+
Sbjct: 127 EHGGQRVVTMLMYLTTVEEGGETVLPNAEQKVTGDGW----SECAKRGLAVKPIKGDALM 182
Query: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRV 283
F+S+KPD S DP+SLHG CP +KG+KWS+TKWI V
Sbjct: 183 FYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 217
>gi|159478673|ref|XP_001697425.1| predicted protein [Chlamydomonas reinhardtii]
gi|158274304|gb|EDP00087.1| predicted protein [Chlamydomonas reinhardtii]
Length = 297
Score = 267 bits (682), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 126/215 (58%), Positives = 160/215 (74%), Gaps = 9/215 (4%)
Query: 74 EQW---VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
E+W V +SW PRAF+ NFLS EEC+Y++ A P M KS+VVD+++GKS DS +RTS
Sbjct: 36 EEWRGEVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTS 95
Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT-- 188
+GT+ A+G D +I IEKR+A T PLEN EGLQVLHY GQKYEPH+DYF D N
Sbjct: 96 TGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGP 155
Query: 189 KNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALL 248
++GGQR+ T+LMYL+ VEEGGETV PNA+ ++ W SEC K GL++KP GDAL+
Sbjct: 156 EHGGQRVVTMLMYLTTVEEGGETVLPNAEQKVTGDGW----SECAKRGLAVKPIKGDALM 211
Query: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRV 283
F+S+KPD S DP+SLHG CP +KG+KWS+TKWI V
Sbjct: 212 FYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 246
>gi|224141325|ref|XP_002324024.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
gi|222867026|gb|EEF04157.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
trichocarpa]
Length = 308
Score = 265 bits (678), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 131/259 (50%), Positives = 180/259 (69%), Gaps = 8/259 (3%)
Query: 29 FAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFV 88
F + + L +++ P S S + + I++K +S D R Q +SW PRAF+
Sbjct: 5 FFVALCLCSMLVNFPLFSCSSIRLHPHKKILQK---KSVFDPTRVTQ----LSWNPRAFL 57
Query: 89 YHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEK 148
Y FLS EEC++L+NLA + KS V D+++GKS +S VRTSSG F+ + +D+I+ DIE
Sbjct: 58 YKGFLSDEECDHLMNLARDKLEKSMVADNESGKSIESEVRTSSGMFIGKSQDEIVDDIEA 117
Query: 149 RIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEG 208
RIA +TF P ENGE +Q+LHYE GQKYEPHFDYF D+ N + GG R+ TVLMYLS+V +G
Sbjct: 118 RIAAWTFLPQENGESIQILHYEHGQKYEPHFDYFHDKANQELGGHRVVTVLMYLSNVGKG 177
Query: 209 GETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCP 268
GETVFPN++G + P + S+C K G ++KP+ GDALLF+S+ PDA+ D +SLHG CP
Sbjct: 178 GETVFPNSEGK-TIQPKDDSWSDCAKNGYAVKPQKGDALLFFSLHPDATTDTNSLHGSCP 236
Query: 269 VIKGNKWSSTKWIRVNEYK 287
VI+G KWS+TKWI V ++
Sbjct: 237 VIEGEKWSATKWIHVRSFE 255
>gi|255551575|ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 297
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 125/212 (58%), Positives = 161/212 (75%), Gaps = 2/212 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +SW+PRAFVY FL+ EC++LI+LA +++S V D+++GKSK S VRTSSG F+A
Sbjct: 36 VKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSEVRTSSGMFIA 95
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+G+D II IE++I+ +TF P ENGE LQVL YE GQKY+PH+DYF D+ N GG RMA
Sbjct: 96 KGKDPIIAGIEEKISTWTFLPKENGEDLQVLRYEHGQKYDPHYDYFADKINIARGGHRMA 155
Query: 197 TVLMYLSDVEEGGETVFPNAQ--GNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
TVLMYLSDV +GGETVFPNA+ A +LSEC K G+S+KP+ GDALLF+S+ P
Sbjct: 156 TVLMYLSDVVKGGETVFPNAEEPPRRKATESHEDLSECAKKGISVKPRRGDALLFFSLHP 215
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
A DP+SLH GCPVI+G KWS+TKWI V+ +
Sbjct: 216 TAIPDPNSLHAGCPVIEGEKWSATKWIHVDSF 247
>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
Length = 1062
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 123/213 (57%), Positives = 159/213 (74%), Gaps = 7/213 (3%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PRAF+Y FLS +EC++L+NLA M KS V D+D+GKS S+VRTSSGTFL+
Sbjct: 37 VTQLSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSGTFLS 96
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ D I+ IEKR+A +TF P EN E +Q+LHYE GQKY+ HFDYF D+ N K GG R+A
Sbjct: 97 KHEDDIVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRVA 156
Query: 197 TVLMYLSDVEEGGETVFPNAQG---NISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
TVLMYL+DV++GGETVFPNA G + W S+C ++GL++KPK GDALLF+S+
Sbjct: 157 TVLMYLTDVKKGGETVFPNAAGRHLQLKDETW----SDCARSGLAVKPKKGDALLFFSLH 212
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+A+ DP+SLHG CPVI+G KWS+TKWI V +
Sbjct: 213 VNATTDPASLHGSCPVIEGEKWSATKWIHVRSF 245
>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
Length = 294
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 124/232 (53%), Positives = 171/232 (73%), Gaps = 6/232 (2%)
Query: 57 SIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVD 116
+ VR+S + + +A+Q ISW+PRAFVY FL+ EEC +LI+LA +++S V D
Sbjct: 17 AFVRESSSSAIINPSKAKQ----ISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAVAD 72
Query: 117 SDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYE 176
+++G SK S VRTSSG F+ + +D I+ IE++IA +TF P ENGE +QVL YE GQKYE
Sbjct: 73 NESGNSKTSEVRTSSGMFIPKAKDPIVSGIEEKIATWTFLPKENGEEIQVLRYEEGQKYE 132
Query: 177 PHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNI--SAVPWWNELSECGK 234
PH+DYF+D+ N GG R+ATVLMYL++VE+GGETVFP A+ + ++ + LSEC K
Sbjct: 133 PHYDYFVDKVNIARGGHRLATVLMYLTNVEKGGETVFPKAEESPRRRSMIADDSLSECAK 192
Query: 235 TGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
G+ +KP+ GDALLF+S+ P+A+ DP SLHGGCPVI+G KWS+TKWI V+ +
Sbjct: 193 KGIPVKPRKGDALLFYSLHPNATPDPLSLHGGCPVIQGEKWSATKWIHVDSF 244
>gi|114796723|gb|ABI79328.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
Length = 297
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 119/213 (55%), Positives = 164/213 (76%), Gaps = 3/213 (1%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V ISW+PRAFVY FL+ EEC++LI++A +++S V D+++GKS+ S VRTSSG F++
Sbjct: 37 VRQISWKPRAFVYEGFLTDEECDHLISIAKTELKRSAVADNESGKSQVSEVRTSSGAFIS 96
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I++ IE+++A +TF P+ENGE +QVL YE GQKYE HFD+F D+ N GG R A
Sbjct: 97 KAKDAIVQRIEEKLATWTFLPIENGEDIQVLRYEEGQKYENHFDFFSDKVNIARGGHRYA 156
Query: 197 TVLMYLSDVEEGGETVFPNA---QGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
TVLMYLS+VE+GG+TVFPNA + +A+ ++LSEC K G+S+KP+ GDALLF+S+
Sbjct: 157 TVLMYLSNVEKGGDTVFPNAELSERQKAAIAANDDLSECAKRGISVKPRKGDALLFFSLT 216
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
P A+ D SLHGGCPVI+G KWS+TKWI V+ +
Sbjct: 217 PTATPDQLSLHGGCPVIEGEKWSATKWIHVDSF 249
>gi|307106819|gb|EFN55064.1| hypothetical protein CHLNCDRAFT_35843 [Chlorella variabilis]
Length = 287
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 126/213 (59%), Positives = 155/213 (72%), Gaps = 6/213 (2%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE +SW PRAFVYHNFLS EECE+L LA + KSTVVD+ TGKS DS VRTSSGTFLA
Sbjct: 38 VEQVSWRPRAFVYHNFLSDEECEHLKELARKRLTKSTVVDNKTGKSMDSTVRTSSGTFLA 97
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN--TKNGGQR 194
RG D+++R IEKRI+ T P ENGE +Q+L Y GQKYEPH DYF D++N T+NGGQR
Sbjct: 98 RGEDEVVRAIEKRISLVTMIPEENGEAIQILKYVDGQKYEPHTDYFHDKYNSRTENGGQR 157
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+AT+LMYLS EEGGETVFP A+ + W SEC + GL++K G ALLF+S+KP
Sbjct: 158 VATILMYLSTPEEGGETVFPYAEKKVEGEGW----SECARKGLAVKAVKGSALLFYSLKP 213
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+ D +S HG CP + G KWS+T+WI V ++
Sbjct: 214 NGEEDQASTHGSCPTLAGEKWSATRWIHVGAFQ 246
>gi|359477453|ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
vinifera]
gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera]
Length = 298
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 129/262 (49%), Positives = 178/262 (67%), Gaps = 20/262 (7%)
Query: 27 FTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRA 86
F +L+ ++ IL SS D+ +N ++ VR+ ISW+PRA
Sbjct: 5 LQFLLLLWISSTILEFSSSYADAAGSNVSAAKVRQ------------------ISWKPRA 46
Query: 87 FVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDI 146
FVY FLS+EEC++LI+LA +++S V D+ +GKS+ S VRTSSG F+ +G+D I+ I
Sbjct: 47 FVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSRLSEVRTSSGMFIGKGKDPIVAGI 106
Query: 147 EKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVE 206
E +IA +TF P +NGE +QVL YE GQKY+ H+DYF+D+ N GG R+ATVLMYLSDV
Sbjct: 107 EDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHYDYFVDKVNIARGGHRIATVLMYLSDVV 166
Query: 207 EGGETVFPNAQ--GNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLH 264
+GGETVFP A+ +P ++LSEC + G+++KP+ GDALLF+S+ P A DP SLH
Sbjct: 167 KGGETVFPMAEEPSRRKPLPTNDDLSECARKGIAVKPRKGDALLFFSLHPTAIPDPMSLH 226
Query: 265 GGCPVIKGNKWSSTKWIRVNEY 286
GGCPVI+G KWS+TKWI V+ +
Sbjct: 227 GGCPVIEGEKWSATKWIHVDSF 248
>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 318
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 123/211 (58%), Positives = 156/211 (73%), Gaps = 1/211 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PRAF+Y FLS EEC++LI LA + KS V D+++GKS S VRTSSG FL
Sbjct: 56 VTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMSEVRTSSGMFLN 115
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D+I+ IE RIA +TF P+ENGE +Q+LHYE GQKYEPHFDYF D+ N GG R+A
Sbjct: 116 KAQDEIVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFDYFHDKANQVMGGHRIA 175
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVLMYLSDVE+GGET+FPNA+ + P SEC G ++KP+ GDALLF+S+ DA
Sbjct: 176 TVLMYLSDVEKGGETIFPNAKAKL-LQPKDESWSECAHKGYAVKPRKGDALLFFSLHLDA 234
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
S D SLHG CPVI+G KWS+TKWI V++++
Sbjct: 235 STDNKSLHGSCPVIEGEKWSATKWIHVSDFQ 265
>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
Length = 319
Score = 260 bits (665), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 123/211 (58%), Positives = 157/211 (74%), Gaps = 1/211 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PRAF+Y FLS+EEC++LI LA + KS V D+D+GKS S +RTSSG FL
Sbjct: 57 VTQLSWSPRAFLYKGFLSEEECDHLIVLAKDKLEKSMVADNDSGKSIMSDIRTSSGMFLN 116
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D+I+ IE RIA +TF P+ENGE +Q+LHYE GQKYEPHFDYF D+ N GG R+A
Sbjct: 117 KAQDEIVAGIEARIAAWTFLPVENGESMQILHYENGQKYEPHFDYFHDKANQVMGGHRIA 176
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVLMYLSDVE+GGET+FPNA+ + P SEC G ++KP+ GDALLF+S+ DA
Sbjct: 177 TVLMYLSDVEKGGETIFPNAEAKL-LQPKDESWSECAHKGYAVKPQKGDALLFFSLHLDA 235
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
S D SLHG CPVI+G KWS+TKWI V++++
Sbjct: 236 STDTKSLHGSCPVIEGEKWSATKWIHVSDFE 266
>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
expressed [Oryza sativa Japonica Group]
gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
Length = 299
Score = 260 bits (664), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 123/213 (57%), Positives = 159/213 (74%), Gaps = 7/213 (3%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PRAF+Y FLS +EC++L+NLA M KS V D+D+GKS S+VRTSSGTFL+
Sbjct: 37 VTQLSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSGTFLS 96
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ D I+ IEKR+A +TF P EN E +Q+LHYE GQKY+ HFDYF D+ N K GG R+A
Sbjct: 97 KHEDDIVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRVA 156
Query: 197 TVLMYLSDVEEGGETVFPNAQG---NISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
TVLMYL+DV++GGETVFPNA G + W S+C ++GL++KPK GDALLF+S+
Sbjct: 157 TVLMYLTDVKKGGETVFPNAAGRHLQLKDETW----SDCARSGLAVKPKKGDALLFFSLH 212
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+A+ DP+SLHG CPVI+G KWS+TKWI V +
Sbjct: 213 VNATTDPASLHGSCPVIEGEKWSATKWIHVRSF 245
>gi|159795555|pdb|2V4A|A Chain A, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795556|pdb|2V4A|B Chain B, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795557|pdb|2V4A|C Chain C, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii.
gi|159795558|pdb|2V4A|D Chain D, Crystal Structure Of The Semet-Labeled Prolyl-4
Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
Reinhardtii
Length = 233
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 124/215 (57%), Positives = 156/215 (72%), Gaps = 9/215 (4%)
Query: 74 EQW---VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
E+W V +SW PRAF+ NFLS EEC+Y++ A P KS+VVD+++GKS DS +RTS
Sbjct: 16 EEWRGEVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKXVKSSVVDNESGKSVDSEIRTS 75
Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT-- 188
+GT+ A+G D +I IEKR+A T PLEN EGLQVLHY GQKYEPH+DYF D N
Sbjct: 76 TGTWFAKGEDSVISKIEKRVAQVTXIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGP 135
Query: 189 KNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALL 248
++GGQR+ T L YL+ VEEGGETV PNA+ ++ W SEC K GL++KP GDAL
Sbjct: 136 EHGGQRVVTXLXYLTTVEEGGETVLPNAEQKVTGDGW----SECAKRGLAVKPIKGDALX 191
Query: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRV 283
F+S+KPD S DP+SLHG CP +KG+KWS+TKWI V
Sbjct: 192 FYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 226
>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 293
Score = 259 bits (661), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 126/219 (57%), Positives = 163/219 (74%), Gaps = 7/219 (3%)
Query: 69 DEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVR 128
D+ R Q +SW PRAF+Y FLS EC++L+ LA ++KS V D+D+GKS S+VR
Sbjct: 27 DQARVTQ----LSWRPRAFLYSGFLSHAECDHLVKLAKGRLQKSMVADNDSGKSVMSQVR 82
Query: 129 TSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT 188
TSSGTFL + D+II IEKR+A +TF P EN E +QVLHYE GQKY+ HFDYF D+ N
Sbjct: 83 TSSGTFLNKHEDEIISGIEKRVAAWTFLPEENAESIQVLHYEVGQKYDAHFDYFHDKNNQ 142
Query: 189 KNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNEL-SECGKTGLSIKPKMGDAL 247
K GG R+ATVLMYL+DV++GGETVFPNA+G + +E SEC ++GL++KP+ GDAL
Sbjct: 143 KLGGHRVATVLMYLTDVKKGGETVFPNAEGR--HLQHKDETWSECARSGLAVKPRKGDAL 200
Query: 248 LFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
LF+S+ +A+ DPSSLHG CPVI+G KWS+TKWI V +
Sbjct: 201 LFFSLHINATTDPSSLHGSCPVIEGEKWSATKWIHVRSF 239
>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
Length = 297
Score = 259 bits (661), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 126/211 (59%), Positives = 160/211 (75%), Gaps = 3/211 (1%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PRAF+Y FLS EC++LINLA M KS V D+D+GKS S+VRTSSG FLA
Sbjct: 35 VTQLSWRPRAFLYSGFLSDTECDHLINLAKGSMEKSMVADNDSGKSLMSQVRTSSGAFLA 94
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ D+I+ IEKR+A +TF P EN E +QVL YE GQKY+ HFDYF D+ N K+GGQR A
Sbjct: 95 KHEDEIVSAIEKRVAAWTFLPEENAESMQVLRYEIGQKYDAHFDYFHDKNNVKHGGQRFA 154
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNEL-SECGKTGLSIKPKMGDALLFWSMKPD 255
TVLMYL+DV++GGETVFPNA+G S + + +E SEC ++GL++KPK GDALLF+ + +
Sbjct: 155 TVLMYLTDVKKGGETVFPNAEG--SHLQYKDETWSECSRSGLAVKPKKGDALLFFGLHLN 212
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
A+ D SSLHG CPVI+G KWS+TKWI V +
Sbjct: 213 ATTDTSSLHGSCPVIEGEKWSATKWIHVRSF 243
>gi|255637501|gb|ACU19077.1| unknown [Glycine max]
Length = 318
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 122/211 (57%), Positives = 155/211 (73%), Gaps = 1/211 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PRAF+Y FLS EEC++LI LA + KS V D+++GKS S VRTSSG FL
Sbjct: 56 VTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMSEVRTSSGMFLN 115
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D+I+ IE RIA +TF P+ENGE +Q+LHYE GQKYEPHFDYF D+ N GG R+A
Sbjct: 116 KAQDEIVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFDYFHDKANQVMGGHRIA 175
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVLMYLSDVE+GGET+F NA+ + P SEC G ++KP+ GDALLF+S+ DA
Sbjct: 176 TVLMYLSDVEKGGETIFSNAKAKL-LQPKDESWSECAHKGYAVKPRKGDALLFFSLHLDA 234
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
S D SLHG CPVI+G KWS+TKWI V++++
Sbjct: 235 STDNKSLHGSCPVIEGEKWSATKWIHVSDFQ 265
>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 303
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 122/212 (57%), Positives = 160/212 (75%), Gaps = 2/212 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ ISW PRAFVY FL+ EC++LI+LA +++S+V D+ +GKSK S VRTSSG F+
Sbjct: 41 VKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSEVRTSSGAFIH 100
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I+ IE +IA +TF P +NGE +QVL YE GQKY+ HFDYF D+ N GG RMA
Sbjct: 101 KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMA 160
Query: 197 TVLMYLSDVEEGGETVFPNAQGNI--SAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
TVLMYLSDVE+GGETVFP+A+ + A +LS+C K G+++KP+ GDALLF+S+ P
Sbjct: 161 TVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHP 220
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+A D SSLHGGCPVI+G KWS+TKWIRV+ +
Sbjct: 221 NAIPDTSSLHGGCPVIEGEKWSATKWIRVDSF 252
>gi|384246332|gb|EIE19822.1| hypothetical protein COCSUDRAFT_25518 [Coccomyxa subellipsoidea
C-169]
Length = 347
Score = 256 bits (655), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 130/249 (52%), Positives = 167/249 (67%), Gaps = 8/249 (3%)
Query: 42 MPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVI--SWEPRAFVYHNFLSKEECE 99
+P D +++ND + G +G+ E EVI SW PRAF+ FL + ECE
Sbjct: 46 LPVPIMDFKESNDSDPSAPPLLTIGLGADGKEEWRGEVIEVSWSPRAFLLKGFLKEAECE 105
Query: 100 YLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLE 159
+LI+ A P M KSTVVD+DTGKS DS VRTS+GTF R D++I+ IE+RI+ T P
Sbjct: 106 HLISKAKPSMVKSTVVDNDTGKSIDSTVRTSTGTFFGREEDEVIQGIERRISMITHLPEV 165
Query: 160 NGEGLQVLHYEAGQKYEPHFDYFMDEFNTK--NGGQRMATVLMYLSDVEEGGETVFPNAQ 217
NGEGLQ+LHYE GQKYE H D+F D+FN++ NGGQR+ATVLMYL+ EEGGETVFP A
Sbjct: 166 NGEGLQILHYEDGQKYEAHHDFFHDKFNSRPENGGQRIATVLMYLTTAEEGGETVFPMAA 225
Query: 218 GNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSS 277
++ W SEC + G ++K + GDALLF+S+ P+ DP+SLHG CP KG KWS+
Sbjct: 226 NKVTGPQW----SECARGGAAVKSRRGDALLFYSLLPNGETDPTSLHGSCPTTKGEKWSA 281
Query: 278 TKWIRVNEY 286
TKWI V +
Sbjct: 282 TKWIHVGPF 290
>gi|357140446|ref|XP_003571778.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 298
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 116/208 (55%), Positives = 156/208 (75%), Gaps = 1/208 (0%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+SW PRAF++ FLS+ EC+++I LA + KS V D+++GKS S VRTSSG FL + +
Sbjct: 38 LSWRPRAFLHKGFLSEPECDHMIELAKDKLEKSMVADNESGKSVQSEVRTSSGMFLEKRQ 97
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D+++ IE+RIA +TF P ENGE +Q+LHY+ G+KYEPH+DYF D+ N GG R+ATVL
Sbjct: 98 DEVVARIEERIAAWTFLPSENGESIQILHYKNGEKYEPHYDYFHDKNNQALGGHRIATVL 157
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
MYLS+VE+GGET+FPNA+G ++ SEC K G ++KP GDALLF+S+ PDA+ D
Sbjct: 158 MYLSNVEKGGETIFPNAEGKLTQHK-DETASECAKNGYAVKPMKGDALLFFSLHPDATTD 216
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
P SLHG CPVI+G KWS+TKWI V ++
Sbjct: 217 PDSLHGSCPVIEGQKWSATKWIHVRSFE 244
>gi|357496283|ref|XP_003618430.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|217073992|gb|ACJ85356.1| unknown [Medicago truncatula]
gi|355493445|gb|AES74648.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
gi|388494436|gb|AFK35284.1| unknown [Medicago truncatula]
Length = 313
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 119/211 (56%), Positives = 158/211 (74%), Gaps = 1/211 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PRAF+Y NFL+ EEC++LI L+ + KS V D+++GKS S VRTSSG FL
Sbjct: 51 VTQLSWSPRAFLYKNFLTDEECDHLIELSKDKLEKSMVADNESGKSIQSEVRTSSGMFLN 110
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D+I+ IE RIA +TF P+ENGE +QVLHY G+KYEPHFD+F D+ N + GG R+A
Sbjct: 111 KQQDEIVSGIEARIAAWTFLPVENGESMQVLHYMNGEKYEPHFDFFHDKANQRLGGHRVA 170
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVLMYLS+VE+GGET+FP+A+G +S P SEC G ++KP+ GDALLF+S+ DA
Sbjct: 171 TVLMYLSNVEKGGETIFPHAEGKLSQ-PKDESWSECAHKGYAVKPRKGDALLFFSLHLDA 229
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+ D SLHG CPVI+G KWS+TKWI V +++
Sbjct: 230 TTDSKSLHGSCPVIEGEKWSATKWIHVADFE 260
>gi|294461211|gb|ADE76168.1| unknown [Picea sitchensis]
Length = 280
Score = 254 bits (650), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 121/205 (59%), Positives = 153/205 (74%), Gaps = 1/205 (0%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P F+Y NFL+ EC++LI LA ++KS V D+++GKS S +RTSSG FL + +D+I+
Sbjct: 28 PGLFLYKNFLTDAECDHLIFLARDKLQKSMVADNESGKSVMSEIRTSSGMFLNKAQDEIV 87
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLS 203
+E RIA +TF P+ENGE +QVLHYE GQKYEPHFDYF D+ N GG R+ATVLMYLS
Sbjct: 88 ASVEDRIAAWTFLPIENGEAMQVLHYELGQKYEPHFDYFHDKINQAMGGHRIATVLMYLS 147
Query: 204 DVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSL 263
DV +GGETVFPNA+ S P + SEC K G S+KP GDALLF+S++PDA+ D SSL
Sbjct: 148 DVVKGGETVFPNAETKDSQ-PKDDSWSECAKGGYSVKPNKGDALLFFSLRPDATTDQSSL 206
Query: 264 HGGCPVIKGNKWSSTKWIRVNEYKV 288
HG CPVI+G KWS+TKWI V ++V
Sbjct: 207 HGSCPVIEGEKWSATKWIHVRSFEV 231
>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 316
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 123/263 (46%), Positives = 176/263 (66%), Gaps = 2/263 (0%)
Query: 25 IMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEP 84
I F++ L ++S + +R +N V K M+ S G V +SW P
Sbjct: 5 IFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIK-MKTSASSFGFDPTRVTQLSWTP 63
Query: 85 RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIR 144
R F+Y FLS EEC++ I LA + KS V D+D+G+S +S VRTSSG FL++ +D I+
Sbjct: 64 RVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQDDIVS 123
Query: 145 DIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSD 204
++E ++A +TF P ENGE +Q+LHYE GQKYEPHFDYF D+ N + GG R+ATVLMYLS+
Sbjct: 124 NVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVLMYLSN 183
Query: 205 VEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLH 264
VE+GGETVFP +G + + + +EC K G ++KP+ GDALLF+++ P+A+ D +SLH
Sbjct: 184 VEKGGETVFPMWKGKATQLK-DDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNSLH 242
Query: 265 GGCPVIKGNKWSSTKWIRVNEYK 287
G CPV++G KWS+T+WI V ++
Sbjct: 243 GSCPVVEGEKWSATRWIHVKSFE 265
>gi|326526235|dbj|BAJ97134.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 308
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 117/207 (56%), Positives = 158/207 (76%), Gaps = 2/207 (0%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
ISW PRAF+Y +FLS +E +L++LA +++S V D +GKS+ S VRTSSGTF+++G+
Sbjct: 54 ISWHPRAFLYPHFLSDDEANHLVSLARAELKRSAVADETSGKSQLSEVRTSSGTFISKGK 113
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D I+ IE +IA +TF P ENGE +QVL Y+ G+KYEPH+D+F D NT GG R+ATVL
Sbjct: 114 DPIVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKYEPHYDFFTDSVNTILGGHRVATVL 173
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
+YL+DV EGGETVFP A+G + LSEC + G+++KP+ GDALLF++++PDA+ D
Sbjct: 174 LYLTDVAEGGETVFPLAKGRKGS--HHKGLSECAQKGIAVKPRKGDALLFFNLRPDAATD 231
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRVNEY 286
P+SLHGGC VIKG KWS+TKWIRV +
Sbjct: 232 PTSLHGGCEVIKGEKWSATKWIRVASF 258
>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
Length = 316
Score = 254 bits (649), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 123/263 (46%), Positives = 176/263 (66%), Gaps = 2/263 (0%)
Query: 25 IMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEP 84
I F++ L ++S + +R +N V K M+ S G V +SW P
Sbjct: 5 IFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIK-MKTSASSFGFDPTRVTQLSWTP 63
Query: 85 RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIR 144
R F+Y FLS EEC++ I LA + KS V D+D+G+S +S VRTSSG FL++ +D I+
Sbjct: 64 RVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQDDIVN 123
Query: 145 DIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSD 204
++E ++A +TF P ENGE +Q+LHYE GQKYEPHFDYF D+ N + GG R+ATVLMYLS+
Sbjct: 124 NVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVLMYLSN 183
Query: 205 VEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLH 264
VE+GGETVFP +G + + + +EC K G ++KP+ GDALLF+++ P+A+ D +SLH
Sbjct: 184 VEKGGETVFPMWKGKATQLK-DDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNSLH 242
Query: 265 GGCPVIKGNKWSSTKWIRVNEYK 287
G CPV++G KWS+T+WI V ++
Sbjct: 243 GSCPVVEGEKWSATRWIHVKSFE 265
>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
Length = 296
Score = 253 bits (647), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 115/204 (56%), Positives = 157/204 (76%), Gaps = 3/204 (1%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+SW+PRAF+Y F+S EC++++ +A ++KS V D+++GKS S +RTSSG FL++G+
Sbjct: 45 LSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSNIRTSSGMFLSKGQ 104
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D++I IE+RIA +TF P ENGE +QVL YE G+KYEPH+DYF D++N GG R+ATVL
Sbjct: 105 DEVINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYDYFHDKYNQALGGHRIATVL 164
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
MYLSDV +GGETVFP+++ W S+C K G+++KP+ GDALLF+S+ PDA+ D
Sbjct: 165 MYLSDVVKGGETVFPSSEDTTVKDDSW---SDCAKKGIAVKPRKGDALLFYSLHPDATPD 221
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRV 283
SSLHGGCPVI+G KWS+TKWI V
Sbjct: 222 ESSLHGGCPVIEGEKWSATKWIHV 245
>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
Length = 298
Score = 253 bits (646), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 118/212 (55%), Positives = 162/212 (76%), Gaps = 2/212 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +S +PRAFVY FL++ EC+++++LA +++S V D+D+G+SK S VRTSSGTF++
Sbjct: 37 VKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSSGTFIS 96
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+G+D I+ IE +I+ +TF P ENGE +QVL YE GQKY+ HFDYF D+ N GG RMA
Sbjct: 97 KGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRMA 156
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNE--LSECGKTGLSIKPKMGDALLFWSMKP 254
T+LMYLS+V +GGETVFP+A+ V NE LS+C K G+++KP+ GDALLF+++ P
Sbjct: 157 TILMYLSNVTKGGETVFPDAEIPSRRVLSENEEDLSDCAKRGIAVKPRKGDALLFFNLHP 216
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DA DP SLHGGCPVI+G KWS+TKWI V+ +
Sbjct: 217 DAIPDPLSLHGGCPVIEGEKWSATKWIHVDSF 248
>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 316
Score = 253 bits (646), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 124/263 (47%), Positives = 177/263 (67%), Gaps = 4/263 (1%)
Query: 24 LIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWE 83
+ F+ L +L+ I S P+ N S+++ M+ S G V +SW
Sbjct: 6 FLAFSLCFLFILS-KISSAPNRFLTRSSNNRDGSVIK--MKTSASSFGFDPTRVTQLSWT 62
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PRAF+Y FLS EEC++ I LA + KS V D+D+G+S +S VRTSSG FL++ +D I+
Sbjct: 63 PRAFLYKGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQDDIV 122
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLS 203
++E ++A +TF P ENGE +Q+LHYE GQKYEPHFDYF D+ N + GG R+ATVLMYLS
Sbjct: 123 ANVEAKLAAWTFIPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVLMYLS 182
Query: 204 DVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSL 263
+VE+GGETVFP +G + + + +EC K G ++KP+ GDALLF+++ P+A+ D +SL
Sbjct: 183 NVEKGGETVFPMWKGKTTQLK-DDSWTECAKQGYAVKPRKGDALLFFNLHPNATTDSNSL 241
Query: 264 HGGCPVIKGNKWSSTKWIRVNEY 286
HG CPV++G KWS+T+WI V +
Sbjct: 242 HGSCPVVEGEKWSATRWIHVRSF 264
>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
Length = 300
Score = 253 bits (645), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 117/212 (55%), Positives = 158/212 (74%), Gaps = 2/212 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +SW+PRAFVY FL+ EC++LI+LA +++S V D+++GKSK S VRTSSG F+
Sbjct: 39 VKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSEVRTSSGMFIT 98
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I+ IE +IA +TF P ENGE +QVL YE GQKY+PH+DYF D+ N GG R+A
Sbjct: 99 KAKDPIVAGIEDKIATWTFLPRENGEDIQVLRYEHGQKYDPHYDYFSDKVNIARGGHRVA 158
Query: 197 TVLMYLSDVEEGGETVFPNAQ--GNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
TVLMYL+DVE+GGETVFP+A+ A +LSEC + G+++KP+ GDALLF+S+ P
Sbjct: 159 TVLMYLTDVEKGGETVFPSAEELPRRKASVSHEDLSECARKGIAVKPRRGDALLFFSLYP 218
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
A D SS+H GCPVI+G KWS+TKWI V+ +
Sbjct: 219 TAVPDTSSIHAGCPVIEGEKWSATKWIHVDSF 250
>gi|242039723|ref|XP_002467256.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
gi|241921110|gb|EER94254.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
Length = 303
Score = 252 bits (643), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 116/209 (55%), Positives = 156/209 (74%), Gaps = 1/209 (0%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+SW PRAF++ FLS EC++LI LA + KS V D+++GKS S VRTSSG FL + +
Sbjct: 43 LSWRPRAFLHKGFLSDAECDHLIVLAKDKLEKSMVADNESGKSVQSEVRTSSGMFLEKKQ 102
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D+++R IE+RIA +TF P ENGE +Q+LHY+ G+KYEPH+DYF D+ N GG R+ATVL
Sbjct: 103 DEVVRGIEERIAAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVL 162
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
MYLS+VE+GGET+FPNA+G + P + S+C + G ++KP GDALLF+S+ PDA+ D
Sbjct: 163 MYLSNVEKGGETIFPNAEGKL-LQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDATTD 221
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
SLHG CPVI+G KWS+TKWI V + +
Sbjct: 222 SESLHGSCPVIEGQKWSATKWIHVRSFDL 250
>gi|293337056|ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays]
gi|224031897|gb|ACN35024.1| unknown [Zea mays]
gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays]
gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays]
Length = 299
Score = 252 bits (643), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 117/212 (55%), Positives = 156/212 (73%), Gaps = 1/212 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PRAF++ FLS EC++LI LA + KS V D+++GKS S VRTSSG FL
Sbjct: 36 VVQLSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSEVRTSSGMFLE 95
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
R +D+++ IE+RI+ +TF P ENGE +Q+LHY+ G+KYEPH+DYF D+ N GG R+A
Sbjct: 96 RKQDEVVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRIA 155
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVLMYLS+VE+GGET+FPNA+G + P N S+C + G ++KP GDALLF+S+ PDA
Sbjct: 156 TVLMYLSNVEKGGETIFPNAEGKL-LQPKDNTWSDCARNGYAVKPVKGDALLFFSLHPDA 214
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
+ D SLHG CPVI+G KWS+TKWI V + +
Sbjct: 215 TTDSDSLHGSCPVIEGQKWSATKWIHVRSFDL 246
>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 298
Score = 252 bits (643), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 117/212 (55%), Positives = 162/212 (76%), Gaps = 2/212 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +S +PRAFVY FL++ EC+++++LA +++S V D+D+G+SK S VRTSSGTF++
Sbjct: 37 VKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSSGTFIS 96
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+G+D I+ IE +I+ +TF P ENGE +QVL YE GQKY+ HFDYF D+ N GG RMA
Sbjct: 97 KGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRMA 156
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWN--ELSECGKTGLSIKPKMGDALLFWSMKP 254
T+LMYLS+V +GGETVFP+A+ V N +LS+C K G+++KP+ GDALLF+++ P
Sbjct: 157 TILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRKGDALLFFNLHP 216
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DA DP SLHGGCPVI+G KWS+TKWI V+ +
Sbjct: 217 DAIPDPLSLHGGCPVIEGEKWSATKWIHVDSF 248
>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
Length = 297
Score = 251 bits (641), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 123/211 (58%), Positives = 157/211 (74%), Gaps = 3/211 (1%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +S PRAF+Y FLS EC++L++LA M KS V D+D+GKS S+ RTSSGTFLA
Sbjct: 35 VTQLSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQARTSSGTFLA 94
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ D+I+ IEKR+A +TF P EN E LQVL YE GQKY+ HFDYF D N K GGQR+A
Sbjct: 95 KREDEIVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRVA 154
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNEL-SECGKTGLSIKPKMGDALLFWSMKPD 255
TVLMYL+DV +GGETVFPNA+G S + + +E SEC ++GL++KPK GDALLF+++ +
Sbjct: 155 TVLMYLTDVNKGGETVFPNAEG--SHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVN 212
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
A+ D SLHG CPVI+G KWS+TKWI V +
Sbjct: 213 ATADTGSLHGSCPVIEGEKWSATKWIHVRSF 243
>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
Length = 283
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 113/204 (55%), Positives = 158/204 (77%), Gaps = 2/204 (0%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+SW+PRAF+Y F+S EC++++ +A ++KS V D+++GKS S +RTSSG FL++G+
Sbjct: 31 LSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSNIRTSSGMFLSKGQ 90
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D++I IE+RIA +TF P ENGE +QVL YE G+KYEPH+DYF D++N GG R+ATVL
Sbjct: 91 DEVINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYDYFHDKYNQALGGHRIATVL 150
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
MYLSD +GGETVFP+++ + + + S+C K G+++KP+ GDALLF+S+ PDA+ D
Sbjct: 151 MYLSDAVKGGETVFPSSEEDTTVKD--DSWSDCAKKGIAVKPRKGDALLFYSLHPDATPD 208
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRV 283
SSLHGGCPVI+G KWS+TKWI V
Sbjct: 209 ESSLHGGCPVIEGEKWSATKWIHV 232
>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
Length = 297
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 122/211 (57%), Positives = 158/211 (74%), Gaps = 3/211 (1%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +S PRAF+Y FLS EC+++++LA M KS V D+D+GKS S+ RTSSGTFLA
Sbjct: 35 VTQLSSRPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVASQARTSSGTFLA 94
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ D+I+ IEKR+A +TF P EN E LQVL YE GQKY+ HFDYF D N K GGQR+A
Sbjct: 95 KREDEIVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRVA 154
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNEL-SECGKTGLSIKPKMGDALLFWSMKPD 255
TVLMYL+DV++GGETVFPNA+G S + + +E SEC ++GL++KPK GDALLF+++ +
Sbjct: 155 TVLMYLTDVKKGGETVFPNAEG--SHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVN 212
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
A+ D SLHG CPVI+G KWS+TKWI V +
Sbjct: 213 ATADTGSLHGSCPVIEGEKWSATKWIHVRSF 243
>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 263
Score = 251 bits (640), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 121/211 (57%), Positives = 157/211 (74%), Gaps = 2/211 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +SW+PRAF+Y NFLS EC+++I+LA + KS V D+++GKS S +RTSSG FL
Sbjct: 6 VKQLSWKPRAFLYSNFLSDAECDHMISLAKDKLEKSMVADNESGKSVKSEIRTSSGMFLM 65
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+G+D II IE RIA +TF P ENGE +QVL Y+ G+KYEPHFDYF D+ N GG R+A
Sbjct: 66 KGQDDIISRIEDRIAAWTFLPKENGEAIQVLRYQDGEKYEPHFDYFHDKNNQALGGHRIA 125
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVLMYLSDV +GGETVFP+++ P + S CGKTG+++KP+ GDALLF+S+ P A
Sbjct: 126 TVLMYLSDVVKGGETVFPSSEDR--GGPKDDSWSACGKTGVAVKPRKGDALLFFSLHPSA 183
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SSLH GCPVI+G KWS+TKWI V ++
Sbjct: 184 VPDESSLHTGCPVIEGEKWSATKWIHVAAFE 214
>gi|449495423|ref|XP_004159836.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 304
Score = 250 bits (639), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 121/213 (56%), Positives = 157/213 (73%), Gaps = 3/213 (1%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ ISW PRAFVY FL+ EC++LI+LA +++S+V D+ +GKSK S VRTSSG F+
Sbjct: 41 VKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSEVRTSSGAFIH 100
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I+ IE +IA +TF P +NGE +QVL YE GQKY+ HFDYF D+ N GG RMA
Sbjct: 101 KAKDPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMA 160
Query: 197 TVLMYLSDVEEGGETVF--PNAQGNISAVPWWNE-LSECGKTGLSIKPKMGDALLFWSMK 253
TVLMYLSDVE+GGETVF ++ NE LS+C K G+++KP+ GDALLF+S+
Sbjct: 161 TVLMYLSDVEKGGETVFLLRRSESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLH 220
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
P+A D SSLHGGCPVI+G KWS+TKWIRV+ +
Sbjct: 221 PNAIPDTSSLHGGCPVIEGEKWSATKWIRVDSF 253
>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 298
Score = 250 bits (639), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 117/212 (55%), Positives = 161/212 (75%), Gaps = 2/212 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +S +PRAFVY FL++ EC+++++LA +++S V D+D+G+SK S VRTSSGTF+
Sbjct: 37 VKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSSGTFIP 96
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+G+D I+ IE +I+ +TF P ENGE +QVL YE GQKY+ HFDYF D+ N GG R+A
Sbjct: 97 KGKDPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRIA 156
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWN--ELSECGKTGLSIKPKMGDALLFWSMKP 254
TVLMYLS+V +GGETVFP+A+ V N +LS+C K G+++KP+ GDALLF+++ P
Sbjct: 157 TVLMYLSNVTKGGETVFPDAEVPSCRVLSENKEDLSDCAKRGIAVKPRKGDALLFFNLHP 216
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DA DP SLHGGCPVI+G KWS+TKWI V+ +
Sbjct: 217 DAIPDPLSLHGGCPVIEGEKWSATKWIHVDSF 248
>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
Length = 487
Score = 249 bits (637), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 119/216 (55%), Positives = 149/216 (68%), Gaps = 11/216 (5%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PR FVY FLS +EC++L+ L M++S V D+ +GKS S VRTSSG FL
Sbjct: 57 VRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLD 116
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D ++ IEKRIA +TF P EN E +Q+L YE GQKYEPHFDYF D+ N GG R A
Sbjct: 117 KRQDPVVSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHDKVNQALGGHRYA 176
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNE-----LSECGKTGLSIKPKMGDALLFWS 251
TVLMYLS VE+GGETVFPNA+G W N+ SEC + GL++KP GDA+LF+S
Sbjct: 177 TVLMYLSTVEKGGETVFPNAEG------WENQPKDDTFSECAQKGLAVKPVKGDAVLFFS 230
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+ D DP SLHG CPVI+G KWS+ KWIR+ Y+
Sbjct: 231 LHIDGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYE 266
>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 332
Score = 249 bits (636), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 112/211 (53%), Positives = 157/211 (74%), Gaps = 1/211 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PR F+Y FLS EEC++ I LA + KS V D+D+G+S +S VRTSSG FL+
Sbjct: 72 VTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLS 131
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I+ ++E ++A +TF P ENGE +Q+LHYE GQKYEPHFDYF D+ N + GG R+A
Sbjct: 132 KRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIA 191
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVLMYLS+VE+GGETVFP +G + + + +EC K G ++KP+ GDALLF+++ P+A
Sbjct: 192 TVLMYLSNVEKGGETVFPMWKGKATQLK-DDSWTECAKQGYAVKPRKGDALLFFNLHPNA 250
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+ D +SLHG CPV++G KWS+T+WI V ++
Sbjct: 251 TTDSNSLHGSCPVVEGEKWSATRWIHVKSFE 281
>gi|449454448|ref|XP_004144967.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449474082|ref|XP_004154068.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515181|ref|XP_004164628.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 300
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 117/212 (55%), Positives = 161/212 (75%), Gaps = 2/212 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ ISW+PRAFVY FL+ EC++L+++A +++S V D+D+GKSK S VRTSSG F++
Sbjct: 39 VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFIS 98
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I+ IE +I+ +TF P ENGE +QVL YE GQKYE H+DYF+D+ N GG R+A
Sbjct: 99 KNKDPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLA 158
Query: 197 TVLMYLSDVEEGGETVFPNAQ--GNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
TVLMYLS+V +GGETVFP A+ + A +LSEC K G+++KPK GDALLF+S++P
Sbjct: 159 TVLMYLSNVTQGGETVFPLAEKPSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEP 218
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+A D +SLHGGCPV++G KWS+TKWI V+ +
Sbjct: 219 NAIPDTNSLHGGCPVLEGEKWSATKWIHVDSF 250
>gi|363543295|ref|NP_001241863.1| prolyl 4-hydroxylase 4 precursor [Zea mays]
gi|347978806|gb|AEP37745.1| prolyl 4-hydroxylase 4 [Zea mays]
gi|414591890|tpg|DAA42461.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 274
Score = 249 bits (635), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 119/211 (56%), Positives = 147/211 (69%), Gaps = 1/211 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +SW PR FVY FLS EC++L+ LA +++S V D+++GKS S VRTSSG FL
Sbjct: 45 VKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSEVRTSSGMFLD 104
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D ++ IE+RIA +TF P EN E +QVL YE GQKYEPHFDYF D N GG R A
Sbjct: 105 KRQDPVVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGHRYA 164
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVLMYLS V EGGETVFPNA+G S P SEC GL++KP GDA+LF+S+ D
Sbjct: 165 TVLMYLSTVREGGETVFPNAKGWESQ-PKDATFSECAHKGLAVKPVKGDAVLFFSLHADG 223
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+ DP SLHG CPVI+G KWS+ KWI V Y+
Sbjct: 224 TPDPLSLHGSCPVIRGEKWSAPKWIHVRSYE 254
>gi|218184507|gb|EEC66934.1| hypothetical protein OsI_33548 [Oryza sativa Indica Group]
Length = 308
Score = 248 bits (633), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 114/209 (54%), Positives = 154/209 (73%), Gaps = 1/209 (0%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+SW PRAF++ FL+ ECE+LI+LA + KS V D+++GKS S VRTSSG FL + +
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGMFLEKKQ 107
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D+++ IE+RIA +TF P +NGE +Q+LHY+ G+KYEPH+DYF D+ N GG R+ATVL
Sbjct: 108 DEVVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVL 167
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
MYLSDV +GGET+FP A+G + P + S+C K G ++KP GDALLF+S+ PDA+ D
Sbjct: 168 MYLSDVGKGGETIFPEAEGKL-LQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPDATTD 226
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
SLHG CPVI+G KWS+TKWI V + +
Sbjct: 227 SDSLHGSCPVIEGQKWSATKWIHVRSFDI 255
>gi|357447555|ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 303
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 116/214 (54%), Positives = 159/214 (74%), Gaps = 4/214 (1%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +SW+PRAFVY FL+ EC++LI++A +++S V D+ +G+SK S VRTSSG F++
Sbjct: 40 VKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIS 99
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I+ IE +I+ +TF P ENGE +QVL YE GQKY+PH+DYF D+ N GG R+A
Sbjct: 100 KNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVA 159
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNE----LSECGKTGLSIKPKMGDALLFWSM 252
TVLMYL++V +GGETVFPNA+ S +E LSECGK G+++KP+ GDALLF+S+
Sbjct: 160 TVLMYLTNVTKGGETVFPNAELQESPRHKLSETDEDLSECGKKGVAVKPRRGDALLFFSL 219
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
P+A D SLH GCPVI+G KWS+TKWI V+ +
Sbjct: 220 HPNAIPDTLSLHAGCPVIEGEKWSATKWIHVDSF 253
>gi|115481998|ref|NP_001064592.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|110289075|gb|ABG66075.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113639201|dbj|BAF26506.1| Os10g0413500 [Oryza sativa Japonica Group]
gi|215692577|dbj|BAG87997.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222612821|gb|EEE50953.1| hypothetical protein OsJ_31503 [Oryza sativa Japonica Group]
Length = 308
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 114/209 (54%), Positives = 154/209 (73%), Gaps = 1/209 (0%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+SW PRAF++ FL+ ECE+LI+LA + KS V D+++GKS S VRTSSG FL + +
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGMFLEKKQ 107
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D+++ IE+RIA +TF P +NGE +Q+LHY+ G+KYEPH+DYF D+ N GG R+ATVL
Sbjct: 108 DEVVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVL 167
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
MYLSDV +GGET+FP A+G + P + S+C K G ++KP GDALLF+S+ PDA+ D
Sbjct: 168 MYLSDVGKGGETIFPEAEGKL-LQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPDATTD 226
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
SLHG CPVI+G KWS+TKWI V + +
Sbjct: 227 SDSLHGSCPVIEGQKWSATKWIHVRSFDI 255
>gi|224141327|ref|XP_002324025.1| predicted protein [Populus trichocarpa]
gi|222867027|gb|EEF04158.1| predicted protein [Populus trichocarpa]
Length = 239
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 122/220 (55%), Positives = 156/220 (70%), Gaps = 9/220 (4%)
Query: 69 DEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVR 128
D RA Q +SW+PRAFVY FLS EEC++LINLA + KS V + +TG+S +S+ R
Sbjct: 14 DPTRAAQ----LSWQPRAFVYKGFLSDEECDHLINLAKGKLVKSMVANDETGESMESQER 69
Query: 129 TSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT 188
TSSG F+ + D+I+ IE RIA +TF P ENGE +Q+L YE GQKYE H DYF+D+ N
Sbjct: 70 TSSGMFIFKTEDEIVNGIEARIAAWTFLPEENGEPIQILRYEHGQKYEAHIDYFVDKANQ 129
Query: 189 KNGGQRMATVLMYLSDVEEGGETVFPN--AQGNISAVPWWNELSECGKTGLSIKPKMGDA 246
+ GG R ATVLMYLSDV++GGETVFP A+G+ + W S+C K G ++KP GDA
Sbjct: 130 EEGGHRAATVLMYLSDVKKGGETVFPTSEAEGSQAKDDSW---SDCAKKGYAVKPNKGDA 186
Query: 247 LLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
LLF+S+ PDA+ DP SLH CPVI+G KWS+TKWI V +
Sbjct: 187 LLFFSLHPDATPDPGSLHASCPVIEGEKWSATKWIHVRSF 226
>gi|222636605|gb|EEE66737.1| hypothetical protein OsJ_23428 [Oryza sativa Japonica Group]
Length = 487
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 118/216 (54%), Positives = 148/216 (68%), Gaps = 11/216 (5%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PR FVY FLS +EC++L+ L M++S V D+ +GKS S VRTSSG FL
Sbjct: 57 VRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLD 116
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D ++ IEKRIA +TF P EN E +Q+L YE GQKYEPHFDYF D+ N GG R A
Sbjct: 117 KRQDPVVSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHDKVNQALGGHRYA 176
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNE-----LSECGKTGLSIKPKMGDALLFWS 251
TVLMYLS VE+GGETVFPNA+G W N+ SEC + GL++KP GD +LF+S
Sbjct: 177 TVLMYLSTVEKGGETVFPNAEG------WENQPKDDTFSECAQKGLAVKPVKGDTVLFFS 230
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+ D DP SLHG CPVI+G KWS+ KWIR+ Y+
Sbjct: 231 LHIDGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYE 266
>gi|242088305|ref|XP_002439985.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
gi|241945270|gb|EES18415.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
Length = 308
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 117/207 (56%), Positives = 152/207 (73%), Gaps = 2/207 (0%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
ISW+PR F+Y +FLS +E +LI+LA +++S V D+ +GKS S VRTSSGTFL +G+
Sbjct: 54 ISWKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSDVRTSSGTFLRKGQ 113
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D I+ IE +IA +TF P ENGE +QVL Y+ G+KYEPH+DYF D NT GG R ATVL
Sbjct: 114 DPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTIRGGHRYATVL 173
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
+YL+DV EGGETVFP A+ A SEC + G+++KP+ GDALLF+++KPD + D
Sbjct: 174 LYLTDVAEGGETVFPLAEEVDDAKD--ATFSECAQKGIAVKPRKGDALLFFNLKPDGTTD 231
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRVNEY 286
P SLHGGC VI+G KWS+TKWIRV +
Sbjct: 232 PVSLHGGCAVIRGEKWSATKWIRVASF 258
>gi|21618073|gb|AAM67123.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
thaliana]
Length = 297
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 116/212 (54%), Positives = 161/212 (75%), Gaps = 2/212 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +S +PRAFVY FL+ EC++LI+LA ++++S V D+D G+S+ S VRTSSGTF++
Sbjct: 36 VKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFIS 95
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+G+D I+ IE +++ +TF P ENGE LQVL YE GQKY+ HFDYF D+ N GG R+A
Sbjct: 96 KGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRIA 155
Query: 197 TVLMYLSDVEEGGETVFPNAQ--GNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
TVL+YLS+V +GGETVFP+AQ S ++LS+C K G+++KPK G+ALLF++++
Sbjct: 156 TVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQQ 215
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DA DP SLHGGCPVI+G KWS+TKWI V+ +
Sbjct: 216 DAIPDPFSLHGGCPVIEGEKWSATKWIHVDSF 247
>gi|18397528|ref|NP_566279.1| P4H isoform 2 [Arabidopsis thaliana]
gi|332640849|gb|AEE74370.1| P4H isoform 2 [Arabidopsis thaliana]
Length = 299
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 116/212 (54%), Positives = 161/212 (75%), Gaps = 2/212 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +S +PRAFVY FL+ EC++LI+LA ++++S V D+D G+S+ S VRTSSGTF++
Sbjct: 38 VKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFIS 97
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+G+D I+ IE +++ +TF P ENGE LQVL YE GQKY+ HFDYF D+ N GG R+A
Sbjct: 98 KGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRIA 157
Query: 197 TVLMYLSDVEEGGETVFPNAQ--GNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
TVL+YLS+V +GGETVFP+AQ S ++LS+C K G+++KPK G+ALLF++++
Sbjct: 158 TVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQQ 217
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DA DP SLHGGCPVI+G KWS+TKWI V+ +
Sbjct: 218 DAIPDPFSLHGGCPVIEGEKWSATKWIHVDSF 249
>gi|297829156|ref|XP_002882460.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
gi|297328300|gb|EFH58719.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
lyrata]
Length = 299
Score = 247 bits (631), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 116/212 (54%), Positives = 161/212 (75%), Gaps = 2/212 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +S +PRAFVY FL+ EC++LI+LA ++++S V D+D G+S+ S VRTSSGTF++
Sbjct: 38 VKQVSAKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFIS 97
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+G+D I+ IE +++ +TF P ENGE LQVL YE GQKY+ HFDYF D+ N GG R+A
Sbjct: 98 KGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEPGQKYDAHFDYFHDKVNIARGGHRIA 157
Query: 197 TVLMYLSDVEEGGETVFPNAQ--GNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
TVL+YLS+V +GGETVFP+AQ S ++LS+C K G+++KPK G+ALLF++++
Sbjct: 158 TVLLYLSNVTKGGETVFPDAQEYSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQQ 217
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DA DP SLHGGCPVI+G KWS+TKWI V+ +
Sbjct: 218 DAIPDPFSLHGGCPVIEGEKWSATKWIHVDSF 249
>gi|110738390|dbj|BAF01121.1| hypothetical protein [Arabidopsis thaliana]
Length = 299
Score = 247 bits (630), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 116/212 (54%), Positives = 161/212 (75%), Gaps = 2/212 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +S +PRAFVY FL+ EC++LI+LA ++++S V D+D G+S+ S VRTSSGTF++
Sbjct: 38 VKQVSSKPRAFVYGGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFIS 97
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+G+D I+ IE +++ +TF P ENGE LQVL YE GQKY+ HFDYF D+ N GG R+A
Sbjct: 98 KGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRIA 157
Query: 197 TVLMYLSDVEEGGETVFPNAQ--GNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
TVL+YLS+V +GGETVFP+AQ S ++LS+C K G+++KPK G+ALLF++++
Sbjct: 158 TVLLYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQQ 217
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DA DP SLHGGCPVI+G KWS+TKWI V+ +
Sbjct: 218 DAIPDPFSLHGGCPVIEGEKWSATKWIHVDSF 249
>gi|449522594|ref|XP_004168311.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Cucumis
sativus]
Length = 313
Score = 247 bits (630), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 115/214 (53%), Positives = 153/214 (71%), Gaps = 7/214 (3%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW+PRAF+Y FLS EC++LI+LA + KS V D+D+GKS S VRTSSG FL
Sbjct: 53 VTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLR 112
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D+++ +E RIA +T P ENGE +Q+LHYE GQKYEPHFD+F D+ N + GG R+A
Sbjct: 113 KAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIA 172
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAV---PWWNELSECGKTGLSIKPKMGDALLFWSMK 253
TVLMYLS+VE+GGET+FPN++ S W S+C + G ++K + GDALLF+S+
Sbjct: 173 TVLMYLSNVEKGGETIFPNSEFKESQAKDESW----SDCSRKGYAVKAQKGDALLFFSLN 228
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
DA+ D SLHG CPVI G KWS+TKWI V ++
Sbjct: 229 LDATTDERSLHGSCPVIAGEKWSATKWIHVRSFE 262
>gi|357447553|ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 301
Score = 247 bits (630), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 115/214 (53%), Positives = 159/214 (74%), Gaps = 6/214 (2%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +SW+PRAFVY FL+ EC++LI++A +++S V D+ +G+SK S VRTSSG F++
Sbjct: 40 VKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIS 99
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I+ IE +I+ +TF P ENGE +QVL YE GQKY+PH+DYF D+ N GG R+A
Sbjct: 100 KNKDAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVA 159
Query: 197 TVLMYLSDVEEGGETVFPNAQ----GNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
TVLMYL++V +GGETVFPNA+ +S +LSECGK G+++KP+ GDALLF+S+
Sbjct: 160 TVLMYLTNVTKGGETVFPNAEESPRHKLSETD--EDLSECGKKGVAVKPRRGDALLFFSL 217
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
P+A D SLH GCPVI+G KWS+TKWI V+ +
Sbjct: 218 HPNAIPDTLSLHAGCPVIEGEKWSATKWIHVDSF 251
>gi|297818458|ref|XP_002877112.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297322950|gb|EFH53371.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 289
Score = 246 bits (628), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 116/211 (54%), Positives = 156/211 (73%), Gaps = 2/211 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV-DSDTGKSKDSRVRTSSGTFL 135
V +SW PRAF+Y+ FLS EEC++LINLA + KS VV D ++G+S DS RTSSG FL
Sbjct: 32 VTQLSWTPRAFLYNGFLSDEECDHLINLAKGKLEKSMVVADDNSGESIDSEERTSSGVFL 91
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
+ +D I+ ++E ++A +TF P ENGE LQ+LHYE GQKY+PHFDY+ D+ K GG R+
Sbjct: 92 TKRQDDIVANVEAKLATWTFLPEENGEALQILHYENGQKYDPHFDYYYDKETLKLGGHRI 151
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
ATVLMYLS+V +GGETVFP +G + + SEC K G ++KP+ GDALLF+++ P+
Sbjct: 152 ATVLMYLSNVTKGGETVFPMWKGKTPQLK-DDTWSECAKQGYAVKPRKGDALLFFNLHPN 210
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
A+ DP+SLHG CPVI+G KWS+T+WI V +
Sbjct: 211 ATTDPTSLHGSCPVIEGEKWSATRWIHVRSF 241
>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Glycine max]
Length = 297
Score = 246 bits (628), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 112/210 (53%), Positives = 156/210 (74%), Gaps = 2/210 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +SW+PRAFVY FL++ EC++LI++A +++S V D+ +G+SK S VRTSSG F+
Sbjct: 40 VKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIP 99
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I+ +E +I+ +T P ENGE +QVL YE GQKY+PH+DYF D+ N GG R+A
Sbjct: 100 KNKDPIVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVA 159
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVLMYL+DV +GGETVFPNA+ S +LSEC + G+++KP+ GDALLF+S+ P+A
Sbjct: 160 TVLMYLTDVTKGGETVFPNAELKSSETK--EDLSECAQKGIAVKPRRGDALLFFSLYPNA 217
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
D SLH GCPVI+G KWS+TKWI V+ +
Sbjct: 218 IPDTMSLHAGCPVIEGEKWSATKWIHVDSF 247
>gi|110289076|gb|ABB47602.2| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
Length = 309
Score = 246 bits (628), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 113/209 (54%), Positives = 152/209 (72%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+SW PRAF++ FL+ ECE+LI+LA + KS V D+++GKS S VRTSSG FL + +
Sbjct: 48 LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGMFLEKKQ 107
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D+++ IE+RIA +TF P +NGE +Q+LHY+ G+KYEPH+DYF D+ N GG R+ATVL
Sbjct: 108 DEVVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVL 167
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
MYLSDV +GGET+FP A+ P + S+C K G ++KP GDALLF+S+ PDA+ D
Sbjct: 168 MYLSDVGKGGETIFPEAEVGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPDATTD 227
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
SLHG CPVI+G KWS+TKWI V + +
Sbjct: 228 SDSLHGSCPVIEGQKWSATKWIHVRSFDI 256
>gi|413923982|gb|AFW63914.1| hypothetical protein ZEAMMB73_179176 [Zea mays]
Length = 222
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 121/176 (68%), Positives = 136/176 (77%), Gaps = 10/176 (5%)
Query: 37 FGILSMPSSSGDSR----------KANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRA 86
FG+ S+P S+ ++ A+ + R + EG R QW EVISWEPRA
Sbjct: 47 FGVFSLPVSAPNAAATTTAGGETESADTRPARPRARRDLGEGLGERGAQWTEVISWEPRA 106
Query: 87 FVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDI 146
FVYHNFLSKEECEYLI LA PHM KSTVVDS TGKSKDSRVRTSSG FL RGRDK+IR I
Sbjct: 107 FVYHNFLSKEECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRVI 166
Query: 147 EKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
EKRIAD+TF P+++GEGLQVLHYE GQKYEPHFDYF+DEFNTKNGGQRMAT+LMYL
Sbjct: 167 EKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQRMATLLMYL 222
>gi|115471029|ref|NP_001059113.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|113610649|dbj|BAF21027.1| Os07g0194500 [Oryza sativa Japonica Group]
gi|215768445|dbj|BAH00674.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 245 bits (626), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 118/216 (54%), Positives = 148/216 (68%), Gaps = 11/216 (5%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PR FVY FLS +EC++L+ L M++S V D+ +GKS S VRTSSG FL
Sbjct: 57 VRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLD 116
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D ++ IEKRIA +TF P EN E +Q+L YE GQKYEPHFDYF D+ N GG R A
Sbjct: 117 KRQDPVVSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHDKVNQALGGHRYA 176
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNE-----LSECGKTGLSIKPKMGDALLFWS 251
TVLMYLS VE+GGETVFPNA+G W N+ SEC + GL++KP GD +LF+S
Sbjct: 177 TVLMYLSTVEKGGETVFPNAEG------WENQPKDDTFSECAQKGLAVKPVKGDTVLFFS 230
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+ D DP SLHG CPVI+G KWS+ KWIR+ Y+
Sbjct: 231 LHIDGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYE 266
>gi|449461905|ref|XP_004148682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 116/217 (53%), Positives = 155/217 (71%), Gaps = 10/217 (4%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW+PRAF+Y FLS EC++LI+LA + KS V D+D+GKS S VRTSSG FL
Sbjct: 32 VTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLR 91
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D+++ +E RIA +T P ENGE +Q+LHYE GQKYEPHFD+F D+ N + GG R+A
Sbjct: 92 KAQDEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIA 151
Query: 197 TVLMYLSDVEEGGETVFPNAQ---GNISAV---PWWNELSECGKTGLSIKPKMGDALLFW 250
TVLMYLS+VE+GGET+FPN++ G+ S W S+C + G ++K + GDALLF+
Sbjct: 152 TVLMYLSNVEKGGETIFPNSEVWYGSESQAKDESW----SDCSRKGYAVKAQKGDALLFF 207
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
S+ DA+ D SLHG CPVI G KWS+TKWI V ++
Sbjct: 208 SLNLDATTDERSLHGSCPVIAGEKWSATKWIHVRSFE 244
>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 244 bits (624), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 112/209 (53%), Positives = 153/209 (73%), Gaps = 1/209 (0%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+SW PRAF++ FL EC++LI LA + KS V D+ +GKS S VRTSSG FL + +
Sbjct: 38 LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSEVRTSSGMFLEKKQ 97
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D+++ IE+RI+ +TF P ENGE +Q+LHY+ G+KYEPH+DYF D+ N GG R+ATVL
Sbjct: 98 DEVVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVL 157
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
MYLS+VE+GGET+FPNA+G + P + S+C + G ++KP GDALLF+S+ PD++ D
Sbjct: 158 MYLSNVEKGGETIFPNAEGKL-LQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTD 216
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
SLHG CPVI+G KWS+TKWI V + +
Sbjct: 217 SDSLHGSCPVIEGQKWSATKWIHVRSFDL 245
>gi|34393269|dbj|BAC83179.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
gi|50509101|dbj|BAD30161.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
sativa Japonica Group]
Length = 313
Score = 244 bits (624), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 118/216 (54%), Positives = 148/216 (68%), Gaps = 11/216 (5%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PR FVY FLS +EC++L+ L M++S V D+ +GKS S VRTSSG FL
Sbjct: 51 VRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLD 110
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D ++ IEKRIA +TF P EN E +Q+L YE GQKYEPHFDYF D+ N GG R A
Sbjct: 111 KRQDPVVSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHDKVNQALGGHRYA 170
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNE-----LSECGKTGLSIKPKMGDALLFWS 251
TVLMYLS VE+GGETVFPNA+G W N+ SEC + GL++KP GD +LF+S
Sbjct: 171 TVLMYLSTVEKGGETVFPNAEG------WENQPKDDTFSECAQKGLAVKPVKGDTVLFFS 224
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+ D DP SLHG CPVI+G KWS+ KWIR+ Y+
Sbjct: 225 LHIDGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYE 260
>gi|242047772|ref|XP_002461632.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
gi|241925009|gb|EER98153.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
Length = 307
Score = 244 bits (623), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 117/211 (55%), Positives = 149/211 (70%), Gaps = 1/211 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +SW+PR FVY FLS EC++L+ LA +++S V D+ +GKS S VRTSSG FL
Sbjct: 44 VKAVSWQPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNQSGKSVMSEVRTSSGMFLN 103
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D ++ IE+RIA +TF P EN E +Q+L YE GQKYEPHFDYF D+ N GG R A
Sbjct: 104 KRQDPVVSRIEERIAAWTFLPQENAENMQILRYEHGQKYEPHFDYFHDKINQVRGGHRYA 163
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVLMYLS V++GGETVFPNA+G S P + SEC GL++KP GDA+LF+S+ D
Sbjct: 164 TVLMYLSTVDKGGETVFPNAKGWESQ-PKDDTFSECAHQGLAVKPVKGDAVLFFSLHVDG 222
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
DP SLHG CPVI+G KWS+ KWI V Y+
Sbjct: 223 VPDPLSLHGSCPVIQGEKWSAPKWIHVRSYE 253
>gi|388496942|gb|AFK36537.1| unknown [Lotus japonicus]
Length = 302
Score = 243 bits (621), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 114/216 (52%), Positives = 158/216 (73%), Gaps = 8/216 (3%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +SW+PRAFVY FL++ EC++LI+LA +++S V D+ +G SK S VRTSSG F++
Sbjct: 41 VKQVSWKPRAFVYKGFLTELECDHLISLAKSELKRSAVADNLSGDSKLSDVRTSSGMFIS 100
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I+ IE +I+ +TF P ENGE +QVL YE GQKY+PH+D+F D+ N GG R+A
Sbjct: 101 KNKDPIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDFFADKVNIARGGHRVA 160
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWW------NELSECGKTGLSIKPKMGDALLFW 250
TVLMYL++V GGETVFPNA+ + P ++LSEC K G+++KP+ GDALLF+
Sbjct: 161 TVLMYLTNVTRGGETVFPNAE--VEEFPRHRGSETIDDLSECAKKGIAVKPRRGDALLFF 218
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
S+ P+A D SLH GCPVI+G KWS+TKWI V+ +
Sbjct: 219 SLYPNAVPDTMSLHAGCPVIEGEKWSATKWIHVDSF 254
>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
gi|194697650|gb|ACF82909.1| unknown [Zea mays]
gi|194708468|gb|ACF88318.1| unknown [Zea mays]
gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
Length = 308
Score = 243 bits (621), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 116/207 (56%), Positives = 151/207 (72%), Gaps = 2/207 (0%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
IS +PR F+Y +FLS +E +LI+LA +++S V D+ +GKS S VRTSSGTFL +G+
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSEVRTSSGTFLRKGQ 113
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D I+ IE +IA +TF P ENGE +QVL Y+ G+KYEPH+DYF D NT GG R ATVL
Sbjct: 114 DPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRYATVL 173
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
+YL+DV EGGETVFP A+ A LSEC + G++++P+ GDALLF+++ PD + D
Sbjct: 174 LYLTDVPEGGETVFPLAEEPDDAKD--ATLSECAQKGIAVRPRKGDALLFFNLNPDGTTD 231
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRVNEY 286
SLHGGCPVIKG KWS+TKWIRV +
Sbjct: 232 SVSLHGGCPVIKGEKWSATKWIRVASF 258
>gi|212720650|ref|NP_001132477.1| uncharacterized protein LOC100193935 precursor [Zea mays]
gi|194694488|gb|ACF81328.1| unknown [Zea mays]
gi|347978828|gb|AEP37756.1| prolyl 4-hydroxylase 7 [Zea mays]
gi|413934218|gb|AFW68769.1| prolyl 4-hydroxylase [Zea mays]
Length = 298
Score = 243 bits (620), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 112/212 (52%), Positives = 153/212 (72%), Gaps = 1/212 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PRAF++ FL EC++LI LA + KS V D+ +GKS S VRTSSG FL
Sbjct: 35 VVQLSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSEVRTSSGMFLE 94
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D+++ IE+RI+ +TF P ENGE +Q+LHY+ G+KYEPH+DYF D+ N GG R+A
Sbjct: 95 KKQDEVVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIA 154
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVLMYLS+VE+GGET+FPNA+G + P + S+C + G ++KP GDALLF+S+ PD+
Sbjct: 155 TVLMYLSNVEKGGETIFPNAEGKL-LQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDS 213
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
+ D SLHG CP I+G KWS+TKWI V + +
Sbjct: 214 TTDSDSLHGSCPAIEGQKWSATKWIHVRSFDL 245
>gi|359806348|ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
gi|255645457|gb|ACU23224.1| unknown [Glycine max]
Length = 298
Score = 243 bits (620), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 115/216 (53%), Positives = 160/216 (74%), Gaps = 10/216 (4%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ ISW+PRAFVY FL+ EC++LI+LA +++S V D+ +G+S+ S VRTSSG F++
Sbjct: 37 VKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSDVRTSSGMFIS 96
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D II IE +I+ +TF P ENGE +QVL YE GQKY+PH+DYF D+ N GG R+A
Sbjct: 97 KNKDPIISGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFTDKVNIARGGHRIA 156
Query: 197 TVLMYLSDVEEGGETVFPNAQ------GNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
TVLMYL++V +GGETVFP+A+ G ++ ++LSEC K G+++KP GDALLF+
Sbjct: 157 TVLMYLTNVTKGGETVFPSAEEPPRRRGTETS----SDLSECAKKGIAVKPHRGDALLFF 212
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
S+ +A+ D SSLH GCPVI+G KWS+TKWI V+ +
Sbjct: 213 SLHTNATPDTSSLHAGCPVIEGEKWSATKWIHVDSF 248
>gi|255085592|ref|XP_002505227.1| predicted protein [Micromonas sp. RCC299]
gi|226520496|gb|ACO66485.1| predicted protein [Micromonas sp. RCC299]
Length = 267
Score = 243 bits (619), Expect = 8e-62, Method: Compositional matrix adjust.
Identities = 119/210 (56%), Positives = 151/210 (71%), Gaps = 6/210 (2%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+S +P+A++Y FL + EC+Y+ A P + KSTVVD+ TG+S S +RTS G F R
Sbjct: 8 LSEKPKAYLYRGFLRQAECDYIKERAKPKLEKSTVVDNKTGQSVPSNIRTSDGMFFDRHE 67
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTK--NGGQRMAT 197
D II DIE+RIA++T P ENGEG+QVL YE GQKYEPH D F D+FNT+ GGQRMAT
Sbjct: 68 DDIIEDIERRIAEWTNVPWENGEGIQVLRYEVGQKYEPHLDAFSDKFNTEESKGGQRMAT 127
Query: 198 VLMYLSDVEEGGETVFPNAQGNI-SAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
VLMYLSDVEEGGETVFP + P W SEC + G+++K + GDALLFWS+ D+
Sbjct: 128 VLMYLSDVEEGGETVFPRSVDKPHKGDPKW---SECAQRGVAVKARKGDALLFWSLDIDS 184
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
++D SLHGGCPVIKG KWS+TKW+ + +
Sbjct: 185 NVDELSLHGGCPVIKGTKWSATKWMHLKSF 214
>gi|449443245|ref|XP_004139390.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 295
Score = 243 bits (619), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 124/216 (57%), Positives = 156/216 (72%), Gaps = 11/216 (5%)
Query: 72 RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG-KSKDSRVRTS 130
R ++W PRAF+YHNFLS++EC LINLA P M +S V +T + S RTS
Sbjct: 78 RGDEW-------PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTS 130
Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY-FMDEFNTK 189
SG FLA+G+++++R IEKRIA+FTF P+ENGEGL +LHYE GQK+EPH DY D F+ K
Sbjct: 131 SGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFK 190
Query: 190 NGGQRMATVLMYLSDVEEGGETVFPNAQGNI-SAVPWWNELSECGK-TGLSIKPKMGDAL 247
+ GQR AT++MYLS V+EGG TVFP A+ SA WW +L E GK GLS+KPKMGDAL
Sbjct: 191 SLGQRNATLVMYLSGVKEGGATVFPEAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDAL 250
Query: 248 LFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRV 283
LFWS+KPD +LDP+SLH PV+KG+KW K + V
Sbjct: 251 LFWSVKPDGTLDPTSLHASSPVVKGDKWVGVKLMHV 286
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 48/72 (66%), Positives = 58/72 (80%)
Query: 204 DVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSL 263
++EEGGETVFP A +S+VPWW +L GK GLSIKPKMGDAL FWSMKPD +LD +SL
Sbjct: 11 NIEEGGETVFPAANQCVSSVPWWKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLDYTSL 70
Query: 264 HGGCPVIKGNKW 275
HG PVI+G++W
Sbjct: 71 HGSYPVIRGDEW 82
>gi|449520144|ref|XP_004167094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 323
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 122/204 (59%), Positives = 152/204 (74%), Gaps = 4/204 (1%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG-KSKDSRVRTSSGTFLARGRDKI 142
PRAF+YHNFLS++EC LINLA P M +S V +T + S RTSSG FLA+G++++
Sbjct: 74 PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQL 133
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY-FMDEFNTKNGGQRMATVLMY 201
+R IEKRIA+FTF P+ENGEGL +LHYE GQK+EPH DY D F+ K+ GQR AT++MY
Sbjct: 134 VRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFKSLGQRNATLVMY 193
Query: 202 LSDVEEGGETVFPNAQGNI-SAVPWWNELSECGK-TGLSIKPKMGDALLFWSMKPDASLD 259
LS V+EGG TVFP A+ SA WW +L E GK GLS+KPKMGDALLFWS+KPD +LD
Sbjct: 194 LSGVKEGGATVFPEAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLD 253
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRV 283
P+SLH PV+KG+KW K + V
Sbjct: 254 PTSLHASSPVVKGDKWVGVKLMHV 277
Score = 94.7 bits (234), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 42/62 (67%), Positives = 49/62 (79%)
Query: 204 DVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSL 263
++EEGGETVFP A +S+VPWW +L GK GLSIKPKMGDAL FWSMKPD +LD +SL
Sbjct: 11 NIEEGGETVFPAANKCVSSVPWWKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLDYTSL 70
Query: 264 HG 265
H
Sbjct: 71 HA 72
>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
Length = 344
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 120/217 (55%), Positives = 147/217 (67%), Gaps = 12/217 (5%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+V+ + R F+YHNFL+ EEC+++I LA P M +S VV++D+GKSK VRTS GTFL
Sbjct: 63 VQVLHEDARIFLYHNFLTDEECDHIIKLAEPTMARSGVVETDSGKSKIDNVRTSKGTFLN 122
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
RG D +I DIE RIA +T P NGEGLQVL YE GQ+YE H+DYF + T NGG R
Sbjct: 123 RGHDSVIADIEARIAKWTLMPAGNGEGLQVLKYEHGQEYEGHYDYFFHKAGTANGGNRYL 182
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWN-----ELSECGKTGLSIKPKMGDALLFWS 251
TVLMYL+DVEEGGET FPN +P N E SEC + L+ KPK G+A+LF S
Sbjct: 183 TVLMYLNDVEEGGETCFPN-------IPSPNGDNGPEFSECARKVLAAKPKKGNAVLFHS 235
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
+KP L+ SLH CPVIKG KWS+ KW+ V Y V
Sbjct: 236 IKPTGELERRSLHTACPVIKGVKWSAPKWVHVGHYAV 272
>gi|30689216|ref|NP_189490.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
gi|332643931|gb|AEE77452.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
Length = 288
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 114/211 (54%), Positives = 155/211 (73%), Gaps = 2/211 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV-DSDTGKSKDSRVRTSSGTFL 135
+ +SW PRAF+Y FLS EEC++LI LA + KS VV D D+G+S+DS VRTSSG FL
Sbjct: 32 ITQLSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFL 91
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
+ +D I+ ++E ++A +TF P ENGE LQ+LHYE GQKY+PHFDYF D+ + GG R+
Sbjct: 92 TKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRI 151
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
ATVLMYLS+V +GGETVFPN +G + + S+C K G ++KP+ GDALLF+++ +
Sbjct: 152 ATVLMYLSNVTKGGETVFPNWKGKTPQLK-DDSWSKCAKQGYAVKPRKGDALLFFNLHLN 210
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+ DP+SLHG CPVI+G KWS+T+WI V +
Sbjct: 211 GTTDPNSLHGSCPVIEGEKWSATRWIHVRSF 241
>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
gi|255641119|gb|ACU20838.1| unknown [Glycine max]
Length = 297
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 113/212 (53%), Positives = 157/212 (74%), Gaps = 2/212 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ ISW+PRAFVY FL+ EC++LI+LA +++S V D+ +G+S+ S VRTSSG F++
Sbjct: 36 VKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSDVRTSSGMFIS 95
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I+ IE +I+ +TF P ENGE +QV YE GQKY+PH+DYF D+ N GG R+A
Sbjct: 96 KNKDPIVAGIEDKISSWTFLPKENGEDIQVSRYEHGQKYDPHYDYFTDKVNIARGGHRIA 155
Query: 197 TVLMYLSDVEEGGETVFPNAQ--GNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
TVLMYL+DV +GGETVFP+A+ ++LSEC K G+++KP+ GDALLF+S+
Sbjct: 156 TVLMYLTDVAKGGETVFPSAEEPPRRRGAETSSDLSECAKKGIAVKPRRGDALLFFSLHT 215
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+A+ D SSLH GCPVI+G KWS+TKWI V+ +
Sbjct: 216 NATPDTSSLHAGCPVIEGEKWSATKWIHVDSF 247
>gi|356546462|ref|XP_003541645.1| PREDICTED: uncharacterized protein LOC100818794 [Glycine max]
Length = 839
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 112/212 (52%), Positives = 156/212 (73%), Gaps = 2/212 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +SW+PRAFVY FL++ EC++LI++A +++S V D+ +G+SK S VRTSSG F+
Sbjct: 578 VKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIP 637
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I+ IE +I+ +TF P ENGE +QVL YE GQKY+PH+DYF D+ N GG R+A
Sbjct: 638 KNKDLIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVA 697
Query: 197 TVLMYLSDVEEGGETVFPNAQGN--ISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
TVLMYL+DV +GGETVFP+A+ + LSEC + G+++KP+ GDALLF+S+ P
Sbjct: 698 TVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECAQKGIAVKPRRGDALLFFSLYP 757
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+A D SLH GCPVI+G KWS+TKWI V+ +
Sbjct: 758 NAIPDTLSLHAGCPVIEGEKWSATKWIHVDSF 789
>gi|356555585|ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Glycine max]
Length = 301
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 112/216 (51%), Positives = 158/216 (73%), Gaps = 10/216 (4%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +SW+PRAFVY FL++ EC++LI++A +++S V D+ +G+SK S VRTSSG F+
Sbjct: 40 VKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIP 99
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I+ +E +I+ +T P ENGE +QVL YE GQKY+PH+DYF D+ N GG R+A
Sbjct: 100 KNKDPIVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVA 159
Query: 197 TVLMYLSDVEEGGETVFPNAQ------GNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
TVLMYL+DV +GGETVFPNA+ G+ + +LSEC + G+++KP+ GDALLF+
Sbjct: 160 TVLMYLTDVTKGGETVFPNAEESPRHRGSETK----EDLSECAQKGIAVKPRRGDALLFF 215
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
S+ P+A D SLH GCPVI+G KWS+TKWI V+ +
Sbjct: 216 SLYPNAIPDTMSLHAGCPVIEGEKWSATKWIHVDSF 251
>gi|28393447|gb|AAO42145.1| putative prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 253
Score = 241 bits (615), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 114/207 (55%), Positives = 153/207 (73%), Gaps = 2/207 (0%)
Query: 81 SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV-DSDTGKSKDSRVRTSSGTFLARGR 139
SW PRAF+Y FLS EEC++LI LA + KS VV D D+G+S+DS VRTSSG FL + +
Sbjct: 1 SWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKRQ 60
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D I+ ++E ++A +TF P ENGE LQ+LHYE GQKY+PHFDYF D+ + GG R+ATVL
Sbjct: 61 DDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRIATVL 120
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
MYLS+V +GGETVFPN +G + + S+C K G ++KP+ GDALLF+++ + + D
Sbjct: 121 MYLSNVTKGGETVFPNWKGKTPQLK-DDSWSKCAKQGYAVKPRKGDALLFFNLHLNGTTD 179
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRVNEY 286
P+SLHG CPVI+G KWS+T+WI V +
Sbjct: 180 PNSLHGSCPVIEGEKWSATRWIHVRSF 206
>gi|125552794|gb|EAY98503.1| hypothetical protein OsI_20415 [Oryza sativa Indica Group]
Length = 319
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 117/210 (55%), Positives = 154/210 (73%), Gaps = 4/210 (1%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
ISW+PR F+Y +FLS +E +L++LA +++S V D+ +GKS+ S RTSSGTF+ + +
Sbjct: 61 ISWKPRVFLYQHFLSDDEANHLVSLARAELKRSAVADNLSGKSELSDARTSSGTFIRKSQ 120
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D I+ IE++IA +TF P ENGE +QVL Y+ G+KYE H+DYF D NT GG R+ATVL
Sbjct: 121 DPIVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLRGGHRIATVL 180
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNE---LSECGKTGLSIKPKMGDALLFWSMKPDA 256
MYL+DV EGGETVFP A+ + NE LSEC K G+++KP+ GDALLF+++ PDA
Sbjct: 181 MYLTDVAEGGETVFPLAE-EFTESGTNNEDSTLSECAKKGVAVKPRKGDALLFFNLSPDA 239
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
S D SLH GCPVIKG KWS+TKWIRV +
Sbjct: 240 SKDSLSLHAGCPVIKGEKWSATKWIRVASF 269
>gi|115464581|ref|NP_001055890.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|50511363|gb|AAT77286.1| putative prolyl 4-hydroxylase alpha subunit [Oryza sativa Japonica
Group]
gi|113579441|dbj|BAF17804.1| Os05g0489100 [Oryza sativa Japonica Group]
gi|125587281|gb|EAZ27945.1| hypothetical protein OsJ_11906 [Oryza sativa Japonica Group]
gi|215737307|dbj|BAG96236.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 319
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 117/210 (55%), Positives = 154/210 (73%), Gaps = 4/210 (1%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
ISW+PR F+Y +FLS +E +L++LA +++S V D+ +GKS+ S RTSSGTF+ + +
Sbjct: 61 ISWKPRVFLYQHFLSDDEANHLVSLARTELKRSAVADNLSGKSELSDARTSSGTFIRKSQ 120
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D I+ IE++IA +TF P ENGE +QVL Y+ G+KYE H+DYF D NT GG R+ATVL
Sbjct: 121 DPIVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLRGGHRIATVL 180
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNE---LSECGKTGLSIKPKMGDALLFWSMKPDA 256
MYL+DV EGGETVFP A+ + NE LSEC K G+++KP+ GDALLF+++ PDA
Sbjct: 181 MYLTDVAEGGETVFPLAE-EFTESGTNNEDSTLSECAKKGVAVKPRKGDALLFFNLSPDA 239
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
S D SLH GCPVIKG KWS+TKWIRV +
Sbjct: 240 SKDSLSLHAGCPVIKGEKWSATKWIRVASF 269
>gi|29150368|gb|AAO72377.1| putative oxidoreductase [Oryza sativa Japonica Group]
gi|108711617|gb|ABF99412.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|125546090|gb|EAY92229.1| hypothetical protein OsI_13949 [Oryza sativa Indica Group]
gi|125588294|gb|EAZ28958.1| hypothetical protein OsJ_13002 [Oryza sativa Japonica Group]
Length = 310
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 111/210 (52%), Positives = 149/210 (70%), Gaps = 1/210 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +ISW+PR F Y FLS +EC++L+ L +++S V D+++GKS S VRTSSG FL
Sbjct: 48 VTIISWKPRIFFYKGFLSDDECDHLVKLGKEKLKRSMVADNESGKSVMSEVRTSSGMFLD 107
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D ++ IE+RIA +T P EN E +Q+L YE GQKY+PHFDYF D+ N GG R A
Sbjct: 108 KQQDPVVSGIEERIAAWTLLPQENAENIQILRYENGQKYDPHFDYFQDKVNQLQGGHRYA 167
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVL YLS VE+GGETVFPNA+G + P + S+C K GL++K GD++LF++++PD
Sbjct: 168 TVLTYLSTVEKGGETVFPNAEG-WESQPKDDSFSDCAKKGLAVKAVKGDSVLFFNLQPDG 226
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+ DP SLHG CPVI+G KWS+ KWI V Y
Sbjct: 227 TPDPLSLHGSCPVIEGEKWSAPKWIHVRSY 256
>gi|168001068|ref|XP_001753237.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162695523|gb|EDQ81866.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 284
Score = 240 bits (613), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 118/211 (55%), Positives = 151/211 (71%), Gaps = 16/211 (7%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVISW+PR + HNFLS +EC++LINLA P + KSTVVD+ TGK +S+VRTS+G FL
Sbjct: 79 EVISWQPRIILLHNFLSADECDHLINLARPRLVKSTVVDATTGKGIESKVRTSTGMFL-N 137
Query: 138 GRDK---IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQR 194
G D+ I+ IE RIA ++ P++NGE LQVL YE+ Q Y+ H DYF DEFN K GGQR
Sbjct: 138 GNDRRHHTIQAIETRIAAYSMVPVQNGELLQVLRYESDQYYKAHHDYFSDEFNLKRGGQR 197
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECG---KTGLSIKPKMGDALLFWS 251
+AT+LMYL++ EGGET+FP A ++ CG K G+ +KPK GDA+LFWS
Sbjct: 198 VATMLMYLTEGVEGGETIFPQAG---------DKECSCGGEMKIGVCVKPKRGDAVLFWS 248
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+K D +DP+SLHGGC V+ G KWSSTKW+R
Sbjct: 249 IKLDGQVDPTSLHGGCKVLSGEKWSSTKWMR 279
>gi|302834449|ref|XP_002948787.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
gi|300265978|gb|EFJ50167.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
nagariensis]
Length = 329
Score = 239 bits (610), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 112/210 (53%), Positives = 144/210 (68%), Gaps = 2/210 (0%)
Query: 79 VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
V+SW+PR F+Y L++EEC+YLI +A + +S V D+ TG+ S +RTSSG F RG
Sbjct: 52 VLSWQPRVFLYKGILTQEECDYLIKIAQGRLERSGVSDATTGEGGVSDIRTSSGMFYTRG 111
Query: 139 RDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATV 198
+ +++ IE R+A +T P+ENGEG+QVL YE QKY+PH DYF E NGG RMATV
Sbjct: 112 ENDVVKRIETRLAMWTMLPVENGEGIQVLRYEKTQKYDPHHDYFSFEGRDANGGNRMATV 171
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
LMYL+ EEGGETVFP + A SECG GL++KP GDA+LFWS++PD
Sbjct: 172 LMYLATPEEGGETVFPKIP--VPAGQTRANFSECGMKGLAVKPVKGDAVLFWSIRPDGRF 229
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
+P SLHG CPVI+G KWS+TKWI V Y +
Sbjct: 230 EPGSLHGSCPVIRGVKWSATKWIHVGPYSM 259
>gi|255641919|gb|ACU21228.1| unknown [Glycine max]
Length = 301
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 111/212 (52%), Positives = 156/212 (73%), Gaps = 2/212 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +SW+PRAFVY FL++ EC++LI++A +++S V D+ +G+SK S VRTSSG F+
Sbjct: 40 VKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIP 99
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I+ IE +I+ +TF P ENGE +QVL YE GQKY+PH+DYF D+ N GG R+A
Sbjct: 100 KNKDLIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVA 159
Query: 197 TVLMYLSDVEEGGETVFPNAQGNI--SAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
TVLMYL+DV +GGETVFP+A+ + LSEC + G+++KP+ GDALLF+S+ P
Sbjct: 160 TVLMYLTDVTKGGETVFPDAEESPRHKGSETNENLSECAQKGIAVKPRRGDALLFFSLYP 219
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+A D SLH GCPVI+G KWS+T+WI V+ +
Sbjct: 220 NAIPDTLSLHAGCPVIEGEKWSATEWIHVDSF 251
>gi|357128903|ref|XP_003566109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
distachyon]
Length = 313
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 112/209 (53%), Positives = 153/209 (73%), Gaps = 2/209 (0%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
ISW+PR F+Y +FLS +E +L++LA +++S V D+ +GKS S VRTS GTF+++G+
Sbjct: 55 ISWKPRVFLYQHFLSDDEANHLLSLARAELKRSAVADNTSGKSTLSEVRTSYGTFISKGK 114
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D I+ IE +IA +TF P ENGE +QVL Y+ G+K EP FD+F D NT GG R+ATVL
Sbjct: 115 DPIVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKDEPQFDFFTDTVNTVRGGHRVATVL 174
Query: 200 MYLSDVEEGGETVFPNAQG--NISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
+YL+DV EGGETVFP A+ + LSEC + G+++KP+ GDALLF++++PDA+
Sbjct: 175 LYLTDVAEGGETVFPLAKDFTDTGLHDKDTTLSECAQKGIAVKPRKGDALLFFNLRPDAA 234
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DP SLHGGC VIKG KW++TKWIRV +
Sbjct: 235 TDPLSLHGGCTVIKGEKWTATKWIRVASF 263
>gi|357478545|ref|XP_003609558.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355510613|gb|AES91755.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 299
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 113/216 (52%), Positives = 157/216 (72%), Gaps = 10/216 (4%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ ISW PRAFVY FL+ EC++LI+LA +++S V D+ +G S+ S VRTSSG F++
Sbjct: 37 VKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSSGMFIS 96
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I+ IE RI+ +TF P ENGE +QVL YE GQKY+PH+DYF D+ N GG R+A
Sbjct: 97 KNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIVQGGHRLA 156
Query: 197 TVLMYLSDVEEGGETVFPNAQ------GNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
TVLMYL++V +GGETVFP A+ G+ + ++LSEC K G+++KP+ GDALLF+
Sbjct: 157 TVLMYLTNVTKGGETVFPEAEEPPRRRGSKKS----SDLSECAKKGIAVKPRRGDALLFF 212
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
S+ +A D +SLH GCPV++G KWS+TKWI V+ +
Sbjct: 213 SLDTNAIPDTNSLHAGCPVLEGEKWSATKWIHVDSF 248
>gi|388492638|gb|AFK34385.1| unknown [Medicago truncatula]
Length = 299
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 113/214 (52%), Positives = 156/214 (72%), Gaps = 10/214 (4%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ ISW PRAFVY FL+ EC++LI+LA +++S V D+ +G S+ S VRTSSG F++
Sbjct: 37 VKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSSGMFIS 96
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I+ IE RI+ +TF P ENGE +QVL YE GQKY+PH+DYF D+ N GG R+A
Sbjct: 97 KNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIVQGGHRLA 156
Query: 197 TVLMYLSDVEEGGETVFPNAQ------GNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
TVLMYL++V +GGETVFP A+ G+ + ++LSEC K G+++KP+ GDALLF+
Sbjct: 157 TVLMYLTNVTKGGETVFPEAEEPPRRRGSKKS----SDLSECAKKGIAVKPRRGDALLFF 212
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVN 284
S+ +A D +SLH GCPV++G KWS+TKWI V+
Sbjct: 213 SLDTNAIPDTNSLHAGCPVLEGEKWSATKWIHVD 246
>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
Length = 269
Score = 237 bits (604), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 116/217 (53%), Positives = 154/217 (70%), Gaps = 17/217 (7%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSK---DSRVRTSSGTF 134
EV++W PR + H FLS EEC+YLI +A P + KSTVVD+ TGK++ +S+VRTS+G F
Sbjct: 61 EVLNWSPRIILLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHGIESKVRTSTGMF 120
Query: 135 LAR--GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGG 192
L+ R +I+ IE+RIA ++ P+ENGE LQVL YE Q Y+PH DYF D+FN K GG
Sbjct: 121 LSNYDRRYPMIQAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQFNLKRGG 180
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECG---KTGLSIKPKMGDALLF 249
QR+ATVLMYLSDVEEGGET+FP+ + ECG + GL +KP+ GDA+LF
Sbjct: 181 QRVATVLMYLSDVEEGGETIFPSVG---------DGECECGGELRKGLCVKPRKGDAILF 231
Query: 250 WSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
WS D ++D +SLHGGC V++G KWS+TKW+R + +
Sbjct: 232 WSAALDGNVDSNSLHGGCSVLRGEKWSATKWLRQSRF 268
>gi|388500582|gb|AFK38357.1| unknown [Medicago truncatula]
Length = 299
Score = 237 bits (604), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 112/216 (51%), Positives = 156/216 (72%), Gaps = 10/216 (4%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ ISW PRAFVY FL+ EC++LI+LA +++S V D+ +G S+ S VRTSSG ++
Sbjct: 37 VKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSSGMLIS 96
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I+ IE RI+ +TF P ENGE +QVL YE GQKY+PH+DYF D+ N GG R+A
Sbjct: 97 KNKDPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIVQGGHRLA 156
Query: 197 TVLMYLSDVEEGGETVFPNAQ------GNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
TVLMYL++V +GGETVFP A+ G+ + ++LSEC K G+++KP+ GDALLF+
Sbjct: 157 TVLMYLTNVTKGGETVFPEAEEPPRRRGSKKS----SDLSECAKKGIAVKPRRGDALLFF 212
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
S+ +A D +SLH GCPV++G KWS+TKWI V+ +
Sbjct: 213 SLDTNAIPDTNSLHAGCPVLEGEKWSATKWIHVDSF 248
>gi|449459442|ref|XP_004147455.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
gi|449515722|ref|XP_004164897.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 319
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 113/204 (55%), Positives = 151/204 (74%), Gaps = 2/204 (0%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+S +PRAF+Y FLS EEC++LIN A + +S +V + TG+S S+ RTS+G FL + +
Sbjct: 63 LSSKPRAFLYKGFLSAEECQHLINSAKGKLHQS-LVAAGTGQSVTSKERTSTGMFLHKAQ 121
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D+I+ IE RIA +TF PL+NGE +Q+L YE GQKYEPHFD+F D N GG R+AT+L
Sbjct: 122 DEIVARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATIL 181
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
MYLS+VE+GGETVFPN+ +S +LSECGK G ++PK+GDALLF+SM P+ + D
Sbjct: 182 MYLSNVEKGGETVFPNSPVKLSEEE-KADLSECGKVGYGVRPKLGDALLFFSMNPNVTPD 240
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRV 283
+S HG CPVI+G KWS+TKWI +
Sbjct: 241 TTSYHGSCPVIEGEKWSATKWIHM 264
>gi|388495016|gb|AFK35574.1| unknown [Lotus japonicus]
Length = 297
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 112/212 (52%), Positives = 152/212 (71%), Gaps = 2/212 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +SW+PRAFVY FL+ EC++LI+LA +++S V D+ G SK S VRTSSG F++
Sbjct: 36 VKQVSWKPRAFVYEGFLTGLECDHLISLAKSELKRSAVADNLPGDSKLSEVRTSSGMFIS 95
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D I+ IE +I+ +TF P ENGE +QVL YE GQKY+PH+DYF D+ N GG RMA
Sbjct: 96 KKKDPIVAGIEDKISAWTFLPKENGEDMQVLRYEHGQKYDPHYDYFTDKVNIVRGGHRMA 155
Query: 197 TVLMYLSDVEEGGETVFPNAQ--GNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
TVL+YL++V GGETVFP A+ + ++LSEC K G+++KP+ GDALLF+S+
Sbjct: 156 TVLLYLTNVTRGGETVFPVAEEPPRRRGLETNSDLSECAKKGIAVKPRRGDALLFFSLHT 215
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
A D SLH GCPVI+G KWS+TKWI V+ +
Sbjct: 216 TAIPDTDSLHAGCPVIEGEKWSATKWIHVDSF 247
>gi|302764100|ref|XP_002965471.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
gi|300166285|gb|EFJ32891.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
Length = 264
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 116/213 (54%), Positives = 151/213 (70%), Gaps = 17/213 (7%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSK---DSRVRTSSGTF 134
EV++W PR + H FLS EEC+YLI +A P + KSTVVD+ TGK++ +S+VRTS+G F
Sbjct: 60 EVLNWSPRITLLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHGIESKVRTSTGMF 119
Query: 135 LAR--GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGG 192
L+ R +I IE+RIA ++ P+ENGE LQVL YE Q Y+PH DYF D+FN K GG
Sbjct: 120 LSNYDRRYPMIEAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQFNLKRGG 179
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECG---KTGLSIKPKMGDALLF 249
QR+ATVLMYLSDVEEGGET+FP+ + ECG + GL +KP+ GDA+LF
Sbjct: 180 QRVATVLMYLSDVEEGGETIFPSVG---------DGECECGGELRKGLCVKPRKGDAILF 230
Query: 250 WSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
WS D ++D +SLHGGC V++G KWS+TKW+R
Sbjct: 231 WSAALDGNVDSNSLHGGCSVLRGEKWSATKWLR 263
>gi|148537204|dbj|BAF63493.1| prolyl 4-hydroxylase [Potamogeton distinctus]
Length = 246
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 110/195 (56%), Positives = 146/195 (74%), Gaps = 1/195 (0%)
Query: 92 FLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIA 151
FLS EEC++LI L + KS V D+++GKS S +RTSSG FL R +D+ I IEKRIA
Sbjct: 3 FLSHEECDHLIALGKDKLEKSMVADNESGKSVMSEIRTSSGMFLERRQDETITRIEKRIA 62
Query: 152 DFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGET 211
+TF P ENGE +Q+LHYE GQKY+ H+DYF D+ N + GG RMATVLMYLSDV++GGET
Sbjct: 63 AWTFLPEENGEPIQILHYEKGQKYDAHYDYFHDKNNQRVGGHRMATVLMYLSDVKKGGET 122
Query: 212 VFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIK 271
VFP+A+G + V + S+C ++G ++KP+ GDALLF+S P+A+ DP+SLH CPVI+
Sbjct: 123 VFPDAEGKLLQVK-DDTWSDCARSGYAVKPRKGDALLFFSCHPNATTDPNSLHASCPVIE 181
Query: 272 GNKWSSTKWIRVNEY 286
G KWS+T+WI V +
Sbjct: 182 GEKWSATRWIHVRSF 196
>gi|218192156|gb|EEC74583.1| hypothetical protein OsI_10158 [Oryza sativa Indica Group]
Length = 299
Score = 234 bits (596), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 111/208 (53%), Positives = 146/208 (70%), Gaps = 2/208 (0%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATP-HMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
+SW PR F+Y FLS ECE+LI LA M +STVV+ +G+S S+ RTSSG FL R
Sbjct: 40 VSWSPRVFLYEGFLSDAECEHLIALAKQGRMERSTVVNGKSGESVMSKTRTSSGMFLIRK 99
Query: 139 RDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATV 198
+D+++ IE+RIA +T FP ENGE +Q+L Y G+KYEPHFDY + GG R+ATV
Sbjct: 100 QDEVVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHRIATV 159
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
LMYLS+V+ GGETVFP+A+ +S P S+C + G ++KP G A+LF+S+ P+A+
Sbjct: 160 LMYLSNVKMGGETVFPDAEARLSQ-PKDETWSDCAEQGFAVKPTKGSAVLFFSLYPNATF 218
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DP SLHG CPVI+G KWS+TKWI V Y
Sbjct: 219 DPGSLHGSCPVIQGEKWSATKWIHVRSY 246
>gi|145345764|ref|XP_001417370.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577597|gb|ABO95663.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 328
Score = 233 bits (595), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 112/209 (53%), Positives = 146/209 (69%), Gaps = 3/209 (1%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E +SW P A VY FL++EEC++L LATP + +STVVD+ G S S +RTSSG FL
Sbjct: 56 IERVSWRPHAEVYRGFLTREECDHLKALATPSLGRSTVVDASNGGSVPSDIRTSSGMFLL 115
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTK--NGGQR 194
RG D ++ IE+RIA +T P +GEG QVL YE GQ+Y PHFDYF DEFN K GGQR
Sbjct: 116 RGEDDVVASIERRIASWTHVPESHGEGFQVLRYEFGQEYRPHFDYFQDEFNQKREKGGQR 175
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ATVLMYL+DVEEGGET+FP+A+ + ++ S C L++KP+ GDAL F S+
Sbjct: 176 VATVLMYLTDVEEGGETIFPDAEAGANP-GGGDDASSCAAGKLAVKPRKGDALFFRSLHH 234
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRV 283
+ + D S H GCPV+KG K+S+TKW+ V
Sbjct: 235 NGTSDAMSSHAGCPVVKGVKFSATKWMHV 263
>gi|108706361|gb|ABF94156.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|222624253|gb|EEE58385.1| hypothetical protein OsJ_09545 [Oryza sativa Japonica Group]
Length = 299
Score = 233 bits (594), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 111/208 (53%), Positives = 146/208 (70%), Gaps = 2/208 (0%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATP-HMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
+SW PR F+Y FLS ECE+LI LA M +STVV+ +G+S S+ RTSSG FL R
Sbjct: 40 VSWSPRVFLYEGFLSDVECEHLIALAKQGRMERSTVVNGKSGESVMSKTRTSSGMFLIRK 99
Query: 139 RDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATV 198
+D+++ IE+RIA +T FP ENGE +Q+L Y G+KYEPHFDY + GG R+ATV
Sbjct: 100 QDEVVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHRIATV 159
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
LMYLS+V+ GGETVFP+A+ +S P S+C + G ++KP G A+LF+S+ P+A+
Sbjct: 160 LMYLSNVKMGGETVFPDAEARLSQ-PKDETWSDCAEQGFAVKPTKGSAVLFFSLYPNATF 218
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DP SLHG CPVI+G KWS+TKWI V Y
Sbjct: 219 DPGSLHGSCPVIQGEKWSATKWIHVRSY 246
>gi|414870897|tpg|DAA49454.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
Length = 222
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 105/131 (80%), Positives = 118/131 (90%)
Query: 72 RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
+ E W EV+SWEPRAFVYHNFLSKEEC++LI+LA PHM+KSTVVDS TG SKDSRVRTSS
Sbjct: 91 KGEPWTEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSRVRTSS 150
Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG 191
G FL RG+DKIIR IEKRIAD+TF P+E GEGLQVLHYE GQKYEPHFDYF D++NTKNG
Sbjct: 151 GMFLRRGQDKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFDYFHDDYNTKNG 210
Query: 192 GQRMATVLMYL 202
GQR+AT+LMYL
Sbjct: 211 GQRIATLLMYL 221
>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
Length = 278
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 110/210 (52%), Positives = 151/210 (71%), Gaps = 19/210 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +S +PRAFVY FL+ EC++LI+LA ++++S V D+D G+S+ S VRTSSGTF++
Sbjct: 38 VKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFIS 97
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+G+D I+ IE +++ +TF P ENGE LQVL YE GQKY+ HFDYF D+ N GG R+A
Sbjct: 98 KGKDPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRIA 157
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVL+YLS+V +GGETVFP+AQ + +KPK G+ALLF++++ DA
Sbjct: 158 TVLLYLSNVTKGGETVFPDAQ-------------------VCLKPKKGNALLFFNLQQDA 198
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DP SLHGGCPVI+G KWS+TKWI V+ +
Sbjct: 199 IPDPFSLHGGCPVIEGEKWSATKWIHVDSF 228
>gi|297802348|ref|XP_002869058.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
gi|297314894|gb|EFH45317.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
Length = 245
Score = 230 bits (586), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 116/201 (57%), Positives = 144/201 (71%), Gaps = 30/201 (14%)
Query: 73 AEQWVEVISWEPRAFVYHNFL--------SKEECEYLINLATPHMRKSTVVDSDTGKSKD 124
+E+W+EVI+ EPRAFVYHNFL + EECE+LI+LA P M +S V ++ TG ++
Sbjct: 51 SERWLEVIAKEPRAFVYHNFLALFFKFCKTNEECEHLISLAKPSMARSKVRNAITGLGEE 110
Query: 125 SRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD 184
S RTSSGTFL +G DKI+++IEKRI++FTF P ENGE LQV+HYE GQK+EPHFD F
Sbjct: 111 SSSRTSSGTFLRKGHDKIVKEIEKRISEFTFIPEENGEALQVIHYEVGQKFEPHFDGF-- 168
Query: 185 EFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMG 244
QR+ATVLMYLSDV++GGETVFP A+G S K G+S++PK G
Sbjct: 169 --------QRIATVLMYLSDVDKGGETVFPEAKGIKS------------KKGVSVRPKKG 208
Query: 245 DALLFWSMKPDASLDPSSLHG 265
DALLFWSM+PD S DPSS HG
Sbjct: 209 DALLFWSMRPDGSQDPSSKHG 229
>gi|326501992|dbj|BAK06488.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 306
Score = 229 bits (585), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 113/210 (53%), Positives = 148/210 (70%), Gaps = 4/210 (1%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPH-MRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
+SW PRAF+Y FL++ EC++L+ LA ++KS VVD TGKS S VRTSSGTFLA+
Sbjct: 41 VSWRPRAFLYKGFLTEAECDHLVALAEEGGLQKSMVVDRQTGKSVMSEVRTSSGTFLAKK 100
Query: 139 RDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD--EFNTKNGGQRMA 196
+D+++ IE RIA +T P ENGE +QVL YE GQKYEPH D+ + + GG R+A
Sbjct: 101 QDQVVATIEARIAAWTLLPQENGESIQVLRYENGQKYEPHVDFIRHAAKGHHSRGGHRVA 160
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVLMYLSDV+ GGETVFPN+ + P + SEC + G ++KP GDA+LF+S+ P+
Sbjct: 161 TVLMYLSDVKMGGETVFPNSDAK-TLQPKDDTQSECARRGYAVKPVKGDAVLFFSLHPNG 219
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+ D SLHGGCPVI+G KWS+TKWI V +
Sbjct: 220 TTDRDSLHGGCPVIEGEKWSATKWIHVRPF 249
>gi|116309432|emb|CAH66506.1| OSIGBa0111I14.1 [Oryza sativa Indica Group]
Length = 267
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 117/213 (54%), Positives = 143/213 (67%), Gaps = 12/213 (5%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVISW PR V+HNFLS EEC+YL ++A P ++ STVVD TGK S VRTSSG F++
Sbjct: 62 EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSNVRTSSGMFVSS 121
Query: 138 GRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
K +I+ IEKRI+ ++ P ENGE +QVL YE Q Y PH DYF D FN K GGQR+
Sbjct: 122 EERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQRV 181
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKT--GLSIKPKMGDALLFWSMK 253
AT+LMYL+D EGGET FP A E S GK GL +KP GDA+LFWSM
Sbjct: 182 ATMLMYLTDGVEGGETHFPQAGD--------GECSCGGKMVKGLCVKPNKGDAVLFWSMG 233
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
D D +S+HGGCPV++G KWS+TKW+R E+
Sbjct: 234 LDGETDSNSIHGGCPVLEGEKWSATKWMRQKEF 266
>gi|115457822|ref|NP_001052511.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|38346023|emb|CAE03962.2| OSJNBb0085H11.11 [Oryza sativa Japonica Group]
gi|113564082|dbj|BAF14425.1| Os04g0346000 [Oryza sativa Japonica Group]
gi|125547818|gb|EAY93640.1| hypothetical protein OsI_15426 [Oryza sativa Indica Group]
gi|125589953|gb|EAZ30303.1| hypothetical protein OsJ_14349 [Oryza sativa Japonica Group]
gi|215693934|dbj|BAG89133.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 267
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 117/213 (54%), Positives = 143/213 (67%), Gaps = 12/213 (5%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVISW PR V+HNFLS EEC+YL ++A P ++ STVVD TGK S VRTSSG F++
Sbjct: 62 EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSNVRTSSGMFVSS 121
Query: 138 GRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
K +I+ IEKRI+ ++ P ENGE +QVL YE Q Y PH DYF D FN K GGQR+
Sbjct: 122 EERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQRV 181
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKT--GLSIKPKMGDALLFWSMK 253
AT+LMYL+D EGGET FP A E S GK GL +KP GDA+LFWSM
Sbjct: 182 ATMLMYLTDGVEGGETHFPQAGD--------GECSCGGKMVKGLCVKPNKGDAVLFWSMG 233
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
D D +S+HGGCPV++G KWS+TKW+R E+
Sbjct: 234 LDGETDSNSIHGGCPVLEGEKWSATKWMRQKEF 266
>gi|334185677|ref|NP_001189994.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
gi|332643930|gb|AEE77451.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
Length = 324
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 119/271 (43%), Positives = 171/271 (63%), Gaps = 10/271 (3%)
Query: 25 IMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEP 84
I F++ L ++S + +R +N V K M+ S G V +SW P
Sbjct: 5 IFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIK-MKTSASSFGFDPTRVTQLSWTP 63
Query: 85 RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR----VRTSSGTFLARGR- 139
R F+Y FLS EEC++ I LA + KS V D+D+G+S +S V S +F+A
Sbjct: 64 RVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEDSVSVVRQSSSFIANMDS 123
Query: 140 ---DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
D I+ ++E ++A +TF P ENGE +Q+LHYE GQKYEPHFDYF D+ N + GG R+A
Sbjct: 124 LEIDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIA 183
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVLMYLS+VE+GGETVFP +G + + + +EC K G ++KP+ GDALLF+++ P+A
Sbjct: 184 TVLMYLSNVEKGGETVFPMWKGKATQLK-DDSWTECAKQGYAVKPRKGDALLFFNLHPNA 242
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+ D +SLHG CPV++G KWS+T+WI V ++
Sbjct: 243 TTDSNSLHGSCPVVEGEKWSATRWIHVKSFE 273
>gi|145343778|ref|XP_001416487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576712|gb|ABO94780.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 255
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 110/201 (54%), Positives = 143/201 (71%), Gaps = 4/201 (1%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PRAFVY FL+ EEC++++ L+ H+ KS VVD+ TG S S +RTS+GTF++R D I
Sbjct: 1 PRAFVYEGFLTDEECDHILALSKGHLHKSGVVDAKTGGSTTSDIRTSTGTFISRAHDPTI 60
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLS 203
IE+RI ++ P+++GE LQVL YE GQ+Y+ HFDYF + +N R+ATVL+YLS
Sbjct: 61 TAIEERIELWSQIPVDHGEALQVLRYENGQEYKAHFDYFFHKGGKRN--NRIATVLLYLS 118
Query: 204 DVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSL 263
DVEEGGETVFPN ++ ++ SECG G S+K + GDALLFWSMKP LDP S
Sbjct: 119 DVEEGGETVFPNT--DVPTDRDRSQYSECGNGGKSVKARKGDALLFWSMKPGGELDPGSS 176
Query: 264 HGGCPVIKGNKWSSTKWIRVN 284
H GCPVIKG KW++TKW+ VN
Sbjct: 177 HAGCPVIKGVKWTATKWMHVN 197
>gi|224069056|ref|XP_002302889.1| predicted protein [Populus trichocarpa]
gi|222844615|gb|EEE82162.1| predicted protein [Populus trichocarpa]
Length = 287
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 117/211 (55%), Positives = 143/211 (67%), Gaps = 16/211 (7%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
E+ISW PR V H+FLS EEC+YL LA P +R STVVD TGK +S+VRTSSG FL+
Sbjct: 82 EIISWSPRIIVLHDFLSSEECDYLRALAKPRLRISTVVDVKTGKGIESKVRTSSGMFLSS 141
Query: 138 GRD--KIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
++++ IEKRI+ ++ P+ENGE +QVL YE Q Y+PH DYF D FN K GGQR+
Sbjct: 142 EEKTYQVVQAIEKRISVYSQVPIENGELIQVLRYEKNQYYKPHHDYFSDTFNLKRGGQRV 201
Query: 196 ATVLMYLSDVEEGGETVFPNA-QGNISAVPWWNELSECGKT---GLSIKPKMGDALLFWS 251
AT+LMYLSD EGGET FP A G S CG GLS+KP G+A+LFWS
Sbjct: 202 ATMLMYLSDNVEGGETYFPMAGSGKCS----------CGGKVVDGLSVKPIKGNAVLFWS 251
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
M D DPSS+HGGC V+ G KWS+TKW+R
Sbjct: 252 MGLDGQSDPSSIHGGCEVLSGVKWSATKWMR 282
>gi|414587756|tpg|DAA38327.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 263
Score = 227 bits (578), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 117/211 (55%), Positives = 141/211 (66%), Gaps = 16/211 (7%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVISW PR V+HNFLS EEC+YL+ +A P ++ STVVD TGK S VRTSSG F+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSDVRTSSGMFVNS 117
Query: 138 GRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
K +++ IEKRI+ F+ P ENGE +QVL YEA Q Y PH DYF D FN K GGQR+
Sbjct: 118 EERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQRV 177
Query: 196 ATVLMYLSDVEEGGETVFPNA-QGNISAVPWWNELSECGKT---GLSIKPKMGDALLFWS 251
AT+LMYL+D GGET FP A G S CG GL +KP GDA+LFWS
Sbjct: 178 ATMLMYLTDGVVGGETHFPQAGDGECS----------CGGNVVKGLCVKPNKGDAVLFWS 227
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
M D + DP+S+H GCPV+KG KWS+TKW+R
Sbjct: 228 MGLDGNTDPNSIHSGCPVLKGEKWSATKWMR 258
>gi|242075290|ref|XP_002447581.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
gi|241938764|gb|EES11909.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
Length = 263
Score = 226 bits (576), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 117/211 (55%), Positives = 141/211 (66%), Gaps = 16/211 (7%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVISW PR ++HNFLS EEC+YL+ +A P ++ STVVD TGK S VRTSSG F+
Sbjct: 58 EVISWTPRIIIFHNFLSSEECDYLMAIARPRLQMSTVVDVATGKGVKSDVRTSSGMFVNS 117
Query: 138 GRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
K +I+ IEKRI+ F+ P ENGE +QVL YEA Q Y PH DYF D FN K GGQR+
Sbjct: 118 EERKSPVIQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQRV 177
Query: 196 ATVLMYLSDVEEGGETVFPNA-QGNISAVPWWNELSECGKT---GLSIKPKMGDALLFWS 251
AT+LMYL+D EGGET F A G S CG GL +KP GDA+LFWS
Sbjct: 178 ATMLMYLTDGVEGGETHFLQAGDGECS----------CGGNVVKGLCVKPNKGDAVLFWS 227
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
M D + DP+S+H GCPV+KG KWS+TKW+R
Sbjct: 228 MGLDGNTDPNSIHSGCPVLKGEKWSATKWMR 258
>gi|159476104|ref|XP_001696154.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Chlamydomonas reinhardtii]
gi|158275325|gb|EDP01103.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Chlamydomonas reinhardtii]
Length = 343
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 108/210 (51%), Positives = 139/210 (66%), Gaps = 2/210 (0%)
Query: 79 VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
V+SW PR F+Y L+ EEC+ L++ + + +S V D+ TG S +RTSSG F RG
Sbjct: 69 VLSWHPRVFLYKGILTHEECDQLMDNSRSRLERSGVSDATTGAGAVSDIRTSSGMFYERG 128
Query: 139 RDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATV 198
++++ IE R+A +T P+ENGEG+QVL YE QKY+PH DYF + NGG RMATV
Sbjct: 129 ETELVKRIENRLAMWTMLPVENGEGIQVLRYEKTQKYDPHHDYFSFDGADDNGGNRMATV 188
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
LMYL+ EEGGETVFP G + V S + GL++KP GDA+LFWS++PD
Sbjct: 189 LMYLATPEEGGETVFPKVVGWV--VQLTTTASAPCRQGLAVKPAKGDAVLFWSIRPDGRF 246
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
DP SLHG CPVIKG KWS+TKWI V Y +
Sbjct: 247 DPGSLHGSCPVIKGVKWSATKWIHVGHYAM 276
>gi|255072321|ref|XP_002499835.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
gi|226515097|gb|ACO61093.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
Length = 454
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 108/213 (50%), Positives = 142/213 (66%), Gaps = 12/213 (5%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
EP+A+++ NFL+ ECE+L+ LA + STVV S S++RTS+G FL RG+D
Sbjct: 176 EPKAYMFRNFLTPHECEHLMQLAKKQLAPSTVVGDKGSGSMVSKIRTSAGMFLGRGQDPT 235
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLM 200
+R IE+RIA + P NGEGLQ+L YE GQKY+PHFDYF D+ N+ + GGQRMAT+L+
Sbjct: 236 VRAIEERIAAASGLPEPNGEGLQILRYENGQKYDPHFDYFHDQVNSSPRRGGQRMATMLI 295
Query: 201 YLSDVEEGGETVFPNAQGNISAVPW-------WNELSECGKTGLSIKPKMGDALLFWSMK 253
YL D EGGET+FPN + W N S+C K G+ +K GDA+LFWS+K
Sbjct: 296 YLEDTTEGGETIFPNG---VRPEDWDADEPGNHNSWSDCAKKGIPVKSHRGDAVLFWSLK 352
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
D +LD SLHG CPVI G KW++ KWIRV ++
Sbjct: 353 EDYTLDNGSLHGACPVIAGEKWTAVKWIRVAKF 385
>gi|308799217|ref|XP_003074389.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116000560|emb|CAL50240.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 294
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 110/210 (52%), Positives = 140/210 (66%), Gaps = 5/210 (2%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E +SW P A VY FL++ ECE++ LAT ++ STVVD+ TG S +RTSSG FL
Sbjct: 26 IERLSWAPHAEVYRGFLTEAECEHIERLATAELKPSTVVDASTGGDASSEIRTSSGMFLG 85
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTK--NGGQR 194
R D +I IE RIA +T P +GEG QVL YE Q+Y H+DYF D+FN K GGQR
Sbjct: 86 RAEDDVIEAIEARIAAWTHVPESHGEGFQVLRYEKHQEYRAHYDYFHDKFNVKREKGGQR 145
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
M TVLMYLSDVEEGGETVFP + P +E SEC + L+++P+ GDAL F S++
Sbjct: 146 MGTVLMYLSDVEEGGETVFPKFE---DGTPAGSEASECARNKLAVRPRKGDALFFRSLRH 202
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVN 284
D D S H GCPVI+G K+S+TKW+ V+
Sbjct: 203 DGVPDTFSEHAGCPVIRGVKFSATKWMHVS 232
>gi|357445147|ref|XP_003592851.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355481899|gb|AES63102.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 281
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 115/209 (55%), Positives = 141/209 (67%), Gaps = 12/209 (5%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV+SW PR + HNFLS EEC+YL +A P ++ STVVD++TGK S VRTSSG FL+
Sbjct: 76 EVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSDVRTSSGMFLSH 135
Query: 138 GRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
K +I IEKRI+ ++ P+ENGE +QVL YE Q Y PH DYF D FN K GGQR+
Sbjct: 136 EERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYFSDTFNLKRGGQRI 195
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGK--TGLSIKPKMGDALLFWSMK 253
AT+LMYL D EGGET FP+A +E S GK GL +KP G+A+LFWSM
Sbjct: 196 ATMLMYLGDNVEGGETHFPSAGS--------DECSCGGKLTKGLCVKPVKGNAVLFWSMG 247
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D DP S+HGGCPV+ G KWS+TKW+R
Sbjct: 248 LDGQSDPDSVHGGCPVLAGEKWSATKWMR 276
>gi|326503458|dbj|BAJ86235.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326516134|dbj|BAJ88090.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 266
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 126/265 (47%), Positives = 158/265 (59%), Gaps = 18/265 (6%)
Query: 23 LLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISW 82
LL + TF L ++ +L + R+ +D S ++ D EVISW
Sbjct: 10 LLPLLTFVALGMILGSLLQLAFF----RRLDDHSHTRHFDNDQGAADLRLGYVKPEVISW 65
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDK- 141
PR V+HNFLS EEC+YL +A P + STVVD TGK S VRTSSG F+ K
Sbjct: 66 TPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSDVRTSSGMFVNSEERKL 125
Query: 142 -IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLM 200
+I+ IEKRI+ F+ P+ENGE +QVL YE Q Y PH DYF D FN K GGQR+AT+LM
Sbjct: 126 PVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHDYFSDTFNLKRGGQRVATMLM 185
Query: 201 YLSDVEEGGETVFPNAQGNISAVPWWNELSECGK---TGLSIKPKMGDALLFWSMKPDAS 257
YL+D EGGET FP A G+ + CG GL +KP GDA+LFWSM D +
Sbjct: 186 YLTDGVEGGETHFPQA-GDGECI--------CGGRLVRGLCVKPNKGDAVLFWSMGLDGN 236
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIR 282
D +SLH GC V+KG KWS+TKW+R
Sbjct: 237 TDSNSLHSGCAVVKGEKWSATKWMR 261
>gi|15233345|ref|NP_195307.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|3805848|emb|CAA21468.1| putative protein [Arabidopsis thaliana]
gi|7270534|emb|CAB81491.1| putative protein [Arabidopsis thaliana]
gi|332661175|gb|AEE86575.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 272
Score = 223 bits (568), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 113/200 (56%), Positives = 142/200 (71%), Gaps = 30/200 (15%)
Query: 74 EQWVEVISWEPRAFVYHNFL--------SKEECEYLINLATPHMRKSTVVDSDTGKSKDS 125
E+W+EVI+ EPRAFVYHNFL + EEC++LI+LA P M +S V ++ TG ++S
Sbjct: 85 ERWLEVITKEPRAFVYHNFLALFFKICKTNEECDHLISLAKPSMARSKVRNALTGLGEES 144
Query: 126 RVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE 185
RTSSGTF+ G DKI+++IEKRI++FTF P ENGE LQV++YE GQK+EPHFD F
Sbjct: 145 SSRTSSGTFIRSGHDKIVKEIEKRISEFTFIPQENGETLQVINYEVGQKFEPHFDGF--- 201
Query: 186 FNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGD 245
QR+ATVLMYLSDV++GGETVFP A+G S K G+S++PK GD
Sbjct: 202 -------QRIATVLMYLSDVDKGGETVFPEAKGIKS------------KKGVSVRPKKGD 242
Query: 246 ALLFWSMKPDASLDPSSLHG 265
ALLFWSM+PD S DPSS HG
Sbjct: 243 ALLFWSMRPDGSRDPSSKHG 262
>gi|297824279|ref|XP_002880022.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
gi|297325861|gb|EFH56281.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
Length = 283
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 117/211 (55%), Positives = 139/211 (65%), Gaps = 16/211 (7%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV+SW PR V H+FLS EECEYL +A P ++ STVVD TGK S VRTSSG FL
Sbjct: 78 EVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSDVRTSSGMFLTH 137
Query: 138 GR--DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
+ II+ IEKRIA F+ P ENGE +QVL YE Q Y+PH DYF D FN K GGQR+
Sbjct: 138 VERSNPIIQAIEKRIAVFSQVPAENGELIQVLRYEPKQFYKPHHDYFADTFNLKRGGQRV 197
Query: 196 ATVLMYLSDVEEGGETVFPNA-QGNISAVPWWNELSECG---KTGLSIKPKMGDALLFWS 251
AT+LMYL+D EGGET FP A G+ + CG G+S+KP GDA+LFWS
Sbjct: 198 ATMLMYLTDDVEGGETYFPLAGDGDCT----------CGGKIMKGISVKPTKGDAVLFWS 247
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
M D DP S+HGGC V+ G KWS+TKW+R
Sbjct: 248 MGLDGQSDPRSIHGGCEVLSGEKWSATKWMR 278
>gi|303285562|ref|XP_003062071.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226456482|gb|EEH53783.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 522
Score = 221 bits (564), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 110/235 (46%), Positives = 147/235 (62%), Gaps = 25/235 (10%)
Query: 72 RAEQWVEVISW-----------EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
RA+Q V V + P+A+++ NFL++EEC +LI LA + STVV
Sbjct: 213 RADQLVNVPGFPRSPPLVLSATRPKAYLFRNFLTEEECRHLIALAKAQLAPSTVVADGGK 272
Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFD 180
KS S +RTS+G FL +G+ +R +E+R+A P ENGEG+Q+L YE GQKY+PH+D
Sbjct: 273 KSTKSGIRTSAGMFLTKGQTPTVRMVEERVAAAVGLPEENGEGMQILRYEHGQKYDPHYD 332
Query: 181 YFMDEFN--TKNGGQRMATVLMYLSDVEEGGETVFPNAQ-------GNISAVPWWNELSE 231
YF D+ N GGQRMAT+L+YL D EEGGET+FPNA+ G S+
Sbjct: 333 YFHDKINPSPNRGGQRMATMLIYLKDTEEGGETIFPNAKKPEGFHDGEKDGA-----FSD 387
Query: 232 CGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
C K GL +K K GDA+LFWS+ D LD SLHG CPV++G KW++ KWIRV ++
Sbjct: 388 CAKRGLPVKSKRGDAVLFWSLTSDYKLDEGSLHGACPVLRGEKWTAVKWIRVAKF 442
>gi|15224220|ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana]
gi|3763917|gb|AAC64297.1| hypothetical protein [Arabidopsis thaliana]
gi|20197628|gb|AAM15158.1| hypothetical protein [Arabidopsis thaliana]
gi|26450452|dbj|BAC42340.1| unknown protein [Arabidopsis thaliana]
gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis thaliana]
gi|330255112|gb|AEC10206.1| P4H isoform 1 [Arabidopsis thaliana]
Length = 283
Score = 221 bits (564), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 117/211 (55%), Positives = 138/211 (65%), Gaps = 16/211 (7%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV+SW PR V H+FLS EECEYL +A P ++ STVVD TGK S VRTSSG FL
Sbjct: 78 EVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSDVRTSSGMFLTH 137
Query: 138 GRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
II+ IEKRIA F+ P ENGE +QVL YE Q Y+PH DYF D FN K GGQR+
Sbjct: 138 VERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYKPHHDYFADTFNLKRGGQRV 197
Query: 196 ATVLMYLSDVEEGGETVFPNA-QGNISAVPWWNELSECG---KTGLSIKPKMGDALLFWS 251
AT+LMYL+D EGGET FP A G+ + CG G+S+KP GDA+LFWS
Sbjct: 198 ATMLMYLTDDVEGGETYFPLAGDGDCT----------CGGKIMKGISVKPTKGDAVLFWS 247
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
M D DP S+HGGC V+ G KWS+TKW+R
Sbjct: 248 MGLDGQSDPRSIHGGCEVLSGEKWSATKWMR 278
>gi|412992163|emb|CCO19876.1| predicted protein [Bathycoccus prasinos]
Length = 350
Score = 220 bits (561), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 107/219 (48%), Positives = 147/219 (67%), Gaps = 10/219 (4%)
Query: 76 WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
W E ISW+PRAFV H+ LS+EECE ++ +A P M++STVVDS TG+ K +RTS TFL
Sbjct: 79 WTEPISWQPRAFVLHSILSEEECEEILRIAKPMMKRSTVVDSITGEIKTDPIRTSKQTFL 138
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTK------ 189
ARG+ ++ +E+R++ FT P NGE +Q+L Y G+KY H D + E NTK
Sbjct: 139 ARGKYPVVTRVEERLSRFTMLPWYNGEDMQILSYGVGEKYSAHHD--VGEKNTKSGQQLS 196
Query: 190 -NGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNE-LSECGKTGLSIKPKMGDAL 247
+GGQR+ATVL+YL D EEGGET FP+++ + + SEC K G++ KPK GD L
Sbjct: 197 ADGGQRVATVLLYLQDTEEGGETAFPDSEWIEPESEYAQQKFSECAKNGVAFKPKRGDGL 256
Query: 248 LFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
LF+S+ P+ +D S+H GCPV+KG KW++TKWI +
Sbjct: 257 LFFSITPEGDIDQKSMHAGCPVVKGTKWTATKWIHARPF 295
>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
Length = 321
Score = 219 bits (559), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 109/225 (48%), Positives = 147/225 (65%), Gaps = 19/225 (8%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATP-HMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
+SW PRAF+Y FLS EC++LI+LA M KSTVVD ++G+S S+VRTSSG FL +
Sbjct: 45 VSWRPRAFLYEGFLSDAECDHLISLAKQGKMEKSTVVDGESGESVTSKVRTSSGMFLDKK 104
Query: 139 RDKIIRDIEKRIADFTFFPLE-----------------NGEGLQVLHYEAGQKYEPHFDY 181
+D+++ IE+RIA +T P E NGE +Q+L Y G+KYEPHFDY
Sbjct: 105 QDEVVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPHFDY 164
Query: 182 FMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKP 241
+ G R+ATVLMYLS+V+ GGET+FP+ + +S P S+C + G ++KP
Sbjct: 165 ISGRQGSTREGDRVATVLMYLSNVKMGGETIFPDCEARLSQ-PKDETWSDCAEQGFAVKP 223
Query: 242 KMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
G A+LF+S+ P+A+LD SLHG CPVI+G KWS+TKWI V Y
Sbjct: 224 AKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEKWSATKWIHVRSY 268
>gi|357162904|ref|XP_003579560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 266
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 114/209 (54%), Positives = 138/209 (66%), Gaps = 12/209 (5%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVISW PR V+HNFLS EEC++L +A P + STVVD TGK S VRTSSG F+
Sbjct: 61 EVISWTPRIIVFHNFLSSEECDFLKEIARPRLEISTVVDVATGKGVKSDVRTSSGMFVNS 120
Query: 138 GRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
K +I+ IEKRI+ F+ P+ENGE +QVL YE Q Y PH DYF D FN K GGQR+
Sbjct: 121 EERKFPVIQAIEKRISVFSQIPVENGELIQVLRYEPSQYYRPHHDYFSDTFNLKRGGQRV 180
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKT--GLSIKPKMGDALLFWSMK 253
AT+LMYL+D EGGET FP A E S G+ GL +KP GDA+LFWSM
Sbjct: 181 ATMLMYLTDGVEGGETHFPQAGD--------GECSCGGRIVRGLCVKPNKGDAVLFWSMG 232
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D + D +S+H GC V+KG KWS+TKW+R
Sbjct: 233 LDGNTDSNSIHSGCAVLKGEKWSATKWMR 261
>gi|449468746|ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
Length = 290
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 113/209 (54%), Positives = 138/209 (66%), Gaps = 12/209 (5%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV+SW PR V HNFLS +EC+YL +A + STVVD+ TGK S RTSSG FL+
Sbjct: 83 EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH 142
Query: 138 GRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
+++ IEKRI+ ++ P+ENGE +QVL YE Q Y+PH DYF D FN K GGQR+
Sbjct: 143 HEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLKRGGQRI 202
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKT--GLSIKPKMGDALLFWSMK 253
AT+LMYLS+ EGGET FP A E S GKT GLS+KP GDA+LFWSM
Sbjct: 203 ATMLMYLSENIEGGETYFPKAGS--------GECSCGGKTVPGLSVKPAKGDAVLFWSMG 254
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D DP S+HGGC V+ G KWS+TKW+R
Sbjct: 255 LDGQSDPKSIHGGCEVLSGEKWSATKWMR 283
>gi|307102962|gb|EFN51227.1| hypothetical protein CHLNCDRAFT_28161 [Chlorella variabilis]
Length = 300
Score = 219 bits (557), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 99/216 (45%), Positives = 150/216 (69%), Gaps = 8/216 (3%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT-GKSKDSRVRTSSGTFL 135
++V+SW+PR F+Y L++EEC++++ A P + +S VVD D G S +RTS G F
Sbjct: 16 LKVLSWDPRIFLYQRLLTEEECDHMMTKAGPRLTRSGVVDVDNPGGESVSDIRTSYGMFF 75
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
RG D+++R++E+R+++++ P +GEG+QVL YE G++Y+PHFDYF D + +NGG R+
Sbjct: 76 DRGEDEVVREVERRLSEWSLIPPGHGEGIQVLRYENGEEYKPHFDYFFDNLSVQNGGNRL 135
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWN---ELSECGKTGLSIKPKMGDALLFWSM 252
AT+LMYL++ E GGETVFP N+ A P SEC GL++KP+ GDA+LF+S+
Sbjct: 136 ATILMYLAEPEFGGETVFP----NVKAPPEQTLEAGYSECATQGLAVKPRKGDAVLFFSL 191
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
+ + +LD SLHG CP +KG K+++TKW V Y +
Sbjct: 192 RTEGTLDKGSLHGSCPTLKGFKFAATKWYHVAHYAM 227
>gi|356576923|ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 287
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 114/209 (54%), Positives = 139/209 (66%), Gaps = 12/209 (5%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV++W PR + HNFLS EEC+YL +A P + S VVD+ TGK S VRTSSG FL
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKTGKGIKSDVRTSSGMFLNP 141
Query: 138 GRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
K +++ IEKRI+ ++ P+ENGE +QVL YE Q Y+PH DYF D FN K GGQR+
Sbjct: 142 QERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQRI 201
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGK--TGLSIKPKMGDALLFWSMK 253
AT+LMYLSD EGGET FP A E S GK GLS+KP G+A+LFWSM
Sbjct: 202 ATMLMYLSDNIEGGETYFPLAGS--------GECSCGGKLVKGLSVKPIKGNAVLFWSMG 253
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D DP+S+HGGC VI G KWS+TKW+R
Sbjct: 254 LDGQSDPNSVHGGCEVISGEKWSATKWMR 282
>gi|308801080|ref|XP_003075321.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116061875|emb|CAL52593.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 541
Score = 218 bits (554), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 107/201 (53%), Positives = 139/201 (69%), Gaps = 4/201 (1%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PRAF+Y NFLS++ECE+L+ L+ + KS VVD+ TG S S VRTS+GTF++R D II
Sbjct: 265 PRAFLYENFLSEKECEHLLALSKGKLHKSGVVDAQTGGSSLSEVRTSTGTFISRKYDDII 324
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLS 203
+E+RI ++ P + E Q+L YE GQ+Y+ HFDYF + +N R+ATVL+YLS
Sbjct: 325 AGVEERIELWSQIPQSHHEAFQILRYEPGQEYKAHFDYFFHKSGMRN--NRIATVLLYLS 382
Query: 204 DVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSL 263
DVEEGGETVFPN ++ + SECG G ++K + GDALLFWSMKP LD S
Sbjct: 383 DVEEGGETVFPNT--DVPTSRNRSMYSECGNGGKALKARKGDALLFWSMKPGGELDAGSS 440
Query: 264 HGGCPVIKGNKWSSTKWIRVN 284
H GCPVIKG KW++TKW+ VN
Sbjct: 441 HAGCPVIKGEKWTATKWMHVN 461
>gi|145345836|ref|XP_001417405.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577632|gb|ABO95698.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 330
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 105/214 (49%), Positives = 149/214 (69%), Gaps = 11/214 (5%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+P+A++ NFLS EEC++L+ LA + STVV + G S S +RTS+G FL +G+DKI
Sbjct: 48 QPKAYLLRNFLSAEECDHLMKLAKRELAPSTVV-GEAGDSVPSDIRTSAGMFLRKGQDKI 106
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN--TKNGGQRMATVLM 200
++ IE+RIA + P++NGEG+Q+L Y+ GQKY+PHFDYF D+ N K GGQR+AT+L+
Sbjct: 107 VKAIEERIARLSGTPVDNGEGMQILRYDVGQKYDPHFDYFHDKVNPAPKRGGQRLATMLI 166
Query: 201 YLSDVEEGGETVFPNA------QGNISAVPWWN--ELSECGKTGLSIKPKMGDALLFWSM 252
YL D ++GGET FPNA + + P+ + E ++C K G+ +K GDA+LF+SM
Sbjct: 167 YLVDTDKGGETTFPNAKLPQSFEADEPENPFASHIEHTDCAKKGIPVKSVRGDAILFFSM 226
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
D LD SLHG CPVI+G KW++ KWIRV ++
Sbjct: 227 TQDGVLDRGSLHGACPVIEGQKWTAVKWIRVGKF 260
>gi|255637879|gb|ACU19258.1| unknown [Glycine max]
Length = 287
Score = 216 bits (550), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 114/209 (54%), Positives = 138/209 (66%), Gaps = 12/209 (5%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV++W PR + HNFLS EEC+YL LA P + STVVD+ TGK S VRTSSG FL
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSDVRTSSGMFLNS 141
Query: 138 GRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
K +++ IEKRI+ ++ P+ENGE +QVL YE Q Y+P DYF D FN K GGQ +
Sbjct: 142 KERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPRHDYFFDTFNLKRGGQGI 201
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGK--TGLSIKPKMGDALLFWSMK 253
AT+LMYLSD EGGET FP A E S GK GLS+KP G+A+LFWSM
Sbjct: 202 ATMLMYLSDNIEGGETYFPLAGS--------GECSCGGKLVKGLSVKPIKGNAVLFWSMG 253
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D DP+S+HGGC VI G KWS+TKW+R
Sbjct: 254 LDGQSDPNSVHGGCEVISGEKWSATKWLR 282
>gi|224034451|gb|ACN36301.1| unknown [Zea mays]
gi|413945801|gb|AFW78450.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 295
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 106/207 (51%), Positives = 139/207 (67%), Gaps = 15/207 (7%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
IS +PR F+Y +FLS +E +LI+LA +++S V D+ +GKS S
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSE------------- 100
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D I+ IE +IA +TF P ENGE +QVL Y+ G+KYEPH+DYF D NT GG R ATVL
Sbjct: 101 DPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRYATVL 160
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
+YL+DV EGGETVFP A+ A LSEC + G++++P+ GDALLF+++ PD + D
Sbjct: 161 LYLTDVPEGGETVFPLAEEPDDAKD--ATLSECAQKGIAVRPRKGDALLFFNLNPDGTTD 218
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRVNEY 286
SLHGGCPVIKG KWS+TKWIRV +
Sbjct: 219 SVSLHGGCPVIKGEKWSATKWIRVASF 245
>gi|225433714|ref|XP_002268409.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296089634|emb|CBI39453.3| unnamed protein product [Vitis vinifera]
Length = 287
Score = 214 bits (544), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 121/285 (42%), Positives = 161/285 (56%), Gaps = 38/285 (13%)
Query: 27 FTFAILILLAFG-----ILSMPSSSG----DSRKANDLSSIVRKSMERSEGDEGRAEQWV 77
F +L L++FG +L + G D + + ++ + S G QWV
Sbjct: 7 FVIGLLALVSFGMIFGALLQLEFIGGLEEVDGKAFPSFRKVHKRQNDGSLQLHGGISQWV 66
Query: 78 ---------------EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKS 122
E+++W PR + H+FLS EEC+YL +A P ++ STVVD+ TGK
Sbjct: 67 NDKDADILRIGYVKPEILNWSPRIILLHSFLSSEECDYLRAMAEPLLQISTVVDAQTGKG 126
Query: 123 KDSRVRTSSGTFLARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFD 180
S VRTSSG FL+ I+R IEKRI+ ++ P+ENGE +QVL Y+ Q Y+PH D
Sbjct: 127 IQSDVRTSSGMFLSPDDSTYPIVRAIEKRISVYSQVPVENGELIQVLRYKKSQFYKPHHD 186
Query: 181 YFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKT---GL 237
YF D FN K GGQR+AT+L+YLSD EGGET FP A CG GL
Sbjct: 187 YFSDSFNLKRGGQRVATMLIYLSDNVEGGETYFPMAGSG---------FCRCGGKSVRGL 237
Query: 238 SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
S+ P G+A+LFWSM D DP+S+HGGC V+ G KWS+TKW+R
Sbjct: 238 SVAPVKGNAVLFWSMGLDGQSDPNSIHGGCEVLAGEKWSATKWMR 282
>gi|302765413|ref|XP_002966127.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
gi|300165547|gb|EFJ32154.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
Length = 201
Score = 214 bits (544), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 100/202 (49%), Positives = 133/202 (65%), Gaps = 1/202 (0%)
Query: 87 FVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDI 146
++ S +EC++LI LA P +R+S+V+D TG KDSR RTS G FL R D I+ I
Sbjct: 1 LIFFYLYSDDECDHLIGLALPRLRRSSVIDEKTGLGKDSRNRTSWGAFLRRDHDNIVSGI 60
Query: 147 EKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVE 206
E RI+ TF P E GE LQV+ Y+ GQK+EPH DY+ N NGG R+ T+L+YL++VE
Sbjct: 61 EDRISSITFIPKEYGESLQVVRYKTGQKFEPHQDYYKLTENNNNGGHRIGTLLLYLTNVE 120
Query: 207 EGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGG 266
GGETVFP A N+ + SEC K G+ I+P+ GD LLFW +P +DP S HGG
Sbjct: 121 NGGETVFPRALANVIN-DYSTNTSECTKKGIVIRPRRGDGLLFWITRPSGEIDPFSFHGG 179
Query: 267 CPVIKGNKWSSTKWIRVNEYKV 288
CPV+KG KW +TK++ +E K+
Sbjct: 180 CPVVKGEKWLATKFLHEHELKL 201
>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
Length = 313
Score = 213 bits (542), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 109/216 (50%), Positives = 145/216 (67%), Gaps = 12/216 (5%)
Query: 76 WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
W++V+ E R F+ NFL++EEC++++ LA PH+ +S VVD+ TG S+ S +RTS G FL
Sbjct: 33 WMQVLDAEARIFI--NFLTEEECDHIVALAKPHLERSGVVDTATGGSEISDIRTSKGMFL 90
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTK-NGGQR 194
RG D + IE+RIA +T P+ NGEGLQVL+Y G+KY+ DYF D+ N + NGG R
Sbjct: 91 ERGHDDTVAAIEERIARWTLLPVGNGEGLQVLNYHPGEKYD---DYFFDKVNGESNGGNR 147
Query: 195 MATVLMYLSDVEEGGETVFPN--AQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
ATVLMYL+ VEEGGETVFPN A G + + +EC + L+ KP G A+LF S+
Sbjct: 148 YATVLMYLNTVEEGGETVFPNIPAPGGDNGPTF----TECARRHLAAKPTKGSAVLFHSI 203
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
KP L+ SLH CPV+KG KWS+ KWI V Y +
Sbjct: 204 KPSGDLERRSLHTACPVVKGEKWSAPKWIHVGHYAM 239
>gi|242085722|ref|XP_002443286.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
gi|241943979|gb|EES17124.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
Length = 147
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 103/153 (67%), Positives = 115/153 (75%), Gaps = 8/153 (5%)
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL RG+D I+R IE+RIAD+T P+ENGE LQVLHY GQK+EPHFDY TK GG
Sbjct: 2 FLKRGQDTIVRTIEQRIADYTSVPIENGEPLQVLHYAVGQKFEPHFDYTDGTSVTKIGGP 61
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R AT LMYLSDVEEGGETVFPNA SA K+G+S+KPKMGDALLFWSMK
Sbjct: 62 RKATFLMYLSDVEEGGETVFPNATAKGSA--------PSAKSGISVKPKMGDALLFWSMK 113
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
PD SLDP SLHG PVIKG+KWS+TKWI VN+Y
Sbjct: 114 PDGSLDPKSLHGASPVIKGDKWSATKWIHVNKY 146
>gi|308802438|ref|XP_003078532.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
tauri]
gi|116056985|emb|CAL51412.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
tauri]
Length = 369
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 104/216 (48%), Positives = 145/216 (67%), Gaps = 11/216 (5%)
Query: 81 SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
S +P+A++ NFLS +EC++L+ LA + STVV D G S S +RTS+G FL + +D
Sbjct: 87 SKKPKAYLMRNFLSPQECDHLMMLAKRELAPSTVV-GDGGSSVASEIRTSAGMFLRKSQD 145
Query: 141 KIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN--TKNGGQRMATV 198
+R+IE+RIA + P++NGEG+Q+L Y+ GQKY+PHFDYF D+ N K GGQR+ATV
Sbjct: 146 DTVREIEERIARLSGVPVDNGEGMQILRYDKGQKYDPHFDYFHDKVNPAPKRGGQRVATV 205
Query: 199 LMYLSDVEEGGETVFPNA------QGNISAVPWWNEL--SECGKTGLSIKPKMGDALLFW 250
L+YL D EEGGET FPN + + P+ + ++C K G+ +K GDA+LF+
Sbjct: 206 LIYLVDTEEGGETTFPNGRLPENFEEDEPDNPFAAHIKHTDCAKNGIPVKSVRGDAILFF 265
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
SM D LD SLHG CPVI G KW++ KW+RV ++
Sbjct: 266 SMTKDGELDHGSLHGACPVIAGQKWTAVKWLRVAKF 301
>gi|412993142|emb|CCO16675.1| predicted protein [Bathycoccus prasinos]
Length = 564
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 105/210 (50%), Positives = 141/210 (67%), Gaps = 7/210 (3%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+P+A+++ NFLS EEC++L+ LA + STVV + G S S +RTS+G FL + DK
Sbjct: 285 KPKAYLFRNFLSAEECDHLMKLAKAELAPSTVVGAG-GTSVPSTIRTSAGMFLRKAADKT 343
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLM 200
+ +IE RIA + P NGEG+Q+L Y+ GQKY+PHFDYF D N K GGQRMAT+L+
Sbjct: 344 LENIEYRIAAASGTPEPNGEGMQILRYDVGQKYDPHFDYFHDAVNPSPKRGGQRMATMLI 403
Query: 201 YLSDVEEGGETVFPNAQG----NISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
YL + +EGGET+FP +++ +E SEC K GL +K GDALLFWS+ D
Sbjct: 404 YLENTKEGGETIFPRGTRAETFDLTEEGNPHEWSECTKHGLPVKSVKGDALLFWSLTDDY 463
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
LD SLHG CPV+KG KW++ KWIRV ++
Sbjct: 464 KLDMGSLHGACPVVKGQKWTAVKWIRVAKF 493
>gi|255085784|ref|XP_002505323.1| predicted protein [Micromonas sp. RCC299]
gi|226520592|gb|ACO66581.1| predicted protein [Micromonas sp. RCC299]
Length = 215
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 107/224 (47%), Positives = 145/224 (64%), Gaps = 24/224 (10%)
Query: 76 WVEVISWEPRAFVYHNFLSKEECEYLINLATP---HMRKSTVVDSDTGKSKDSRVRTSSG 132
W+E ISWEPRAFVYHNFL+ EEC +L+NLA ++++TV D+ TG + SG
Sbjct: 1 WIEQISWEPRAFVYHNFLTPEECAHLVNLAKATDGGLKRATVADARTGGTF-----PGSG 55
Query: 133 TFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNG 191
FL R D I+ IE+RI+ F P ++GEG+++L Y G+KY+PH DYF D + N +
Sbjct: 56 AFLLRNHDPIVTRIEERISAFAMIPADHGEGMRILRYGRGEKYDPHHDYFDDGDKNLRFY 115
Query: 192 GQRMATVLMYLSDVEEGGETVFP-----------NAQGNISAVPWWNELSECGKTGLSIK 240
GQR+ATVLMYLSDVE GGETVFP + +G S+ + S+C K L +K
Sbjct: 116 GQRVATVLMYLSDVESGGETVFPKHGAWIEPDEMDVRGRSSS----KDSSKCAKGALHVK 171
Query: 241 PKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVN 284
P+ GDALLF + + DP+SLH GCPV++G KW++TKW+R
Sbjct: 172 PRRGDALLFHNCHLNGREDPTSLHAGCPVLRGEKWTATKWMRAG 215
>gi|303282201|ref|XP_003060392.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457863|gb|EEH55161.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 369
Score = 209 bits (533), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 105/208 (50%), Positives = 138/208 (66%), Gaps = 12/208 (5%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PRA+VY FL+ EC++ I A+P + KS VVD+DTG+ S +RTS G F RG D ++
Sbjct: 83 PRAYVYRGFLTDAECDHFIARASPKLAKSNVVDTDTGEGVPSAIRTSDGMFFDRGEDDVV 142
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKN--GGQRMATVLMY 201
+E+RI+ +T P ENGEG+QVL Y GQKY+ H D F+D+FN + GGQR+ATVLMY
Sbjct: 143 DAVERRISAWTRLPTENGEGMQVLRYAGGQKYDAHLDAFVDKFNADDAHGGQRVATVLMY 202
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNE--LSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
L+DV++GGETVFP +A P + S C + G+++KP+ GDALLFWSM +
Sbjct: 203 LNDVDDGGETVFP----ETTAKPHVGDERYSACARRGVAVKPRRGDALLFWSMDETFT-- 256
Query: 260 PSSLHGGCPVIKGN-KWSSTKWIRVNEY 286
SLHGGCPV G KWS TKWI +
Sbjct: 257 -RSLHGGCPVGAGGVKWSMTKWIHKGAF 283
>gi|242047774|ref|XP_002461633.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
gi|241925010|gb|EER98154.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
Length = 275
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 104/211 (49%), Positives = 141/211 (66%), Gaps = 6/211 (2%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +SW+PR FVY FLS +EC++L+ LA +K T+V + S + RTSSG FL
Sbjct: 48 VKALSWQPRIFVYKGFLSDDECDHLVTLA----KKGTMV-AHNRSSYYRQTRTSSGMFLR 102
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D ++ IE+RIA +T P EN E +Q+ Y+ GQKY+PHFDYF D+ + GG R A
Sbjct: 103 KRQDPVVSRIEERIAAWTLLPRENVEKMQIQRYQHGQKYDPHFDYFDDKIHHTRGGPRYA 162
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVLMYLS V++GGETVFP A+G S P + SEC GL++KP GDA+LF+S+ D
Sbjct: 163 TVLMYLSTVDKGGETVFPKAKGWESQ-PKDDTFSECAHKGLAVKPVKGDAVLFFSLHVDG 221
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
DP +LHG CPVI+G KWS+ WI V ++
Sbjct: 222 GPDPLTLHGSCPVIQGEKWSAPNWIHVRSFE 252
>gi|297727581|ref|NP_001176154.1| Os10g0415128 [Oryza sativa Japonica Group]
gi|255679404|dbj|BAH94882.1| Os10g0415128 [Oryza sativa Japonica Group]
Length = 241
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 97/175 (55%), Positives = 130/175 (74%), Gaps = 1/175 (0%)
Query: 114 VVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQ 173
V D+++GKS S VRTSSG FL + +D+++ IE+RIA +TF P +NGE +Q+LHY+ G+
Sbjct: 2 VADNESGKSVMSEVRTSSGMFLEKKQDEVVARIEERIAAWTFLPPDNGESIQILHYQNGE 61
Query: 174 KYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECG 233
KYEPH+DYF D+ N GG R+ATVLMYLSDV +GGET+FP A+G + P + S+C
Sbjct: 62 KYEPHYDYFHDKNNQALGGHRIATVLMYLSDVGKGGETIFPEAEGKLLQ-PKDDTWSDCA 120
Query: 234 KTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
K G ++KP GDALLF+S+ PDA+ D SLHG CPVI+G KWS+TKWI V + +
Sbjct: 121 KNGYAVKPVKGDALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSFDI 175
>gi|302831512|ref|XP_002947321.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
nagariensis]
gi|300267185|gb|EFJ51369.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
nagariensis]
Length = 797
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 140/218 (64%), Gaps = 5/218 (2%)
Query: 74 EQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGT 133
E W+E ISW PRAFVYHNFL+ EC++L+ + T + +S VVDS TG+SK +RTS G
Sbjct: 490 EPWIETISWSPRAFVYHNFLTSAECDHLVQIGTQRVSRSLVVDSQTGQSKLDDIRTSYGA 549
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKN--- 190
RG D +I +IE+RIA++T P E+GE +Q+L Y GQKY+ H+D+F D + ++
Sbjct: 550 AFGRGEDPVIAEIEERIAEWTHLPPEHGEPMQILRYVDGQKYDAHWDWFDDPVHHRSYLV 609
Query: 191 GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSEC-GKTGLSIKPKMGDALLF 249
G R ATVL+YLS+VE GGET P A +V S C K GLSI+P+ GDALLF
Sbjct: 610 DGNRYATVLLYLSEVEAGGETNLPLADPIDMSVQAIENPSPCAAKMGLSIRPRKGDALLF 669
Query: 250 WSMKPDASL-DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+ M + D +LH CP +KG KW++TKWI Y
Sbjct: 670 YDMDIEGQKGDRKALHASCPTLKGMKWTATKWIHSKPY 707
>gi|255083627|ref|XP_002508388.1| predicted protein [Micromonas sp. RCC299]
gi|226523665|gb|ACO69646.1| predicted protein [Micromonas sp. RCC299]
Length = 253
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 100/218 (45%), Positives = 144/218 (66%), Gaps = 12/218 (5%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+SW PRAF HNF+S EEC+ ++ +A P +R+STV+DS TG+SK +RTS TFL RG
Sbjct: 1 VSWYPRAFHLHNFMSHEECDRILEIARPRVRRSTVIDSVTGQSKVDPIRTSEQTFLNRGT 60
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKN-------GG 192
I+ +E+R+A T P +GE +Q+L Y GQKY+ H D + E + + GG
Sbjct: 61 WDIVTKVEERLAVVTQLPAYHGEDMQILKYGLGQKYDAHHD--VGELTSASGKQLAAEGG 118
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNE---LSECGKTGLSIKPKMGDALLF 249
R+ATVL+YLSDVEEGGET FP+++ + W E S+C + +++KP+ GD LLF
Sbjct: 119 HRVATVLLYLSDVEEGGETAFPDSEWMTPELRKWAEGQKWSDCAEGNVAVKPRKGDGLLF 178
Query: 250 WSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
WS+ + ++DP S+H GCPVI+G KW++TKWI ++
Sbjct: 179 WSVNNENAIDPHSMHAGCPVIRGEKWTATKWIHARPFR 216
>gi|412985583|emb|CCO19029.1| predicted protein [Bathycoccus prasinos]
Length = 458
Score = 204 bits (519), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 96/211 (45%), Positives = 139/211 (65%), Gaps = 5/211 (2%)
Query: 77 VEVISWE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
+++IS + PRAF+Y F++ EEC++LI+ + M KS VVD++TG + S +RTS+G+F+
Sbjct: 177 MQIISLDHPRAFLYKRFMTDEECDFLIDHSKSRMSKSGVVDAETGGTAKSDIRTSTGSFV 236
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
G + +++ +EKR+A F+ P+++ E QVL YE Q+Y H+DYF + N R+
Sbjct: 237 GIGANDLMKKLEKRVATFSMLPVKHQEATQVLRYEVKQEYRAHYDYFFHKGGMAN--NRI 294
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVP--WWNELSECGKTGLSIKPKMGDALLFWSMK 253
T+LMYL + E GGETVFPN + + W SECG G + + GDAL+FWSMK
Sbjct: 295 VTILMYLHEPEFGGETVFPNTEVPLERAEKGWGKNFSECGNRGRAAVVRKGDALIFWSMK 354
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVN 284
P LDP S H GCPV++G KW++TKWI VN
Sbjct: 355 PGGELDPGSSHAGCPVVRGEKWTATKWIHVN 385
>gi|9294584|dbj|BAB02865.1| unnamed protein product [Arabidopsis thaliana]
Length = 328
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 94/174 (54%), Positives = 129/174 (74%), Gaps = 1/174 (0%)
Query: 113 TVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAG 172
V D D+G+S+DS VRTSSG FL + +D I+ ++E ++A +TF P ENGE LQ+LHYE G
Sbjct: 2 VVADVDSGESEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENG 61
Query: 173 QKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSEC 232
QKY+PHFDYF D+ + GG R+ATVLMYLS+V +GGETVFPN +G + + S+C
Sbjct: 62 QKYDPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLK-DDSWSKC 120
Query: 233 GKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
K G ++KP+ GDALLF+++ + + DP+SLHG CPVI+G KWS+T+WI V +
Sbjct: 121 AKQGYAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSF 174
>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
Length = 433
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 102/214 (47%), Positives = 144/214 (67%), Gaps = 11/214 (5%)
Query: 77 VEVISWE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
++V+S + PRAF++ FLS+ EC+ L+ A P+M KS VVD+ G S S +RTS+G+F+
Sbjct: 158 IQVVSLDNPRAFMHIGFLSERECDLLVEYARPNMYKSGVVDASNGGSSFSNIRTSTGSFV 217
Query: 136 AR----GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG 191
G + ++R IE+RIA +T P +GE +QVL Y+ GQ+Y+ HFDYF E KN
Sbjct: 218 PTVFPLGMNDVVRRIERRIAAWTQIPAAHGEPIQVLRYQIGQEYQSHFDYFFHEGGMKN- 276
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQG-NISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
R+ATVLMYLSDV++GGETVFP+A+ + P + C K G+++ PK GDA+LFW
Sbjct: 277 -NRIATVLMYLSDVKDGGETVFPSAESLQVKPEPIHHA---CAKNGITVIPKKGDAILFW 332
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVN 284
+MK LD S H GCPV+ G KW++TKW+ V+
Sbjct: 333 NMKVGGDLDGGSTHAGCPVVLGEKWTATKWLHVS 366
>gi|145354086|ref|XP_001421326.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144581563|gb|ABO99619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 309
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 118/299 (39%), Positives = 166/299 (55%), Gaps = 25/299 (8%)
Query: 12 RKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSI------------- 58
R +S +L L+ + T A +P + D R +D + +
Sbjct: 3 RGGASRVAVLALIALATSAAPRRATADRARLPRDARDERLDDDDARLRAEEHVAYASDAR 62
Query: 59 VRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD 118
R + R D A QW+E IS PRA+VY NFL++EE E I A MR+S VV+
Sbjct: 63 SRVGLRRDGAD---ARQWIERISESPRAYVYRNFLTREEAEATIAAARRTMRRSEVVNEA 119
Query: 119 TGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPH 178
G SK S RTSSG +++ +++ +IE+R+A +T P GE QV+ YEAGQ+Y H
Sbjct: 120 DGTSKTSDERTSSGGWVSGEDSEVMANIERRVAAWTMLPRNRGETTQVMRYEAGQEYAAH 179
Query: 179 FDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSE---CGKT 235
DYF DE N KNGGQR ATVLMYLSDVEEGGETVFP A P + +++ C +
Sbjct: 180 DDYFHDEVNVKNGGQRAATVLMYLSDVEEGGETVFPRGTPLGGAAPEKSGVTQGNACERA 239
Query: 236 ------GLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
L++KP+ GDALLF+++ + +D + H GCPV++G KW++T+W V +
Sbjct: 240 LRGDPNVLAVKPRRGDALLFFNVHLNGEVDERARHAGCPVVRGTKWTATRWQHVGALNI 298
>gi|159469311|ref|XP_001692811.1| predicted protein [Chlamydomonas reinhardtii]
gi|158278064|gb|EDP03830.1| predicted protein [Chlamydomonas reinhardtii]
Length = 273
Score = 198 bits (503), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 101/216 (46%), Positives = 135/216 (62%), Gaps = 7/216 (3%)
Query: 73 AEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSG 132
A + + V+ + R +++ FL+ EEC+Y+ A + +S VVD+ +G S S +RTS G
Sbjct: 32 ARKKIVVLDPDARIYLWKGFLTPEECDYIRMKAEKRLERSGVVDTGSGGSVVSDIRTSDG 91
Query: 133 TFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGG 192
F RG D II +E+R+AD+T P+ GE LQVL Y QKY+ H+DYF + + NGG
Sbjct: 92 MFFERGEDAIIEAVEQRLADWTMTPIWGGESLQVLRYRKDQKYDSHWDYFFHKDGSSNGG 151
Query: 193 QRMATVLMYLSDVEEGGETVFPN--AQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
R ATVL+YL++ EEGGETVFP A I+ SEC K L++KP GDALLF
Sbjct: 152 NRWATVLLYLTETEEGGETVFPKIPAPNGINV-----GFSECAKYNLAVKPHKGDALLFH 206
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
SMKP L+ S+HG CPVI+G K+S TKWI Y
Sbjct: 207 SMKPTGELEERSMHGACPVIRGEKFSMTKWIHAGHY 242
>gi|307110744|gb|EFN58979.1| hypothetical protein CHLNCDRAFT_137600 [Chlorella variabilis]
Length = 327
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 101/223 (45%), Positives = 142/223 (63%), Gaps = 17/223 (7%)
Query: 76 WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
WVEV++W+PRA + H FLS EC+++I +A P + +STVV + G D +RTSSG F+
Sbjct: 41 WVEVVAWKPRALLLHGFLSHAECDHIIRVADPSLERSTVVSPEGGSMLD-EIRTSSGMFI 99
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY-----FMDEFNTKN 190
+G D +I +E+R+A T P+ + E LQVL YE GQKY H+D + K
Sbjct: 100 LKGHDAVISGLEERVAALTHLPVSHQEDLQVLRYELGQKYSAHWDINDSPERAQQMRAKG 159
Query: 191 --GGQRMATVLMYLSDVEEGGETVFPNA----QGNISAVPWWNELSECGKTGLSIKPKMG 244
GG R AT+LMYLSDVEEGGET FP+ +G +A P+ +EC G+ +KP+ G
Sbjct: 160 VLGGLRTATLLMYLSDVEEGGETAFPHGRWLDEGVQAAPPY----TECASKGVVVKPRKG 215
Query: 245 DALLFWSMKPDAS-LDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DA+LF+S+K + D SLH GCPV++G K+S+TKW+ V +
Sbjct: 216 DAILFFSLKLNGQKKDVYSLHAGCPVVRGVKYSATKWVHVEPF 258
>gi|384250156|gb|EIE23636.1| hypothetical protein COCSUDRAFT_53414 [Coccomyxa subellipsoidea
C-169]
Length = 285
Score = 197 bits (502), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 99/258 (38%), Positives = 157/258 (60%), Gaps = 11/258 (4%)
Query: 30 AILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVY 89
A+ +L+A G+L + + + ++ S ++ VE ISW PRAF+Y
Sbjct: 11 ALALLMAVGVLGIRQGAFERE-----DKVIGWSESYQHKTPAKSSLMVERISWNPRAFLY 65
Query: 90 HNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKR 149
LS++EC+Y+IN A P+M K+TV+D+ T K +++R + ++ D +I IE+R
Sbjct: 66 RGLLSQDECDYIINAARPNMVKATVLDAKTKKQVPNKLRNNKEAYIDGSADDVIDQIERR 125
Query: 150 IADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGG 209
IA +TF P +GE ++ Y GQ Y PH D+ D ++ + G +R+AT+++YLSDV EGG
Sbjct: 126 IARYTFLPAAHGEPFHIMQYLPGQGYAPHTDWLDDWWHPRLGNERIATMIIYLSDVVEGG 185
Query: 210 ETVFPNA--QGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGC 267
ETVFPN+ Q ++ + S+C + G+++KP GDALL +++ + D SLH GC
Sbjct: 186 ETVFPNSTMQPHVGDAAY----SKCAQQGIAVKPVKGDALLLYNLLENGRNDGESLHQGC 241
Query: 268 PVIKGNKWSSTKWIRVNE 285
PVI+G KW++TK I VN+
Sbjct: 242 PVIRGVKWTATKRILVNQ 259
>gi|302841711|ref|XP_002952400.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
gi|300262336|gb|EFJ46543.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
nagariensis]
Length = 269
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 99/218 (45%), Positives = 135/218 (61%), Gaps = 7/218 (3%)
Query: 73 AEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSG 132
A + + + + R +++ FL+ EEC+Y+ A + +S VVD+ +G S S +RTS G
Sbjct: 32 ARKRIITLDADARIYLWRGFLTPEECDYIRMKAEKRLERSGVVDTASGSSVVSDIRTSDG 91
Query: 133 TFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGG 192
F RG D I+ +E+R+AD+T P+ GE LQVL Y QKY+ H +YF + + NGG
Sbjct: 92 MFFERGEDAILEAVEQRLADWTMTPIWAGEALQVLRYRKDQKYDSHVNYFFHKEGSANGG 151
Query: 193 QRMATVLMYLSDVEEGGETVFPN--AQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
R ATVL YL+D EEGGETVFP A G ++ SEC K L++KP+ GDA+LF
Sbjct: 152 NRWATVLTYLTDTEEGGETVFPKIPAPGGVNV-----GFSECAKYNLAVKPRKGDAILFH 206
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
SMK + L+ SLHG CPVIKG K+S TKWI Y +
Sbjct: 207 SMKTNGQLEERSLHGACPVIKGEKFSMTKWIHAGHYDM 244
>gi|308812133|ref|XP_003083374.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
gi|116055254|emb|CAL57650.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
[Ostreococcus tauri]
Length = 311
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 97/228 (42%), Positives = 138/228 (60%), Gaps = 9/228 (3%)
Query: 70 EGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRT 129
+G + W+E IS PRA+V+ FL+ EC+ +I A P M S V D D+G+++ R+
Sbjct: 61 DGGSSGWIEKISDSPRAYVFREFLTDAECDRVIERAYPTMEASEVTDDDSGEARPDDARS 120
Query: 130 SSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTK 189
S G +++ D++IR+IE R + + P+ GE +QVL YE GQKY+ H D+F DE N K
Sbjct: 121 SIGGWVSGDDDEVIRNIELRASTWAMLPMNRGETMQVLRYEKGQKYDAHDDFFHDEHNVK 180
Query: 190 NGGQRMATVLMYLSDVEEGGETVFP---------NAQGNISAVPWWNELSECGKTGLSIK 240
NGGQR+AT+LMYLSDVEEGGETVFP + ++ S+ L++K
Sbjct: 181 NGGQRVATILMYLSDVEEGGETVFPLGTPLGGRDPEKSGVTGDNACELASQNDPRVLAVK 240
Query: 241 PKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
P+ GDALLF++ +D + H GCPV +G KW+ T+W RV V
Sbjct: 241 PRRGDALLFFNAHLSGEMDEKANHAGCPVNRGTKWTMTRWHRVGAIGV 288
>gi|159486447|ref|XP_001701251.1| hypothetical protein CHLREDRAFT_122372 [Chlamydomonas reinhardtii]
gi|158271833|gb|EDO97644.1| predicted protein [Chlamydomonas reinhardtii]
Length = 251
Score = 194 bits (492), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 96/212 (45%), Positives = 131/212 (61%), Gaps = 9/212 (4%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E +SW PR F+YHNFLS EC ++ A P M++S+VV ++ G S +RTS GTF+
Sbjct: 2 IETVSWNPRVFIYHNFLSDAECRHIKRTAAPMMKRSSVVGTN-GSSVLDTIRTSYGTFIR 60
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
R D ++ + +R+A +T P EN E LQVL Y GQKY H D +D+ RMA
Sbjct: 61 RRHDPVVERVLRRVAAWTKAPPENQEDLQVLRYGPGQKYGAHMDSLIDD------SPRMA 114
Query: 197 TVLMYLSDVEEGGETVFPNAQG--NISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
TVL+YL D E GGET FP++ + S SEC + ++ +PK GDAL+FWS+KP
Sbjct: 115 TVLLYLHDTEYGGETAFPDSGHWLDPSLAQSMGPFSECAQGHVAFRPKKGDALMFWSIKP 174
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
D + DP SLH GCPV+ G KW++T W+ Y
Sbjct: 175 DGTHDPLSLHTGCPVVTGVKWTATSWVHSMPY 206
>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
Length = 244
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 98/200 (49%), Positives = 133/200 (66%), Gaps = 6/200 (3%)
Query: 85 RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIR 144
R F+ +FL+ EE ++++ ++ + +S VV ++ G S++S++RTS G FL RG D +++
Sbjct: 1 RIFLIEHFLTDEEADHIVQVSERRLERSGVVATNGG-SEESQIRTSFGVFLERGEDPVVK 59
Query: 145 DIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSD 204
+E+RI+ T P+ NGEGLQVL Y+ QKY+ H+DYF + NGG R ATVLMYL D
Sbjct: 60 GVEERISALTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGIANGGNRYATVLMYLVD 119
Query: 205 VEEGGETVFPNAQGNISAVPWWN-ELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSL 263
EEGGETVFP NI+A N SEC + L+ KPK G A+LF S+KP L+ SL
Sbjct: 120 TEEGGETVFP----NIAAPGGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERKSL 175
Query: 264 HGGCPVIKGNKWSSTKWIRV 283
H CPVIKG KWS+ KWI V
Sbjct: 176 HTACPVIKGIKWSAAKWIHV 195
>gi|302838815|ref|XP_002950965.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
nagariensis]
gi|300263660|gb|EFJ47859.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
nagariensis]
Length = 298
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 97/224 (43%), Positives = 132/224 (58%), Gaps = 21/224 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E +SW PR F+YHNFL+ EC ++ A P M++S+VV + G S +RTS GTF+
Sbjct: 2 IEAVSWNPRVFIYHNFLTDGECRHIKRTAAPMMKRSSVVGQN-GSSVTDNIRTSYGTFIR 60
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQ------------VLHYEAGQKYEPHFDYFMD 184
R D +I I +R+A +T P EN E LQ VL Y GQKY H D +D
Sbjct: 61 RRHDPVIERILRRVAAWTKAPPENQEDLQAGRGEGGREKERVLRYGIGQKYGAHMDSLID 120
Query: 185 EFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISA--VPWWNELSECGKTGLSIKPK 242
+ RMATVL+YL D EEGGET FP++ ++ SEC + ++ +PK
Sbjct: 121 D------SPRMATVLLYLHDTEEGGETAFPDSSSWLTPDLATRMGPFSECAQGHVAFRPK 174
Query: 243 MGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
GDAL+FWS+KPD + DP S+H GCPV+KG KW++T W+ Y
Sbjct: 175 KGDALMFWSIKPDGTHDPLSMHTGCPVVKGVKWTATSWVHSMPY 218
>gi|55741082|gb|AAV64222.1| unknown [Zea mays]
Length = 369
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 84/150 (56%), Positives = 114/150 (76%), Gaps = 1/150 (0%)
Query: 139 RDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATV 198
+D+++ IE+RI+ +TF P ENGE +Q+LHY+ G+KYEPH+DYF D+ N GG R+ATV
Sbjct: 193 QDEVVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRIATV 252
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
LMYLS+VE+GGET+FPNA+G + P N S+C + G ++KP GDALLF+S+ PDA+
Sbjct: 253 LMYLSNVEKGGETIFPNAEGKLLQ-PKDNTWSDCARNGYAVKPVKGDALLFFSLHPDATT 311
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
D SLHG CPVI+G KWS+TKWI V + +
Sbjct: 312 DSDSLHGSCPVIEGQKWSATKWIHVRSFDL 341
>gi|55741040|gb|AAV64184.1| unknown [Zea mays]
Length = 394
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 84/150 (56%), Positives = 114/150 (76%), Gaps = 1/150 (0%)
Query: 139 RDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATV 198
+D+++ IE+RI+ +TF P ENGE +Q+LHY+ G+KYEPH+DYF D+ N GG R+ATV
Sbjct: 193 QDEVVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRIATV 252
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
LMYLS+VE+GGET+FPNA+G + P N S+C + G ++KP GDALLF+S+ PDA+
Sbjct: 253 LMYLSNVEKGGETIFPNAEGKLLQ-PKDNTWSDCARNGYAVKPVKGDALLFFSLHPDATT 311
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
D SLHG CPVI+G KWS+TKWI V + +
Sbjct: 312 DSDSLHGSCPVIEGQKWSATKWIHVRSFDL 341
>gi|307102975|gb|EFN51240.1| hypothetical protein CHLNCDRAFT_28187 [Chlorella variabilis]
Length = 322
Score = 189 bits (479), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 102/226 (45%), Positives = 137/226 (60%), Gaps = 20/226 (8%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD-TGKSKDSRVR---TSSG 132
+E++SW+PRA + H FL+ EC+++I+LA + S VV D +GK R R +SSG
Sbjct: 15 IELVSWKPRALLLHGFLAHSECDHMISLAEARLEPSKVVSRDGSGKLDSVRTRQGLSSSG 74
Query: 133 TFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTK--- 189
TFL + +D ++ +E RI T P + E LQVL YE GQKY H+D +
Sbjct: 75 TFLTKRQDSVVAGVEDRIELATHLPFSHSEQLQVLKYELGQKYSAHYDVHGSNEQAQLAI 134
Query: 190 ----NGGQRMATVLMYLSDVEEGGETVFPNA----QGNISAVPWWNELSECGKTGLSIKP 241
GG R AT+LMYLSDVEEGGET FP+ +G + P+ SECG G+++KP
Sbjct: 135 RRGEQGGSRYATMLMYLSDVEEGGETSFPHGRWIDEGAQAQPPY----SECGSRGVAVKP 190
Query: 242 KMGDALLFWSMKPDA-SLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+ GDA+LF+S+K D S D SLH GCPV KG K+S+T WI V Y
Sbjct: 191 RKGDAILFYSLKSDGQSKDFFSLHAGCPVAKGVKYSATAWIHVEPY 236
>gi|303287328|ref|XP_003062953.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226455589|gb|EEH52892.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 259
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 101/225 (44%), Positives = 138/225 (61%), Gaps = 17/225 (7%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE ISW PRAF HN ++ EC+ ++ LA +R+STVVDS TG+SK +RTS FL
Sbjct: 1 VEPISWHPRAFHLHNIMTDAECDEVLELARTRVRRSTVVDSTTGESKVDPIRTSEQCFLN 60
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQ-----VLHYEAGQKYEPHFDYFMDEFNTKN- 190
RG I+ IEKR+ +T P NGE LQ VL Y GQKY+ H D + E +T +
Sbjct: 61 RGHFPIVSVIEKRLERYTMLPWYNGEDLQARPSRVLKYSNGQKYDAHHD--VGELDTASG 118
Query: 191 ------GGQRMATVLMYLSDVEE--GGETVFPNAQGNISAVPWWNELSECGKTGLSIKPK 242
GG R+ATVL+YLSDV++ GGET FP+++ + SEC + +++KPK
Sbjct: 119 KQLAAEGGHRVATVLLYLSDVDDDGGGETAFPDSEWIDPTADRGSGWSECAEDHVAVKPK 178
Query: 243 MGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
GD LLFWS+ P+ +D S+H GCPV+ G W++TKWI ++
Sbjct: 179 KGDGLLFWSITPEGVIDQQSMHAGCPVL-GKSWTATKWIHARPFR 222
>gi|125546091|gb|EAY92230.1| hypothetical protein OsI_13950 [Oryza sativa Indica Group]
Length = 178
Score = 187 bits (476), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 87/140 (62%), Positives = 109/140 (77%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PRAF+Y FLS +EC++L+NLA M KS V D+D+GKS S+VRTSSGTFL+
Sbjct: 37 VTQLSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSGTFLS 96
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ D I+ IEKR+A +TF P EN E +Q+LHYE GQKY+ HFDYF D+ N K GG R+A
Sbjct: 97 KHEDDIVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRVA 156
Query: 197 TVLMYLSDVEEGGETVFPNA 216
TVLMYL+DV++GGETVFPNA
Sbjct: 157 TVLMYLTDVKKGGETVFPNA 176
>gi|108706360|gb|ABF94155.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative [Oryza
sativa Japonica Group]
gi|125585047|gb|EAZ25711.1| hypothetical protein OsJ_09544 [Oryza sativa Japonica Group]
Length = 277
Score = 187 bits (475), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 97/213 (45%), Positives = 135/213 (63%), Gaps = 22/213 (10%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATP-HMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
+SW PRAF+Y FLS EC++LI+LA M KSTVVD ++G+S S+VRTSSG FL +
Sbjct: 45 VSWRPRAFLYEGFLSDAECDHLISLAKQGKMEKSTVVDGESGESVTSKVRTSSGMFLDKK 104
Query: 139 RDKIIRDIEKRIADFTFFPLE-----------------NGEGLQVLHYEAGQKYEPHFDY 181
+D+++ IE+RIA +T P E NGE +Q+L Y G+KYEPHFDY
Sbjct: 105 QDEVVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPHFDY 164
Query: 182 FMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKP 241
+ G R+ATVLMYLS+V+ G+++ P A+ + W S+C + G ++KP
Sbjct: 165 ISGRQGSTREGDRVATVLMYLSNVKM-GDSLLPQARLSQPKDETW---SDCAEQGFAVKP 220
Query: 242 KMGDALLFWSMKPDASLDPSSLHGGCPVIKGNK 274
G A+LF+S+ P+A+LD SLHG CPVI+G K
Sbjct: 221 AKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEK 253
>gi|168043388|ref|XP_001774167.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674574|gb|EDQ61081.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 284
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 97/218 (44%), Positives = 132/218 (60%), Gaps = 7/218 (3%)
Query: 68 GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV-VDSDTGKSKDSR 126
GD + +V+SW+PRA +Y NF SKE+CE +I LA + S + + ++
Sbjct: 66 GDSSVTDIPFQVLSWKPRALLYPNFASKEQCEAIIKLARTRLAPSGLALRKGESEATTKE 125
Query: 127 VRTSSGTFLARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD 184
+RTSSGTFL DK + ++E+++A T P +NGE VL Y GQKY+ H+D F
Sbjct: 126 IRTSSGTFLRASEDKTQSLAEVEEKMARATMIPRQNGEAFNVLRYNPGQKYDCHYDVFDP 185
Query: 185 EFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMG 244
QRMA+ L+YLSDVEEGGET+FP N + +C GL +KP+ G
Sbjct: 186 AEYGPQPSQRMASFLLYLSDVEEGGETMFPFE--NFQNMNTGYNYKDC--IGLKVKPRQG 241
Query: 245 DALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
DALLF+SM P+ + D ++LHG CPVIKG KW +TKWIR
Sbjct: 242 DALLFYSMHPNGTFDKTALHGSCPVIKGEKWVATKWIR 279
>gi|168006299|ref|XP_001755847.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693166|gb|EDQ79520.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 299
Score = 186 bits (472), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 101/221 (45%), Positives = 136/221 (61%), Gaps = 9/221 (4%)
Query: 66 SEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDS 125
S GD A+ +V+SW+PRA +Y F SKE+CE ++ LA + S + G+S+DS
Sbjct: 79 STGDNFIADIPFQVLSWKPRALLYPRFASKEQCEAIMKLARTRLAPSALA-LRKGESEDS 137
Query: 126 R--VRTSSGTFLARGRD--KIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY 181
+RTSSGTFL D + + +E+++A T P ENGE VL Y GQKY+ H+D
Sbjct: 138 TKDIRTSSGTFLRADEDTTRSLEQVEEKMAKATMIPRENGEAFNVLKYNVGQKYDCHYDV 197
Query: 182 FMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKP 241
F QRMA+ L+YLSDVEEGGET+FP N + + +C G+ +KP
Sbjct: 198 FDPAEYGPQPSQRMASFLLYLSDVEEGGETMFPFE--NFQNMNIGFDYKKC--IGMKVKP 253
Query: 242 KMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ GDALLF+SM P+ + D S+LHG CPVIKG KW +TKWIR
Sbjct: 254 RQGDALLFYSMHPNGTFDKSALHGSCPVIKGEKWVATKWIR 294
>gi|414591891|tpg|DAA42462.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
Length = 207
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 90/162 (55%), Positives = 111/162 (68%), Gaps = 1/162 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +SW PR FVY FLS EC++L+ LA +++S V D+++GKS S VRTSSG FL
Sbjct: 45 VKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSEVRTSSGMFLD 104
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D ++ IE+RIA +TF P EN E +QVL YE GQKYEPHFDYF D N GG R A
Sbjct: 105 KRQDPVVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGHRYA 164
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLS 238
TVLMYLS V EGGETVFPNA+G + P SEC GL+
Sbjct: 165 TVLMYLSTVREGGETVFPNAKG-WESQPKDATFSECAHKGLA 205
>gi|255584898|ref|XP_002533164.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223527036|gb|EEF29223.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 290
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 94/211 (44%), Positives = 134/211 (63%), Gaps = 7/211 (3%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR-VRTSSGTFLA 136
+V+SW+PRA + NF + E+C+ +IN+A P++ ST+ ++++ +RTSSG FL+
Sbjct: 81 QVLSWKPRALYFPNFATAEQCQSVINMAKPNLTPSTLALRKGETEENTKGIRTSSGMFLS 140
Query: 137 RGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQR 194
DK ++ IE++IA T P NGE +L YE GQKY H+D F QR
Sbjct: 141 ASEDKTGVLDAIEEKIARATMLPRANGEAFNILRYEIGQKYNSHYDAFNPAEYGPQKSQR 200
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+A+ L+YLSDVEEGGET+FP N V + +C GL ++P+ GD LLF+S+ P
Sbjct: 201 VASFLLYLSDVEEGGETMFPFE--NDLDVDESYDFEKC--IGLQVRPRRGDGLLFYSLFP 256
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
+ ++DP+SLHG CPVIKG KW +TKWIR E
Sbjct: 257 NNTIDPTSLHGSCPVIKGEKWVATKWIRDQE 287
>gi|363543309|ref|NP_001241870.1| prolyl 4-hydroxylase 6-3 precursor [Zea mays]
gi|347978824|gb|AEP37754.1| prolyl 4-hydroxylase 6-3 [Zea mays]
Length = 208
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 92/163 (56%), Positives = 120/163 (73%), Gaps = 3/163 (1%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +S PRAF+Y FLS EC++L++LA M KS V D+D+GKS S+ RTSSGTFLA
Sbjct: 35 VTQLSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQARTSSGTFLA 94
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ D+I+ IEKR+A +TF P EN E LQVL YE GQKY+ HFDYF D N K GGQR+A
Sbjct: 95 KREDEIVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRVA 154
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNEL-SECGKTGLS 238
TVLMYL+DV++GGE VFP+A+G S + + +E S+C ++GL+
Sbjct: 155 TVLMYLTDVKKGGEAVFPDAEG--SHLQYKDETWSDCSRSGLA 195
>gi|363543293|ref|NP_001241862.1| prolyl 4-hydroxylase 2-1 precursor [Zea mays]
gi|347978802|gb|AEP37743.1| prolyl 4-hydroxylase 2-1 [Zea mays]
Length = 204
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 87/162 (53%), Positives = 118/162 (72%), Gaps = 1/162 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +SW PRAF++ FLS EC++LI LA + KS V D+++GKS S VRTSSG FL
Sbjct: 36 VVQLSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSEVRTSSGMFLE 95
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
R +D+++ IE+RI+ +TF P ENGE +Q+LHY+ G+KYEPH+DYF D+ N GG R+A
Sbjct: 96 RKQDEVVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRIA 155
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLS 238
TVLMYLS+VE+GGET+FPNA+G + P N S+C + G +
Sbjct: 156 TVLMYLSNVEKGGETIFPNAEGKL-LQPKDNTWSDCARNGYA 196
>gi|326518408|dbj|BAJ88233.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 276
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 109/236 (46%), Positives = 137/236 (58%), Gaps = 18/236 (7%)
Query: 23 LLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISW 82
LL + TF L ++ +L + R+ +D S ++ D EVISW
Sbjct: 10 LLPLLTFVALGMILGSLLQLAFF----RRLDDHSHTRHFDNDQGAADLRLGYVKPEVISW 65
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDK- 141
PR V+HNFLS EEC+YL +A P + STVVD TGK S VRTSSG F+ K
Sbjct: 66 TPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSDVRTSSGMFVNSEERKL 125
Query: 142 -IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLM 200
+I+ IEKRI+ F+ P+ENGE +QVL YE Q Y PH DYF D FN K GGQR+AT+LM
Sbjct: 126 PVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHDYFSDTFNLKRGGQRVATMLM 185
Query: 201 YLSDVEEGGETVFPNAQGNISAVPWWNELSECGK---TGLSIKPKMGDALLFWSMK 253
YL+D EGGET FP A G+ + CG GL +KP GDA+LFWSM+
Sbjct: 186 YLTDGVEGGETHFPQA-GDGECI--------CGGRLVRGLCVKPNKGDAVLFWSME 232
>gi|226494249|ref|NP_001141909.1| uncharacterized protein LOC100274058 [Zea mays]
gi|194706408|gb|ACF87288.1| unknown [Zea mays]
gi|413932757|gb|AFW67308.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
gi|413932758|gb|AFW67309.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
Length = 217
Score = 184 bits (468), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 89/149 (59%), Positives = 114/149 (76%), Gaps = 3/149 (2%)
Query: 139 RDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATV 198
+D+I+ IEKR+A +TF P EN E LQVL YE GQKY+ HFDYF D N K GGQR+ATV
Sbjct: 17 KDEIVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRVATV 76
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNEL-SECGKTGLSIKPKMGDALLFWSMKPDAS 257
LMYL+DV +GGETVFPNA+G S + + +E SEC ++GL++KPK GDALLF+++ +A+
Sbjct: 77 LMYLTDVNKGGETVFPNAEG--SHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNAT 134
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
D SLHG CPVI+G KWS+TKWI V +
Sbjct: 135 ADTGSLHGSCPVIEGEKWSATKWIHVRSF 163
>gi|159487763|ref|XP_001701892.1| predicted protein [Chlamydomonas reinhardtii]
gi|158281111|gb|EDP06867.1| predicted protein [Chlamydomonas reinhardtii]
Length = 259
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 94/221 (42%), Positives = 137/221 (61%), Gaps = 22/221 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E ++W+PR F+YHNF+++ E ++LI LA P M++STVV + GKS + RTS GTFL
Sbjct: 1 IEHVAWKPRVFIYHNFITEVEAKHLIELAAPQMKRSTVVGAG-GKSVEDNYRTSYGTFLK 59
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
R +D+I+ IE R+A +T P+ + E Q+L Y GQ+Y+ H D DE G R+A
Sbjct: 60 RYQDEIVERIENRVAAWTQIPVAHQEDTQILRYGLGQQYKVHADTLRDE----EAGVRVA 115
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWN---------ELSECGKTGLSIKPKMGDAL 247
TVL+YL++ + GGET FP+++ W N S+C K ++ PK GDAL
Sbjct: 116 TVLIYLNEPDGGGETAFPSSE-------WVNPQLAKTLGANFSDCAKNHVAFAPKRGDAL 168
Query: 248 LFWSMKPDASL-DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
LFWS+ PD + D + H GCPV+ G KW++TKWI ++
Sbjct: 169 LFWSINPDGNTEDTHASHTGCPVLSGVKWTATKWIHARPFR 209
>gi|363543297|ref|NP_001241864.1| prolyl 4-hydroxylase 4-2 precursor [Zea mays]
gi|194704960|gb|ACF86564.1| unknown [Zea mays]
gi|347978810|gb|AEP37747.1| prolyl 4-hydroxylase 4-2 [Zea mays]
Length = 207
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 90/162 (55%), Positives = 110/162 (67%), Gaps = 1/162 (0%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +SW PR FVY FLS EC++L+ LA ++S V D+++GKS S VRTSSG FL
Sbjct: 45 VKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKTQRSMVADNESGKSVKSEVRTSSGMFLD 104
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +D ++ IE+RIA +TF P EN E +QVL YE GQKYEPHFDYF D N GG R A
Sbjct: 105 KRQDPVVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGHRYA 164
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLS 238
TVLMYLS V EGGETVFPNA+G + P SEC GL+
Sbjct: 165 TVLMYLSTVREGGETVFPNAKG-WESQPKDATFSECAHKGLA 205
>gi|116784858|gb|ABK23496.1| unknown [Picea sitchensis]
Length = 208
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 87/157 (55%), Positives = 113/157 (71%), Gaps = 4/157 (2%)
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
F+ +G+D II IE +IA +TF P ENGE +QVL YE G+KY+PHFD+F D+ N GG
Sbjct: 2 FIPKGKDAIISRIEDKIAAWTFLPKENGEDMQVLRYEPGEKYDPHFDFFQDKVNIVRGGH 61
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGN----ISAVPWWNELSECGKTGLSIKPKMGDALLF 249
R+ATVLMYL+DV +GGETVFP+A+ + IS++ + LS+C K G ++KPK GDALLF
Sbjct: 62 RVATVLMYLTDVSKGGETVFPSAEEDTHRRISSIIKDDTLSDCAKRGTAVKPKRGDALLF 121
Query: 250 WSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+S+ A D SLH GCPVI+G KWS TKWI V +
Sbjct: 122 FSLTTQAKPDTRSLHAGCPVIEGEKWSVTKWIHVESF 158
>gi|413934216|gb|AFW68767.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
Length = 210
Score = 183 bits (465), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 81/150 (54%), Positives = 113/150 (75%), Gaps = 1/150 (0%)
Query: 139 RDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATV 198
+D+++ IE+RI+ +TF P ENGE +Q+LHY+ G+KYEPH+DYF D+ N GG R+ATV
Sbjct: 9 QDEVVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATV 68
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
LMYLS+VE+GGET+FPNA+G + P + S+C + G ++KP GDALLF+S+ PD++
Sbjct: 69 LMYLSNVEKGGETIFPNAEGKL-LQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTT 127
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
D SLHG CP I+G KWS+TKWI V + +
Sbjct: 128 DSDSLHGSCPAIEGQKWSATKWIHVRSFDL 157
>gi|255083957|ref|XP_002508553.1| predicted protein [Micromonas sp. RCC299]
gi|226523830|gb|ACO69811.1| predicted protein [Micromonas sp. RCC299]
Length = 262
Score = 183 bits (464), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 100/220 (45%), Positives = 139/220 (63%), Gaps = 17/220 (7%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDSRVRTSSGTF 134
VE +S EP+AF+YH FLS EEC++LI + TPH+++STVV DTG D VRTS GTF
Sbjct: 1 VEKLSDEPKAFLYHGFLSAEECDHLIKIGTPHLKRSTVVGGKDDTGVLDD--VRTSFGTF 58
Query: 135 LARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQR 194
L + D ++ IE+R+ DF+ EN E LQ+L Y GQ+Y+ H D + NGG+R
Sbjct: 59 LPKKYDDVLYGIERRVEDFSQISYENQEQLQLLKYHDGQEYKDH----QDGLTSPNGGRR 114
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVP-----WWNELSECG---KTGLSIKPKMGDA 246
+ATVLM+L + E+GGET FP + + AV +ELS+C GL++KP+ GDA
Sbjct: 115 IATVLMFLHEPEKGGETSFPQGK-PLPAVAQRLRGMRDELSDCAWRDGRGLAVKPRRGDA 173
Query: 247 LLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+LF+S K + D +S H CP + G KW++TKWI +
Sbjct: 174 VLFFSFKKNGGSDIASTHASCPTVGGVKWTATKWIHEKRF 213
>gi|413934217|gb|AFW68768.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
Length = 204
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 81/150 (54%), Positives = 112/150 (74%), Gaps = 1/150 (0%)
Query: 139 RDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATV 198
D+++ IE+RI+ +TF P ENGE +Q+LHY+ G+KYEPH+DYF D+ N GG R+ATV
Sbjct: 3 NDEVVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATV 62
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
LMYLS+VE+GGET+FPNA+G + P + S+C + G ++KP GDALLF+S+ PD++
Sbjct: 63 LMYLSNVEKGGETIFPNAEGKL-LQPKDDTWSDCARNGYAVKPVKGDALLFFSLHPDSTT 121
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
D SLHG CP I+G KWS+TKWI V + +
Sbjct: 122 DSDSLHGSCPAIEGQKWSATKWIHVRSFDL 151
>gi|307109700|gb|EFN57937.1| hypothetical protein CHLNCDRAFT_142031 [Chlorella variabilis]
Length = 325
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 98/221 (44%), Positives = 135/221 (61%), Gaps = 9/221 (4%)
Query: 69 DEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVR 128
D A W E +SW PRAFV HNF SKEE +++I LA P +R+STVV S G+S R
Sbjct: 24 DTAAAHPWFEPVSWYPRAFVAHNFASKEETDHMIKLAQPQLRRSTVVGS-RGESVVDNYR 82
Query: 129 TSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT 188
TS G F+ R D+++ +EKR+A +T + + + E +QVL Y Q+Y+ HFD D+
Sbjct: 83 TSYGMFIRRHHDEVVSTLEKRVATWTKYNVTHQEDIQVLRYGTTQEYKAHFDSLDDD--- 139
Query: 189 KNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVP-WWNELSECGKTGLSIKPKMGDAL 247
R ATVL+YLSDVE GGET FPN++ A+P SEC + +++KPK GDA+
Sbjct: 140 ---SPRTATVLIYLSDVESGGETTFPNSEWIDPALPKALGPFSECAQGHVAMKPKRGDAI 196
Query: 248 LFWSMKPDA-SLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+F S+ PD S D +LH CPVI G K+ + WI ++
Sbjct: 197 VFHSLNPDGRSHDQHALHTACPVIVGVKYVAIFWIHTKPFR 237
>gi|302844247|ref|XP_002953664.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
gi|300261073|gb|EFJ45288.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
nagariensis]
Length = 364
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 103/233 (44%), Positives = 144/233 (61%), Gaps = 13/233 (5%)
Query: 56 SSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV 115
S++V + + DE A WVE + PRA+++HNFL+K E +++ LA P +++STVV
Sbjct: 28 SALVMHTEADKQFDE-EATPWVEQVGLHPRAYLFHNFLTKAERAHMVRLAAPKLKRSTVV 86
Query: 116 DSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKY 175
S G+ +RTS G F+ R D II IEKRI+ +T P+E+ E +QVL Y GQ Y
Sbjct: 87 GS-KGEGVVDNIRTSFGMFIRRLSDPIIARIEKRISLWTHLPIEHQEDIQVLRYAHGQTY 145
Query: 176 EPHFDYFM--DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNE----L 229
H+D D K R+AT LMYLSDVEEGGET FP Q ++ P E +
Sbjct: 146 GAHYDSGASSDHVGPK---WRLATFLMYLSDVEEGGETAFP--QNSVWYDPTIPERIGPV 200
Query: 230 SECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
SEC K ++ KPK GDA+LF+S P+ ++DP+++H GCPVIKG KW++ W+
Sbjct: 201 SECAKGHVAAKPKAGDAVLFYSFLPNNTMDPAAMHTGCPVIKGIKWAAPVWMH 253
>gi|302845026|ref|XP_002954052.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
nagariensis]
gi|300260551|gb|EFJ44769.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
nagariensis]
Length = 311
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 98/214 (45%), Positives = 132/214 (61%), Gaps = 10/214 (4%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V VISW+PRAFV NFL++ EC ++ +LA HMR+STVV +D G S RTS GTF+
Sbjct: 1 VSVISWQPRAFVIRNFLTEHECTHIADLAQVHMRRSTVV-ADNGSSVLDDYRTSYGTFIN 59
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
R + +I +E R+A T P+ E +QVL Y GQ Y H D + +N RMA
Sbjct: 60 RYQTPVIAAVEDRVALLTRTPVVYQEDMQVLRYGLGQYYHRHTD------SLENDSPRMA 113
Query: 197 TVLMYLSDVEEGGETVFPNAQ--GNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
TVL+YLS+ E GGET FP A + + + S+C K ++ KP+ GDALLFWS+KP
Sbjct: 114 TVLLYLSEPELGGETAFPQAASWAHPAMAQLFGPFSDCVKGNVAFKPRRGDALLFWSVKP 173
Query: 255 DA-SLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D + DP S H GCPVI+G KW++T W+ ++
Sbjct: 174 DGRTEDPYSEHEGCPVIRGVKWTATVWVHTQPFR 207
>gi|159489450|ref|XP_001702710.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280732|gb|EDP06489.1| predicted protein [Chlamydomonas reinhardtii]
Length = 252
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 97/212 (45%), Positives = 128/212 (60%), Gaps = 8/212 (3%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V VISWEPRAFV NFL+ +E ++ ++A HMR+STVV +D G S RTS GTF+
Sbjct: 1 VSVISWEPRAFVIRNFLTDQEATHIADVAQVHMRRSTVV-ADNGSSVLDDYRTSYGTFIN 59
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
R ++ +E R+A T P+ E +QVL Y GQ Y H D + +N R+A
Sbjct: 60 RYATPVVARVEDRVAVLTRVPVHYQEDMQVLRYGNGQYYHRHTD------SLENDSPRLA 113
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVL+YLSD E GGET FP A + + SEC K ++ KP+ GDALLFWS+KPD
Sbjct: 114 TVLLYLSDPELGGETAFPLAWAHPDMPKVFGPFSECVKNNVAFKPRKGDALLFWSVKPDG 173
Query: 257 SL-DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
DP S H GCPVI+G KW++T W+ ++
Sbjct: 174 KTEDPLSEHEGCPVIRGVKWTATVWVHTKPFR 205
>gi|412988743|emb|CCO15334.1| predicted protein [Bathycoccus prasinos]
Length = 352
Score = 181 bits (458), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 92/213 (43%), Positives = 134/213 (62%), Gaps = 13/213 (6%)
Query: 76 WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
W+E +SW+PRAF+YHNFLSKEE ++L++L P + +STVV TG+ D +RTS GTF+
Sbjct: 67 WIEALSWDPRAFLYHNFLSKEEAKHLVDLGEPRVTRSTVVGGQTGRVSD--IRTSFGTFI 124
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
+ D+++ IE R A F+ P+ + E +Q+L Y GQKY H D + E NGG+R+
Sbjct: 125 PKKYDEVLEKIEDRCAVFSGIPVAHQEQMQLLRYRDGQKYSDHTDGLISE----NGGKRI 180
Query: 196 ATVLMYLSDVEEGGET--VFPNAQGNISA--VPWWNELSECG---KTGLSIKPKMGDALL 248
AT+LM+L + EGGET V N G + ++ S+CG G ++KPK+GDA+L
Sbjct: 181 ATILMFLHEPTEGGETSFVLGNPLGKVKERIERTKDQFSDCGYRSGKGFAVKPKVGDAIL 240
Query: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
F+S D +S+H CP + G KW++T WI
Sbjct: 241 FFSFSEAGITDNNSMHASCPTLGGTKWTATMWI 273
>gi|159487419|ref|XP_001701720.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280939|gb|EDP06695.1| predicted protein [Chlamydomonas reinhardtii]
Length = 274
Score = 181 bits (458), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 93/209 (44%), Positives = 137/209 (65%), Gaps = 5/209 (2%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ + PRA+ +HNFL+K E +L+ LA P +++STVV +D G+ +RTS G F+
Sbjct: 1 VQQVGLHPRAYYFHNFLTKAERGHLVKLAAPKLKRSTVVGND-GEGVVDNIRTSYGMFIR 59
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
R +D ++ IEKRI+ +T P+E+ E +QVL Y GQ Y H+D D+ N R+A
Sbjct: 60 RLQDPVVARIEKRISLWTHLPVEHQEDIQVLRYAHGQTYGAHYDS-GDKSNEPGPKWRLA 118
Query: 197 TVLMYLSDVEEGGETVFP-NAQGNISAVP--WWNELSECGKTGLSIKPKMGDALLFWSMK 253
T LMYLSDVEEGGET FP N+ ++P ++ S+C K ++ KPK GDA+LF+S
Sbjct: 119 TFLMYLSDVEEGGETAFPHNSVWADPSIPEKVGDKFSDCAKGNVAAKPKAGDAVLFYSFY 178
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
P+ ++DP+++H GCPVIKG KW++ W+
Sbjct: 179 PNMTMDPAAMHTGCPVIKGVKWAAPVWMH 207
>gi|449448264|ref|XP_004141886.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
sativus]
Length = 294
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 96/220 (43%), Positives = 134/220 (60%), Gaps = 11/220 (5%)
Query: 68 GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDS 125
GD+ + +V+SW PRA + F + E+C+ ++NLA P +R ST+ +T +S
Sbjct: 72 GDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRKGETAESTKG 131
Query: 126 RVRTSSGTFLARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM 183
VRTSSG F + D+ + IE++IA T P +GE +L YE GQKY H+D F
Sbjct: 132 -VRTSSGVFFSASEDESGTLGVIEEKIARATMIPRTHGEAYNILRYEIGQKYNSHYDAFK 190
Query: 184 DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQG-NISAVPWWNELSECGKTGLSIKPK 242
QR+A+ L+YL+DVEEGGET+FP G N+ +N C GL +KP+
Sbjct: 191 PSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGT--YN-FQTC--IGLKVKPR 245
Query: 243 MGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
GD LLF+S+ P+ ++DP+SLHG CPVIKG KW +TKWIR
Sbjct: 246 QGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIR 285
>gi|40809925|dbj|BAD07294.1| prolyl 4-hydroxylase [Nicotiana tabacum]
Length = 286
Score = 180 bits (456), Expect = 8e-43, Method: Compositional matrix adjust.
Identities = 96/210 (45%), Positives = 136/210 (64%), Gaps = 11/210 (5%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR--VRTSSGTFL 135
+V+SW PRA + NF S E+C+ +I +A +M S++ TG+++++ +RTSSGTF+
Sbjct: 77 QVLSWFPRALYFPNFASIEQCQSIIKMAKANMEPSSLA-LRTGETEETTKGIRTSSGTFI 135
Query: 136 ARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
+ DK I+ IE++IA T P +GE VL YE GQ+Y+ H+D F Q
Sbjct: 136 SASEDKTGILDLIEEKIAKATMIPKTHGEAFNVLRYEIGQRYQSHYDAFDPAQYGPQKSQ 195
Query: 194 RMATVLMYLSDVEEGGETVFPNAQG-NISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R A+ L+YLSDVEEGGETVFP G N+ A + S+C GL +KP+ GD LLF+S+
Sbjct: 196 RAASFLLYLSDVEEGGETVFPYENGQNMDAS---YDFSKC--IGLKVKPRRGDGLLFYSL 250
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
P+ ++D +SLHG CPVI+G KW +TKWIR
Sbjct: 251 FPNGTIDLTSLHGSCPVIRGEKWVATKWIR 280
>gi|388505024|gb|AFK40578.1| unknown [Medicago truncatula]
Length = 297
Score = 179 bits (455), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 108/293 (36%), Positives = 162/293 (55%), Gaps = 23/293 (7%)
Query: 8 RFPTRKSSSSTL---ILTLLIMFTFAILILLA--FG-ILSMPSSSGDSRKANDLSSIVRK 61
R T KSS+ +L LT +F I LA FG L S GD ++
Sbjct: 2 RIKTVKSSNWSLRTNKLTFPYVFLICIFFFLAGFFGSTLFSHSQDGDGYGLRPRPRLLDS 61
Query: 62 SMERSE--------GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKST 113
+ E GD+ +V+SW+PRA + NF + E+CE ++++A ++ S+
Sbjct: 62 TKETEYNLMTAGEFGDDSITSIPFQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSS 121
Query: 114 VVDSDTGKSKDSR-VRTSSGTFLARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYE 170
+ +++++ +RTSSG FL+ RDK + IE++IA T P +GE +L YE
Sbjct: 122 LALRKGETTENTKGIRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYE 181
Query: 171 AGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQG-NISAVPWWNEL 229
GQ+Y H+D F + QR+A+ L+YL+DVEEGGET+FP G N+ + +
Sbjct: 182 VGQRYYSHYDAFNPDEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGTYGYED- 240
Query: 230 SECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ GL +KP+ GD LLF+S+ P+ ++D +SLHG CPVIKG KW +TKWIR
Sbjct: 241 ----RVGLRVKPRQGDGLLFYSLLPNGTIDQTSLHGSCPVIKGEKWVATKWIR 289
>gi|357476355|ref|XP_003608463.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355509518|gb|AES90660.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 297
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 110/294 (37%), Positives = 164/294 (55%), Gaps = 25/294 (8%)
Query: 8 RFPTRKSSSSTL---ILTLLIMFTFAILILLA--FG-ILSMPSSSGDSRKANDLSSIVRK 61
R T KSS+ +L LT +F I LA FG L S GD ++
Sbjct: 2 RIKTVKSSNWSLRTNKLTFPYVFLICIFFFLAGFFGSTLFSHSQDGDGYGLRPRPRLLDS 61
Query: 62 SMERSE--------GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKST 113
+ E GD+ +V+SW+PRA + NF + E+CE ++++A ++ S+
Sbjct: 62 TKETEYNLMTAGEFGDDSITSIPFQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSS 121
Query: 114 VVDSDTGKSKDSR-VRTSSGTFLARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYE 170
+ +++++ +RTSSG FL+ RDK + IE++IA T P +GE +L YE
Sbjct: 122 LALRKGETTENTKGIRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYE 181
Query: 171 AGQKYEPHFDYFM-DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQG-NISAVPWWNE 228
GQ+Y H+D F DE+ + QR+A+ L+YL+DVEEGGET+FP G N+ + +
Sbjct: 182 VGQRYNSHYDAFNPDEYGPQKS-QRVASFLLYLTDVEEGGETMFPFENGLNMDGTYGYED 240
Query: 229 LSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
GL +KP+ GD LLF+S+ P+ ++D +SLHG CPVIKG KW +TKWIR
Sbjct: 241 C-----VGLRVKPRQGDGLLFYSLLPNGTIDQTSLHGSCPVIKGEKWVATKWIR 289
>gi|449511009|ref|XP_004163837.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1-like [Cucumis sativus]
Length = 294
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 95/220 (43%), Positives = 133/220 (60%), Gaps = 11/220 (5%)
Query: 68 GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDS 125
GD+ + +V+SW PRA + F + E+C+ ++NLA P +R ST+ +T +S
Sbjct: 72 GDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRKGETAESTKG 131
Query: 126 RVRTSSGTFLARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM 183
VRTSSG F + D+ + IE++ A T P +GE +L YE GQKY H+D F
Sbjct: 132 -VRTSSGVFFSASEDESGTLGVIEEKXARATMIPRTHGEAYNILRYEIGQKYNSHYDAFK 190
Query: 184 DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQG-NISAVPWWNELSECGKTGLSIKPK 242
QR+A+ L+YL+DVEEGGET+FP G N+ +N C GL +KP+
Sbjct: 191 PSEYGPQKSQRVASFLLYLTDVEEGGETMFPFENGLNMDGT--YN-FQTC--IGLKVKPR 245
Query: 243 MGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
GD LLF+S+ P+ ++DP+SLHG CPVIKG KW +TKWIR
Sbjct: 246 QGDGLLFYSVFPNGTIDPTSLHGSCPVIKGQKWVATKWIR 285
>gi|224103711|ref|XP_002313164.1| predicted protein [Populus trichocarpa]
gi|222849572|gb|EEE87119.1| predicted protein [Populus trichocarpa]
Length = 294
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 92/210 (43%), Positives = 137/210 (65%), Gaps = 11/210 (5%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKS-KDSRVRTSSGTF 134
+V+SW+PRA + F + E+CE +I + ++ ST+ +T +S KD+R TSSG+F
Sbjct: 83 QVLSWKPRALYFPKFATPEQCESIIKMVESKLKPSTLALRKGETAESTKDTR--TSSGSF 140
Query: 135 LARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGG 192
++ D+ + IEK+IA T P +GE +L YE GQKY+ H+D F + +
Sbjct: 141 VSGSEDETGTLDFIEKKIAKATMIPQSHGEAFNILRYEIGQKYDSHYDAFNPDEYGQQSS 200
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
QR A+ L+YLS+VEEGGET+FP G+ + +P ++ +C GL +KP+ GD LLF+S+
Sbjct: 201 QRTASFLLYLSNVEEGGETMFPFENGS-AVIPGFD-YKQC--VGLKVKPRQGDGLLFYSL 256
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
P+ ++DP+SLHG CPVIKG KW +TKWIR
Sbjct: 257 FPNGTIDPTSLHGSCPVIKGVKWVATKWIR 286
>gi|302802700|ref|XP_002983104.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
gi|300149257|gb|EFJ15913.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
Length = 292
Score = 177 bits (450), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 106/279 (37%), Positives = 155/279 (55%), Gaps = 13/279 (4%)
Query: 16 SSTLILTLLIMFTFAILILLAFGIL------SMPSSSGDSRKANDLSSIVRKSMERSEGD 69
S+ L+L L ++ + F + S+PS S + + ++L V + GD
Sbjct: 15 SAPLVLVLCVLAFLVGYFIPEFQQVILVTKHSIPSFSSFANQRHELLEDVTVAEHGVTGD 74
Query: 70 EGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR-VR 128
+ + +V+SW PRA ++ F S +CE +I+LA + S++ + +++ VR
Sbjct: 75 DQLSFIPFQVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALRKGETATETQDVR 134
Query: 129 TSSGTFLARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEF 186
TS G FL+ +DK + +E+++A T P +GE VL YE GQKY H+D F
Sbjct: 135 TSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYDVFNPAE 194
Query: 187 NTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDA 246
QRMA+ L+YLSDVEEGGET+FP N + + EC GL +KPK GDA
Sbjct: 195 YGPQKSQRMASFLLYLSDVEEGGETMFPFE--NYEHMNENYDYKEC--IGLKVKPKQGDA 250
Query: 247 LLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
LLF+SM P+ + D ++LHG CPVIKG KW +TKWIR E
Sbjct: 251 LLFYSMFPNGTFDKTALHGSCPVIKGEKWVATKWIRDKE 289
>gi|24417248|gb|AAN60234.1| unknown [Arabidopsis thaliana]
Length = 190
Score = 177 bits (449), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 89/187 (47%), Positives = 122/187 (65%), Gaps = 1/187 (0%)
Query: 25 IMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEP 84
I F++ L ++S + +R +N V K M+ S G V +SW P
Sbjct: 5 IFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIK-MKTSASSFGFDPTRVTQLSWTP 63
Query: 85 RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIR 144
R F+Y FLS EEC++ I LA + KS V D+D+G+S +S VRTSSG FL++ +D I+
Sbjct: 64 RVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQDDIVS 123
Query: 145 DIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSD 204
++E ++A +TF P ENGE +Q+LHYE GQKYEPHFDYF D+ N + GG R+ATVLMYLS+
Sbjct: 124 NVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVLMYLSN 183
Query: 205 VEEGGET 211
VE+GGET
Sbjct: 184 VEKGGET 190
>gi|307110383|gb|EFN58619.1| hypothetical protein CHLNCDRAFT_19485 [Chlorella variabilis]
Length = 328
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 91/213 (42%), Positives = 133/213 (62%), Gaps = 27/213 (12%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE +SW+PRAFV+HNF+++EE ++++ LA P M++STVV + G S + ++RTS GTFL
Sbjct: 32 VEPVSWKPRAFVFHNFMTEEEADHIVALAKPFMKRSTVVGAG-GASVEDQIRTSYGTFLK 90
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
R +D I+ +E+R+A +T + + E +Q+L Y GQKY H+D + N R+
Sbjct: 91 RLQDPIVTAVEQRLATWTKLNVSHQEDMQILRYGIGQKYGAHYD------SLDNDSPRVC 144
Query: 197 TVLMYLSDV--EEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
TVL+YLSDV + GGET FP + ++ PK GDALLF+S+KP
Sbjct: 145 TVLLYLSDVPADGGGETAFPGVRRQ------------------ALYPKKGDALLFYSLKP 186
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D + D SLH GCP+I G KW++TKWI ++
Sbjct: 187 DGTSDAYSLHTGCPIISGVKWTATKWIHTLPFR 219
>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 286
Score = 176 bits (446), Expect = 9e-42, Method: Compositional matrix adjust.
Identities = 90/208 (43%), Positives = 123/208 (59%), Gaps = 24/208 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P+ V+ + LS EC LI + +++ST V+ TG+ R RTS G + RG D++I
Sbjct: 97 PQLVVFADVLSAAECAELIERSRHRLKRSTTVNPLTGREDVIRNRTSEGVWYRRGEDQLI 156
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNGGQRMATV 198
+E+RIA T +PLENGEGLQVLHY +Y PHFD+F + +T GGQR+AT+
Sbjct: 157 ARVERRIASLTNWPLENGEGLQVLHYGTSGEYSPHFDFFAPDQPGSAVHTTQGGQRVATL 216
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+DV +GGETVFP A GLS+ + G A+ F M + L
Sbjct: 217 IIYLNDVADGGETVFPTA-------------------GLSVAAQAGGAVYFRYMNAERQL 257
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DPS+LHGG PV+ G+KW TKW+R Y
Sbjct: 258 DPSTLHGGAPVLAGDKWIMTKWMRERAY 285
>gi|302844249|ref|XP_002953665.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300261074|gb|EFJ45289.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 245
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 98/247 (39%), Positives = 146/247 (59%), Gaps = 9/247 (3%)
Query: 26 MFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPR 85
+ F +L+ L +L++ ++ L S + E + + A WVE + PR
Sbjct: 4 LLAFTVLLFLR-AVLALSENTWGGLPERLLPSALVMHHEADKQFDEEATPWVEQVGLHPR 62
Query: 86 AFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRD 145
A+++HNFL+K E +++ LA P +++STVV +D G+ +RTS G F+ R D +I
Sbjct: 63 AYLFHNFLTKAERAHMVRLAAPKLKRSTVVGND-GEGVVDEIRTSYGMFIRRLADPVITR 121
Query: 146 IEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDV 205
IEKRI+ +T P+E+ E +QVL Y GQ Y H+D D+ N R+AT LMYLSDV
Sbjct: 122 IEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDS-GDKSNEPGPKWRLATFLMYLSDV 180
Query: 206 EEGGETVFPNAQGNISAVPWWNE----LSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
EEGGET FP Q ++ P E +SEC K ++ KPK GDA+LF+S P+ ++DP+
Sbjct: 181 EEGGETAFP--QNSVWYDPTIPERIGPVSECAKGHVAAKPKAGDAVLFYSFYPNLTMDPA 238
Query: 262 SLHGGCP 268
++H GCP
Sbjct: 239 AMHTGCP 245
>gi|224071291|ref|XP_002303388.1| predicted protein [Populus trichocarpa]
gi|222840820|gb|EEE78367.1| predicted protein [Populus trichocarpa]
Length = 297
Score = 176 bits (445), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 105/284 (36%), Positives = 157/284 (55%), Gaps = 32/284 (11%)
Query: 18 TLILTLLIMFTFAILILLAFGILSMPSSSGDSR----KANDLSSIVRKSMERSE------ 67
LIL+ F I L A +L +S D R +A L S+ + + + +
Sbjct: 22 ALILSCSFFF---IAGLFASNLLLSQGTSSDERWLRARARQLQSVEEEIISKYDLLPSGE 78
Query: 68 -GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR 126
GD+ +V+SW PRA Y F++ E+C+++IN+A P ++ ST+ ++ ++
Sbjct: 79 SGDDFITLIPFQVLSWRPRALYYPGFITAEQCQHIINMAKPSLQPSTLALRKGETAETTK 138
Query: 127 -VRTSSGTFLARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM 183
+RTSSG F+ D+ +++ IE++IA T P +GE VL YE GQKY+ H+D F
Sbjct: 139 GIRTSSGMFVFSSEDQAGVLQVIEEKIARATMIPSTHGEAFNVLRYEIGQKYDAHYDAFN 198
Query: 184 DEFNTKNGGQRMATVLMYLSDVEEGGETVFP-----NAQGNISAVPWWNELSECGKTGLS 238
QR+AT L+YLS+ EEGGET FP N +G + +C GL
Sbjct: 199 PAEYGPQTSQRVATFLLYLSNFEEGGETTFPIENDENFEG--------YDAQKC--NGLR 248
Query: 239 IKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+KP GDA+LF+S+ P+ ++DP+SLH C VIKG KW +TKWIR
Sbjct: 249 VKPHQGDAILFYSIFPNNTIDPASLHASCHVIKGEKWVATKWIR 292
>gi|225438938|ref|XP_002279411.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296087348|emb|CBI33722.3| unnamed protein product [Vitis vinifera]
Length = 285
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 94/213 (44%), Positives = 127/213 (59%), Gaps = 11/213 (5%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV---VDSDTGKSKDSRVRTSSGTF 134
+V+SW PRA + NF + E+C+ +IN+A ++ STV V G ++ +RTSSG F
Sbjct: 76 QVLSWRPRALYFPNFATSEQCQSIINMAKSNLTPSTVALRVGEIRGNTEG--IRTSSGVF 133
Query: 135 LARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGG 192
++ DK + IE++IA P +GE VL YE GQ+Y H+D F
Sbjct: 134 ISASEDKTGTLDLIEQKIARVIMIPRTHGEAFNVLRYEIGQRYNSHYDAFDPAEYGPQKS 193
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT L+YLSDVEEGGET+FP G + + C GL +KP GD LLF+SM
Sbjct: 194 HRIATFLVYLSDVEEGGETMFPFENGLNMDKDY--DFQRC--IGLKVKPHQGDGLLFYSM 249
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
P+ ++DP+SLHG CPVIKG KW +TKWIR E
Sbjct: 250 FPNGTIDPTSLHGSCPVIKGEKWVATKWIRDQE 282
>gi|326492085|dbj|BAJ98267.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 347
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 88/209 (42%), Positives = 131/209 (62%), Gaps = 9/209 (4%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR--VRTSSGTFL 135
+++SW+PRA + F + E+CE ++ A +R ST+ G+S+++ +RTSSGTFL
Sbjct: 140 QILSWQPRALYFPQFATAEQCENVVKTAKARLRPSTLA-LRKGESEETTKGIRTSSGTFL 198
Query: 136 ARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
+ D + +IE +IA T P +GE VL YE GQKY H+D F Q
Sbjct: 199 SAEEDPTGALAEIETKIAKATMMPRSHGEPFNVLRYEIGQKYASHYDAFDPAQYGPQKSQ 258
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+A+ L+YL+DVEEGGET+FP G+ + + + +C GL +KP+ GD LLF+S+
Sbjct: 259 RVASFLLYLTDVEEGGETMFPYENGDNMNIGY--DYEQC--IGLKVKPRKGDGLLFYSLM 314
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ ++DP+SLHG CPV++G KW +TKWIR
Sbjct: 315 VNGTIDPTSLHGSCPVVRGEKWVATKWIR 343
>gi|255647903|gb|ACU24410.1| unknown [Glycine max]
Length = 293
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 93/223 (41%), Positives = 134/223 (60%), Gaps = 11/223 (4%)
Query: 68 GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR- 126
GD+ +V+SW PRA + NF + E+CE +I++A ++ ST+ G+++D+
Sbjct: 74 GDDSITSIPFQVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLA-LRQGETEDNTK 132
Query: 127 -VRTSSGTFLARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM 183
+RTSSG F++ DK + IE++IA T P +GE +L YE Q+Y H+D F
Sbjct: 133 GIRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYNSHYDAFN 192
Query: 184 DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQG-NISAVPWWNELSECGKTGLSIKPK 242
QRMA+ L+YL+DVEEGGET+FP G N+ + G GL +KP+
Sbjct: 193 PAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYE-----GCIGLKVKPR 247
Query: 243 MGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
GD LLF+S+ + ++DP+SLHG CPVIKG KW +TKWIR E
Sbjct: 248 QGDGLLFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQE 290
>gi|356563543|ref|XP_003550021.1| PREDICTED: putative prolyl 4-hydroxylase-like [Glycine max]
Length = 293
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 94/227 (41%), Positives = 134/227 (59%), Gaps = 19/227 (8%)
Query: 68 GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR- 126
GD+ +V+SW PRA + NF + E+CE +I++A ++ ST+ G+++D+
Sbjct: 74 GDDSITSIPFQVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLA-LRQGETEDNTK 132
Query: 127 -VRTSSGTFLARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM 183
+RTSSG F++ DK + IE++IA T P +GE +L YE Q+Y H+D F
Sbjct: 133 GIRTSSGVFVSASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYNSHYDAFN 192
Query: 184 DEFNTKNGGQRMATVLMYLSDVEEGGETVFP-----NAQGNISAVPWWNELSECGKTGLS 238
QRMA+ L+YL+DVEEGGET+FP N GN +C GL
Sbjct: 193 PAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMDGNYG-------YEDC--IGLK 243
Query: 239 IKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
+KP+ GD LLF+S+ + ++DP+SLHG CPVIKG KW +TKWIR E
Sbjct: 244 VKPRQGDGLLFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQE 290
>gi|363807682|ref|NP_001242420.1| uncharacterized protein LOC100775302 [Glycine max]
gi|255641811|gb|ACU21174.1| unknown [Glycine max]
Length = 293
Score = 174 bits (440), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 93/229 (40%), Positives = 133/229 (58%), Gaps = 17/229 (7%)
Query: 65 RSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD 124
R GD+ +V+SW PRA + NF + E+CE +I++A ++ ST+ ++
Sbjct: 71 RDLGDDSITSIPFQVLSWRPRALYFPNFATAEQCENIIDVAKDGLKPSTLALRQGETEEN 130
Query: 125 SR-VRTSSGTFLARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY 181
++ +RTSSG F++ DK + IE++IA T P +GE +L YE Q+Y H+D
Sbjct: 131 TKGIRTSSGVFVSASGDKTGTLAVIEEKIARATMIPRSHGEAFNILRYEVDQRYNSHYDA 190
Query: 182 FMDEFNTKNGGQRMATVLMYLSDVEEGGETVFP-----NAQGNISAVPWWNELSECGKTG 236
F QRMA+ L+YL+DVEEGGET+FP N GN +C G
Sbjct: 191 FNPAEYGPQKSQRMASFLLYLTDVEEGGETMFPFENGLNMDGNYG-------YEDC--IG 241
Query: 237 LSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
L +KP+ GD LLF+S+ + ++DP+SLHG CPVIKG KW +TKWIR E
Sbjct: 242 LKVKPRQGDGLLFYSLLTNGTIDPTSLHGSCPVIKGEKWVATKWIRDQE 290
>gi|225428938|ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083079|emb|CBI22483.3| unnamed protein product [Vitis vinifera]
Length = 284
Score = 174 bits (440), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 105/270 (38%), Positives = 151/270 (55%), Gaps = 13/270 (4%)
Query: 20 ILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKS-MERSEGDEGRAEQW-V 77
+L L I ++F L L +L +G + L S+ S M E E +
Sbjct: 15 LLLLFISWSFFFLAGLFGSMLFSQDVNGVRSQPRLLESVEEYSPMPHGETGESSVDMIPF 74
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDSRVRTSSGTFL 135
+V+SW+PRA + F + E+C+ +I +A H+R ST+ +T +S RTSSGTF+
Sbjct: 75 QVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQGETDESTKG-TRTSSGTFI 133
Query: 136 ARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
+ DK I+ +E++IA T P +GE +L YE GQ+Y H+D F Q
Sbjct: 134 SASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSHYDAFNPAEYGPQTSQ 193
Query: 194 RMATVLMYLSDVEEGGETVFPNAQG-NISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+A+ L+YLSDVEEGGET+FP NI + +C GL +KP+ GD LLF+S+
Sbjct: 194 RVASFLLYLSDVEEGGETMFPFEHDLNIGT---GYDYKKC--IGLKVKPQRGDGLLFYSV 248
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
P+ ++D +SLHG CPVI G KW +TKWIR
Sbjct: 249 FPNGTIDRTSLHGSCPVIAGEKWVATKWIR 278
>gi|357114580|ref|XP_003559078.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
distachyon]
Length = 295
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 128/208 (61%), Gaps = 7/208 (3%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR-VRTSSGTFLA 136
+++SW+PRA + F + E+CE ++ A +R ST+ + ++ +RTSSGTFL+
Sbjct: 88 QILSWQPRALYFPQFATSEQCENVVKTAKARLRPSTLALRKGETEETTKGIRTSSGTFLS 147
Query: 137 RGRD--KIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQR 194
D + + ++EK+IA T P +GE VL YE GQKY H+D F QR
Sbjct: 148 ADEDPTRTLAEVEKKIAKATMIPRSHGEPFNVLRYEIGQKYASHYDAFDPAQYGPQKSQR 207
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+A+ L+YL+DVEEGGET+FP G + + + +C GL +KP+ GD LLF+S+
Sbjct: 208 VASFLLYLTDVEEGGETMFPYENGENMDIGY--DYEQC--IGLKVKPRKGDGLLFYSLMV 263
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ ++D +SLHG CPVIKG KW +TKWIR
Sbjct: 264 NGTIDLTSLHGSCPVIKGEKWVATKWIR 291
>gi|308799555|ref|XP_003074558.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
gi|116000729|emb|CAL50409.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
Length = 274
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 87/215 (40%), Positives = 133/215 (61%), Gaps = 6/215 (2%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE +SW PRAF N L + E ++ LA + +STV+DS++GKS + +RTS TFL+
Sbjct: 9 VEPLSWYPRAFALRNALDETEMRAILALARTRVARSTVIDSESGKSVVNPIRTSKQTFLS 68
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT-----KNG 191
R D ++R + +R++ T P + E LQVL Y AG+KY+ H D + + KNG
Sbjct: 69 RN-DPVVRKVLERMSSVTHLPWYHCEDLQVLEYSAGEKYDAHEDVGEEGTKSGDQLSKNG 127
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
G+R+AT+L+YL + EEGGET FP+++ S+C +++KP GD L+FWS
Sbjct: 128 GKRVATILLYLEEPEEGGETAFPDSEWIDPERAKTETWSKCAHRRVAMKPTRGDGLMFWS 187
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
++PD ++D +LH GCP +G KW++T W+ + Y
Sbjct: 188 VRPDGTIDHRALHVGCPPTRGTKWTATIWVHADPY 222
>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
Length = 313
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 87/208 (41%), Positives = 121/208 (58%), Gaps = 24/208 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P+ V+ N LS +EC +I + +++ST+VD TG+ R RTS G + RG D +I
Sbjct: 124 PQVIVFGNVLSPDECAEMIERSRHRLKRSTIVDPATGREDVIRNRTSEGIWYQRGEDALI 183
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNGGQRMATV 198
+++RIA +PLENGEGLQ+LHY +Y PHFDYF + +T GGQR+AT+
Sbjct: 184 ERLDQRIASLMNWPLENGEGLQILHYGPSGEYRPHFDYFPPDQPGSAVHTARGGQRVATL 243
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+DV +GGET+FP A GLS+ + G A+ F M L
Sbjct: 244 VVYLNDVPDGGETIFPEA-------------------GLSVAAQQGGAVYFRYMNGRRQL 284
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DP +LHGG PV+ G+KW TKW+R Y
Sbjct: 285 DPLTLHGGAPVLSGDKWIMTKWVRERPY 312
>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Collimonas fungivorans Ter331]
gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
[Collimonas fungivorans Ter331]
Length = 289
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 90/210 (42%), Positives = 122/210 (58%), Gaps = 24/210 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PRA ++ N LS +EC+ LI L+ + +S VVD TG +K RTSSGTF RG
Sbjct: 99 KPRAILFGNVLSHDECDQLIALSKTKLLRSGVVDHQTGNTKLHEHRTSSGTFFHRGTTPF 158
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEF-----NTKNGGQRMAT 197
I I+KR+A P +GEGLQ+L+Y+ G +Y PH+DYF + + GGQR AT
Sbjct: 159 IAMIDKRLAALMQVPESHGEGLQILNYQMGGEYRPHYDYFRPDAPGSAKHLARGGQRTAT 218
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
+++YL+DV+ GGET+FP + GLSI P G A+ F +
Sbjct: 219 LIIYLNDVDGGGETIFP-------------------RNGLSIVPAKGSAIYFSYTNAENQ 259
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
LD S HGG PVI+G KW +TKW+R NEY+
Sbjct: 260 LDSLSFHGGSPVIEGEKWIATKWVRQNEYR 289
>gi|356574299|ref|XP_003555286.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 290
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 92/210 (43%), Positives = 134/210 (63%), Gaps = 12/210 (5%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKS-KDSRVRTSSGTF 134
+++SW PRA + NF S E C+ +I +A P + S + +T +S KD+R TSSGTF
Sbjct: 76 QILSWRPRAVYFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGETAESTKDTR--TSSGTF 133
Query: 135 LARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-DEFNTKNG 191
++ DK I+ +E++IA T P +GE +L YE QKY+ H+D F DE+ T
Sbjct: 134 ISASEDKSGILDFVERKIAKVTMIPRTHGEKFNILKYEVAQKYDSHYDAFNPDEYGTVES 193
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
QR+A+ L+YLS+VE GGET+FP +G ++ + + +C GL +KP+ GD LLF+S
Sbjct: 194 -QRIASFLLYLSNVEAGGETMFP-YEGGLNIDKGYYDYKKC--IGLKVKPRQGDGLLFYS 249
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ P+ +D +SLHG CPVIKG KW +TKWI
Sbjct: 250 LLPNGKIDKTSLHGSCPVIKGEKWVATKWI 279
>gi|302842389|ref|XP_002952738.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300262082|gb|EFJ46291.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 281
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 94/197 (47%), Positives = 125/197 (63%), Gaps = 8/197 (4%)
Query: 91 NFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRI 150
+ + EE ++++ ++ + +S VV D S+ S +RTS G FL RG D+I++ +E+RI
Sbjct: 3 HLIFAEEADHIVKVSERRLERSGVVGGDG-GSETSNIRTSYGVFLDRGEDEIVKRVEERI 61
Query: 151 ADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGE 210
A +T P+ NGEGLQVL Y+ QKY+ H+DYF + NGG R ATVLMYL D EEGGE
Sbjct: 62 AAWTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGITNGGNRYATVLMYLVDTEEGGE 121
Query: 211 TVFPNAQGNISAVPWWNEL--SECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCP 268
TVFPN A P + SEC + L+ KPK G A+LF S+KP L+ SLH CP
Sbjct: 122 TVFPNV-----AAPGGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERKSLHTACP 176
Query: 269 VIKGNKWSSTKWIRVNE 285
VI+G KWS+ KWI E
Sbjct: 177 VIRGIKWSAAKWIHHAE 193
>gi|159487421|ref|XP_001701721.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280940|gb|EDP06696.1| predicted protein [Chlamydomonas reinhardtii]
Length = 336
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 94/218 (43%), Positives = 135/218 (61%), Gaps = 9/218 (4%)
Query: 70 EGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRT 129
E A WV+ + PRA+ +HNFL+K E +L+ +A P +++STVV D +RT
Sbjct: 12 EEDATPWVQQVGLHPRAYYFHNFLTKAERAHLVRVAAPKLKRSTVVGGKGEGVVDD-IRT 70
Query: 130 SSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM--DEFN 187
S G F+ R D ++ IEKRI+ +T P+E+ E +Q+L Y GQ Y H+D D
Sbjct: 71 SYGMFIRRLSDPVVTRIEKRISLWTHLPVEHQEDIQILRYAHGQTYGAHYDSGASSDHVG 130
Query: 188 TKNGGQRMATVLMYLSDVEEGGETVFP-NAQGNISAVPWW--NELSECGKTGLSIKPKMG 244
K R+AT LMYLSDVEEGGET FP N+ ++P ++ S+C K ++ KPK G
Sbjct: 131 PK---WRLATFLMYLSDVEEGGETAFPHNSVWADPSIPEQVGDKFSDCAKGHVAAKPKAG 187
Query: 245 DALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
DA+LF+S P+ ++DP+S+H GCPVIKG KW++ W+
Sbjct: 188 DAVLFYSFYPNNTMDPASMHTGCPVIKGVKWAAPVWMH 225
>gi|388523073|gb|AFK49598.1| unknown [Lotus japonicus]
Length = 318
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/210 (43%), Positives = 129/210 (61%), Gaps = 11/210 (5%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR--VRTSSGTFL 135
+V+SW P A + NF + E+CE +I A ++ ST+V G++ +S +RTSSG F+
Sbjct: 93 QVLSWNPHALYFPNFATAEQCESIIETAKEGLKPSTLV-LRVGETDESTTGIRTSSGVFI 151
Query: 136 ARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
+ DK ++ IE++IA T P +GE VL Y+ GQKY H+D + Q
Sbjct: 152 SAFEDKTGVLDVIEEKIARATKIPRTHGEAFNVLRYKVGQKYSSHYDALHPDIYGPQKSQ 211
Query: 194 RMATVLMYLSDVEEGGETVFPNAQG-NISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
RMA+ L+YLSDV EGGET+FP G N+ ++ + GL +KP+ GD LLF+S+
Sbjct: 212 RMASFLLYLSDVPEGGETMFPFENGLNMDGSYYYEKC-----IGLKVKPRKGDGLLFYSL 266
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
P+ ++DP SLHG CPVIKG KW +TKWIR
Sbjct: 267 FPNGTIDPMSLHGSCPVIKGEKWVATKWIR 296
>gi|302764866|ref|XP_002965854.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
gi|300166668|gb|EFJ33274.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
Length = 231
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 93/211 (44%), Positives = 128/211 (60%), Gaps = 7/211 (3%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR-VRTSSGTFLA 136
+V+SW PRA ++ F S +CE +I+LA + S++ + +++ VRTS G FL+
Sbjct: 22 QVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALRKGETATETQDVRTSHGCFLS 81
Query: 137 RGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQR 194
+DK + +E+++A T P +GE VL YE GQKY H+D F QR
Sbjct: 82 SRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYDVFNPAEYGPQKSQR 141
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
MA+ L+YLSDVEEGGET+FP N + + EC GL +KPK GDALLF+SM P
Sbjct: 142 MASFLLYLSDVEEGGETMFPFE--NYEHMNENYDYKEC--IGLKVKPKQGDALLFYSMFP 197
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
+ + D ++LHG CPVIKG KW +TKWIR E
Sbjct: 198 NGTFDKTALHGSCPVIKGEKWVATKWIRDKE 228
>gi|299532490|ref|ZP_07045880.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
gi|298719437|gb|EFI60404.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
Length = 299
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 88/209 (42%), Positives = 125/209 (59%), Gaps = 26/209 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V+ N LS EEC+ +I A P MR+S VD+ +G + RTS+G F RG +++I
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAARPRMRRSLTVDNQSGGEAVNDDRTSNGMFFQRGENELI 171
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-DEFNT----KNGGQRMATV 198
+E+RIA +PLENGEG+QVLHY G +Y+PH+DYF +E T K GGQR+ T+
Sbjct: 172 SLVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTL 231
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL++ GG T FP+ GL + P+ G+A+ F +PD +
Sbjct: 232 VMYLNEPARGGATTFPD-------------------VGLQVVPRRGNAVFFSYNRPDPAT 272
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV++G KW +TKW+R E+K
Sbjct: 273 --KTLHGGAPVLEGEKWIATKWLREREFK 299
>gi|384250599|gb|EIE24078.1| hypothetical protein COCSUDRAFT_47131 [Coccomyxa subellipsoidea
C-169]
Length = 327
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 91/221 (41%), Positives = 131/221 (59%), Gaps = 8/221 (3%)
Query: 68 GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT-GKSKDSR 126
G++ + Q ++ISW PR +Y F+ E C++ + +A + S + T G +
Sbjct: 100 GNDFYSVQPQQLISWYPRIILYPGFIDPERCKHFVKVAKARLAPSGLALRTTEGPQETEN 159
Query: 127 VRTSSGTFLARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD 184
VRTS GTF++R D +I +E++ A T P+ +GE VL Y+ GQ Y+ H+D F
Sbjct: 160 VRTSQGTFMSRKDDPAGVIAWVEEKAAQVTGLPVSHGEPFNVLRYQDGQHYDSHYDIFEP 219
Query: 185 EFNTKNGGQRMATVLMYLSDVEEGGETVFP---NAQGNISAVPWWNELSECGKTGLSIKP 241
E QRMAT+L YL+DVEEGGET+FP ++ + +N S C TG KP
Sbjct: 220 ESYGPQPSQRMATILFYLTDVEEGGETIFPLEGRYGPDLLKMTGFNYKS-C-TTGFKYKP 277
Query: 242 KMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+MGDAL+F+SM P+ + D +LHGGCPV+ G KW +TKWIR
Sbjct: 278 RMGDALMFYSMHPNGTFDKHALHGGCPVMAGEKWVATKWIR 318
>gi|264677094|ref|YP_003277000.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
gi|262207606|gb|ACY31704.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
Length = 306
Score = 172 bits (436), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 89/209 (42%), Positives = 124/209 (59%), Gaps = 26/209 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V+ N LS EEC+ +I A P MR+S VD+ +G + RTS+G F RG + +I
Sbjct: 119 PRVVVFGNLLSDEECDAIIAAARPRMRRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDLI 178
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-DEFNT----KNGGQRMATV 198
+E+RIA +PLENGEG+QVLHY G +Y+PH+DYF +E T K GGQR+ T+
Sbjct: 179 SLVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTL 238
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL++ GG T FP+ GL I P+ G+A+ F +PD +
Sbjct: 239 VMYLNEPARGGATTFPD-------------------VGLQIVPRRGNAVFFSYNRPDPAT 279
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV++G KW +TKW+R E+K
Sbjct: 280 --KTLHGGAPVLEGEKWIATKWLREREFK 306
>gi|145341735|ref|XP_001415959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144576182|gb|ABO94251.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 254
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 84/216 (38%), Positives = 134/216 (62%), Gaps = 6/216 (2%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE +SW PRAF + L++ +CE ++ +R+STVVDS TG+SK +RTS TFL
Sbjct: 3 VEPLSWYPRAFALRDALTEAQCEAVLRATRARVRRSTVVDSVTGESKVDPIRTSKQTFLN 62
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT-----KNG 191
R ++++R+I ++ T P + E +QVL Y G+KY+ H D ++ + K+G
Sbjct: 63 RD-EEVVREIYDALSAVTMLPWTHNEDMQVLEYRVGEKYDAHEDVGAEDSLSGRELSKDG 121
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
G+R+ATVL+YL + E GGET FP+++ + S+C + +++KP+ GD L+FWS
Sbjct: 122 GKRVATVLLYLEEPEAGGETAFPDSEWIDPKMAEGTSWSKCAEHRVAMKPRRGDGLIFWS 181
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+ P+ +D +LH GCPV+ G KW++T W+ Y+
Sbjct: 182 VDPNGKIDHRALHVGCPVVAGVKWTATVWVHAEPYR 217
>gi|147823227|emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera]
Length = 276
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 91/210 (43%), Positives = 129/210 (61%), Gaps = 11/210 (5%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDSRVRTSSGTFL 135
+V+SW+PRA + F + E+C+ +I +A H+R ST+ +T +S RTSSGTF+
Sbjct: 67 QVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQGETDESTKG-TRTSSGTFI 125
Query: 136 ARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
+ DK I+ +E++IA T P +GE +L YE GQ+Y H+D F Q
Sbjct: 126 SASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSHYDAFNPAEYGPQTSQ 185
Query: 194 RMATVLMYLSDVEEGGETVFPNAQG-NISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+A+ L+YLSDVEEGGET+FP NI + +C GL +KP+ GD LLF+S+
Sbjct: 186 RVASFLLYLSDVEEGGETMFPFEHDLNIGT---GYDYKKC--IGLKVKPQRGDGLLFYSV 240
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
P+ ++D +SLHG CPVI G KW +TKWIR
Sbjct: 241 FPNGTIDRTSLHGSCPVIAGEKWVATKWIR 270
>gi|299115886|emb|CBN75895.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
[Ectocarpus siliculosus]
Length = 404
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 94/211 (44%), Positives = 129/211 (61%), Gaps = 13/211 (6%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV--VDSDTGKSKDSRVRTSSGTF 134
++ +S EP F NFL EEC+++ A PHM+ S V +D D GK D+ RTS+ F
Sbjct: 193 MKTLSMEPLVFEARNFLLDEECKHIREKADPHMKPSPVSLMDHDKGKP-DTNWRTSTTYF 251
Query: 135 LARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKN--GG 192
+ RD +++ I++R+ +FT P + E +QVL Y+ GQ+Y H D F+DE +N GG
Sbjct: 252 MPSTRDPLLQGIDRRVEEFTRVPKSHQEQVQVLKYDKGQRYTAHHD-FLDERTMRNMDGG 310
Query: 193 Q--RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
+ RM TV YLSDVEEGGET+FP G V + S+C TGL +KP G +F+
Sbjct: 311 RKNRMITVFWYLSDVEEGGETIFPRYGGRTGRV----DFSDC-TTGLKVKPVEGKVAMFY 365
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
S+KPD D SLHG CPVI G KW++ KW+
Sbjct: 366 SLKPDGQFDDFSLHGACPVITGQKWAANKWV 396
>gi|115455509|ref|NP_001051355.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|14488368|gb|AAK63935.1|AC084282_16 putative dioxygenase [Oryza sativa Japonica Group]
gi|17027263|gb|AAL34117.1|AC090713_4 putative hydroxylase subunit [Oryza sativa Japonica Group]
gi|108711218|gb|ABF99013.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
Group]
gi|113549826|dbj|BAF13269.1| Os03g0761900 [Oryza sativa Japonica Group]
gi|125545807|gb|EAY91946.1| hypothetical protein OsI_13633 [Oryza sativa Indica Group]
Length = 310
Score = 172 bits (435), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 86/209 (41%), Positives = 130/209 (62%), Gaps = 9/209 (4%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR--VRTSSGTFL 135
+++SW+PRA + F + ++CE ++ A + ST+ G++++S +RTSSGTFL
Sbjct: 101 QILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLA-LRKGETEESTKGIRTSSGTFL 159
Query: 136 ARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
+ D + ++EK+IA T P +GE +L YE GQ+Y H+D F Q
Sbjct: 160 SSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDPAQYGPQKSQ 219
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+A+ L+YL+DVEEGGET+FP G + + + +C GL +KP+ GD LLF+S+
Sbjct: 220 RVASFLLYLTDVEEGGETMFPYENGENMDIGY--DYEKC--IGLKVKPRKGDGLLFYSLM 275
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ ++DP+SLHG CPVIKG KW +TKWIR
Sbjct: 276 VNGTIDPTSLHGSCPVIKGEKWVATKWIR 304
>gi|125588006|gb|EAZ28670.1| hypothetical protein OsJ_12681 [Oryza sativa Japonica Group]
Length = 280
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 86/209 (41%), Positives = 130/209 (62%), Gaps = 9/209 (4%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR--VRTSSGTFL 135
+++SW+PRA + F + ++CE ++ A + ST+ G++++S +RTSSGTFL
Sbjct: 71 QILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLA-LRKGETEESTKGIRTSSGTFL 129
Query: 136 ARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
+ D + ++EK+IA T P +GE +L YE GQ+Y H+D F Q
Sbjct: 130 SSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDPAQYGPQKSQ 189
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+A+ L+YL+DVEEGGET+FP G + + + +C GL +KP+ GD LLF+S+
Sbjct: 190 RVASFLLYLTDVEEGGETMFPYENGENMDIGY--DYEKC--IGLKVKPRKGDGLLFYSLM 245
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ ++DP+SLHG CPVIKG KW +TKWIR
Sbjct: 246 VNGTIDPTSLHGSCPVIKGEKWVATKWIR 274
>gi|356536125|ref|XP_003536590.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 286
Score = 171 bits (434), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 111/286 (38%), Positives = 160/286 (55%), Gaps = 28/286 (9%)
Query: 13 KSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERS-----E 67
K SS L L + +F L G P D S I+++S+++
Sbjct: 5 KVKSSKLKLGVPTLFILCALFFFV-GFFVSPLLFQDLDDVGPRSRILQESVKKEYEPLEH 63
Query: 68 GDEGRAEQWV-----EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTG 120
G+ G E +V +++SW PRA + NF S E C+ +I +A P + S + +T
Sbjct: 64 GESG--EPFVDSIPSQILSWRPRAVFFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGETA 121
Query: 121 KS-KDSRVRTSSGTFLARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEP 177
+S KD+R TSSGTF++ DK I+ +E++IA T P +GE +L YE GQKY+
Sbjct: 122 ESTKDTR--TSSGTFISASEDKSGILDLVERKIAKVTMIPRTHGEIFNILKYEVGQKYDS 179
Query: 178 HFDYFM-DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQG-NISAVPWWNELSECGKT 235
H+D F DE+ + QR+A+ L+YLS+VE GGET+FP G NI + +C
Sbjct: 180 HYDAFNPDEYGSVE-SQRIASFLLYLSNVEAGGETMFPYEGGLNIDR---GYDYQKC--I 233
Query: 236 GLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
GL +KP+ GD LLF+S+ P+ +D +SLHG CPVIKG KW +TKWI
Sbjct: 234 GLKVKPRQGDGLLFYSLLPNGKIDKTSLHGSCPVIKGEKWVATKWI 279
>gi|297798522|ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297312981|gb|EFH43404.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 288
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 87/218 (39%), Positives = 131/218 (60%), Gaps = 7/218 (3%)
Query: 68 GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR- 126
G+E +V+SW PRA + NF + E+C+ +I A +++ S + +++++
Sbjct: 69 GEESVGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKG 128
Query: 127 VRTSSGTFLARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD 184
RTSSGTF++ D + +E++IA T P +GE +L YE GQKY+ H+D F
Sbjct: 129 TRTSSGTFISASEDSTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNP 188
Query: 185 EFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMG 244
QR+A+ L+YLSDVEEGGET+FP G+ + + +C GL +KP+ G
Sbjct: 189 TEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGTGY--DYKQC--IGLKVKPRKG 244
Query: 245 DALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D LLF+S+ P+ ++D +SLHG CPV KG KW +TKWIR
Sbjct: 245 DGLLFYSVFPNGTIDQTSLHGSCPVTKGEKWVATKWIR 282
>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
Length = 307
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 88/208 (42%), Positives = 120/208 (57%), Gaps = 24/208 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P+ V+ N LS EEC+ +I + +++ST+VD TG+ R RTS G + RG D I
Sbjct: 118 PQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEDVIRNRTSEGIWYQRGEDAFI 177
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNGGQRMATV 198
+++RIA +P+ENGEGLQ+LHY +Y PHFDYF + +T GGQR+AT+
Sbjct: 178 ERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSMVHTARGGQRVATL 237
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+DV +GGET+FP A GLS+ K G A+ F M L
Sbjct: 238 VIYLNDVPDGGETIFPEA-------------------GLSVAAKQGGAVYFRYMNGQRQL 278
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DP +LHGG PV G+KW TKW+R Y
Sbjct: 279 DPLTLHGGAPVRAGDKWIMTKWMRERAY 306
>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
Length = 300
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 88/208 (42%), Positives = 120/208 (57%), Gaps = 24/208 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P+ V+ N LS EEC+ +I + +++ST+VD TG+ R RTS G + RG D I
Sbjct: 111 PQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEGVIRNRTSEGIWYQRGEDAFI 170
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNGGQRMATV 198
+++RIA +P+ENGEGLQ+LHY +Y PHFDYF + +T GGQR+AT+
Sbjct: 171 ERLDRRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQRVATL 230
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+DV +GGET+FP A GLS+ K G A+ F M L
Sbjct: 231 VVYLNDVADGGETIFPAA-------------------GLSVAAKQGGAVYFRYMNGQRQL 271
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DP +LHGG PV G+KW TKW+R Y
Sbjct: 272 DPLTLHGGAPVRAGDKWIMTKWMRERAY 299
>gi|356496957|ref|XP_003517331.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 299
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 89/209 (42%), Positives = 132/209 (63%), Gaps = 9/209 (4%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR--VRTSSGTFL 135
+V+SW PRA + NF+S E+CE +I +A ++ ST+V G++++S +RTS G F+
Sbjct: 90 QVLSWYPRALYFPNFVSAEQCETIIEMARGGLKPSTLV-LRKGETEESTKGIRTSYGVFM 148
Query: 136 ARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
+ D+ I+ IE++IA T P +GE +L YE GQKY PH+D F + Q
Sbjct: 149 SASEDETGILDSIEEKIAKATKIPRTHGEAFNILRYEVGQKYSPHYDAFDEAEFGPLQSQ 208
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R A+ L+YL+DV EGGET+FP G + + +C GL ++P+ GD LLF+S+
Sbjct: 209 RAASFLLYLTDVPEGGETLFPYENGFNRDGSY--DFEDC--IGLRVRPRKGDGLLFYSLL 264
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
P+ ++D +S+HG CPVIKG KW +TKWIR
Sbjct: 265 PNGTIDQTSVHGSCPVIKGEKWVATKWIR 293
>gi|413945803|gb|AFW78452.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
Length = 239
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 89/174 (51%), Positives = 114/174 (65%), Gaps = 9/174 (5%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
IS +PR F+Y +FLS +E +LI+LA +++S V D+ +GKS S VRTSSGTFL +G+
Sbjct: 54 ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSEVRTSSGTFLRKGQ 113
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D I+ IE +IA +TF P ENGE +QVL Y+ G+KYEPH+DYF D NT GG R ATVL
Sbjct: 114 DPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRYATVL 173
Query: 200 MYLSDVEEGGETVFPNAQGN---ISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
+YL+DV EGGETVFP A+ N S + E+ E G I LFW
Sbjct: 174 LYLTDVPEGGETVFPLAEVNFFIFSVTFVFKEMVESGSEVFLI------FFLFW 221
>gi|385137888|gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana]
Length = 288
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 86/218 (39%), Positives = 132/218 (60%), Gaps = 7/218 (3%)
Query: 68 GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR- 126
G+E +V+SW PRA + NF + E+C+ +I A +++ S + +++++
Sbjct: 69 GEESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKG 128
Query: 127 VRTSSGTFLARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD 184
RTSSGTF++ + + +E++IA T P +GE +L YE GQKY+ H+D F
Sbjct: 129 TRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNP 188
Query: 185 EFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMG 244
QR+A+ L+YLSDVEEGGET+FP G+ + + + +C GL +KP+ G
Sbjct: 189 TEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGY--DYKQC--IGLKVKPRKG 244
Query: 245 DALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D LLF+S+ P+ ++D +SLHG CPV KG KW +TKWIR
Sbjct: 245 DGLLFYSVFPNGTIDQTSLHGSCPVTKGEKWVATKWIR 282
>gi|241767624|ref|ZP_04765273.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
gi|241361463|gb|EER57922.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
Length = 318
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 89/209 (42%), Positives = 121/209 (57%), Gaps = 26/209 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V+ N LS EECE LI A P M +S V + TG + + RTS G F RG ++
Sbjct: 131 PRVVVFGNLLSPEECEALIAAAAPRMARSLTVATQTGGEEVNDDRTSHGMFFQRGESPLV 190
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATV 198
+ IE+RIA +P+ENGEGLQVLHY G +Y+PH+DYF E T + GGQR+ T+
Sbjct: 191 QRIEERIASLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTVIQRGGQRVGTL 250
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL+ E+GG T FP+AQ + + P+ G+A F +P S
Sbjct: 251 VMYLNTPEQGGGTTFPDAQ-------------------IEVAPQRGNAAFFSYERPTPST 291
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV+ G+KW +TKW+R E+K
Sbjct: 292 --RTLHGGAPVLAGDKWIATKWLREREFK 318
>gi|18418321|ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
gi|17381226|gb|AAL36425.1| unknown protein [Arabidopsis thaliana]
gi|20465827|gb|AAM20018.1| unknown protein [Arabidopsis thaliana]
gi|21592377|gb|AAM64328.1| putative dioxygenase [Arabidopsis thaliana]
gi|332660892|gb|AEE86292.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
thaliana]
Length = 288
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 86/218 (39%), Positives = 132/218 (60%), Gaps = 7/218 (3%)
Query: 68 GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR- 126
G+E +V+SW PRA + NF + E+C+ +I A +++ S + +++++
Sbjct: 69 GEESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKG 128
Query: 127 VRTSSGTFLARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD 184
RTSSGTF++ + + +E++IA T P +GE +L YE GQKY+ H+D F
Sbjct: 129 TRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNP 188
Query: 185 EFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMG 244
QR+A+ L+YLSDVEEGGET+FP G+ + + + +C GL +KP+ G
Sbjct: 189 TEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGY--DYKQC--IGLKVKPRKG 244
Query: 245 DALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D LLF+S+ P+ ++D +SLHG CPV KG KW +TKWIR
Sbjct: 245 DGLLFYSVFPNGTIDQTSLHGSCPVTKGEKWVATKWIR 282
>gi|356541677|ref|XP_003539300.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
Length = 297
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 91/214 (42%), Positives = 132/214 (61%), Gaps = 21/214 (9%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR--VRTSSGTFL 135
+V+SW PRA + NF S E+CE +I +A ++ ST+ G++++S +RTSSG F+
Sbjct: 90 QVLSWYPRALYFPNFASAEQCESIIEMARGGLKSSTLA-LRKGETEESTKGIRTSSGVFM 148
Query: 136 ARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
+ D+ I+ IE++IA T P +GE +L YE GQKY H+D F + Q
Sbjct: 149 SASEDETGILDAIEEKIAKATKIPRTHGEAFNILRYEVGQKYNSHYDAFDEAEYGPLQSQ 208
Query: 194 RMATVLMYLSDVEEGGETVFP-----NAQGNISAVPWWNELSECGKTGLSIKPKMGDALL 248
R+A+ L+YL+DV EGGET+FP N GN+ +C GL ++P+ GDALL
Sbjct: 209 RVASFLLYLTDVPEGGETMFPYENGFNRDGNVE---------DC--IGLRVRPRKGDALL 257
Query: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
F+S+ P+ ++D +S HG CPVIKG KW +TKWIR
Sbjct: 258 FYSLLPNGTIDQTSAHGSCPVIKGEKWVATKWIR 291
>gi|302845120|ref|XP_002954099.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
nagariensis]
gi|300260598|gb|EFJ44816.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
nagariensis]
Length = 231
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 92/211 (43%), Positives = 122/211 (57%), Gaps = 10/211 (4%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD--SRVRTSSGTFL 135
+++SW PR V+ F+ K EY+I LA+ M S + G++ D + RTS+GTFL
Sbjct: 18 QILSWYPRVVVFPGFIDKARAEYVIKLASKFMYPSGLA-YRPGETVDPSQQTRTSTGTFL 76
Query: 136 ARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
A D ++ +E+RIA T P ENGE VLHYE Q Y+ H+D F + Q
Sbjct: 77 AAAMDPEGVLGWVEQRIAAATLLPAENGEAFNVLHYEKEQHYDSHYDTFDPKEFGPQPSQ 136
Query: 194 RMATVLMYLSDVEEGGETVFP--NAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
R+ATVL+YLS+V EGGETVF G + W C P+MGDA+LFW
Sbjct: 137 RIATVLLYLSEVLEGGETVFKREGVDGENRVIGDWR---NCDDGSFKYMPRMGDAVLFWG 193
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
KP+ +DP +LHGGCPV +G KW +TKWIR
Sbjct: 194 TKPNGDIDPHALHGGCPVKRGEKWVATKWIR 224
>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
Length = 300
Score = 171 bits (432), Expect = 5e-40, Method: Compositional matrix adjust.
Identities = 88/208 (42%), Positives = 120/208 (57%), Gaps = 24/208 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P+ V+ N LS EEC+ +I + +++ST+VD TG+ R RTS G + RG D I
Sbjct: 111 PQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEGVIRNRTSEGIWYQRGEDAFI 170
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNGGQRMATV 198
+++RIA +P+ENGEGLQ+LHY +Y PHFDYF + +T GGQR+AT+
Sbjct: 171 ERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQRVATL 230
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+DV +GGET+FP A GLS+ K G A+ F M L
Sbjct: 231 VVYLNDVADGGETIFPAA-------------------GLSVAAKQGGAVYFRYMNGQRQL 271
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DP +LHGG PV G+KW TKW+R Y
Sbjct: 272 DPLTLHGGAPVHAGDKWIMTKWMRERAY 299
>gi|221068712|ref|ZP_03544817.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
gi|220713735|gb|EED69103.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
Length = 299
Score = 170 bits (430), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 87/209 (41%), Positives = 124/209 (59%), Gaps = 26/209 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V+ N LS EEC+ +I A P M++S VD+ +G + RTS+G F RG + +I
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAAGPRMQRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDLI 171
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-DEFNT----KNGGQRMATV 198
+E+RIA +PLENGEG+QVLHY G +Y+PH+DYF +E T K GGQR+ T+
Sbjct: 172 CRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTL 231
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL++ GG T FP+ GL + P+ G+A+ F +PD +
Sbjct: 232 VMYLNEPARGGATTFPD-------------------VGLQVVPRRGNAVFFSYNRPDPAT 272
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV++G KW +TKW+R E+K
Sbjct: 273 --KTLHGGAPVLEGEKWIATKWLREREFK 299
>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
Length = 305
Score = 170 bits (430), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 93/256 (36%), Positives = 136/256 (53%), Gaps = 42/256 (16%)
Query: 36 AFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSK 95
A+ + P ++G+ +A+D VR ER P+ V+ + LS+
Sbjct: 86 AYDYDACPVAAGNIVRAHDRDVAVRVRFER------------------PQVIVFDDVLSR 127
Query: 96 EECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTF 155
+EC+ LI A +++ST V+ ++G+ ++RTS G + R D I +++RI+
Sbjct: 128 DECDELIERARHRLKRSTTVNPESGREDVIQLRTSEGFWFQRCEDAFIERLDRRISALMN 187
Query: 156 FPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNGGQRMATVLMYLSDVEEGGE 210
+PLE+GEGLQ+LHY G +Y PHFDYF +T GGQR+AT+++YLSDV GGE
Sbjct: 188 WPLEHGEGLQILHYTKGGEYRPHFDYFPPSQSGSVLHTSRGGQRVATLIVYLSDVAGGGE 247
Query: 211 TVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVI 270
TVFPNA GL++ + G A+ F + LDP +LHGG PV
Sbjct: 248 TVFPNA-------------------GLAVMARQGGAIYFRYLNGHRQLDPLTLHGGAPVT 288
Query: 271 KGNKWSSTKWIRVNEY 286
G KW TKW+R Y
Sbjct: 289 NGEKWIMTKWMRERPY 304
>gi|255577610|ref|XP_002529682.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223530830|gb|EEF32693.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 165
Score = 169 bits (429), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 91/166 (54%), Positives = 106/166 (63%), Gaps = 10/166 (6%)
Query: 119 TGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPH 178
T + S VRTSSG FL+ K IEKRI+ ++ P+ENGE +QVL YE Q Y PH
Sbjct: 3 TNQGMKSNVRTSSGMFLSSEERKSPMAIEKRISVYSQVPIENGELVQVLRYEKSQFYRPH 62
Query: 179 FDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKT--G 236
DYF D FN K GGQR+AT+LMYLSD EGGET FP A E S GK G
Sbjct: 63 HDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGS--------GECSCGGKIVKG 114
Query: 237 LSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
LS+KP GDA+LFWSM D DP S+HGGC V+ G KWS+TKW+R
Sbjct: 115 LSVKPIKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWMR 160
>gi|418530659|ref|ZP_13096582.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
gi|371452378|gb|EHN65407.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
Length = 299
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 86/209 (41%), Positives = 124/209 (59%), Gaps = 26/209 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V+ N LS EEC+ +I A P M++S VD+ +G + RTS+G F RG + +I
Sbjct: 112 PRVVVFGNLLSNEECDAIIAAARPRMQRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDLI 171
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-DEFNT----KNGGQRMATV 198
+E+RIA +PLENGEG+QVLHY G +Y+PH+DYF +E T K GGQR+ T+
Sbjct: 172 SRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTL 231
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL++ GG T FP+ GL + P+ G+A+ F +P+ +
Sbjct: 232 VMYLNEPARGGATTFPD-------------------VGLQVVPRRGNAVFFSYNRPEPAT 272
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV++G KW +TKW+R E+K
Sbjct: 273 --KTLHGGAPVLEGEKWIATKWLREREFK 299
>gi|145347188|ref|XP_001418057.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144578285|gb|ABO96350.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 317
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 96/238 (40%), Positives = 138/238 (57%), Gaps = 14/238 (5%)
Query: 60 RKSMERSEGDEGRAE--QWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
R S GDE E + VE +SW PR F+ NFLS EECE+LI L + +STVV+S
Sbjct: 17 RTSTSGGGGDEDDVERSKVVETLSWSPRVFLLKNFLSDEECEHLIELGEKKLERSTVVNS 76
Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEP 177
D + S RTS GTF+ R + ++ +E R+A ++ P E+ E LQ+L Y GQ+Y
Sbjct: 77 DESGAV-STARTSFGTFVTRRLTETLQRVEDRVAKYSGIPWEHQEQLQLLRYRDGQEYVA 135
Query: 178 HFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNA----QGNISAVPWWNELSECG 233
H D + E NGG+R+ATVLM+L + GGET FP + + + ++LSECG
Sbjct: 136 HHDGIISE----NGGKRIATVLMFLREPTSGGETSFPQGTPLPETKAAFLANKDKLSECG 191
Query: 234 ---KTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
G S+ PK G+A+LF+S + + DP + H CP + G K+++TKWI N ++
Sbjct: 192 WNDGNGFSVIPKKGEAVLFFSFHINGTNDPFANHASCPTLGGTKYTATKWIHENPFET 249
>gi|302835042|ref|XP_002949083.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
nagariensis]
gi|300265828|gb|EFJ50018.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
nagariensis]
Length = 263
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 97/223 (43%), Positives = 127/223 (56%), Gaps = 28/223 (12%)
Query: 76 WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
WVE +SW PRAFVYH FL+ EC++LI LATP + +S VV +D+ D +RTS +
Sbjct: 58 WVETVSWMPRAFVYHQFLTPAECDHLIELATPKLERSMVVGTDSDLIDD--IRTSFSASI 115
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGG-QR 194
G I+ IE+RIA +T VL Y GQKY+ H+D+F D K GG R
Sbjct: 116 MYGETSIVSSIEERIARWT-----------VLRYVNGQKYDAHWDWFDDNEVAKAGGSNR 164
Query: 195 MATVLMYLSDVE--EGGETVFPNAQ------GNISAVPWWNELSEC-GKTGLSIKPKMGD 245
MATVLMYLSDV+ GGET P A+ ++ S+C + G+SI+P+ GD
Sbjct: 165 MATVLMYLSDVDPAAGGETALPLAEPLDPHKQSVDG----QGYSQCAARMGISIRPRKGD 220
Query: 246 ALLFWSMKPDASL-DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
LLFW M P + D +LH CP G KW++TKWI Y+
Sbjct: 221 VLLFWDMDPAGLIPDRHALHASCPTFSGTKWTATKWIHNKPYR 263
>gi|242038031|ref|XP_002466410.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
gi|241920264|gb|EER93408.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
Length = 294
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 87/211 (41%), Positives = 129/211 (61%), Gaps = 9/211 (4%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDSRVRTSSGTFL 135
+++SW+PRA + F + E+CE ++ A ++ ST+ +T +S +RTSSGTFL
Sbjct: 87 QILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKG-IRTSSGTFL 145
Query: 136 ARGRD--KIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
+ D + + +IEK+IA T P +GE VL Y GQ+Y H+D F Q
Sbjct: 146 SANEDPTRTLAEIEKKIARATMIPRNHGEPFNVLRYNIGQRYASHYDAFDPVQYGPQKSQ 205
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+A+ L+YL++VEEGGET+FP G + + + +C GL +KP+ GD LLF+S+
Sbjct: 206 RVASFLLYLTNVEEGGETMFPYENGENMDIGY--DYEKC--IGLKVKPRKGDGLLFYSLM 261
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVN 284
+ ++D +SLHG CPVIKG KW +TKWIR N
Sbjct: 262 VNGTIDRTSLHGSCPVIKGEKWVATKWIRDN 292
>gi|223945827|gb|ACN26997.1| unknown [Zea mays]
gi|414872966|tpg|DAA51523.1| TPA: prolyl 4-hydroxylase [Zea mays]
Length = 294
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 87/211 (41%), Positives = 128/211 (60%), Gaps = 9/211 (4%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDSRVRTSSGTFL 135
+++SW+PRA + F + E+CE ++ A ++ ST+ +T +S +RTSSGTFL
Sbjct: 87 QILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKG-IRTSSGTFL 145
Query: 136 ARGRD--KIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
+ D + + +IEK+IA T P +GE VL Y GQ+Y H+D F Q
Sbjct: 146 SANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDPAQYGPQKNQ 205
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+A+ L+YL+DVEEGGET+FP + + + +C GL +KP+ GD LLF+S+
Sbjct: 206 RVASFLLYLTDVEEGGETMFPYENSENMDIGY--DYEKC--IGLKVKPRKGDGLLFYSLM 261
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVN 284
+ ++D +SLHG CPVIKG KW +TKWIR N
Sbjct: 262 VNGTIDRTSLHGSCPVIKGEKWVATKWIRDN 292
>gi|226499492|ref|NP_001150030.1| LOC100283657 [Zea mays]
gi|195636206|gb|ACG37571.1| prolyl 4-hydroxylase [Zea mays]
gi|347978804|gb|AEP37744.1| prolyl 4-hydroxylase 3 [Zea mays]
Length = 294
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 87/211 (41%), Positives = 128/211 (60%), Gaps = 9/211 (4%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDSRVRTSSGTFL 135
+++SW+PRA + F + E+CE ++ A ++ ST+ +T +S +RTSSGTFL
Sbjct: 87 QILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKG-IRTSSGTFL 145
Query: 136 ARGRD--KIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
+ D + + +IEK+IA T P +GE VL Y GQ+Y H+D F Q
Sbjct: 146 SANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDPAQYGPQKNQ 205
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+A+ L+YL+DVEEGGET+FP + + + +C GL +KP+ GD LLF+S+
Sbjct: 206 RVASFLLYLTDVEEGGETMFPYENSENMDIGY--DYEKC--IGLKVKPRKGDGLLFYSLM 261
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVN 284
+ ++D +SLHG CPVIKG KW +TKWIR N
Sbjct: 262 VNGTIDRTSLHGSCPVIKGEKWVATKWIRDN 292
>gi|295700439|ref|YP_003608332.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
gi|295439652|gb|ADG18821.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
Length = 296
Score = 168 bits (426), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 121/220 (55%), Gaps = 26/220 (11%)
Query: 74 EQWVEVISWEPRAFVYH--NFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
E+ V V+S R H NFLS +ECE LI LA P + +S VVD TG+ + R+S
Sbjct: 90 ERKVRVLSRLQRPAAVHLANFLSADECEQLIALAQPRLDRSAVVDPVTGRDVIATHRSSH 149
Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM--DEFNTK 189
G F G +I IE RIA+ T P+ENGEGLQ+LHYE G + PH DY M +E N +
Sbjct: 150 GMFFRLGETPLIARIEARIAELTATPVENGEGLQMLHYEEGAESTPHVDYLMTGNEANRE 209
Query: 190 N---GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDA 246
+ GQRM T+LMYL DVE GGETVFP + G SI P+ G A
Sbjct: 210 SIARSGQRMGTLLMYLKDVEGGGETVFP-------------------QVGWSIVPQRGHA 250
Query: 247 LLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
L F DPSSLH P+ G+KW +TKWIR +
Sbjct: 251 LYFEYGNRYGMCDPSSLHASTPLRTGDKWVATKWIRTRRF 290
>gi|159485424|ref|XP_001700744.1| hypothetical protein CHLREDRAFT_187378 [Chlamydomonas reinhardtii]
gi|158281243|gb|EDP06998.1| predicted protein [Chlamydomonas reinhardtii]
Length = 253
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 97/219 (44%), Positives = 126/219 (57%), Gaps = 11/219 (5%)
Query: 76 WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
W+E ISW PRAF+YH FLS EC++LI LA P + +S VV + + + +RTS +
Sbjct: 37 WIETISWVPRAFIYHGFLSHAECDHLIGLALPKLERSLVVGNKSDEVDP--IRTSYSASI 94
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNTKNGGQR 194
++ DIE RIA +T P + E ++VL Y GQKY+ H+D+F E GG R
Sbjct: 95 GYNETDVVADIEGRIARWTHLPRSHQEPMEVLRYINGQKYDAHWDWFDETETGGTGGGNR 154
Query: 195 MATVLMYLSDVE--EGGETVFPNAQG---NISAVPWWNELSECG-KTGLSIKPKMGDALL 248
MAT LMYLSD+E GGET P AQ + V SEC K G+S++PK GD LL
Sbjct: 155 MATALMYLSDMEPAAGGETALPLAQPLDWEVQGVE-GRGYSECASKMGISVRPKKGDVLL 213
Query: 249 FWSMKPDA-SLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
FW M+P D +LH CP G KW++TKWI Y
Sbjct: 214 FWDMEPGGREPDRHALHASCPTFSGTKWTATKWIHNTPY 252
>gi|159489502|ref|XP_001702736.1| predicted protein [Chlamydomonas reinhardtii]
gi|158280758|gb|EDP06515.1| predicted protein [Chlamydomonas reinhardtii]
Length = 231
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 92/212 (43%), Positives = 125/212 (58%), Gaps = 12/212 (5%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR-VRTSSGTFLA 136
+++SW PR V+ F+ K E+++ LA M S + + + S+ RTS+GTFL+
Sbjct: 18 QILSWYPRIVVFPGFIDKARAEHIVKLAGKFMYPSGLAYRPGEQVESSQQTRTSTGTFLS 77
Query: 137 RGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGG-- 192
G D ++ +E+RIA T P +NGE VLHYE Q H+D MD F+ K+ G
Sbjct: 78 SGMDTEGVLGWVEQRIAAATLLPADNGEAFNVLHYEHMQ----HYDSHMDSFDPKDFGPQ 133
Query: 193 --QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
QR+ATVL+YLS+V EGGETVF +G A + C P+MGDA+LFW
Sbjct: 134 PSQRIATVLLYLSEVLEGGETVF-KKEGVDGADRPIQDWRNCDDGSFKYAPRMGDAVLFW 192
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+P+ +DP SLHGGCPV KG KW +TKWIR
Sbjct: 193 GTRPNGEIDPHSLHGGCPVKKGEKWVATKWIR 224
>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
Length = 282
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 95/256 (37%), Positives = 132/256 (51%), Gaps = 42/256 (16%)
Query: 36 AFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSK 95
A+ + P ++G++ A+D VR ER P+ + + LS
Sbjct: 63 AYHYDACPVAAGNTVHAHDRDVTVRIRFER------------------PQVIAFDDVLSG 104
Query: 96 EECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTF 155
EEC LI A +++ST V+ + G ++RTS G + R D I ++ RI+
Sbjct: 105 EECAELIERARHRLKRSTTVNPENGSEDVIQLRTSEGFWFQRCEDAFIERLDHRISALMN 164
Query: 156 FPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN-----TKNGGQRMATVLMYLSDVEEGGE 210
+PLE+GEGLQ+LHY G +Y PHFDYF N T GGQR+AT+++YLSDVE GGE
Sbjct: 165 WPLEHGEGLQILHYRQGGEYRPHFDYFPPGQNGSVLHTARGGQRVATLIVYLSDVEGGGE 224
Query: 211 TVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVI 270
TVFP+A GL++ + G A+ F M LDP +LHGG PV
Sbjct: 225 TVFPDA-------------------GLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVT 265
Query: 271 KGNKWSSTKWIRVNEY 286
G+KW TKW+R Y
Sbjct: 266 SGDKWIMTKWMRERPY 281
>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
Length = 285
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 95/256 (37%), Positives = 132/256 (51%), Gaps = 42/256 (16%)
Query: 36 AFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSK 95
A+ + P ++G++ A+D VR ER P+ + + LS
Sbjct: 66 AYHYDACPVAAGNTVHAHDRDVTVRIRFER------------------PQVIAFDDVLSG 107
Query: 96 EECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTF 155
EEC LI A +++ST V+ + G ++RTS G + R D I ++ RI+
Sbjct: 108 EECAELIERARHRLKRSTTVNPENGSEDVIQLRTSEGFWFQRCEDAFIERLDHRISALMN 167
Query: 156 FPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN-----TKNGGQRMATVLMYLSDVEEGGE 210
+PLE+GEGLQ+LHY G +Y PHFDYF N T GGQR+AT+++YLSDVE GGE
Sbjct: 168 WPLEHGEGLQILHYRQGGEYRPHFDYFPPGQNGSVLHTARGGQRVATLIVYLSDVEGGGE 227
Query: 211 TVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVI 270
TVFP+A GL++ + G A+ F M LDP +LHGG PV
Sbjct: 228 TVFPDA-------------------GLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVT 268
Query: 271 KGNKWSSTKWIRVNEY 286
G+KW TKW+R Y
Sbjct: 269 SGDKWIMTKWMRERPY 284
>gi|255573113|ref|XP_002527486.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223533126|gb|EEF34884.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 286
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 89/210 (42%), Positives = 130/210 (61%), Gaps = 11/210 (5%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDSRVRTSSGTFL 135
+V+SW+PRA + +F + E+C+ +I +A ++ S + +T +S RTSSGTFL
Sbjct: 77 QVLSWKPRAVYFPDFATPEQCKNIIEMAKLRLKPSGLALRKGETAESTKG-TRTSSGTFL 135
Query: 136 ARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
+ D + IE +IA T P +GE +L YE GQKY+ H+D F Q
Sbjct: 136 SASEDGTGTLDFIEHKIARATMIPRSHGEAFNILRYEIGQKYDSHYDSFNPAEYGPQMSQ 195
Query: 194 RMATVLMYLSDVEEGGETVFPNAQG-NISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+A+ L+YLSDVE+GGET+FP G IS+V + +C GL +KP+ GD +LF+S+
Sbjct: 196 RVASFLLYLSDVEKGGETMFPFENGVKISSV---YDYKKCA--GLKVKPRQGDGILFYSL 250
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
P+ ++D +SLHG CPVI+G KW +TKWIR
Sbjct: 251 LPNGTIDQTSLHGSCPVIEGEKWVATKWIR 280
>gi|449520827|ref|XP_004167434.1| PREDICTED: putative prolyl 4-hydroxylase-like, partial [Cucumis
sativus]
Length = 164
Score = 166 bits (421), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 88/162 (54%), Positives = 107/162 (66%), Gaps = 12/162 (7%)
Query: 125 SRVRTSSGTFLARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF 182
S RTSSG FL+ +++ IEKRI+ ++ P+ENGE +QVL YE Q Y+PH DYF
Sbjct: 4 SDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYF 63
Query: 183 MDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKT--GLSIK 240
D FN K GGQR+AT+LMYLS+ EGGET FP A E S GKT GLS+K
Sbjct: 64 SDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGS--------GECSCGGKTVPGLSVK 115
Query: 241 PKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
P GDA+LFWSM D DP S+HGGC V+ G KWS+TKW+R
Sbjct: 116 PAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWMR 157
>gi|302844281|ref|XP_002953681.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
gi|300261090|gb|EFJ45305.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
nagariensis]
Length = 304
Score = 166 bits (421), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 86/221 (38%), Positives = 130/221 (58%), Gaps = 22/221 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E ++W+PR F+YHNF++ E +++I LA P M++STVV + G+S + RT +
Sbjct: 1 IEHVAWKPRVFIYHNFITDMEAKHMIELAAPQMKRSTVVGAG-GQSVEDSYRTLYTAGVR 59
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
R +D ++ IE R+A +T + + E +Q+L Y GQ+Y+ H D D+ G R+A
Sbjct: 60 RYQDDVVERIENRVAAWTQISVLHQEDMQILRYGIGQQYKVHADTLRDD----EAGVRVA 115
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWN---------ELSECGKTGLSIKPKMGDAL 247
TVL+YL++ E GGET FP++Q W N S C K ++ PK GDAL
Sbjct: 116 TVLIYLNEPEAGGETAFPDSQ-------WVNPKLAETIGANFSACAKNHVAFAPKRGDAL 168
Query: 248 LFWSMKPDASL-DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
LFWS+ PD + D + H GCPV+ G KW++TKWI ++
Sbjct: 169 LFWSIGPDGTTEDYHASHTGCPVLSGVKWTATKWIHAKPFR 209
>gi|388567209|ref|ZP_10153646.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
gi|388265592|gb|EIK91145.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
Length = 296
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 89/222 (40%), Positives = 126/222 (56%), Gaps = 28/222 (12%)
Query: 73 AEQWVEVISW--EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
++WV++I+ PR V N LS EEC+ +I A P + +S V + TG + + RTS
Sbjct: 96 GDRWVDIITHMNHPRVVVLGNLLSAEECDAIIESAKPKLARSLTVQTATGGEELNADRTS 155
Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT- 188
SG F RG+ + +E+RIA +P+ENGEGLQVLHY G +Y+PH+DYF E T
Sbjct: 156 SGMFFTRGQTPEVTAVERRIARLVGWPVENGEGLQVLHYRPGAEYKPHYDYFDPKEAGTP 215
Query: 189 ---KNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGD 245
K GGQR+AT++MYL++ GG T FP+ GL + P G
Sbjct: 216 TILKRGGQRVATLVMYLNEPARGGGTTFPD-------------------VGLEVAPVKGS 256
Query: 246 ALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
A+ F +P + SLHGG PV++G KW +TKW+R E++
Sbjct: 257 AVFFSYDRPHPTT--RSLHGGAPVLEGEKWVATKWLREREFQ 296
>gi|407938132|ref|YP_006853773.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
gi|407895926|gb|AFU45135.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
Length = 303
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 88/210 (41%), Positives = 122/210 (58%), Gaps = 26/210 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR V+ N LS EEC+ LI A P M +S V + TG + + RTS G F RG+ +
Sbjct: 115 QPRIVVFGNLLSPEECDALIAAAEPRMARSLTVATKTGGEEINADRTSDGMFFQRGQSPL 174
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMAT 197
I+ IE+RIA +P+ENGEGLQVLHY G +Y+PH+DYF E T K GGQR+ T
Sbjct: 175 IQRIEERIARLLQWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIIKRGGQRVGT 234
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
++MYL+ ++GG T FP+ L + P+ G+A+ F +P S
Sbjct: 235 LVMYLNTPDKGGGTTFPDVH-------------------LEVAPQRGNAVFFSYERPHPS 275
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PVI G+KW +TKW+R E++
Sbjct: 276 T--RTLHGGAPVIAGDKWIATKWLREREFQ 303
>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
gladioli BSR3]
Length = 302
Score = 166 bits (419), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 90/221 (40%), Positives = 119/221 (53%), Gaps = 24/221 (10%)
Query: 71 GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
G+ E V + P A + FLS EC LI LA P + +STVVD TG++ + R+S
Sbjct: 89 GQHETRVLLRLQRPAAVLLDGFLSAGECRQLIELARPRLNRSTVVDPVTGRNIVAGHRSS 148
Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DE 185
G F G +I IE+RIA T FP+ENGEGLQ+LHYEAG + PH DY + +
Sbjct: 149 DGMFFRLGETPLISRIEQRIAALTGFPVENGEGLQMLHYEAGAESTPHVDYLVPGNPANA 208
Query: 186 FNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGD 245
+ GQR+ T+LMYL+DVE GGET+FP + G S+ P+ G
Sbjct: 209 ESIARSGQRVGTLLMYLNDVESGGETLFP-------------------QVGCSVVPRRGQ 249
Query: 246 ALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
A F DP+SLH P+ G+KW +TKWIR +
Sbjct: 250 AFYFEYGNGSGRSDPASLHASSPIGSGDKWVATKWIRTRRF 290
>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
bacterium R229]
Length = 289
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 119/209 (56%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ +FLS EEC+ LI L +++S VV+ +TG+ RTS G G +I
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATV 198
IE RIA T P+E+GEG QVLHY+ G +Y+PHFDYF + + GGQR+AT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ V+ GG T FP K GL + P G+A+ F +PD +L
Sbjct: 217 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 257
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D ++LH G PV +G KW +TKW+R Y+
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRERPYR 286
>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
PSI07]
Length = 289
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 119/209 (56%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ +FLS EEC+ LI L +++S VV+ +TG+ RTS G G +I
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATV 198
IE RIA T P+E+GEG QVLHY+ G +Y+PHFDYF + + GGQR+AT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ V+ GG T FP K GL + P G+A+ F +PD +L
Sbjct: 217 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 257
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D ++LH G PV +G KW +TKW+R Y+
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRERPYR 286
>gi|303279839|ref|XP_003059212.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226459048|gb|EEH56344.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 409
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 96/248 (38%), Positives = 142/248 (57%), Gaps = 39/248 (15%)
Query: 69 DEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT----GKSKD 124
D G + VE +S PRA+++ FL+KEEC +LI ++TPH+++STVV D +
Sbjct: 75 DIGVGDARVEKLSDSPRAYLFREFLTKEECAHLIEISTPHLKRSTVVGDDALLGEADGRR 134
Query: 125 SRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQ---VLHYEAGQKYEPHFDY 181
S RTS+G FL + D ++ +E+R+ F+ P EN E LQ +L YE GQ+Y H
Sbjct: 135 SDYRTSTGAFLPKLYDDVVTRVERRVEAFSRLPFENQEQLQARSLLRYELGQEYRDH--- 191
Query: 182 FMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGN------ISAVPWWNELSECG-- 233
+D F T+NGG+R+ATVLM+L++ EEGGET FPN + + ++A ELS+C
Sbjct: 192 -VDGFATENGGKRVATVLMFLAEPEEGGETAFPNGEPSEAVAARVAAQRARGELSDCAWR 250
Query: 234 -------------KTGLSIKPKMGDALLFWS-------MKPDASLDPSSLHGGCPVIKGN 273
G ++KP++GDA+LF+S A + +S H CP +G
Sbjct: 251 GGGGGTAGGGRGNLRGFAVKPRLGDAVLFFSYDADDDGGYDGAEVSHASTHASCPTTRGV 310
Query: 274 KWSSTKWI 281
KW++TKWI
Sbjct: 311 KWTATKWI 318
>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 289
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 119/209 (56%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ +FLS EEC+ LI L +++S VV+ +TG+ RTS G G +I
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATV 198
IE RIA T P+E+GEG QVLHY+ G +Y+PHFDYF + + GGQR+AT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ V+ GG T FP K GL + P G+A+ F +PD +L
Sbjct: 217 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 257
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D ++LH G PV +G KW +TKW+R Y+
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRERPYR 286
>gi|351731158|ref|ZP_08948849.1| 2OG-Fe(II) oxygenase [Acidovorax radicis N35]
Length = 303
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 88/210 (41%), Positives = 121/210 (57%), Gaps = 26/210 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR V+ N LS EEC+ LI A P M +S V + TG + + RTS G F RG+ +
Sbjct: 115 QPRVVVFGNLLSPEECDALIADAAPRMARSLTVATKTGGEEINDDRTSDGMFFQRGQSPL 174
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMAT 197
I+ IE+RIA +P+ENGEGLQVLHY G +Y+PH+DYF E T K GGQR+ T
Sbjct: 175 IQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTIVKRGGQRVGT 234
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
++MYL+ E+GG T FP+ + + P+ G+A+ F +P S
Sbjct: 235 LVMYLNTPEKGGGTTFPDVH-------------------VEVAPQRGNAVFFSYERPHPS 275
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV+ G KW +TKW+R E+K
Sbjct: 276 T--RTLHGGAPVLAGEKWIATKWLREREFK 303
>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 296
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 121/220 (55%), Gaps = 26/220 (11%)
Query: 74 EQWVEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
E+ VIS P A + +FLS ECE LI LA P + +STVVD TG++ + R+S
Sbjct: 90 ERKTRVISRMQRPAAVLLDDFLSANECEQLIALARPRLSRSTVVDPVTGRNVVAGHRSSD 149
Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEF 186
G F G +I +E RIA+ T P+ENGEGLQ+LHYEAG + PH DY + +
Sbjct: 150 GMFFRLGETPLIARLEARIAELTGLPVENGEGLQLLHYEAGAESTPHVDYLIAGNPANRE 209
Query: 187 NTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDA 246
+ GQR+ T+LMYL+DVE GGET+FP +TG S+ P+ G A
Sbjct: 210 SIARSGQRVGTLLMYLNDVEGGGETMFP-------------------QTGWSVVPRRGQA 250
Query: 247 LLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
L F DPSSLH P+ G KW +TKWIR +
Sbjct: 251 LYFEYGNRFGLADPSSLHTSTPLRAGEKWVATKWIRTRRF 290
>gi|414587755|tpg|DAA38326.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 244
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 83/153 (54%), Positives = 103/153 (67%), Gaps = 2/153 (1%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVISW PR V+HNFLS EEC+YL+ +A P ++ STVVD TGK S VRTSSG F+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSDVRTSSGMFVNS 117
Query: 138 GRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
K +++ IEKRI+ F+ P ENGE +QVL YEA Q Y PH DYF D FN K GGQR+
Sbjct: 118 EERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQRV 177
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNE 228
AT+LMYL+D GGET FP + + W++
Sbjct: 178 ATMLMYLTDGVVGGETHFPQEMESAAVEETWSK 210
>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
Length = 285
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 85/208 (40%), Positives = 118/208 (56%), Gaps = 24/208 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P+ V+ N L ++EC+ +I + + +ST V+++TG + R RTS GT+ G D +I
Sbjct: 96 PQIVVFGNVLDQDECDEMIQRSMHKLEQSTTVNAETGTQEVIRHRTSHGTWFQNGEDALI 155
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEF-----NTKNGGQRMATV 198
R IE R+A P+ENGEGLQVL Y G +Y H+DYF + + GGQR+AT+
Sbjct: 156 RRIETRLAALMNCPVENGEGLQVLRYTPGGEYRSHYDYFQPTAAGSLTHVRTGGQRVATL 215
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+DV GGETVFP A G+S+ P+ GDA+ F M L
Sbjct: 216 IVYLNDVPSGGETVFPEA-------------------GISVVPRRGDAVYFRYMNRLRQL 256
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DP++LH G PV G KW TKW+R Y
Sbjct: 257 DPATLHAGAPVRDGEKWIMTKWVRERPY 284
>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
Length = 295
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 83/208 (39%), Positives = 120/208 (57%), Gaps = 24/208 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P+ V+ + LS +EC +I + +++ST V+ +TGK R RTS G + RG D I
Sbjct: 106 PQVIVFGDVLSPDECAEMIERSRHRLKRSTTVNPETGKEDVIRNRTSEGIWYQRGEDAFI 165
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNGGQRMATV 198
+++RI+ +P+ENGEGLQ+LHY +Y PHFDYF + +T GGQR+AT+
Sbjct: 166 ERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQRVATL 225
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+DV +GGET+FP A G+S+ + G A+ F M L
Sbjct: 226 VIYLNDVPDGGETIFPEA-------------------GISVAARQGGAVYFRYMNGQRQL 266
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DP +LHGG PV+ G+KW TKW+R Y
Sbjct: 267 DPLTLHGGAPVLGGDKWIMTKWMRERAY 294
>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 296
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 92/220 (41%), Positives = 122/220 (55%), Gaps = 26/220 (11%)
Query: 74 EQWVEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
E+ VIS P A + +FLS ECE LI+LA P + +STVVD TG++ + R+S
Sbjct: 90 ERKTRVISRMQRPAAILLDDFLSANECEQLISLARPRLSRSTVVDPVTGRNVVAGHRSSD 149
Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEF 186
G F G +I +E RIA+ T P+ENGEGLQ+LHYE G + PH DY + ++
Sbjct: 150 GMFFRLGETPLIARLEARIAELTGLPVENGEGLQLLHYEVGAESTPHVDYLIAGNPANQE 209
Query: 187 NTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDA 246
+ GQR+ T+LMYL+DVE GGET+FP +TG S+ P+ G A
Sbjct: 210 SIARSGQRVGTLLMYLNDVEGGGETMFP-------------------QTGWSVVPRRGQA 250
Query: 247 LLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
L F DPSSLH P+ G KW +TKWIR +
Sbjct: 251 LYFEYGNRFGLADPSSLHTSTPLRVGEKWVATKWIRTRRF 290
>gi|219121927|ref|XP_002181308.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217407294|gb|EEC47231.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 226
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 92/217 (42%), Positives = 120/217 (55%), Gaps = 19/217 (8%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKS--TVVDSDTGKSKDSRVRTSSGTF 134
+E +S P FLS +EC Y+ A PHM S T++D D G+ S RTS F
Sbjct: 7 LETLSLVPLVLSVEGFLSDDECTYIQETAEPHMEYSEVTLMDKDQGRPA-SDFRTSQSAF 65
Query: 135 LARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTK----- 189
+ D I+ DI+ R A P + E +QVL Y+ +KY+ H DYF TK
Sbjct: 66 IRAHDDAILTDIDYRTASLVRIPRRHQEDVQVLRYDVTEKYDSHADYFDPALYTKDKRTL 125
Query: 190 ----NGGQ-RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMG 244
NG + RMATV YLSDVE+GGETVFP G + +C KTGL +KP+ G
Sbjct: 126 ALIRNGHRNRMATVFWYLSDVEKGGETVFPRFNGAQE-----TSMKDC-KTGLKVKPEKG 179
Query: 245 DALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
++F+SM PD +LD SLHG CPV KG KW++ KW+
Sbjct: 180 KVIIFYSMTPDGALDEYSLHGACPVQKGTKWAANKWV 216
>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CMR15]
Length = 289
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 117/209 (55%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ +FLS EEC+ LI L +++S VV+ +TG+ RTS G G +I
Sbjct: 97 PRIVLFQHFLSDEECDQLITLGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATV 198
IE RIA T P+E+GEG QVLHY+ G +Y+PHFDYF + + GGQR+AT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ V GG T FP K GL + P G+A+ F +PD +L
Sbjct: 217 VIYLNSVPAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 257
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D +LH G PV +G KW +TKW+R Y+
Sbjct: 258 DDKTLHAGLPVERGEKWIATKWLRERPYR 286
>gi|225428943|ref|XP_002263094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
gi|296083076|emb|CBI22480.3| unnamed protein product [Vitis vinifera]
Length = 282
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 89/210 (42%), Positives = 131/210 (62%), Gaps = 12/210 (5%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR--VRTSSGTFL 135
+V+SW+PRA + +F + E+C+ +I +A + ST+V G++++S +RTSSGTF+
Sbjct: 74 QVLSWKPRARYFPHFATAEQCQSIIEMAKSGLSPSTLV-LRKGETEESTKGIRTSSGTFI 132
Query: 136 ARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
+ DK I+ IE++IA T P +GE +L YE GQ+Y H+D Q
Sbjct: 133 SASEDKTGILDFIERKIAKATMIPRNHGEVFNILRYEIGQRYNSHYDAISPAEYGLQTSQ 192
Query: 194 RMATVLMYLSDVEEGGETVFPNAQG-NISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+A+ L+YLSDVEEGGET+FP NI N + GL +KP+ GD LLF+S+
Sbjct: 193 RIASFLLYLSDVEEGGETMFPFEHDLNI------NTFNSRKCIGLKVKPRRGDGLLFYSV 246
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
P+ ++D +S+HG CPVI+G KW +TKWIR
Sbjct: 247 FPNGTIDWTSMHGSCPVIEGEKWVATKWIR 276
>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Ralstonia solanacearum GMI1000]
Length = 289
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 83/209 (39%), Positives = 118/209 (56%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ +FLS EEC+ LI L +++S VV+ +TG+ RTS G G ++
Sbjct: 97 PRIVLFQHFLSDEECDQLIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLV 156
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATV 198
IE RIA T P+E+GEG QVLHY+ G +Y+PHFDYF + + GGQR+AT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ V GG T FP K GL + P G+A+ F +PD +L
Sbjct: 217 VIYLNSVPAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 257
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D ++LH G PV +G KW +TKW+R Y+
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRERPYR 286
>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
Length = 286
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 82/208 (39%), Positives = 118/208 (56%), Gaps = 24/208 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P V F+S EECE LI + + S +VD TGK + R+S GT+ RG +I
Sbjct: 96 PDIVVVDEFMSGEECEQLIEQSRRKLTPSAIVDPQTGKFQVIADRSSEGTYFQRGESPLI 155
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNGGQRMATV 198
+++RI++ +P ++GEG+Q+LHY G +Y+PHFDYF++ GQR+AT+
Sbjct: 156 SRLDRRISELMNWPEDHGEGIQILHYGVGAQYKPHFDYFLENESGGALQMTQSGQRVATL 215
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL++V EGGETVFP+ G+SI PK G A F +
Sbjct: 216 VMYLNEVTEGGETVFPD-------------------VGISITPKRGSAAYFAYCNSLGQV 256
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DP++LHGG PV+ G KW +TKW+R +Y
Sbjct: 257 DPATLHGGAPVLTGEKWIATKWMRQYKY 284
>gi|388520887|gb|AFK48505.1| unknown [Lotus japonicus]
Length = 187
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 75/130 (57%), Positives = 99/130 (76%), Gaps = 1/130 (0%)
Query: 158 LENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQ 217
+ENGE +Q+LHYE G+KYEPH+DYF D N GG R+ATVLMYLSDV +GGET+FPNA+
Sbjct: 6 IENGESIQILHYENGRKYEPHYDYFHDRANQFMGGHRIATVLMYLSDVGKGGETIFPNAE 65
Query: 218 GNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSS 277
+S P SEC G ++KP+ GDALLF+S+ +A+ D +SLHG CPVI+G KWS+
Sbjct: 66 SKLSQ-PKDESWSECAHKGYAVKPRKGDALLFFSLHLNATTDSNSLHGSCPVIEGEKWSA 124
Query: 278 TKWIRVNEYK 287
TKWI V++++
Sbjct: 125 TKWIHVSDFE 134
>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
Length = 289
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 83/209 (39%), Positives = 118/209 (56%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ +FLS EEC+ LI L +++S VV+ +TG+ RTS G G ++
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 156
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATV 198
IE RIA T P+E+GEG QVLHY G +Y+PHFDYF + + GGQR+AT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ V+ GG T FP K GL + P G+A+ F +PD +L
Sbjct: 217 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 257
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D ++LH G PV +G KW +TKW+R Y+
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRERPYR 286
>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
Length = 292
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 83/209 (39%), Positives = 118/209 (56%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ +FLS EEC+ LI L +++S VV+ +TG+ RTS G G ++
Sbjct: 100 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 159
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATV 198
IE RIA T P+E+GEG QVLHY G +Y+PHFDYF + + GGQR+AT+
Sbjct: 160 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 219
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ V+ GG T FP K GL + P G+A+ F +PD +L
Sbjct: 220 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 260
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D ++LH G PV +G KW +TKW+R Y+
Sbjct: 261 DDNTLHAGLPVERGEKWIATKWLRERPYR 289
>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
Length = 287
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 87/206 (42%), Positives = 116/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR V+ FLS +EC+ L+ LA P + +S VD+DTG S+ + RTS G F RG ++
Sbjct: 99 DPRVVVFGGFLSHDECDALVALAQPRLARSETVDNDTGGSEVNEARTSQGMFFMRGEGEL 158
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT------KNGGQRMA 196
I IE RIA +PLENGEG+QVLHY G +Y+PH+DYF D K GGQR+
Sbjct: 159 ISRIEARIAALLDWPLENGEGVQVLHYRPGAEYKPHYDYF-DPAQPGTPTILKRGGQRVG 217
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
T++MYL+ E GG T FP+ L + P G+A+ F +
Sbjct: 218 TLVMYLNTPERGGGTTFPD-------------------VNLEVAPIKGNAVFFSYERAHP 258
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIR 282
S SLHGG PV+ G KW +TKW+R
Sbjct: 259 ST--RSLHGGAPVLAGEKWVATKWLR 282
>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum IPO1609]
Length = 280
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 83/209 (39%), Positives = 118/209 (56%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ +FLS EEC+ LI L +++S VV+ +TG+ RTS G G ++
Sbjct: 88 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 147
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATV 198
IE RIA T P+E+GEG QVLHY G +Y+PHFDYF + + GGQR+AT+
Sbjct: 148 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 207
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ V+ GG T FP K GL + P G+A+ F +PD +L
Sbjct: 208 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 248
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D ++LH G PV +G KW +TKW+R Y+
Sbjct: 249 DDNTLHAGLPVERGEKWIATKWLRERPYR 277
>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
CFBP2957]
gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
CFBP2957]
Length = 289
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 83/209 (39%), Positives = 118/209 (56%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ +FLS EEC+ LI L +++S VV+ +TG+ RTS G G ++
Sbjct: 97 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 156
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATV 198
IE RIA T P+E+GEG QVLHY G +Y+PHFDYF + + GGQR+AT+
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ V+ GG T FP K GL + P G+A+ F +PD +L
Sbjct: 217 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 257
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D ++LH G PV +G KW +TKW+R Y+
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRERPYR 286
>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
Length = 286
Score = 163 bits (412), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 86/211 (40%), Positives = 119/211 (56%), Gaps = 26/211 (12%)
Query: 82 WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDK 141
+ PR V+ + LS +ECE LI LA P + +S V + TG + + RTSSG F RG ++
Sbjct: 97 YNPRVVVFGSLLSDQECEQLIGLAKPRLARSLTVATKTGGEEVNEDRTSSGMFFQRGENE 156
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMA 196
++ IE RIA +P+ENGEGLQVLHY G +Y+PH+DYF E T K GGQR+
Sbjct: 157 LVARIEARIARLVNWPVENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILKRGGQRVG 216
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
T++MYL + E+GG T FP+ L + PK G + F +P
Sbjct: 217 TLVMYLGEPEKGGGTTFPDVH-------------------LEVAPKRGHGVFFSYERPHP 257
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
S +LHGG PV+ G KW +TKW+R ++
Sbjct: 258 ST--RTLHGGAPVLAGEKWIATKWLRERRFE 286
>gi|357453665|ref|XP_003597113.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|357482683|ref|XP_003611628.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355486161|gb|AES67364.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
gi|355512963|gb|AES94586.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
Length = 294
Score = 163 bits (412), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 85/208 (40%), Positives = 125/208 (60%), Gaps = 7/208 (3%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR-VRTSSGTFLA 136
+V+SW PRA + NF S E+C+ +I +A + S ++ + + ++ +RTSSG F++
Sbjct: 84 QVLSWNPRALYFPNFASAEQCDRIIEMAKAELSPSRLMLREGETEEGTKGIRTSSGMFIS 143
Query: 137 RGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQR 194
DK ++ I+++IA P +G +L Y+ GQKY H+D F QR
Sbjct: 144 ASEDKTGLLEVIDEKIARAAKIPKTHGGAYNILRYKVGQKYNSHYDAFNPAEYGPQESQR 203
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+A+ L+YL+DV EGGET+FP G S + +C GL IKP GD LLF+S+ P
Sbjct: 204 VASFLLYLTDVPEGGETMFPFENG--SNMDSSYNFEDC--IGLKIKPLKGDGLLFYSLFP 259
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ ++DP+SLHG CPVIKG KW +TKWIR
Sbjct: 260 NGTIDPTSLHGSCPVIKGEKWVATKWIR 287
>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
Length = 296
Score = 163 bits (412), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 96/220 (43%), Positives = 122/220 (55%), Gaps = 26/220 (11%)
Query: 74 EQWVEVISWEPRAFVYH--NFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
E+ V V+S R H +FLS +ECE LI LA P + +STVVD TG++ + R+S
Sbjct: 90 ERKVRVLSRLQRPAAVHLADFLSADECEQLIALAQPRLDRSTVVDPVTGRNVVAGHRSSH 149
Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM--DEFNTK 189
G F G +I IE RIA T P+ENGEGLQ+LHYE G + PH DY + +E N +
Sbjct: 150 GMFFRLGETPLIVRIEARIAALTGTPVENGEGLQMLHYEEGAESTPHVDYLITGNEANRE 209
Query: 190 N---GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDA 246
+ GQRM T+LMYL DVE GGETVFP + G S+ P+ G A
Sbjct: 210 SIARSGQRMGTLLMYLKDVEGGGETVFP-------------------QIGWSVAPQRGHA 250
Query: 247 LLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
L F DPSSLH P+ G+KW +TKWIR +
Sbjct: 251 LYFEYGNRFGLCDPSSLHASTPLRVGDKWVATKWIRTRRF 290
>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
K60-1]
Length = 288
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 83/209 (39%), Positives = 117/209 (55%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ +FLS EEC+ LI L +++S VV+ +TG+ RTS G G ++
Sbjct: 96 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 155
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATV 198
IE RIA T P+E+GEG QVLHY G +Y+PHFDYF + GGQR+AT+
Sbjct: 156 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLDVGGQRVATL 215
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ V+ GG T FP K GL + P G+A+ F +PD +L
Sbjct: 216 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 256
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D ++LH G PV +G KW +TKW+R Y+
Sbjct: 257 DDNTLHAGLPVERGEKWIATKWLRERPYR 285
>gi|395003644|ref|ZP_10387769.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
gi|394318439|gb|EJE54870.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
Length = 299
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 87/210 (41%), Positives = 120/210 (57%), Gaps = 26/210 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR V+ N LS EEC+ LI A P M +S V + TG + + RTS G F RG + +
Sbjct: 111 KPRIVVFGNLLSAEECDALIAAAAPRMARSLTVATKTGGEEVNDDRTSDGMFFQRGENPV 170
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMAT 197
++ IE+RIA +P+ENGEGLQVLHY G +Y+PH+DYF E T K GGQR+ T
Sbjct: 171 VQRIEERIARLLDWPIENGEGLQVLHYRPGAEYKPHYDYFDPGEPGTPTILKRGGQRVGT 230
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
++MYL+ E+GG T FP+ + + P+ G+A+ F + A
Sbjct: 231 LVMYLNTPEKGGGTTFPDVH-------------------VEVAPQRGNAVFFSYER--AH 269
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PVI G KW +TKW+R E+K
Sbjct: 270 PATRTLHGGAPVIAGEKWIATKWLREREFK 299
>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
xenovorans LB400]
Length = 292
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 83/208 (39%), Positives = 118/208 (56%), Gaps = 24/208 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P+ V+ + LS +EC +I + +++ST V+ TGK R RTS G + RG D I
Sbjct: 103 PQVIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRNRTSEGIWYQRGEDPFI 162
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNGGQRMATV 198
+++RI+ +P+ENGEGLQ+LHY +Y PHFDYF + +T GGQR+AT+
Sbjct: 163 ERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQRVATL 222
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+DV +GGET+FP A G+S+ G A+ F M L
Sbjct: 223 VIYLNDVPDGGETIFPEA-------------------GMSVAASQGGAVYFRYMNDRRQL 263
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DP +LHGG PV+ G+KW TKW+R Y
Sbjct: 264 DPLTLHGGAPVLAGDKWIMTKWMRERAY 291
>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
solanacearum MolK2]
Length = 283
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 83/209 (39%), Positives = 117/209 (55%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ +FLS EEC+ LI L +++S VV+ +TG+ RTS G G ++
Sbjct: 91 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 150
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATV 198
IE RIA T P+E+GEG QVLHY G +Y+PHFDYF + + GGQR+AT+
Sbjct: 151 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRGGEARQLEVGGQRVATL 210
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ V+ GG T FP K GL + P G+A+ F +PD L
Sbjct: 211 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGML 251
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D ++LH G PV +G KW +TKW+R Y+
Sbjct: 252 DDNTLHAGLPVERGEKWIATKWLRERPYR 280
>gi|365090417|ref|ZP_09328465.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
gi|363416516|gb|EHL23626.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
Length = 302
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 87/210 (41%), Positives = 121/210 (57%), Gaps = 26/210 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR V+ N LS EEC+ LI A P + +S V + TG + + RTS G F RG+ +
Sbjct: 114 QPRIVVFGNLLSPEECDALIADAQPRLARSLTVATKTGGEEINDDRTSDGMFFQRGQSPL 173
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNTKN----GGQRMAT 197
I+ IE+RIA +P+ENGEGLQVLHY G +Y+PH+DYF E T + GGQR+ T
Sbjct: 174 IQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIVNRGGQRVGT 233
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
++MYL+ E+GG T FP+ L + P+ G+A+ F +P S
Sbjct: 234 LVMYLNTPEKGGGTTFPDVH-------------------LEVAPQRGNAVFFSYERPHPS 274
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PVI G KW +TKW+R E++
Sbjct: 275 T--RTLHGGAPVIAGEKWIATKWLREREFR 302
>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 296
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 88/213 (41%), Positives = 125/213 (58%), Gaps = 22/213 (10%)
Query: 73 AEQWVEVISWEPRAFVYH--NFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
+++ ++V+S + F+ H FLS+EEC+ LI ++ ++ STV+D TG+ K + RTS
Sbjct: 96 SDREIKVLSKVEKPFILHLDYFLSEEECDQLIEMSRERLKPSTVIDPKTGEEKAATGRTS 155
Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNTK 189
G ++ I+ +EKRIA+ FP+ENGEGLQVL+Y G++Y+ HFDYF + +
Sbjct: 156 KGMSFYLQENEFIKKVEKRIAELIEFPVENGEGLQVLNYGIGEEYKSHFDYFPQSKVVPE 215
Query: 190 NGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLF 249
GGQR+ T L+YL+DV GGETVFP K G+SI PK G A+ F
Sbjct: 216 KGGQRVGTFLIYLNDVPAGGETVFP-------------------KAGVSIVPKKGSAVYF 256
Query: 250 WSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+D SLH PV +G KW +TKWIR
Sbjct: 257 QYGNSKGEVDRMSLHSSIPVSEGEKWVATKWIR 289
>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
Length = 309
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 83/210 (39%), Positives = 122/210 (58%), Gaps = 26/210 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR ++ N LS EEC+ +I+ A P M +S V + TG + + RTS+G F R + +
Sbjct: 121 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDDRTSNGMFFQREENPV 180
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMAT 197
+ +E RIA +PLENGEGLQVLHY G +Y+PH+DYF E T + GGQR+AT
Sbjct: 181 VARLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILRRGGQRVAT 240
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
+++YL+D E+GG T FP+ L + P+ G+A+ F +P S
Sbjct: 241 IVIYLNDPEKGGGTTFPDVH-------------------LEVAPRRGNAVFFSYERPHPS 281
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV+ G+KW +TKW+R ++
Sbjct: 282 T--RTLHGGAPVVAGDKWIATKWLRERRFE 309
>gi|222111817|ref|YP_002554081.1| procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
gi|221731261|gb|ACM34081.1| Procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
Length = 289
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 124/209 (59%), Gaps = 26/209 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ N LS EEC+ +I+ A P M +S V + TG + + RTS G F RG ++
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARSLTVQTTTGGEEVNADRTSDGMFFQRGETPVV 161
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATV 198
+ +E+RIA +P++NGEGLQVLHY G +Y+PH+DYF D+ T + GGQR+AT+
Sbjct: 162 QRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRVATL 221
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL++ +GG T FP+ VP L + P+ G+A+ F +P S
Sbjct: 222 VIYLNNPRKGGGTTFPD-------VP------------LEVAPRQGNAVFFSYERPHPST 262
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG VI+G KW +TKW+R E+K
Sbjct: 263 --RTLHGGASVIEGEKWIATKWLREREFK 289
>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
[Alteromonas sp. S89]
Length = 294
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 115/209 (55%), Gaps = 24/209 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+P ++ NFL++ EC+ L+ ++ P++ S VV++ G + RTS GT ARG +
Sbjct: 102 QPNIVLFANFLAEWECDALVEMSRPNLSPSRVVNTQHGAFELKPSRTSGGTHFARGETPL 161
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKN-----GGQRMAT 197
I DIE RIA P +GE LQ+LHY +Y PH+D+F E GGQR+ T
Sbjct: 162 IADIEARIASLLKVPEAHGEPLQILHYPVSGEYRPHYDFFDPEKPGNQEVLAAGGQRVGT 221
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
++MYLSDVE GG TVFP + GL ++P+ G AL F +
Sbjct: 222 LIMYLSDVESGGATVFP-------------------RVGLEVQPQKGAALFFSYVGEHGK 262
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
LD SLHGG PV+ G KW +TKW+R EY
Sbjct: 263 LDLQSLHGGSPVLAGEKWIATKWLRAAEY 291
>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
ATCC 19860]
Length = 298
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 83/210 (39%), Positives = 122/210 (58%), Gaps = 26/210 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR ++ N LS EEC+ +I+ A P M +S V + TG + + RTS+G F R + +
Sbjct: 110 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDDRTSNGMFFQREENPM 169
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMAT 197
+ +E RIA +PLENGEGLQVLHY G +Y+PH+DYF E T + GGQR+AT
Sbjct: 170 VAKLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPTEPGTPTILRRGGQRVAT 229
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
+++YL+D E+GG T FP+ L + P+ G+A+ F +P S
Sbjct: 230 IVIYLNDPEKGGGTTFPDVH-------------------LEVAPRRGNAVFFSYERPHPS 270
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV+ G+KW +TKW+R ++
Sbjct: 271 T--RTLHGGAPVVAGDKWIATKWLRERRFE 298
>gi|363814557|ref|NP_001242754.1| uncharacterized protein LOC100794585 [Glycine max]
gi|255628535|gb|ACU14612.1| unknown [Glycine max]
Length = 238
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 84/152 (55%), Positives = 104/152 (68%), Gaps = 3/152 (1%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV++W PR + HNFLS EEC+YL LA P + STVVD+ TGK S VRTSSG FL
Sbjct: 82 EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSDVRTSSGMFLNS 141
Query: 138 GRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
K +++ IEKRI+ ++ P+ENGE +QVL YE Q Y+PH DYF D FN K GGQR+
Sbjct: 142 KERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQRI 201
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWN 227
AT+LMYLSD E GET FP A G+++A N
Sbjct: 202 ATMLMYLSDNIERGETYFPLA-GSVNAAVVGN 232
>gi|357417854|ref|YP_004930874.1| procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
gi|355335432|gb|AER56833.1| Procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
Length = 283
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 84/216 (38%), Positives = 122/216 (56%), Gaps = 28/216 (12%)
Query: 74 EQWVEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
++ V+V++ PR V+ N L+ EEC+ LI LA +++S V D DTG+ + + RTS
Sbjct: 84 DRQVQVLASLLHPRVIVFGNLLAAEECDALIALARRQIKRSPVFDPDTGQDQQHQARTSE 143
Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-----MDEF 186
G F RG + + +E RIA +PLENGEGLQVL Y G +YEPH+DYF E
Sbjct: 144 GMFFGRGANPLCARVEARIAALLNWPLENGEGLQVLRYGPGAQYEPHYDYFDPARPGAEV 203
Query: 187 NTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDA 246
+ GGQR+A++++YL+ +GG T FP+A L + P G+A
Sbjct: 204 ALRRGGQRVASLVIYLNTPTQGGATTFPDAH-------------------LEVAPIKGNA 244
Query: 247 LLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ F +P +LHGG PV++G KW +TKW+R
Sbjct: 245 VYFSYDRPHPMT--GTLHGGAPVVEGEKWVATKWLR 278
>gi|121595595|ref|YP_987491.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
gi|120607675|gb|ABM43415.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
Length = 289
Score = 161 bits (407), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 124/209 (59%), Gaps = 26/209 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ N LS EEC+ +I+ A P M +S V + TG + + RTS G F RG ++
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARSLTVQTTTGGEEVNADRTSDGMFFQRGETPVV 161
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATV 198
+ +E+RIA +P++NGEGLQVLHY G +Y+PH+DYF D+ T + GGQR+AT+
Sbjct: 162 QRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRVATL 221
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL++ +GG T FP+ VP L + P+ G+A+ F +P S
Sbjct: 222 VIYLNNPLKGGGTTFPD-------VP------------LEVAPRQGNAVFFSYERPHPST 262
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG VI+G KW +TKW+R E+K
Sbjct: 263 --RTLHGGASVIEGEKWIATKWLREREFK 289
>gi|412994121|emb|CCO14632.1| predicted protein [Bathycoccus prasinos]
Length = 341
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 87/225 (38%), Positives = 132/225 (58%), Gaps = 6/225 (2%)
Query: 68 GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDS-- 125
GDE E +++S PR+ +Y NF S +C+ ++ A + KS + G++ ++
Sbjct: 119 GDEYLTELKFQLLSTAPRSVMYRNFASDADCDAIVEAARSRLHKSGLA-LKRGETLETTK 177
Query: 126 RVRTSSGTFLARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM 183
+RTSSGTFL ++ ++ +E+++A T P +GE +L YE GQKY+ H+D F
Sbjct: 178 NIRTSSGTFLTSKMEQSGALKRVEEKMARATHIPATHGEAYNILRYEIGQKYDSHYDMFD 237
Query: 184 DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKM 243
QR+A+ L+YL+ +EGGETVFP N + + C + GL +KP+
Sbjct: 238 PSQYGPQRSQRVASFLLYLTTPDEGGETVFPLEGQNGLYRLRGIDYTSC-EAGLKVKPRK 296
Query: 244 GDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
GDALLFWS+ P+ + D SSLHGGCPVI G K+ +TKWI N + +
Sbjct: 297 GDALLFWSVHPNNTFDRSSLHGGCPVISGTKFVATKWIHDNRWTL 341
>gi|319786559|ref|YP_004146034.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
gi|317465071|gb|ADV26803.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
Length = 289
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 86/204 (42%), Positives = 113/204 (55%), Gaps = 26/204 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V LS EEC+ L+ L+ P +R+ST VD+ TG S+ RTS GTF RG +
Sbjct: 102 PRVVVLGGLLSDEECDALVELSRPRLRRSTTVDAQTGGSQVHADRTSRGTFFERGAHPVC 161
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-----MDEFNTKNGGQRMATV 198
IE RIA +P+ENGEGLQVLHY G ++ PH+DYF E + GGQR+ATV
Sbjct: 162 ATIEARIARLLEWPVENGEGLQVLHYPPGAEFRPHYDYFDPDEPGAEVLLRQGGQRVATV 221
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL+ GG T FP+A ++AV G+A+ F +P
Sbjct: 222 VMYLNTPARGGATTFPDAHLEVAAV-------------------KGNAVFFSYDRPHPMT 262
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
+LHGG PV +G KW +TKW+R
Sbjct: 263 --RTLHGGAPVTEGEKWIATKWLR 284
>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 286
Score = 160 bits (404), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 82/209 (39%), Positives = 116/209 (55%), Gaps = 24/209 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+P V LS EEC+ LI A +++ST+VD TGK + R+S GTF D
Sbjct: 94 QPVLAVLDGVLSHEECDELIRRAAAKLQRSTIVDPTTGKHETIADRSSEGTFFEINADDF 153
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMAT 197
I +++RI+ P+++GEGLQ+LHY G +Y+PHFD+F GGQR++T
Sbjct: 154 IARLDRRISALMNLPVDHGEGLQILHYGPGGEYKPHFDFFPPGDPGSAVQMATGGQRVST 213
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
++MYL++VE+GG T+FP + GLS+ PK G A+ F
Sbjct: 214 LVMYLNEVEDGGATIFP-------------------ELGLSVLPKKGSAVYFEYTNSRGQ 254
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
LDP +LHGG PV++G KW TKW+R Y
Sbjct: 255 LDPRTLHGGAPVLRGEKWIVTKWMRQRRY 283
>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
Length = 294
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 90/223 (40%), Positives = 125/223 (56%), Gaps = 26/223 (11%)
Query: 69 DEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVR 128
D G + V V PR V+ N LS EEC+ +I A P M +S V + +G + + R
Sbjct: 92 DVGDRQVQVLVSMRNPRIVVFGNLLSHEECDAIIAAARPRMARSLTVATQSGGEEINDDR 151
Query: 129 TSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFN 187
TS+G F RG I+ +E+RIA +PL++GEGLQVLHY G +Y+PH DYF E
Sbjct: 152 TSNGMFFQRGETGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYFAPGEPG 211
Query: 188 T----KNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKM 243
T K GGQR+ T+++YL++ E GG T+FP VP L + P+
Sbjct: 212 TPTILKRGGQRVGTLVIYLNEPERGGATIFPE-------VP------------LQVVPRR 252
Query: 244 GDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
G+A+ F +PD S +LHGG PV+ G KW +TKW+R E+
Sbjct: 253 GNAVFFSYERPDPST--RTLHGGAPVLAGEKWIATKWLREREF 293
>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
Length = 294
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 90/223 (40%), Positives = 125/223 (56%), Gaps = 26/223 (11%)
Query: 69 DEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVR 128
D G + V V PR V+ N LS EEC+ +I A P M +S V + +G + + R
Sbjct: 92 DVGDRQVQVLVSMRNPRIVVFGNLLSHEECDAIIAAARPRMARSLTVATQSGGEEINDDR 151
Query: 129 TSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFN 187
TS+G F RG I+ +E+RIA +PL++GEGLQVLHY G +Y+PH DYF E
Sbjct: 152 TSNGMFFQRGETGIVSQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYFAPGEPG 211
Query: 188 T----KNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKM 243
T K GGQR+ T+++YL++ E GG T+FP VP L + P+
Sbjct: 212 TPTILKRGGQRVGTLVIYLNEPERGGATIFPE-------VP------------LQVVPRR 252
Query: 244 GDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
G+A+ F +PD S +LHGG PV+ G KW +TKW+R E+
Sbjct: 253 GNAVFFSYERPDPST--RTLHGGAPVLAGEKWIATKWLREREF 293
>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
Length = 224
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 84/204 (41%), Positives = 116/204 (56%), Gaps = 26/204 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V+ LS++EC+ L+ LA P + +S VD+ TG S+ + RTS G F RG +I
Sbjct: 37 PRVVVFGGLLSEQECDELVALAQPRLLRSETVDNSTGGSEVNAARTSDGMFFERGETPLI 96
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEF-NTKN----GGQRMATV 198
IE+RIA+ +P+E GEGLQVLHY G +Y+PH D+F T N GGQR+ TV
Sbjct: 97 ERIERRIAELVHWPVERGEGLQVLHYRPGAQYKPHHDFFDPAHPGTANILRRGGQRVGTV 156
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ GG T FP + GL ++P G+A+ F +P AS
Sbjct: 157 VIYLNTPAGGGATTFP-------------------EVGLEVQPIKGNAVFFSYERPLAST 197
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
+LHGG PV+ G KW +TKW+R
Sbjct: 198 --RTLHGGAPVLDGEKWVATKWLR 219
>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
JS666]
Length = 277
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 93/234 (39%), Positives = 131/234 (55%), Gaps = 37/234 (15%)
Query: 62 SMERSEGDEGRAEQWVEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT 119
SM GD +WV+V+ P V+ N LS ECE L+ +A P + +S V+ T
Sbjct: 71 SMSMDAGD-----RWVDVLQRLQLPDLVVFGNLLSDSECEALMEVAQPRLARSLTVNIKT 125
Query: 120 GKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHF 179
G + +R RTS G F ARG + +++ +E RIA +P++ GEGLQVL Y G +Y+PH+
Sbjct: 126 GGEERNRDRTSQGMFFARGENPLVQRVEARIARLVGWPVDRGEGLQVLRYRQGAQYKPHY 185
Query: 180 DYF-MDEFNT----KNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGK 234
DYF E T + GGQR+AT++MYL++ E+GG TVFP+
Sbjct: 186 DYFDPAEPGTPAILQRGGQRVATLIMYLNEPEQGGATVFPD------------------- 226
Query: 235 TGLSIKPKMGDALLFWSMKPDASLDPSSL--HGGCPVIKGNKWSSTKWIRVNEY 286
GL + P+ G A+ F P A +P+SL HGG PV G KW +TKW+R E+
Sbjct: 227 IGLQVTPRRGTAVFF--SYPAA--NPASLTRHGGEPVKAGEKWIATKWLREREF 276
>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
Length = 292
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 82/208 (39%), Positives = 117/208 (56%), Gaps = 24/208 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P+ V+ + LS +EC +I + +++ST V+ TGK R RTS G + RG D I
Sbjct: 103 PQMIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRNRTSEGIWYQRGEDPFI 162
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNGGQRMATV 198
+++RI+ +P+ENGEGLQ+L Y +Y PHFDYF + +T GGQR+AT+
Sbjct: 163 ERMDRRISSLMNWPVENGEGLQLLRYGTTGEYRPHFDYFPPDQPGSTVHTAQGGQRVATL 222
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+DV +GGET+FP A G+S+ G A+ F M L
Sbjct: 223 VIYLNDVPDGGETIFPEA-------------------GMSVAASQGGAVYFRYMNGRRQL 263
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DP +LHGG PV+ G+KW TKW+R Y
Sbjct: 264 DPLTLHGGAPVLSGDKWIMTKWMRERAY 291
>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
Length = 288
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 80/209 (38%), Positives = 117/209 (55%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ +FLS +EC+ LI + +++S VV+ DTG+ RTS G G +I
Sbjct: 96 PRIVLFQHFLSDQECDELIAIGRNRLKRSPVVNPDTGEENLISARTSQGGMFQVGEHPLI 155
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATV 198
IE RIA P+E+GEG QVL+Y+ G +Y+PHFD+F + + GGQR+AT+
Sbjct: 156 AKIEARIAQAVGVPVEHGEGFQVLNYQPGGEYQPHFDFFNPGRSGEARQLEVGGQRVATM 215
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ V+ GG T FP K GL + P G+A+ F +PD +L
Sbjct: 216 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 256
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D +LH G PV +G KW +TKW+R Y+
Sbjct: 257 DEDTLHAGLPVERGEKWIATKWLRERPYR 285
>gi|388519941|gb|AFK48032.1| unknown [Lotus japonicus]
Length = 151
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 79/148 (53%), Positives = 99/148 (66%), Gaps = 10/148 (6%)
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
++ IEKRI+ ++ P+ENGE +QVL YE Q Y+PH DYF D FN K GGQR+AT+LMY
Sbjct: 12 MVHAIEKRISVYSQVPIENGELMQVLRYEKNQYYKPHHDYFADTFNLKRGGQRIATMLMY 71
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKT--GLSIKPKMGDALLFWSMKPDASLD 259
LSD EGGET FPN + S GKT GLS+KP G+A+LFWSM D D
Sbjct: 72 LSDNVEGGETYFPNIGS--------GQCSCGGKTVEGLSVKPTKGNAVLFWSMGLDGQSD 123
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
P S+HGGC V+ G KWS+TKW+R ++
Sbjct: 124 PLSVHGGCEVLAGEKWSATKWMRQKAHQ 151
>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
IL144]
gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
oxidoreductase protein [Rubrivivax gelatinosus IL144]
Length = 279
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 85/204 (41%), Positives = 114/204 (55%), Gaps = 26/204 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V+ LS EEC+ L+ LA P + +S VD+ TG S+ + RTS G F RG +I
Sbjct: 92 PRVVVFGGLLSDEECDELVALARPRLARSETVDNSTGGSEVNAARTSDGMFFERGEKPLI 151
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEF-NTKN----GGQRMATV 198
IE+RIA+ +P+E GEGLQVL Y G +Y+PH D+F T N GGQR+ TV
Sbjct: 152 ERIERRIAELVRWPVERGEGLQVLRYRPGAQYKPHHDFFDPAHPGTANILRRGGQRVGTV 211
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL+ GG T FP + GL ++P G+A+ F +P AS
Sbjct: 212 VMYLNTPAGGGATTFP-------------------EVGLEVQPVKGNAVFFSYERPLAST 252
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
+LHGG PV+ G KW +TKW+R
Sbjct: 253 --RTLHGGAPVLDGEKWVATKWMR 274
>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
Length = 288
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 80/209 (38%), Positives = 116/209 (55%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ +FLS EC+ LI + +++S VV+ DTG+ RTS G G +I
Sbjct: 96 PRIVLFQHFLSDAECDELIAIGRNRLKRSPVVNPDTGEENLISARTSQGGMFQVGEHPLI 155
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATV 198
IE RIA P+E+GEG QVL+Y+ G +Y+PHFD+F + + GGQR+AT+
Sbjct: 156 AKIEVRIAQAVGVPVEHGEGFQVLNYQPGGEYQPHFDFFNPGRSGEARQLEVGGQRVATM 215
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ V+ GG T FP K GL + P G+A+ F +PD +L
Sbjct: 216 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 256
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D +LH G PV +G KW +TKW+R Y+
Sbjct: 257 DEDTLHAGLPVERGEKWIATKWLRERPYR 285
>gi|148653656|ref|YP_001280749.1| procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
gi|148572740|gb|ABQ94799.1| Procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
Length = 268
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 86/212 (40%), Positives = 117/212 (55%), Gaps = 24/212 (11%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+ ++P V ++FLS EEC+ LI+ A ++ S VVD + G + RTS+ T RG
Sbjct: 75 VCYKPFVTVINDFLSPEECDALISDADQKLKASRVVDPEDGSFVEHSARTSTSTGYHRGE 134
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-----MDEFNTKNGGQR 194
II+ IE RIAD +P+++GEGLQVL YE G +Y PHFD+F TK GGQR
Sbjct: 135 IDIIKTIEARIADLINWPVDHGEGLQVLRYEDGGEYRPHFDFFDPAKKSSRLVTKQGGQR 194
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ T LMYLS+V+ GG T FPN I+P G AL F +
Sbjct: 195 VGTFLMYLSEVDSGGSTRFPN-------------------LNFEIRPNKGSALYFANTNL 235
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
A ++P +LH G PV +G K+ +TKW+R Y
Sbjct: 236 KAEIEPLTLHAGMPVTEGVKYLATKWLREKPY 267
>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
Length = 319
Score = 156 bits (395), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 81/209 (38%), Positives = 114/209 (54%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ L +ECE LI L+ + +S VV+ DTG RTS G G +I
Sbjct: 127 PRIALFQRLLMPDECEALIALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVGEHPLI 186
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNGGQRMATV 198
+E RIA T P+E+GEGLQ+L+Y+ G +Y+PH+D+F + + GGQRMAT+
Sbjct: 187 ERLEARIAAVTGVPVEHGEGLQILNYKPGAEYQPHYDFFNPQRPGEARQLRVGGQRMATL 246
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+DV GG T FP K GL + P G+A+ F + D SL
Sbjct: 247 VIYLNDVPAGGATAFP-------------------KLGLRVNPVQGNAVFFAYLGEDGSL 287
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D +LH G PV +G KW +TKW+R Y+
Sbjct: 288 DERTLHAGLPVEQGEKWIATKWLREAPYR 316
>gi|319792090|ref|YP_004153730.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315594553|gb|ADU35619.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 280
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 85/209 (40%), Positives = 116/209 (55%), Gaps = 26/209 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V+ N LS EECE LI A + +S V++ TG + RTS G F RG ++I+
Sbjct: 93 PRVIVFGNLLSTEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSDGMFFERGENEIV 152
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATV 198
+E+R+A +PLE GEGLQ+L Y G +Y PH+DYF +E T K GGQR+AT+
Sbjct: 153 ARLEQRLAMLLRWPLEYGEGLQILRYAPGAQYRPHYDYFDPNEPGTPTILKRGGQRVATL 212
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL + E+GG T FP+ GL + P G + F +PD
Sbjct: 213 VMYLQEPEQGGATTFPD-------------------VGLEVAPVRGTGVFFSYDRPDPVT 253
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV+ G KW +TKW+R E+K
Sbjct: 254 --RTLHGGAPVLAGEKWVATKWLREREFK 280
>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
Length = 286
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 80/213 (37%), Positives = 118/213 (55%), Gaps = 24/213 (11%)
Query: 81 SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
S +P + + L EC+ LI + H+++S+VVD D+GK R S G F+ D
Sbjct: 91 SEQPVIALVADVLDDTECDRLIEIGREHVQRSSVVDPDSGKEITIEERRSEGAFVNASTD 150
Query: 141 KIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNGGQRM 195
++ I++RIA+ P+ENGE L +L Y G +Y PH+DYF +E + + GGQR+
Sbjct: 151 ALVETIDRRIAELFRQPVENGEDLHILRYGMGGEYRPHYDYFPEEQAGSKHHMQRGGQRI 210
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
ATV++YL++VE+GG+T FP+ GL+I P+ G AL F +
Sbjct: 211 ATVILYLNEVEQGGDTTFPD-------------------IGLAIHPRRGSALYFEYVNEL 251
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
DP +LH G PV KG KW +TKWIR ++
Sbjct: 252 GQSDPKTLHAGTPVEKGEKWIATKWIRRGRFRA 284
>gi|319763870|ref|YP_004127807.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
gi|330823866|ref|YP_004387169.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
gi|317118431|gb|ADV00920.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
gi|329309238|gb|AEB83653.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
Length = 284
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 85/209 (40%), Positives = 121/209 (57%), Gaps = 26/209 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ N LS EEC+ +I A M +S V + +G + ++ RTS G F RG ++ +
Sbjct: 97 PRVVLFGNLLSPEECQAVIEAARTRMARSLTVQAASGGEEVNKDRTSDGMFFQRGENEAV 156
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATV 198
+E+RIA +P+ENGEGLQVLHY G +Y+PH+DYF E T + GGQR+AT+
Sbjct: 157 ARLEERIARLVRWPVENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPRLLRRGGQRVATL 216
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+D GG T FP+ VP L I P+ G+A+ F + S
Sbjct: 217 VIYLNDPVRGGGTTFPD-------VP------------LEIGPRQGNAVFFSYGRAHPS- 256
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PVI+G KW +TKW+R E+K
Sbjct: 257 -SRTLHGGAPVIEGEKWIATKWLREREFK 284
>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
Length = 283
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 86/209 (41%), Positives = 118/209 (56%), Gaps = 22/209 (10%)
Query: 77 VEVISWEPRAFVYH--NFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTF 134
V+V+S + FV H LS EEC+ LI+L+ ++ S VVD +G+ + RTS
Sbjct: 87 VKVLSRNEKPFVLHLDQVLSSEECDELISLSRSRLQPSLVVDRGSGEERAGSGRTSKSMA 146
Query: 135 LARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEF-NTKNGGQ 193
++++ IE RIA+ T +P ENGEGLQ+L+Y G++Y+PHFD+F + GGQ
Sbjct: 147 FRLKENELVERIETRIAELTGYPAENGEGLQILNYGLGEEYKPHFDFFPPHMADASKGGQ 206
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+ T L+YL+DVE+GGETVF K GLS PK G A+ F
Sbjct: 207 RVGTFLIYLNDVEDGGETVF-------------------SKAGLSFVPKKGAAIYFHYGN 247
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
LD S+H PV KG KW++TKWIR
Sbjct: 248 AQGQLDRLSVHSSVPVRKGEKWAATKWIR 276
>gi|124267278|ref|YP_001021282.1| hypothetical protein Mpe_A2091 [Methylibium petroleiphilum PM1]
gi|124260053|gb|ABM95047.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
Length = 289
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 83/209 (39%), Positives = 115/209 (55%), Gaps = 26/209 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR V+ LS EC+ ++ LA + +S VD+ TG S+ + RTS G F RG +
Sbjct: 101 DPRVIVFSGLLSDAECDEIVALAGARLARSHTVDTATGASEVNAARTSDGMFFTRGEHPV 160
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMAT 197
E RIA +P+ENGEGLQVLHY G +Y+PH+DYF D+ T + GGQR+AT
Sbjct: 161 CARFEARIAALLNWPVENGEGLQVLHYRPGAEYKPHYDYFDPDQPGTPAVLRRGGQRVAT 220
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
++ YL+ GG T FP+ GL + P G A+ F +P S
Sbjct: 221 LVTYLNTPTRGGGTTFPD-------------------IGLEVTPLKGHAVFFSYDRPHPS 261
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
SLHGG PV++G+KW +TKW+RV +
Sbjct: 262 T--RSLHGGAPVLEGDKWVATKWLRVGRF 288
>gi|354334983|gb|AER23925.1| procollagen-proline dioxygenase [Variovorax sp. HH01]
Length = 280
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 86/209 (41%), Positives = 114/209 (54%), Gaps = 26/209 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V+ N LS EECE LI A + +S V++ TG + RTS G F RG ++I+
Sbjct: 93 PRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSDGMFFERGENEIV 152
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATV 198
+E+RIA +PLE GEGLQ+L Y G +Y PH+DYF E T K GGQR+AT+
Sbjct: 153 ARVEQRIAALLRWPLEFGEGLQILRYAPGAQYRPHYDYFDPSEPGTPTILKRGGQRVATL 212
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL + E GG T FP+ GL + P G + F +PD
Sbjct: 213 VMYLQEPEGGGATTFPD-------------------VGLEVAPARGCGVFFSYDRPDPVT 253
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV+ G KW +TKW+R E+K
Sbjct: 254 --RTLHGGAPVLAGEKWVATKWLREREFK 280
>gi|302850293|ref|XP_002956674.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
nagariensis]
gi|300258035|gb|EFJ42276.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
nagariensis]
Length = 325
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 88/212 (41%), Positives = 123/212 (58%), Gaps = 14/212 (6%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
++ ISW+PRA VYHNFLS +E ++I+LA M++STVV + D +RTS GTFL
Sbjct: 41 IQTISWKPRAVVYHNFLSDQEARHIIDLAHEQMKRSTVVGNKNEGVVDD-IRTSYGTFLR 99
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
R +D +I IE+R+A ++ P + E +QVL Y KY PH D G +R+A
Sbjct: 100 RAQDPVIMAIEERLALWSHMPPSHQEDMQVLRYGRTNKYGPHID----------GLERVA 149
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD- 255
TVLMYL E G + P + N S C K ++ KPK GDAL+F+ +KPD
Sbjct: 150 TVLMYLVG-ESPGPDLAPVSACECMYAEQSNP-SACAKGHVAYKPKRGDALMFFDVKPDY 207
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+ D S+H GCPV+ G KW++ KWI ++
Sbjct: 208 TTTDGHSMHTGCPVVAGVKWNAVKWIHGTPFR 239
>gi|398804098|ref|ZP_10563100.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
gi|398094921|gb|EJL85274.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
Length = 277
Score = 155 bits (391), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 90/221 (40%), Positives = 125/221 (56%), Gaps = 30/221 (13%)
Query: 74 EQWVEVISWE--PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
++WV V P +V+ N LS ECE LI A + +S VD TG + + RTS
Sbjct: 78 DKWVTVREHRSAPELWVFDNLLSAAECEALIAAAESRLARSLTVDIRTGGEELNHDRTSH 137
Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT-- 188
G F RG +++IR IE RIA +P++NGEGLQVL Y G +Y+PH+DYF E T
Sbjct: 138 GMFYTRGENEVIRRIEARIARLLNWPVQNGEGLQVLRYRRGAEYKPHYDYFDPGEPGTAA 197
Query: 189 --KNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDA 246
+ GGQR+A+++MYL + EGG TVFP+ GL ++P+ G A
Sbjct: 198 ILRRGGQRVASLIMYLREPGEGGATVFPD-------------------IGLKVRPQQGSA 238
Query: 247 LLF-WSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+ F +++ ASL +LHGG PV G KW +TKW+R E+
Sbjct: 239 VFFSYALAHPASL---TLHGGEPVKSGEKWIATKWLREREF 276
>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
Length = 283
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 80/211 (37%), Positives = 116/211 (54%), Gaps = 24/211 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
EP + + LS EC+ LI + +R+S+VVD D+G R S G F+ D +
Sbjct: 90 EPVVALLADVLSPRECDRLIEIGRERVRRSSVVDPDSGGEVLIDARKSEGAFVNGSTDPL 149
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNGGQRMAT 197
+ I++RIA+ P+ENGE L +L Y AG +Y PHFDYF +E + + GGQR+AT
Sbjct: 150 VATIDRRIAELVQQPVENGEDLHILRYGAGGEYRPHFDYFPEEQAGSKHHMQRGGQRIAT 209
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
+++YL+ VEEGG+T FP+ GL+I P+ G AL F +
Sbjct: 210 LILYLNQVEEGGDTTFPD-------------------IGLTIHPRRGAALYFEYVNALGQ 250
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
DP +LH G PV +G KW +TKW+R ++
Sbjct: 251 TDPRTLHAGMPVERGEKWIATKWMRRGRFRA 281
>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
Length = 216
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 89/214 (41%), Positives = 123/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EECE LI L+ M++S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECEELIELSKNKMKRSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ T P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
Length = 311
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 83/209 (39%), Positives = 113/209 (54%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P V LS EEC+ +I L+ M+ S VVD ++G S +S VR S G+ RG ++++
Sbjct: 121 PNIAVIRGLLSDEECDEVIRLSRGKMKTSQVVDRESGGSYESSVRKSEGSHFERGENELV 180
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNGGQRMATV 198
R IE R++ P+ GE LQ+LHY G +Y+ H D+F + T+ GGQR+ TV
Sbjct: 181 RRIEARLSALVDLPVNRGEPLQILHYGPGGEYKAHQDFFEPKDPGSAVLTRVGGQRIGTV 240
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL+DV EGGET FP+ G S KP G A+ F D L
Sbjct: 241 VMYLNDVPEGGETAFPD-------------------IGFSAKPIKGSAVYFEYQNADGQL 281
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D LH G PVI+G+KW TKW+R Y+
Sbjct: 282 DYRCLHAGMPVIRGDKWIMTKWLRERPYE 310
>gi|30681957|ref|NP_850038.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
gi|330252315|gb|AEC07409.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
[Arabidopsis thaliana]
Length = 274
Score = 154 bits (389), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 92/267 (34%), Positives = 138/267 (51%), Gaps = 18/267 (6%)
Query: 23 LLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEV--I 80
L +F F L + GI S R ND E D G + + +
Sbjct: 22 FLAIFGFCFFNLFSQGI-SFSEIPTTRRSVND---------ETDSLDHGSSVSNIPFHGL 71
Query: 81 SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
SW PR F NF +K++CE +I++A P ++ ST+ ++ ++ S
Sbjct: 72 SWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSTLALRKGETAETTQNYRSLHQHTDEDES 131
Query: 141 KIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLM 200
++ IE++IA T FP + E +L Y+ GQKY+ H+D F QR+ T L+
Sbjct: 132 GVLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYDAFHSAEYGPLISQRVVTFLL 191
Query: 201 YLSDVEEGGETVFPNAQG-NISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
+LS VEEGGET+FP G N++ + +C GL +KP+ GDA+ F+++ P+ ++D
Sbjct: 192 FLSSVEEGGETMFPFENGRNMNGR---YDYEKC--VGLKVKPRQGDAIFFYNLFPNGTID 246
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+SLHG CPVIKG KW +TKWIR Y
Sbjct: 247 QTSLHGSCPVIKGEKWVATKWIRDQTY 273
>gi|377810637|ref|YP_005043077.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
gi|357939998|gb|AET93554.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
YI23]
Length = 297
Score = 154 bits (389), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 87/221 (39%), Positives = 117/221 (52%), Gaps = 26/221 (11%)
Query: 73 AEQWVEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
AE+ VI+ P A + FL+ EC+ LI LA P + +STVVD TG+ + R+S
Sbjct: 89 AERKTRVIARLQRPAAVLLDEFLTGSECDQLIALARPRLSRSTVVDPVTGRDVAAGHRSS 148
Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM--DEFNT 188
GTF ++ +E RIA T ENGEGLQ+L Y+ G + PH DY + +E N
Sbjct: 149 DGTFFRLAETPLVARLEMRIAALTGLAAENGEGLQLLRYQPGAESTPHVDYLVAGNETNR 208
Query: 189 KN---GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGD 245
++ GQR+ T+LMYL+DVE GGETVFP + G S+ P+ G
Sbjct: 209 ESIARSGQRVGTLLMYLNDVEGGGETVFP-------------------QVGCSVVPRRGQ 249
Query: 246 ALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
AL F DP+SLH P+ G KW +TKWIR +
Sbjct: 250 ALYFEYCNRAGVCDPASLHASTPLRSGEKWVATKWIRARRF 290
>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
Length = 216
Score = 154 bits (388), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 86/206 (41%), Positives = 119/206 (57%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI L+ +M++S V G S+D + +RTSSG FL ++
Sbjct: 38 EPLIVVLANVLSDEECDKLIELSKNNMKRSKV-----GSSRDVNDIRTSSGAFLEE--NE 90
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ T P+ +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 91 LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 149
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 150 LNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 190
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGAYK 216
>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
Length = 216
Score = 154 bits (388), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 88/214 (41%), Positives = 123/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ M++S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECDELIELSKSKMKRSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ T P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|398808448|ref|ZP_10567311.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398087480|gb|EJL78066.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 280
Score = 154 bits (388), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 85/209 (40%), Positives = 114/209 (54%), Gaps = 26/209 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V+ N LS EECE LI A + +S V++ TG + RTS G F RG ++I+
Sbjct: 93 PRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSDGMFFERGENEIV 152
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATV 198
+E+R+A +PLE GEGLQ+L Y G +Y PH+DYF E T K GGQR+AT+
Sbjct: 153 ARLEQRLATLLRWPLEYGEGLQILRYAPGAQYRPHYDYFDPGEPGTPTILKRGGQRVATL 212
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL + E GG T FP+ GL + P G + F +PD
Sbjct: 213 VMYLQEPEGGGATTFPD-------------------VGLEVAPVRGCGVFFSYDRPDPVT 253
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV+ G KW +TKW+R E+K
Sbjct: 254 --RTLHGGAPVLAGEKWVATKWLREREFK 280
>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
Length = 216
Score = 153 bits (387), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 88/214 (41%), Positives = 123/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ M++S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECDELIELSKNKMKRSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ T P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
Length = 216
Score = 153 bits (386), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 88/214 (41%), Positives = 122/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ M +S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECDELIELSKSKMERSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ T P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSITNVPVSHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
Length = 216
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 88/214 (41%), Positives = 122/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ M +S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECDELIELSKSKMERSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ T P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
Length = 220
Score = 153 bits (386), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 87/209 (41%), Positives = 121/209 (57%), Gaps = 30/209 (14%)
Query: 77 VEVISW--EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EECE LI L+ M++S + G S++ +RTSSGT
Sbjct: 34 IQIISRVEEPLIVVLENVLSDEECESLIELSKDSMKRSKI-----GASREVDNIRTSSGT 88
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL ++ + IEKR++ P+E+GEGL +L Y GQ+Y+ H+DYF E +
Sbjct: 89 FLEE--NETVAIIEKRVSSIMNIPVEHGEGLHILKYTPGQEYKAHYDYFA-EHSRAAENN 145
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LSI PK G A+ F
Sbjct: 146 RISTLVMYLNDVEEGGETFFP-------------------KLNLSIAPKKGSAVYFEYFY 186
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D SL+ +LHGG PVIKG KW +T+W++
Sbjct: 187 NDKSLNELTLHGGAPVIKGEKWVATQWMK 215
>gi|388515007|gb|AFK45565.1| unknown [Lotus japonicus]
Length = 154
Score = 152 bits (384), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 108/152 (71%), Positives = 128/152 (84%), Gaps = 3/152 (1%)
Query: 1 MAKPRYSRFPTRKSSSS-TLILTLLIMFTFAILILLAFGILSMPSSSGDSR--KANDLSS 57
MAKPRYSR P+RKSSSS TLI L + FTF +LIL A GILS+PSSS + K NDL+S
Sbjct: 1 MAKPRYSRLPSRKSSSSSTLIFALFLAFTFLLLILFALGILSIPSSSSRDKFPKPNDLTS 60
Query: 58 IVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
I +++R++ D+GR EQWVEVISWEPRAFVYHNFL+KEECEYLI++A P+M KSTVVDS
Sbjct: 61 IAHNTLDRTDDDDGRGEQWVEVISWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSTVVDS 120
Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKR 149
+TGKSKDSRVRTSSGTFL RGR KI+R+IEK+
Sbjct: 121 ETGKSKDSRVRTSSGTFLPRGRGKIVRNIEKK 152
>gi|239814309|ref|YP_002943219.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239800886|gb|ACS17953.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 279
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 83/209 (39%), Positives = 114/209 (54%), Gaps = 26/209 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V+ N +S EECE LI A + +S V++ TG + RTS G F RG + I+
Sbjct: 92 PRVVVFGNLVSPEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSEGMFFERGENDIV 151
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATV 198
+E+RIA +P+E GEGLQ+L Y G +Y PH+DYF E T K GGQR+AT+
Sbjct: 152 ARLEQRIAALLRWPVEFGEGLQILRYAPGAQYRPHYDYFDPGEPGTPTILKRGGQRVATL 211
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL + +GG T FP+ GL + P G + F +PD +
Sbjct: 212 VMYLQEPGQGGATTFPD-------------------VGLEVAPVRGTGVFFSYEEPDPAT 252
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV+ G KW +TKW+R E+K
Sbjct: 253 --RTLHGGAPVLAGEKWVATKWLREREFK 279
>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
Length = 293
Score = 152 bits (383), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 84/208 (40%), Positives = 110/208 (52%), Gaps = 24/208 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P V L EEC+ LI + +++ST VD G + R+S GTF D I
Sbjct: 97 PTIAVLDQVLDDEECDELIRRSADKLQRSTTVDPVNGGYEVIAARSSEGTFFPVNADDFI 156
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATV 198
+++RIA+ P+ENGEGLQVLHY G +Y+PHFDYF E GGQR++T+
Sbjct: 157 ARLDRRIAELMNCPVENGEGLQVLHYGEGGEYQPHFDYFSPGDPGSEAQMVVGGQRVSTL 216
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L+YL+DV +GG TVFP GL + P+ G A+ F D +
Sbjct: 217 LIYLNDVAQGGATVFPT-------------------LGLRVLPRKGMAVYFEYSNRDGQV 257
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
DP +LHGG PV KG KW TKW+R Y
Sbjct: 258 DPLTLHGGEPVEKGEKWIITKWMRQRSY 285
>gi|229135058|ref|ZP_04263863.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
gi|228648443|gb|EEL04473.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
Length = 216
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 87/214 (40%), Positives = 122/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC LI L+ +M++S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECAELIELSKSNMKRSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ T P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D L+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQLLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|423512354|ref|ZP_17488885.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
gi|402449325|gb|EJV81162.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
Length = 216
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 87/214 (40%), Positives = 122/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC LI L+ +M++S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECAELIELSKSNMKRSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ T P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D L+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQLLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|325925807|ref|ZP_08187179.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
gi|325543793|gb|EGD15204.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
91-118]
Length = 286
Score = 151 bits (381), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 92/242 (38%), Positives = 127/242 (52%), Gaps = 31/242 (12%)
Query: 51 KANDLSSIVRKSMERSEGDE-----GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLA 105
+ANDL VR + + D G + V V PR V FLS EEC+ LI LA
Sbjct: 58 QANDLPMPVRVPALQQDADASLLALGDRDVRVLVSLLLPRVVVLGGFLSDEECDALIALA 117
Query: 106 TPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQ 165
PH+ +S VD+ G+ RTS L G+D + + IE RIA +P+++GEGLQ
Sbjct: 118 RPHLARSRTVDNANGEHVVHAARTSDSMCLRLGQDALCQRIEARIARLLDWPVDHGEGLQ 177
Query: 166 VLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATVLMYLSDVEEGGETVFPNAQGNI 220
VL Y G +Y PH+DYF D T + GGQR+A+++MYL+ E GG T FP+A ++
Sbjct: 178 VLRYATGAEYRPHYDYFDPDAAGTPVLVQAGGQRVASLVMYLNTPERGGATRFPDAHLDV 237
Query: 221 SAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKW 280
+AV G+A+ F +P SLH G PV+ G+KW +TKW
Sbjct: 238 AAV-------------------KGNAVFFSYDRPHPMT--RSLHAGAPVLAGDKWVATKW 276
Query: 281 IR 282
+R
Sbjct: 277 LR 278
>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
Length = 293
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 78/209 (37%), Positives = 114/209 (54%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR + N L EC+ ++ LA +++S VV+ DTG RTS G G ++
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVVNPDTGDENLIDARTSMGAMFQVGEHALL 160
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATV 198
+ IE RIA T +P+E+GEG QVL+Y+ G +Y+PHFD+F + + GGQR+AT+
Sbjct: 161 QRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRVATM 220
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ GG T FP + GL + P G+A+LF PD +L
Sbjct: 221 VIYLNSPASGGATAFP-------------------RIGLEVAPVKGNAVLFSYGLPDGAL 261
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D +LH G PV G KW +TKW+R + Y+
Sbjct: 262 DERTLHAGLPVEAGEKWIATKWLREHPYR 290
>gi|423368291|ref|ZP_17345723.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
gi|401081042|gb|EJP89322.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
Length = 216
Score = 151 bits (381), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 87/214 (40%), Positives = 122/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC LI L+ +M++S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECAELIELSKNNMKRSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ T P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D L+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQLLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
Length = 293
Score = 150 bits (380), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 78/209 (37%), Positives = 114/209 (54%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR + N L EC+ ++ LA +++S VV+ DTG RTS G G ++
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVVNPDTGDENLIDARTSMGAMFQVGEHALL 160
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATV 198
+ IE RIA T +P+E+GEG QVL+Y+ G +Y+PHFD+F + + GGQR+AT+
Sbjct: 161 QRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRVATM 220
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ GG T FP + GL + P G+A+LF PD +L
Sbjct: 221 VIYLNSPASGGATAFP-------------------RIGLEVAPVKGNAVLFSYGLPDGAL 261
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D +LH G PV G KW +TKW+R + Y+
Sbjct: 262 DERTLHAGLPVEAGEKWIATKWLREHPYR 290
>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
Length = 297
Score = 150 bits (379), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 77/209 (36%), Positives = 112/209 (53%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P+ ++ L+ +EC+ L+ L+ + +S VV+ DTG RTS G +I
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHALI 164
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKN-----GGQRMATV 198
IE RIA T P E+GEGLQ+L+Y+ G +Y+PHFDYF + + GGQR+AT+
Sbjct: 165 ARIEARIAAVTGVPAEHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQLSVGGQRIATL 224
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ E GG T FP + GL + P G+A+ F + PD +L
Sbjct: 225 VIYLNTPEAGGATAFP-------------------RVGLEVAPVKGNAVYFSYLLPDGTL 265
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D +LH G PV G KW +TKW+R Y+
Sbjct: 266 DERTLHAGLPVASGEKWIATKWLRERPYR 294
>gi|229061929|ref|ZP_04199257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
gi|228717372|gb|EEL69042.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
Length = 216
Score = 150 bits (379), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 87/214 (40%), Positives = 122/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC LI L+ +M++S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECAELIELSKSNMKRSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ T P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSITNVPVVHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D L+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQLLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
Length = 216
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 87/214 (40%), Positives = 121/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ M +S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECDELIELSKSKMERSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ T P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D L+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQLLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|239816557|ref|YP_002945467.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
gi|239803134|gb|ACS20201.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
Length = 296
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 77/204 (37%), Positives = 115/204 (56%), Gaps = 24/204 (11%)
Query: 88 VYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIE 147
V + S EECE LI LA P + ST VD TG+++ R+S G F + + ++
Sbjct: 103 VLSDVFSAEECEALIALARPRLAPSTSVDPLTGRNRLGAQRSSLGMFFRLRENAFVARLD 162
Query: 148 KRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATVLMYL 202
+R+++ P+ENGEGLQVLHY AG + PHFD+ + ++ + + GQR++T++ YL
Sbjct: 163 ERLSELMNLPVENGEGLQVLHYPAGAQSLPHFDFLVPSNAANQASLQRSGQRVSTLVAYL 222
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
++VEEGGETVFP +TG S+ P+ G A+ F +D +S
Sbjct: 223 NEVEEGGETVFP-------------------ETGWSVSPQRGGAVYFEYCNSLGQVDHAS 263
Query: 263 LHGGCPVIKGNKWSSTKWIRVNEY 286
LH G PV+ G KW +TKW+R +
Sbjct: 264 LHAGAPVLSGEKWVATKWMRQRRF 287
>gi|423478381|ref|ZP_17455096.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
gi|402428543|gb|EJV60640.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
Length = 216
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 85/206 (41%), Positives = 117/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI L+ M++S V G S+D + +RTSSG FL ++
Sbjct: 38 EPLIVVLGNVLSDEECDELIELSKSKMKRSKV-----GSSRDVNDIRTSSGAFL--DDNE 90
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 91 LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 150 LNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 190
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGAYK 216
>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
Length = 216
Score = 150 bits (379), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 85/206 (41%), Positives = 117/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI L+ M++S V G S+D + +RTSSG FL ++
Sbjct: 38 EPLIVVLGNVLSDEECDELIELSKSKMKRSKV-----GSSRDVNDIRTSSGAFL--DDNE 90
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 91 LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 150 LNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 190
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGAYK 216
>gi|224001336|ref|XP_002290340.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220973762|gb|EED92092.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 483
Score = 150 bits (379), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 86/234 (36%), Positives = 126/234 (53%), Gaps = 23/234 (9%)
Query: 63 MERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKS 122
++ S + + + +E +S P FLS EEC+Y+ +A+P ++ S+V D K
Sbjct: 247 LDPSASSKSQKQVTIETLSLRPLVVSVEGFLSDEECDYIAEIASPQVKYSSVSLKDADKG 306
Query: 123 KDSRV-RTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY 181
KDS RTS FL+ D+++ +I+ R+A T P + E +QVL Y AG+KY+ H DY
Sbjct: 307 KDSSEWRTSQSAFLSARDDEVLTEIDHRVASLTRIPRNHQEYVQVLRYGAGEKYDSHHDY 366
Query: 182 F-------------MDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNE 228
F + E KN R ATV YL+DV +GGET+FP G P
Sbjct: 367 FDPSAYRSDKSTLRLIENGKKN---RYATVFWYLTDVHDGGETIFPRYGG----APAPRS 419
Query: 229 LSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGN-KWSSTKWI 281
+C GL +KP+ G ++F+S+ +DP SLHG CPV + N KW++ KWI
Sbjct: 420 HKDC-SIGLKVKPQKGKVVIFYSLDASGEMDPFSLHGACPVGENNLKWAANKWI 472
>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
Length = 212
Score = 150 bits (378), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 78/205 (38%), Positives = 119/205 (58%), Gaps = 28/205 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
EP V N LS EEC+ LI L+ +++S + ++ ++ +RTSS TF+ G ++
Sbjct: 36 EPLIVVLGNVLSDEECDALIGLSKDKLKRSKIGNT----RNENDMRTSSSTFMEEGESEV 91
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ +EKRI+ P ENGEGLQ+L+Y+ GQ+Y+ HFD+F + N R++T++MYL
Sbjct: 92 VTRVEKRISQIMNIPYENGEGLQILNYKIGQEYKAHFDFFKNASNP-----RISTLVMYL 146
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
+DVEEGGET FP K S+ P+ G A+ F + L+ +
Sbjct: 147 NDVEEGGETYFP-------------------KLNFSVSPQKGMAVYFEYFYDNQELNDLT 187
Query: 263 LHGGCPVIKGNKWSSTKWIRVNEYK 287
LHGG PVI G+KW++T+W+R + K
Sbjct: 188 LHGGAPVIIGDKWAATQWMRRKQVK 212
>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
Length = 211
Score = 150 bits (378), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 80/205 (39%), Positives = 115/205 (56%), Gaps = 25/205 (12%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV+ EP + N LS EEC+ LI+ A+ + +S + K + S +RTSSG F
Sbjct: 24 EVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKL-----AKKEISSIRTSSGMFFEE 78
Query: 138 GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMAT 197
+ +I +IEKRI+ P+E+ EGLQVLHYE GQ+++PHFD+F + + R+ T
Sbjct: 79 NENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKPHFDFFGPN-HPSSSNNRICT 137
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
+++YL+DVEEGG T FPN G+ PK G A+ F D
Sbjct: 138 LVVYLNDVEEGGVTTFPNL-------------------GIVNVPKKGTAVYFEYFYNDQK 178
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIR 282
L+ +LH G PVI+G KW +T+W+R
Sbjct: 179 LNELTLHSGEPVIQGEKWVATQWMR 203
>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
eutropha JMP134]
Length = 282
Score = 150 bits (378), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 76/210 (36%), Positives = 113/210 (53%), Gaps = 24/210 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P +Y + LS EC+ L+ LA + +S V++ DTG RTS G G +I
Sbjct: 90 PSIRLYQHLLSDAECDALVELARGRLARSPVINPDTGDENLIDARTSMGAMFQVGEHTLI 149
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATV 198
+ IE RIA P+++GEGLQ+L+Y+ G +Y+PHFD+F + + GGQR AT+
Sbjct: 150 QRIEDRIAAVLGVPVDHGEGLQILNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRTATL 209
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ + GG T FP + GL + P G+A+ F ++PD L
Sbjct: 210 VIYLNTPQAGGATAFP-------------------RIGLEVAPVKGNAVYFSYLQPDGKL 250
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
D +LH G PV G KW +TKW+R + Y+
Sbjct: 251 DERTLHAGLPVQSGEKWIATKWLREHPYRA 280
>gi|423518940|ref|ZP_17495421.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
gi|401159995|gb|EJQ67374.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
Length = 216
Score = 149 bits (377), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 122/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC LI L+ +M++S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECAELIELSKNNMKRSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ T P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP + LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------QLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D L+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQLLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|297825201|ref|XP_002880483.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
gi|297326322|gb|EFH56742.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
Length = 272
Score = 149 bits (377), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 78/213 (36%), Positives = 124/213 (58%), Gaps = 16/213 (7%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV-----DSDTGKSKDSRVRTSSGTF 134
+SW PR F NF +K++CE +I++A P ++ S + ++T ++ +R++ +
Sbjct: 69 LSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSLLALRKGETAETTQNVRTRLKKTD--- 125
Query: 135 LARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQR 194
I+ IE++IA T P++ E +L Y+ GQKY+ H+D F QR
Sbjct: 126 --EDESGILAAIEEKIALATRIPIDYYESFNILRYQLGQKYDSHYDAFHPAEYGPQISQR 183
Query: 195 MATVLMYLSDVEEGGETVFPNAQG-NISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
+ T +++LS VEEGGET+FP G N++ + C GL +KP+ GDA+ F+++
Sbjct: 184 VVTFILFLSSVEEGGETMFPFENGRNMNG---RYDYETC--IGLRVKPRQGDAIFFYNLL 238
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
P+ ++D +SLHG CPVIKG KW +TKWIR Y
Sbjct: 239 PNRTIDQTSLHGSCPVIKGEKWVATKWIRDQTY 271
>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
19424]
gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
taiwanensis LMG 19424]
Length = 296
Score = 149 bits (377), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 77/209 (36%), Positives = 112/209 (53%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P+ ++ LS +EC+ L+ L+ + +S VV+ DTG RTS G +I
Sbjct: 104 PQVQLFQQLLSDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHALI 163
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKN-----GGQRMATV 198
IE RIA T P ++GEGLQ+L+Y+ G +Y+PHFDYF + + GGQR+AT+
Sbjct: 164 ARIEARIAAVTGVPADHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQLSVGGQRIATL 223
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ E GG T FP + GL + P G+A+ F + PD +L
Sbjct: 224 VIYLNTPEAGGATAFP-------------------RVGLEVAPVKGNAVYFSYLLPDGTL 264
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D +LH G PV G KW +TKW+R Y+
Sbjct: 265 DDRTLHAGLPVAAGEKWIATKWLRERPYR 293
>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
Length = 217
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 79/200 (39%), Positives = 114/200 (57%), Gaps = 25/200 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
EP V N LS EEC+ LI L+ + +S + +++ +RTSS TF+ + I
Sbjct: 38 EPLIVVLGNVLSDEECDELIRLSKDRINRSKIANANV-----DNMRTSSSTFIEENENII 92
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ IEKRI+ P E GEGLQ+L+Y+ GQ+Y+ HFD+F N N R++T++MYL
Sbjct: 93 VSRIEKRISQIMNIPTEYGEGLQILNYQVGQEYKSHFDFFSSPHNAIN-NPRISTLVMYL 151
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
SDVE+GGET FP K S+ P+ G A+ F D +L+ +
Sbjct: 152 SDVEQGGETYFP-------------------KLHFSVSPQKGMAVYFEYFYNDQTLNELT 192
Query: 263 LHGGCPVIKGNKWSSTKWIR 282
LHGG PVI G+KW++T+W+R
Sbjct: 193 LHGGAPVIVGDKWAATQWMR 212
>gi|308804269|ref|XP_003079447.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
gi|116057902|emb|CAL54105.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
Length = 363
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 82/215 (38%), Positives = 124/215 (57%), Gaps = 20/215 (9%)
Query: 79 VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
+SW PRAF+Y NFL+++ECE+LI L + +STVV S + RTS GTF+ R
Sbjct: 93 TLSWSPRAFLYQNFLTEDECEHLIALGEKKLERSTVVGSKGKEGDVHSARTSFGTFITRR 152
Query: 139 RDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATV 198
+ +E R+A+++ P + E LQ+L YE GQ+Y NG +R+ATV
Sbjct: 153 LTPTLSAVEDRVAEYSGIPWRHQEQLQLLRYEKGQEY-------------GNGEKRIATV 199
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWW----NELSECG---KTGLSIKPKMGDALLFWS 251
LM+L + E GGET FP+A + + +LS+CG G S+ P+ GDA+LF+S
Sbjct: 200 LMFLREPEFGGETHFPDATPLPATRSEFLGSRAKLSDCGWNEGRGFSVIPRKGDAILFFS 259
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+ + D ++ H CP ++G K+++TKWI E+
Sbjct: 260 HHINGTSDDAASHASCPTLRGIKYTATKWIHEKEF 294
>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
Length = 216
Score = 149 bits (377), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 84/214 (39%), Positives = 123/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N +S EEC+ LI ++ +++ST+ G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVISDEECDELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|307108817|gb|EFN57056.1| hypothetical protein CHLNCDRAFT_143796 [Chlorella variabilis]
Length = 334
Score = 149 bits (376), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 80/209 (38%), Positives = 124/209 (59%), Gaps = 12/209 (5%)
Query: 75 QWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTF 134
Q ++++S PRA++ FLS+++C+++I +A + S + ++++R G
Sbjct: 129 QPMQLLSLYPRAYLMPRFLSQKQCDHVIAMAERRLAPSGLAFKAGDTAENTRDEDPDG-- 186
Query: 135 LARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQR 194
++ IE ++A T P +GE VL YE Q Y+ H+D F +E QR
Sbjct: 187 -------VLAWIEDKLAAVTMIPAGHGEPFNVLRYEPSQHYDSHYDSFSEEEYGPQFSQR 239
Query: 195 MATVLMYLSDVEEGGETVF-PNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
+ATVL+YL+DVEEGGETVF +G ++ + + C TG+ +KP+ GDALLF+S+
Sbjct: 240 IATVLLYLADVEEGGETVFLLEGKGGLARLERID-YKAC-DTGIKVKPRQGDALLFFSVS 297
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ +LD SLHGGCPV+ G KW+ TKWIR
Sbjct: 298 VNGTLDKHSLHGGCPVVAGTKWAMTKWIR 326
>gi|229168980|ref|ZP_04296697.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|423591765|ref|ZP_17567796.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
gi|228614572|gb|EEK71680.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
gi|401231898|gb|EJR38400.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
Length = 216
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 87/214 (40%), Positives = 122/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC LI L+ +M++S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECAELIELSKSNMKRSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ T P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTWKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D L+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQLLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|423558182|ref|ZP_17534484.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
gi|401191450|gb|EJQ98472.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
Length = 216
Score = 149 bits (376), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 122/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ +++S + G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECDGLIELSKNKIKRSKI-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWVATQWVRRGTYK 216
>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus thuringiensis HD-771]
gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
sotto str. T04001]
gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
4222]
gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-771]
gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
thuringiensis HD-789]
Length = 216
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 116/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M++S V G S+D + +RTSSG FL ++
Sbjct: 38 EPLIVVLANVLSDEECDKLIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLED--NE 90
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 150 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 190
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
H3081.97]
gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
Length = 216
Score = 149 bits (375), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 121/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDKLIELSKNKLARSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL D++ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FL--DDDELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R Y+
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYR 216
>gi|346723630|ref|YP_004850299.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346648377|gb|AEO41001.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 286
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 91/242 (37%), Positives = 126/242 (52%), Gaps = 31/242 (12%)
Query: 51 KANDLSSIVRKSMERSEGDE-----GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLA 105
+AN L VR + + D G + V V PR V FLS EEC+ LI LA
Sbjct: 58 QANGLPMPVRVPALQQDADASLLALGDRDVRVLVSLLLPRVVVLGGFLSDEECDALIALA 117
Query: 106 TPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQ 165
PH+ +S VD+ G+ RTS L G+D + + IE RIA +P+++GEGLQ
Sbjct: 118 QPHLARSRTVDNANGEHVVHAARTSDSMCLRLGQDALCQRIEARIARLLDWPVDHGEGLQ 177
Query: 166 VLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATVLMYLSDVEEGGETVFPNAQGNI 220
VL Y G +Y PH+DYF D T + GGQR+A+++MYL+ E GG T FP+A ++
Sbjct: 178 VLRYATGAEYRPHYDYFDPDAAGTPVLVQAGGQRVASLVMYLNTPERGGATRFPDAHLDV 237
Query: 221 SAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKW 280
+AV G+A+ F +P SLH G PV+ G+KW +TKW
Sbjct: 238 AAV-------------------KGNAVFFSYDRPHPMT--RSLHAGAPVLAGDKWVATKW 276
Query: 281 IR 282
+R
Sbjct: 277 LR 278
>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
Length = 219
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 85/208 (40%), Positives = 117/208 (56%), Gaps = 28/208 (13%)
Query: 77 VEVISW--EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTF 134
+++IS EP V N LS EECE LI ++ M++S + S K + +RTSSG F
Sbjct: 33 IQIISRLEEPLIVVLANVLSDEECETLIEMSKNKMKRSKIGIS----RKTNDIRTSSGAF 88
Query: 135 LARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQR 194
L +I IE+RIA P +GEGLQ+L Y GQ+Y+ H+D+F+ E + R
Sbjct: 89 LEE--SEITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFFV-ENSAAASNNR 145
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
M+T++MYL+ VEEGGET FP K LS+ PK G A+ F
Sbjct: 146 MSTLVMYLNHVEEGGETFFP-------------------KLNLSVSPKKGMAVYFEYFYQ 186
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D S++ +LHGG PVIKG KW +T+W+R
Sbjct: 187 DESINKLTLHGGAPVIKGEKWVATQWMR 214
>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
12442]
Length = 219
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 85/208 (40%), Positives = 117/208 (56%), Gaps = 28/208 (13%)
Query: 77 VEVISW--EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTF 134
+++IS EP V N LS EECE LI ++ M++S + S K + +RTSSG F
Sbjct: 33 IQIISRLEEPLIVVLANVLSDEECETLIEMSKNKMKRSKIGVS----RKTNDIRTSSGAF 88
Query: 135 LARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQR 194
L +I IE+RIA P +GEGLQ+L Y GQ+Y+ H+D+F+ E + R
Sbjct: 89 LEE--SEITTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFFV-ENSAAASNNR 145
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
M+T++MYL+ VEEGGET FP K LS+ PK G A+ F
Sbjct: 146 MSTLVMYLNHVEEGGETFFP-------------------KLNLSVSPKKGMAVYFEYFYQ 186
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D S++ +LHGG PVIKG KW +T+W+R
Sbjct: 187 DESINKLTLHGGAPVIKGEKWVATQWMR 214
>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
Length = 216
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 83/214 (38%), Positives = 123/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N +S EEC+ LI ++ +++ST+ G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVISDEECDELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R Y+
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYR 216
>gi|352086439|ref|ZP_08953941.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
gi|389799401|ref|ZP_10202396.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
gi|351679404|gb|EHA62545.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
gi|388442818|gb|EIL98985.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
Length = 284
Score = 148 bits (374), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 112/209 (53%), Gaps = 29/209 (13%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P V N LS +ECE LI LA P ++++ VDS+ G+ + R RTS G F ++
Sbjct: 95 PALRVLENILSTQECEELIALARPRLQRALTVDSE-GRQQVDRRRTSEGMFFTLNEVPLV 153
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN-----TKNGGQRMATV 198
IE+R+A P +GEGLQ+LHY GQ+YEPHFD+F E T GGQR+A+V
Sbjct: 154 GRIEQRLAALLRVPASHGEGLQILHYLPGQEYEPHFDWFDPEQPGYGAITAVGGQRIASV 213
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL+ GG T FP + GL++ + G A+ F
Sbjct: 214 VMYLNTPARGGGTAFP-------------------ELGLTVTARRGSAVYFAY----EGG 250
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
DPSSLH G PV+ G KW +TKW+R YK
Sbjct: 251 DPSSLHAGLPVLDGEKWIATKWLRERPYK 279
>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
Length = 248
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 116/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M++S V G S+D + +RTSSG FL ++
Sbjct: 70 EPLIVVLANVLSDEECDKLIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 182 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 222
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248
>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
Length = 216
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 83/214 (38%), Positives = 123/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N +S EEC+ LI ++ +++ST+ G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVISDEECDELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R Y+
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYR 216
>gi|30022316|ref|NP_833947.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|229129515|ref|ZP_04258486.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
gi|29897873|gb|AAP11148.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
gi|228654120|gb|EEL09987.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
Length = 232
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 115/206 (55%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M +S + G S+D + +RTSSG FL +K
Sbjct: 54 EPLIVVLANVLSDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NK 106
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 107 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 165
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 166 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 206
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 207 TLHGGAPVTKGEKWIATQWVRRGTYK 232
>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
Length = 297
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 77/209 (36%), Positives = 112/209 (53%), Gaps = 24/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P+ ++ L+ +EC+ L+ L+ + +S VV+ DTG RTS G +I
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHPLI 164
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKN-----GGQRMATV 198
IE RIA T P E+GEGLQ+L+Y+ G +Y+PHFDYF + + GGQR+AT+
Sbjct: 165 TRIEARIAAVTGVPAEHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQLSVGGQRIATL 224
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ E GG T FP + GL + P G+A+ F + PD +L
Sbjct: 225 VIYLNTPEAGGATAFP-------------------RVGLEVAPVKGNAVYFSYLLPDGAL 265
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D +LH G PV G KW +TKW+R Y+
Sbjct: 266 DERTLHAGLPVAFGEKWIATKWLRERPYR 294
>gi|428170517|gb|EKX39441.1| hypothetical protein GUITHDRAFT_114401 [Guillardia theta CCMP2712]
Length = 322
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 92/279 (32%), Positives = 138/279 (49%), Gaps = 43/279 (15%)
Query: 29 FAILILLAFGILSM---------PSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEV 79
F L+++ + +LSM P S KA S+ R+SM + G + W+E
Sbjct: 22 FLFLVIVGYAVLSMLLQSLWMTGPKSDALLSKA---PSLERRSM-TNLGGMAKKSTWIET 77
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPH-MRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
+S +PR F+ HN L++EEC++L++LA + S + T K +S RT+ +L
Sbjct: 78 VSVDPRIFIVHNLLTEEECDHLVSLALQKGLSASLITPYGTNKLVESTTRTNKQAWLDFQ 137
Query: 139 RDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEF----NTKNGGQR 194
+D +++ +E +IA T E GE LQVLHY Q++ H DYF N + GG R
Sbjct: 138 QDDVVKRVEDKIAKLTKTTPEQGENLQVLHYAKSQQFTEHHDYFDPATDPPENYEKGGNR 197
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ TV++YL EEGGET F G L + GDA++F+++K
Sbjct: 198 LITVIVYLQAAEEGGETHF-------------------GAANLKLTAAKGDAVMFYNLKH 238
Query: 255 DA------SLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+D +LH G P IKG KW +TKWI Y+
Sbjct: 239 GCDGIDPTCVDKQTLHAGLPPIKGEKWVATKWIHERGYQ 277
>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
israelensis ATCC 35646]
gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
Length = 248
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 116/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M++S V G S+D + +RTSSG FL ++
Sbjct: 70 EPLIVVLANVLSDEECDKLIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 182 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 222
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248
>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
Length = 248
Score = 148 bits (374), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 116/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M++S V G S+D + +RTSSG FL ++
Sbjct: 70 EPLIVVLANVLSDEECDKLIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 182 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 222
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248
>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pakistani str. T13001]
gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
Length = 248
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 116/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M++S V G S+D + +RTSSG FL ++
Sbjct: 70 EPLIVVLANVLSDEECDELIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 182 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 222
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248
>gi|206971296|ref|ZP_03232247.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|229081494|ref|ZP_04213993.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|423411965|ref|ZP_17389085.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|423432249|ref|ZP_17409253.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
gi|206734068|gb|EDZ51239.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH1134]
gi|228701801|gb|EEL54288.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
gi|401104033|gb|EJQ12010.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
gi|401117005|gb|EJQ24843.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
Length = 216
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 116/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M +S + G S+D + +RTSSG FL ++
Sbjct: 38 EPLIVVLANVLSDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 90
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 150 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 190
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
Length = 248
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 120/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI ++ M++S V G S+D + +RTSSG
Sbjct: 62 IQIISKFEEPLIVVLANVLSDEECDELIEMSKNKMKRSKV-----GSSRDVNDIRTSSGA 116
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL ++ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 117 FLEDS--ELTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 173
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 174 RISTLVMYLNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFY 214
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 215 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 248
>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. T03a001]
gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
kurstaki str. HD73]
Length = 216
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 116/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M++S V G ++D + +RTSSG FL ++
Sbjct: 38 EPLIVVLANVLSDEECDELIEMSKNKMKRSKV-----GSARDVNDIRTSSGAFLED--NE 90
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 150 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 190
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|423452458|ref|ZP_17429311.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
gi|401140096|gb|EJQ47653.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
Length = 216
Score = 148 bits (373), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 121/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S + G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECDGLIELSKNKIERSKI-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWVATQWVRRGTYK 216
>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
Length = 216
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 121/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FL--DDNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFH 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R Y+
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYR 216
>gi|365158975|ref|ZP_09355162.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
gi|363625964|gb|EHL76973.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
Length = 248
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 116/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M +S + G S+D + +RTSSG FL ++
Sbjct: 70 EPLIVVLANVLSDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 182 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 222
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248
>gi|229180513|ref|ZP_04307855.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
gi|228602937|gb|EEK60416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
Length = 232
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 116/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M +S + G S+D + +RTSSG FL ++
Sbjct: 54 EPLIVVLANVLSDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 106
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 107 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 165
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 166 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 206
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 207 TLHGGAPVTKGEKWIATQWVRRGTYK 232
>gi|30264308|ref|NP_846685.1| prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. Ames]
gi|47529753|ref|YP_021102.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. 'Ames
Ancestor']
gi|65321616|ref|ZP_00394575.1| hypothetical protein Bant_01005109 [Bacillus anthracis str. A2012]
gi|165873278|ref|ZP_02217887.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167634610|ref|ZP_02392930.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|167638693|ref|ZP_02396969.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|170687507|ref|ZP_02878724.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|170709341|ref|ZP_02899757.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|177655890|ref|ZP_02937082.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190566156|ref|ZP_03019075.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|196034803|ref|ZP_03102210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227817011|ref|YP_002817020.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
anthracis str. CDC 684]
gi|228929280|ref|ZP_04092307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|228935557|ref|ZP_04098373.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|229123754|ref|ZP_04252949.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|229604260|ref|YP_002868528.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|254683996|ref|ZP_05147856.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CNEVA-9066]
gi|254721830|ref|ZP_05183619.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A1055]
gi|254736344|ref|ZP_05194050.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Western North America USA6153]
gi|254741382|ref|ZP_05199069.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Kruger B]
gi|254753983|ref|ZP_05206018.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Vollum]
gi|254757854|ref|ZP_05209881.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Australia 94]
gi|386738126|ref|YP_006211307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|421506493|ref|ZP_15953416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|421638315|ref|ZP_16078911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
gi|30258953|gb|AAP28171.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Ames]
gi|47504901|gb|AAT33577.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. 'Ames Ancestor']
gi|164710995|gb|EDR16563.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0488]
gi|167513541|gb|EDR88911.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0193]
gi|167530062|gb|EDR92797.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0442]
gi|170125767|gb|EDS94678.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0389]
gi|170668702|gb|EDT19448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0465]
gi|172079923|gb|EDT65028.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0174]
gi|190563075|gb|EDV17041.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Tsiankovskii-I]
gi|195992342|gb|EDX56303.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
W]
gi|227005734|gb|ACP15477.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. CDC 684]
gi|228659889|gb|EEL15534.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
gi|228824095|gb|EEM69911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
andalousiensis BGSC 4AW1]
gi|228830570|gb|EEM76180.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pondicheriensis BGSC 4BA1]
gi|229268668|gb|ACQ50305.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. A0248]
gi|384387978|gb|AFH85639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
gi|401823486|gb|EJT22633.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
gi|403394741|gb|EJY91981.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
Length = 216
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 87/214 (40%), Positives = 120/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ KS + S G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELS-----KSKLARSKVGSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FL--DDNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
huazhongensis BGSC 4BD1]
Length = 216
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 115/206 (55%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M++S V G S+D + +RTSSG FL +
Sbjct: 38 EPLIVVLANVLSDEECDELIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLEDS--E 90
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 91 LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 150 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 190
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|423426372|ref|ZP_17403403.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
gi|401111119|gb|EJQ19018.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
Length = 248
Score = 147 bits (372), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 116/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M +S + G S+D + +RTSSG FL ++
Sbjct: 70 EPLIVVLANVLSDEECDELIEISKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 182 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 222
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248
>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
Length = 216
Score = 147 bits (372), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 77/200 (38%), Positives = 116/200 (58%), Gaps = 24/200 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
EP + N LS EEC+ LI + M++S V +S + +RTSS TF G ++I
Sbjct: 37 EPLIVILGNVLSDEECDQLIQQSKDRMQRSKVANS----LEVDELRTSSSTFFHEGENEI 92
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ IEKRI+ P+E+GEGLQ+L+Y+ GQ+Y+ HFD+F + R++T++MYL
Sbjct: 93 VARIEKRISQIMNIPVEHGEGLQILNYKIGQEYKAHFDFF-SSTSRAASNPRISTLVMYL 151
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
+DVE+GGET FP K S+ P+ G A+ F D +L+ +
Sbjct: 152 NDVEQGGETYFP-------------------KLNFSVSPQKGMAVYFEYFYNDQNLNDLT 192
Query: 263 LHGGCPVIKGNKWSSTKWIR 282
LHGG PV+ G+KW++T+W+R
Sbjct: 193 LHGGAPVVMGDKWAATQWMR 212
>gi|49187135|ref|YP_030387.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. Sterne]
gi|228947951|ref|ZP_04110238.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
gi|49181062|gb|AAT56438.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
anthracis str. Sterne]
gi|228811938|gb|EEM58272.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
monterrey BGSC 4AJ1]
Length = 232
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 87/214 (40%), Positives = 120/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ KS + S G S+D + +RTSSG
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDELIELS-----KSKLARSKVGSSRDVNDIRTSSGA 100
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 101 FL--DDNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 157
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 158 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 198
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 199 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 232
>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
Length = 248
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 116/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M++S V G ++D + +RTSSG FL ++
Sbjct: 70 EPLIVVLANVLSDEECDELIEMSKNKMKRSKV-----GSARDVNDIRTSSGAFLED--NE 122
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 182 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 222
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248
>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
G9842]
gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
G9842]
Length = 216
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 116/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M++S V G S+D + +RTSSG FL ++
Sbjct: 38 EPLIVVLANVLSDEECDELIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLED--NE 90
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAVNNRISTLVMY 149
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 150 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 190
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
Length = 211
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 79/205 (38%), Positives = 115/205 (56%), Gaps = 25/205 (12%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV+ EP + N LS EEC+ LI+ A+ + +S + K + S +RTSSG F
Sbjct: 24 EVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKL-----AKKEISSIRTSSGMFFEE 78
Query: 138 GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMAT 197
+ +I +IEKRI+ P+E+ EGLQVLHYE GQ+++ HFD+F + + R++T
Sbjct: 79 NENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKAHFDFFGPN-HPSSSNNRIST 137
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
+++YL+DVEEGG T FPN G+ PK G A+ F D
Sbjct: 138 LVVYLNDVEEGGVTTFPNL-------------------GIVNVPKKGTAVYFEYFYNDQK 178
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIR 282
L+ +LH G PVI+G KW +T+W+R
Sbjct: 179 LNELTLHSGEPVIQGEKWVATQWMR 203
>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
Length = 216
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 85/206 (41%), Positives = 115/206 (55%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI L+ KS + S G S+D + +RTSSG FL ++
Sbjct: 38 EPLIVVLGNVLSDEECDELIELS-----KSKLARSKVGSSRDVNDIRTSSGAFLED--NE 90
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 91 LTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 150 LNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 190
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
Length = 248
Score = 147 bits (371), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 115/206 (55%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M++S V G S+D + +RTSSG FL +
Sbjct: 70 EPLIVVLANVLSDEECDELIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLEDS--E 122
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 123 LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 182 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 222
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248
>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
Length = 248
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 116/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M++S V G S+D + +RTSSG FL ++
Sbjct: 70 EPLIVVLANVLSDEECDELIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAVNNRISTLVMY 181
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 182 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 222
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248
>gi|228941395|ref|ZP_04103947.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|228974327|ref|ZP_04134896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228980919|ref|ZP_04141223.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|384188306|ref|YP_005574202.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|410676625|ref|YP_006928996.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452200698|ref|YP_007480779.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
gi|228778855|gb|EEM27118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|228785377|gb|EEM33387.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
thuringiensis str. T01001]
gi|228818321|gb|EEM64394.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
berliner ATCC 10792]
gi|326942015|gb|AEA17911.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
chinensis CT-43]
gi|409175754|gb|AFV20059.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
gi|452106091|gb|AGG03031.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
thuringiensis serovar thuringiensis str. IS5056]
Length = 216
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 115/206 (55%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC LI ++ M++S V G S+D + +RTSSG FL ++
Sbjct: 38 EPLIVVLANVLSDEECGELIEMSKNKMKRSKV-----GSSRDVNDIRTSSGAFLED--NE 90
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 150 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 190
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|229192445|ref|ZP_04319408.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
gi|228591022|gb|EEK48878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
Length = 216
Score = 147 bits (371), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 82/206 (39%), Positives = 116/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N +S EEC+ LI ++ M +S + G S+D + +RTSSG FL ++
Sbjct: 38 EPLIVVLANVISDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 90
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 150 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 190
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|229071739|ref|ZP_04204954.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
gi|228711334|gb|EEL63294.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
Length = 232
Score = 147 bits (370), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 116/206 (56%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M +S + G S+D + +RTSSG FL ++
Sbjct: 54 EPLIVVLANVLSDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 106
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 107 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 165
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 166 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 206
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 207 TLHGGAPVTKGEKWIATQWMRRGTYK 232
>gi|423470454|ref|ZP_17447198.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
gi|402436583|gb|EJV68613.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
Length = 216
Score = 147 bits (370), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 121/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S + G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLANVLSDEECDGLIELSKNKIERSKI-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWVATQWMRRGTYK 216
>gi|217961727|ref|YP_002340297.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus AH187]
gi|222097680|ref|YP_002531737.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
Q1]
gi|229198365|ref|ZP_04325071.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|375286242|ref|YP_005106681.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
[Bacillus cereus NC7401]
gi|423354732|ref|ZP_17332357.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|423566803|ref|ZP_17543050.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
gi|423574080|ref|ZP_17550199.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|217067199|gb|ACJ81449.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH187]
gi|221241738|gb|ACM14448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
Q1]
gi|228585065|gb|EEK43177.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
gi|358354769|dbj|BAL19941.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NC7401]
gi|401086280|gb|EJP94507.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
gi|401212649|gb|EJR19392.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
gi|401215318|gb|EJR22035.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
Length = 216
Score = 147 bits (370), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 121/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDKLIELSKNKLARSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FL--DDNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R Y+
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYR 216
>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
Length = 248
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 115/206 (55%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M +S + G S+D + +RTSSG FL ++
Sbjct: 70 EPLIVVLANVLSDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 182 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 222
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248
>gi|196041590|ref|ZP_03108882.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218905373|ref|YP_002453207.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
AH820]
gi|225866219|ref|YP_002751597.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|423550018|ref|ZP_17526345.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
gi|196027578|gb|EDX66193.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
NVH0597-99]
gi|218537435|gb|ACK89833.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
AH820]
gi|225786013|gb|ACO26230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB102]
gi|401189634|gb|EJQ96684.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
Length = 216
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 120/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FL--DDNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
Length = 511
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 77/207 (37%), Positives = 116/207 (56%), Gaps = 22/207 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++ P V+H+ LS +E +YL NLA P ++++TV GK RVRTS G +L
Sbjct: 310 MEIVLLNPFIVVFHDALSPQEIDYLQNLARPLLKRTTV--HVNGKYVSRRVRTSKGAWLE 367
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNTKNGGQRM 195
R + + R IE+R+ D T ++ E +++Y G Y H+D+F + T G R+
Sbjct: 368 RDLNNLTRRIERRVVDMTELSMQGSEAYNIMNYGLGGHYAAHYDFFNTTKQQTSETGDRI 427
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
ATVL YLSDVE+GG TVFPN + L++ P+ G AL ++++ +
Sbjct: 428 ATVLFYLSDVEQGGATVFPNLK-------------------LAVSPERGMALFWYNLLDN 468
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ D +LHGGCPV+ G+KW T WI
Sbjct: 469 GTGDTRTLHGGCPVLVGSKWVMTLWIH 495
>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
konkukian str. 97-27]
gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
konkukian str. 97-27]
Length = 232
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 120/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTSSG
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGA 100
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 101 FL--DDNELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 157
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 158 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 198
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 199 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 232
>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
Length = 216
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 83/214 (38%), Positives = 122/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N +S EEC LI ++ +++ST+ G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVISDEECGELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R Y+
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYR 216
>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
Length = 216
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 82/214 (38%), Positives = 122/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N +S EEC LI ++ +++ST+ G ++D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVISDEECNELIEMSKNKIKRSTI-----GSARDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R Y+
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYR 216
>gi|301055727|ref|YP_003793938.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus biovar
anthracis str. CI]
gi|300377896|gb|ADK06800.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus biovar
anthracis str. CI]
Length = 216
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 120/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FL--DDNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|196046329|ref|ZP_03113555.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|376268135|ref|YP_005120847.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
gi|196022799|gb|EDX61480.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
03BB108]
gi|364513935|gb|AEW57334.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
F837/76]
Length = 216
Score = 147 bits (370), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 120/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FL--DDNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
Length = 232
Score = 147 bits (370), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 120/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTSSG
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGA 100
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 101 FL--DDNELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 157
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 158 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 198
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 199 QDQSLNELTLHGGAPVTKGEKWITTQWVRRGTYK 232
>gi|52141260|ref|YP_085568.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
gi|51974729|gb|AAU16279.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
Length = 232
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 120/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTSSG
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGA 100
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 101 FL--DDNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 157
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 158 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 198
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 199 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 232
>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
Length = 232
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 121/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTSSG
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDKLIELSKNKLARSKV-----GSSRDVNDIRTSSGA 100
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 101 FL--DDNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 157
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 158 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 198
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R Y+
Sbjct: 199 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYR 232
>gi|384182063|ref|YP_005567825.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
gi|324328147|gb|ADY23407.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
finitimus YBT-020]
Length = 216
Score = 146 bits (369), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 121/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FL--DDNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R Y+
Sbjct: 183 QDRSLNELTLHGGAPVTKGEKWIATQWVRRGTYR 216
>gi|42783360|ref|NP_980607.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10987]
gi|42739288|gb|AAS43215.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
ATCC 10987]
Length = 216
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 121/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FL--DDNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R Y+
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWMRRGTYR 216
>gi|281307110|pdb|3ITQ|A Chain A, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
gi|281307111|pdb|3ITQ|B Chain B, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
Anthracis
Length = 216
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 119/214 (55%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ KS + S G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELS-----KSKLARSKVGSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FL--DDNELTAKIEKRISSIXNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++ YL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVXYLNDVEEGGETFFP-------------------KLNLSVHPRKGXAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
Length = 248
Score = 146 bits (368), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 114/206 (55%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M +S + G S+D + +RTSSG FL ++
Sbjct: 70 EPLIVVLANVLSDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 123 FTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 182 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 222
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248
>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
Length = 229
Score = 145 bits (367), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 114/206 (55%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI L+ KS + S G S+D + +RTS G FL ++
Sbjct: 51 EPLIVVLGNVLSDEECDELIELS-----KSKLARSKVGSSRDVNDIRTSKGAFL--DDNE 103
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 104 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 162
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 163 LNDVEEGGETFFP-------------------KLNLSVNPRKGMAVYFEYFYQDQSLNEL 203
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 204 TLHGGAPVTKGEKWIATQWVRRGTYK 229
>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
Length = 216
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 82/214 (38%), Positives = 122/214 (57%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N +S EEC+ LI ++ +++ST+ G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVISDEECDELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG V KG KW +T+W+R Y+
Sbjct: 183 QDQSLNELTLHGGASVTKGEKWIATQWVRRGTYR 216
>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
Length = 216
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 119/214 (55%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTS G
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSKGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FL--DDNELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|375106426|ref|ZP_09752687.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
JOSHI_001]
gi|374667157|gb|EHR71942.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
JOSHI_001]
Length = 295
Score = 145 bits (367), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 81/210 (38%), Positives = 113/210 (53%), Gaps = 28/210 (13%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V+ LS EEC+ +++LA P + +S V + +G S+ + RTS G F RG +
Sbjct: 108 PRVMVFGGLLSDEECDAMVDLARPRLARSETVHNGSGGSEVNAARTSDGMFFDRGEFPLC 167
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT------KNGGQRMAT 197
R IE+RIA +P+ENGEGLQVL Y G +Y+ H DYF D K GGQR+ T
Sbjct: 168 RTIEQRIAALVNWPVENGEGLQVLRYRPGSEYKAHHDYF-DPAQPGTPTILKRGGQRVGT 226
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
V+MYL+ GG T FP+ GL + P G+A+ F + A
Sbjct: 227 VVMYLNHPIRGGGTAFPD-------------------VGLEVAPFKGNAVFFSYDR--AH 265
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LH G PV++G KW +TKW+R E++
Sbjct: 266 PMTRTLHAGTPVLEGEKWVATKWVREGEFR 295
>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
Length = 232
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 119/214 (55%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTS G
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSKGA 100
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 101 FL--DDNELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 157
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 158 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 198
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 199 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGAYK 232
>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
Length = 279
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 80/208 (38%), Positives = 108/208 (51%), Gaps = 24/208 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P V NF++ EEC LI LA + +TVVD TG+ + RTS AR +I
Sbjct: 91 PEVVVLDNFITAEECAQLIALAEGKVEDATVVDPATGEFVKHQDRTSMNAAFARAEHPLI 150
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEF-----NTKNGGQRMATV 198
+E RIA +P ENGEG+QVL Y +G +Y+ HFDYF + N + GGQR+ T
Sbjct: 151 ARLEARIAAAIHWPAENGEGMQVLRYRSGGEYKAHFDYFDTQSEGGRKNMQTGGQRVGTF 210
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L+YL DV+ GG T FP I+PK G AL F + P+
Sbjct: 211 LVYLCDVDAGGATRFP-------------------ALNFEIRPKKGMALFFANTLPNGEG 251
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+P +LH G PV+ G K+ ++KW+R Y
Sbjct: 252 NPLTLHAGVPVVSGVKYLASKWLREKPY 279
>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
tochigiensis BGSC 4Y1]
Length = 232
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 119/214 (55%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTS G
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSKGA 100
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 101 FL--DDNELTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 157
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 158 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 198
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 199 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 232
>gi|229186477|ref|ZP_04313640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
gi|228596991|gb|EEK54648.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
Length = 216
Score = 145 bits (366), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 120/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FL--DDNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T+++YL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVIYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
Length = 216
Score = 145 bits (365), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 114/206 (55%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI L+ KS + S G S+D + +RTS G FL ++
Sbjct: 38 EPLIVVLGNVLSDEECDELIELS-----KSKLARSKVGSSRDVNDIRTSKGAFL--DDNE 90
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 91 LTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 150 LNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 190
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|159481038|ref|XP_001698589.1| predicted protein [Chlamydomonas reinhardtii]
gi|158282329|gb|EDP08082.1| predicted protein [Chlamydomonas reinhardtii]
Length = 258
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 71/143 (49%), Positives = 95/143 (66%), Gaps = 3/143 (2%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E ISW PRAF+YHNFLS+ EC++L ++ + +S VVDS TG+SK +RTS G
Sbjct: 8 IETISWSPRAFIYHNFLSEAECDHLTDIGNKRVSRSLVVDSKTGQSKLDDIRTSYGAAFG 67
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTK---NGGQ 193
RG D +I +E+RIA++T P E GE +Q+L Y GQKY+ H+D+F D + + G
Sbjct: 68 RGEDPVIAAVEERIAEWTHLPPEYGEPMQILRYVDGQKYDAHWDWFDDPVHHAAYLHEGN 127
Query: 194 RMATVLMYLSDVEEGGETVFPNA 216
R ATVL+YLS VE GGET P A
Sbjct: 128 RYATVLLYLSGVEGGGETNLPLA 150
>gi|78046308|ref|YP_362483.1| 2OG-Fe(II) oxygenase [Xanthomonas campestris pv. vesicatoria str.
85-10]
gi|78034738|emb|CAJ22383.1| putative 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas
campestris pv. vesicatoria str. 85-10]
Length = 296
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 90/242 (37%), Positives = 125/242 (51%), Gaps = 31/242 (12%)
Query: 51 KANDLSSIVRKSMERSEGDE-----GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLA 105
+AN L VR + + D G + V V PR V FLS EEC+ LI LA
Sbjct: 68 QANGLPMPVRVPALQQDADASLLALGDRDVRVLVSLLLPRVVVLGGFLSDEECDALIALA 127
Query: 106 TPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQ 165
P + +S VD+ G+ RTS L G+D + + IE RIA +P+++GEGLQ
Sbjct: 128 RPRLARSRTVDNANGEHVVHAARTSDSMCLRLGQDALCQRIEARIARLLDWPVDHGEGLQ 187
Query: 166 VLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATVLMYLSDVEEGGETVFPNAQGNI 220
VL Y G +Y PH+DYF D T + GGQR+A+++MYL+ E GG T FP+A ++
Sbjct: 188 VLRYATGAEYRPHYDYFDPDAAGTPVLVQAGGQRVASLVMYLNTPERGGATRFPDAHLDV 247
Query: 221 SAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKW 280
+AV G+A+ F +P SLH G PV+ G+KW +TKW
Sbjct: 248 AAV-------------------KGNAVFFSYDRPHPMT--RSLHAGAPVLAGDKWVATKW 286
Query: 281 IR 282
+R
Sbjct: 287 LR 288
>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
Length = 528
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 73/211 (34%), Positives = 117/211 (55%), Gaps = 21/211 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++ +P +YH+ LS +E + L +ATP ++++TV + +G+++ R RTS +
Sbjct: 325 MELVGLDPYMVLYHDVLSAKEIKELQGMATPGLKRATVFQAASGRNEVVRTRTSKVAWFP 384
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF--MDEFNTKNGGQR 194
G + + RI D T F L E LQ+++Y G Y+ H+DYF ++ T G R
Sbjct: 385 DGYSPLTVRLNARITDMTGFNLHGSEMLQLMNYGLGGHYDQHYDYFNTINSNLTAMSGDR 444
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ATVL YL+DVE+GG TVFPN + ++ P+ G ++++++K
Sbjct: 445 IATVLFYLTDVEQGGATVFPNIRK-------------------AVFPQRGSVIMWYNLKD 485
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
D +D +LH CPVI G+KW KWIR E
Sbjct: 486 DGQIDTQTLHAACPVIVGSKWVCNKWIRERE 516
>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
pulsiensis BGSC 4CC1]
Length = 232
Score = 145 bits (365), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 119/214 (55%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTS G
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSKGA 100
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 101 FL--DDNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 157
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 158 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 198
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 199 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 232
>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
Length = 216
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 114/206 (55%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI L+ + +S V G S+D + +RTS G FL ++
Sbjct: 38 EPLIVVLGNVLSDEECDKLIELSKNKLARSKV-----GSSRDVNDIRTSKGAFL--DDNE 90
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 91 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 150 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 190
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
Length = 254
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 86/214 (40%), Positives = 119/214 (55%), Gaps = 30/214 (14%)
Query: 77 VEVISW--EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
V +IS EP V N LS EEC+ LI L+ M +S + G S++ + +RTSSG
Sbjct: 68 VHIISRFEEPLIVVLANVLSDEECDELIELSKNKMERSKI-----GSSRNVNDIRTSSGA 122
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL ++ IEKRI+ T P+ +GEGL +L+Y Q+Y+ H+DYF E +
Sbjct: 123 FLEE--NEFTSKIEKRISSITNVPVAHGEGLHILNYAVDQEYKAHYDYFA-EHSRSAANN 179
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 180 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 220
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 221 QDQSLNELTLHGGAPVTKGEKWIATQWMRRGTYK 254
>gi|118479416|ref|YP_896567.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis str. Al
Hakam]
gi|118418641|gb|ABK87060.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis str. Al
Hakam]
Length = 232
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 120/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC+ LI L+ + +S V G S+D + +RTSSG
Sbjct: 46 IQIISKFEEPLIVVLGNVLSDEECDELIELSKNKLARSKV-----GSSRDVNDIRTSSGA 100
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 101 FL--DDNELTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 157
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T+++YL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 158 RISTLVIYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 198
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R YK
Sbjct: 199 QDQSLNELTLHGGAPVTKGEKWIATQWVRRGTYK 232
>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
Length = 216
Score = 144 bits (364), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 83/206 (40%), Positives = 114/206 (55%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI L+ + +S V G S+D + +RTS G FL ++
Sbjct: 38 EPLIVVLGNVLSDEECDKLIELSKNKLARSKV-----GSSRDVNDIRTSKGAFL--DDNE 90
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 91 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 150 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 190
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|218231188|ref|YP_002369041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
B4264]
gi|218159145|gb|ACK59137.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
B4264]
Length = 216
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 82/206 (39%), Positives = 114/206 (55%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC LI ++ M +S + G S+D + +RTSSG FL ++
Sbjct: 38 EPLIVVLANVLSDEECGELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 90
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D S++
Sbjct: 150 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSINEL 190
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|402555628|ref|YP_006596899.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus FRI-35]
gi|401796838|gb|AFQ10697.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus FRI-35]
Length = 216
Score = 144 bits (364), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 85/214 (39%), Positives = 120/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N LS EEC LI L+ + +S V G S+D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVLSDEECGELIELSKNKLARSKV-----GSSRDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FL--DDNELTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG PV KG KW +T+W+R Y+
Sbjct: 183 QDQSLNELTLHGGAPVTKGEKWIATQWMRRGTYR 216
>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
Length = 529
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 118/209 (56%), Gaps = 22/209 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E+IS +P +YH+ +S E L +LA P ++++TV + + ++ + RTS T+L
Sbjct: 323 MELISLDPYMVIYHDVISPSEISELQSLAVPGLKRATVFNQQSMRNHVVKTRTSKVTWLL 382
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN---TKNGGQ 193
+++ + +RI D T F + E LQV++Y G Y+ H+DYF T+ G
Sbjct: 383 DTLNQLTIRLNRRITDMTGFDMYGSEMLQVMNYGLGGHYDKHYDYFNSSVAADLTRLNGD 442
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+ATVL YL+DVE+GG TVFPN + ++ PK G A+++++++
Sbjct: 443 RIATVLFYLTDVEQGGATVFPNIEK-------------------AVFPKSGTAVVWYNLR 483
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D + DP +LH CPVI G+KW KWIR
Sbjct: 484 HDGNGDPQTLHAACPVIVGSKWVCNKWIR 512
>gi|229152436|ref|ZP_04280628.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
gi|228631044|gb|EEK87681.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
Length = 248
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 82/206 (39%), Positives = 114/206 (55%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC LI ++ M +S + G S+D + +RTSSG FL ++
Sbjct: 70 EPLIVVLANVLSDEECGELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 122
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D S++
Sbjct: 182 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSINEL 222
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248
>gi|229031885|ref|ZP_04187873.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
gi|228729503|gb|EEL80492.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
Length = 216
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 84/206 (40%), Positives = 113/206 (54%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC LI L+ KS + S G S+D + +RTS G FL ++
Sbjct: 38 EPLIVVLGNVLSDEECGELIELS-----KSKLARSKVGSSRDVNDIRTSKGAFL--DDNE 90
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 91 LTTKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F D SL+
Sbjct: 150 LNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFYQDQSLNEL 190
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGAYK 216
>gi|418523362|ref|ZP_13089380.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410699993|gb|EKQ58573.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 286
Score = 144 bits (363), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 91/242 (37%), Positives = 124/242 (51%), Gaps = 31/242 (12%)
Query: 51 KANDLSSIVRKSMERSEGDE-----GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLA 105
+AN L VR + + D G E V V PR V FLS EC+ LI LA
Sbjct: 58 QANGLPMPVRVPALQQDTDASLLALGDREVRVLVSLLLPRVVVLGGFLSDGECDALIALA 117
Query: 106 TPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQ 165
P + +S VD+ G+ RTS G L G+D + + IE RIA +P+++GEGLQ
Sbjct: 118 RPRLARSRTVDNANGEHLVHAARTSDGMCLRVGQDALCQRIEARIARLFDWPVDHGEGLQ 177
Query: 166 VLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATVLMYLSDVEEGGETVFPNAQGNI 220
VL Y G +Y PH+DYF D T + GGQR+A+++MYL+ E GG T FP+A ++
Sbjct: 178 VLRYATGAEYRPHYDYFDPDAVGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAHLDV 237
Query: 221 SAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKW 280
+AV G+A+ F +P SLH G PV+ G KW +TKW
Sbjct: 238 AAV-------------------KGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKW 276
Query: 281 IR 282
+R
Sbjct: 277 LR 278
>gi|229098707|ref|ZP_04229647.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|423441025|ref|ZP_17417931.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|423533441|ref|ZP_17509859.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
gi|228684786|gb|EEL38724.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
gi|402417686|gb|EJV49986.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
gi|402463660|gb|EJV95360.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
Length = 216
Score = 144 bits (363), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 81/214 (37%), Positives = 121/214 (56%), Gaps = 30/214 (14%)
Query: 77 VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
+++IS EP V N +S EEC LI ++ +++ST+ G ++D + +RTSSG
Sbjct: 30 IQIISKFEEPLIVVLGNVISDEECNELIEMSKNKIKRSTI-----GSARDVNDIRTSSGA 84
Query: 134 FLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ 193
FL +++ IEKRI+ P+ +GEGL +L+YE Q+Y+ H+DYF E +
Sbjct: 85 FLEE--NELTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANN 141
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R++T++MYL+DVEEGGET FP K LS+ P+ G A+ F
Sbjct: 142 RISTLVMYLNDVEEGGETFFP-------------------KLNLSVHPRKGMAVYFEYFY 182
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SL+ +LHGG V KG KW +T+W+R Y+
Sbjct: 183 QDQSLNELTLHGGASVTKGEKWIATQWVRRGTYR 216
>gi|325922187|ref|ZP_08183974.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
gi|325547306|gb|EGD18373.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
19865]
Length = 285
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 78/204 (38%), Positives = 109/204 (53%), Gaps = 26/204 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V +FLS EC+ LI LA P + +S VD+D G RTS L G+D +
Sbjct: 96 PRVVVLGDFLSDAECDALIALAQPRLARSRTVDNDNGAQIVHAARTSDSMCLQLGQDALC 155
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-----MDEFNTKNGGQRMATV 198
+ IE RIA +P+++GEGLQVL Y G +Y+PH+DYF + GGQR+A++
Sbjct: 156 QRIEARIARLLDWPVDHGEGLQVLRYATGAEYQPHYDYFDPTAAGTPVLLQAGGQRLASL 215
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL+ E GG T FP+ +++AV G+A+ F +P
Sbjct: 216 VMYLNTPERGGATRFPDVHLDVAAV-------------------KGNAVFFSYDRPHPMT 256
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
SLH G PV+ G KW +TKW+R
Sbjct: 257 --RSLHAGAPVLAGEKWVATKWLR 278
>gi|229146822|ref|ZP_04275187.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
gi|228636650|gb|EEK93115.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
Length = 216
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 82/206 (39%), Positives = 114/206 (55%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI ++ M +S + G S+D + +RTSSG FL ++
Sbjct: 38 EPLIVVLANVLSDEECDELIEMSKNKMERSKI-----GSSRDVNDIRTSSGAFLED--NE 90
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P +GEGL +L+YE Q+Y+ H+DYF E + R++T++MY
Sbjct: 91 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K LS+ P+ G A+ F SL+
Sbjct: 150 LNDVEEGGETYFP-------------------KLNLSVHPRKGMAVYFEYFYQGQSLNEL 190
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV KG KW +T+W+R YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216
>gi|224056224|ref|XP_002298763.1| predicted protein [Populus trichocarpa]
gi|222846021|gb|EEE83568.1| predicted protein [Populus trichocarpa]
Length = 175
Score = 144 bits (362), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 74/156 (47%), Positives = 98/156 (62%), Gaps = 6/156 (3%)
Query: 129 TSSGTFLARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEF 186
T+ TF+ DK + IE++IA T P +GE +L YE GQKY+ H+D F +
Sbjct: 18 TTESTFIGGSEDKTGTLDFIERKIAKATMIPQSHGEAFNILRYEIGQKYDSHYDAFNPDE 77
Query: 187 NTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDA 246
QR+A+ L+YLS VEEGGET+FP G SAV E +C GL +KP+ GD
Sbjct: 78 YGPQPSQRVASFLLYLSSVEEGGETMFPFENG--SAVSSGFEYKQC--VGLKVKPRQGDG 133
Query: 247 LLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
LLF+S+ P+ ++D +SLHG CPVIKG KW +TKWIR
Sbjct: 134 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWIR 169
>gi|363543363|ref|NP_001241692.1| prolyl 4-hydroxylase 8-2 [Zea mays]
gi|347978834|gb|AEP37759.1| prolyl 4-hydroxylase 8-2 [Zea mays]
Length = 184
Score = 144 bits (362), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 91/158 (57%), Positives = 108/158 (68%), Gaps = 11/158 (6%)
Query: 9 FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDL-------SSIV 59
FPTR ++S T+ LT L++ + A+L L+AFG+ S+P S+ ++ S+ V
Sbjct: 17 FPTRGGRASPYTVALTALLLVSAALLALIAFGVFSLPVSAPNAAATTGTAAGGETESADV 76
Query: 60 RKSMER--SEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
R R EG R QW EVISWEPRAFVYHNFLSK+ECEYLI LA PHM KSTVVDS
Sbjct: 77 RPRARRDLGEGLGERGAQWTEVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDS 136
Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTF 155
TGKSKDSRVRTSSG FL RGRDK+IR IE+ I TF
Sbjct: 137 TTGKSKDSRVRTSSGMFLQRGRDKVIRAIEELIKRSTF 174
>gi|224122338|ref|XP_002318810.1| predicted protein [Populus trichocarpa]
gi|222859483|gb|EEE97030.1| predicted protein [Populus trichocarpa]
Length = 310
Score = 143 bits (360), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 96/278 (34%), Positives = 151/278 (54%), Gaps = 23/278 (8%)
Query: 16 SSTLILTLLIMFTFAILILLAFGILSMPSSSGDSR-KANDLSSIVR--KSMERSEGDEGR 72
+S + L L ++ T L FG SS + R K L ++++ S++ + D R
Sbjct: 2 ASFVYLLLFMVLTLTTQFSLCFG----KSSRKELRNKEAHLETMIQFGSSIQTNWVDPSR 57
Query: 73 AEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSG 132
V +SW+PR FVY FL+ EEC++LI+LA S D D+G+ + +R+ SS
Sbjct: 58 ----VVTVSWQPRVFVYKGFLTDEECDHLISLAQGTKETSEGKDDDSGRIERNRLFASST 113
Query: 133 TFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHY--EAGQKYEPHFDYFMDEFNTKN 190
+ L D I+ IE+R++ +T P EN + LQV+HY E + Y FDYF ++ +
Sbjct: 114 SLLNMD-DNILSRIEERVSAWTLLPKENSKPLQVMHYGIEDAKNY---FDYFGNKSAIIS 169
Query: 191 GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
MAT++ YLS+V +GGE FP ++ + W S+C K S++P G+A+LF+
Sbjct: 170 SEPLMATLVFYLSNVTQGGEIFFPKSE--VKNKIW----SDCTKISDSLRPIKGNAILFF 223
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
++ P+ S D S H CPV++G W +TK + KV
Sbjct: 224 TVHPNTSPDMGSSHSRCPVLEGEMWYATKKFYLRAIKV 261
>gi|428175714|gb|EKX44602.1| hypothetical protein GUITHDRAFT_71994 [Guillardia theta CCMP2712]
Length = 244
Score = 143 bits (360), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 86/222 (38%), Positives = 118/222 (53%), Gaps = 18/222 (8%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV---DSDTGKSK-DSRVRTSSG 132
V+ +S PR FV NFLS EECE +I ATP + STV+ D G+ K VRTS
Sbjct: 20 VKRLSSTPRLFVVENFLSAEECEEIIKTATPLLAPSTVLKQGDQSNGEEKVKDEVRTSET 79
Query: 133 TFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF---MDEFNTK 189
+L + I+ I +R+ + P+ E +QVL Y Q Y H+D+F M
Sbjct: 80 AWLMDKKVPIVAKIRQRVEELIRIPMSYAEDMQVLKYTFKQHYHVHYDFFDPKMYPGRWS 139
Query: 190 NGGQRMATVLMYLSDVEEGGETVFPNAQGNISA-----VPWWNELSECGKTGLSIKPKMG 244
+G R+ TV YL+ VE+GGET+FP GN SA + W ++ + +KP G
Sbjct: 140 SGHNRLVTVFFYLTSVEKGGETIFPF--GNTSAEEHHKIQSWGPCENAVESSIKVKPVRG 197
Query: 245 DALLFWSMKP----DASLDPSSLHGGCPVIKGNKWSSTKWIR 282
A++F+ MKP LD +SLHGGC I G KW++ WIR
Sbjct: 198 SAVIFYLMKPHGHTHGELDHTSLHGGCDPIVGEKWAANYWIR 239
>gi|301093292|ref|XP_002997494.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110636|gb|EEY68688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 324
Score = 143 bits (360), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 82/223 (36%), Positives = 122/223 (54%), Gaps = 18/223 (8%)
Query: 71 GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRV-RT 129
G+ + +E +S P F FL +E + ++ L+ H++ STV D + + + RT
Sbjct: 102 GKGDVVLETLSLTPLVFSVDEFLKDDEIDIIMALSLEHLKPSTVTLMDGHEDRAATDWRT 161
Query: 130 SSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTK 189
S+ FL+ + + +I++R+AD T P+++ E +QVL YE QKY+ H DYF E + K
Sbjct: 162 STTYFLSSSKHSKLDEIDQRVADLTKVPVDHQEDVQVLRYEETQKYDHHTDYFPVEHH-K 220
Query: 190 NGGQ-----------RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLS 238
N RM TV Y+SDV +GG T+FP A G P + +C TGL
Sbjct: 221 NSPHVLESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAGG----APRPQSMKDCS-TGLK 275
Query: 239 IKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ PK ++F+SM P+ DP SLHGGCPV G K+S KW+
Sbjct: 276 VSPKKRKVIVFYSMLPNGQGDPMSLHGGCPVEDGIKYSGNKWV 318
>gi|348683507|gb|EGZ23322.1| hypothetical protein PHYSODRAFT_310730 [Phytophthora sojae]
Length = 417
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 82/223 (36%), Positives = 123/223 (55%), Gaps = 18/223 (8%)
Query: 71 GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRV-RT 129
G+ + +E +S P F FL +E + ++NL+ H++ S V D +++ + RT
Sbjct: 195 GKGDVVLETLSMTPLVFSVEEFLKDDEIDIIMNLSLEHLKPSGVTLMDGHENRAATDWRT 254
Query: 130 SSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTK 189
S+ FL I +I++R++D T P+++ E +QVL YE QKY+ H DYF E + K
Sbjct: 255 STTYFLPSDAHPKIDEIDQRVSDLTKVPIDHQEDVQVLRYEKTQKYDHHTDYFPVEHH-K 313
Query: 190 NGGQ-----------RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLS 238
N RM TV Y+SDV +GG T+FP A G P + +C TGL+
Sbjct: 314 NAPHILESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAGG----APRPTSMKDC-TTGLN 368
Query: 239 IKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ PK ++F+SM P+ DP SLHGGCPV +G K+S KW+
Sbjct: 369 VPPKKRKVIVFYSMLPNGEGDPMSLHGGCPVEEGVKYSGNKWV 411
>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
Length = 525
Score = 142 bits (359), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 75/211 (35%), Positives = 117/211 (55%), Gaps = 22/211 (10%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
E+++ +P +YH+ ++ E L LA P ++++TV + G++ + RTS T+L
Sbjct: 323 ELLALDPYMVLYHDVITPSEIRELQYLAVPTLKRATVFNQKMGRNTVVKTRTSKVTWLTD 382
Query: 138 GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN---TKNGGQR 194
+ + + +RI+D T F L E LQV++Y G Y+ HFDYF TK G R
Sbjct: 383 SLNPLTVRLNRRISDMTGFDLYGSEMLQVMNYGLGGHYDLHFDYFNATIAKDLTKLNGDR 442
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ATVL YL+DVE+GG TVFPN + +I PK G A+++++++
Sbjct: 443 IATVLFYLTDVEQGGATVFPNIKQ-------------------AIFPKKGTAVMWYNLRH 483
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
+ DP +LH CPVI G+KW KWIR ++
Sbjct: 484 NNDGDPQTLHAACPVIVGSKWVCNKWIREHQ 514
>gi|397568865|gb|EJK46391.1| hypothetical protein THAOC_34939 [Thalassiosira oceanica]
Length = 488
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 84/219 (38%), Positives = 116/219 (52%), Gaps = 21/219 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDSRVRTSSGTF 134
+E +S +P FL+ EEC+Y++ A P M+ S V D+D G+ S RTS TF
Sbjct: 267 IETLSMKPLVLSISGFLADEECDYIMEKAAPTMKYSGVSLKDADKGRPA-SDWRTSQSTF 325
Query: 135 LARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGG-- 192
+A D I+RDIE R A T P+ + E +QVL Y +KY+ H D+F + G
Sbjct: 326 VAAMGDPILRDIELRTASLTRVPVTHQEFVQVLRYGVTEKYDAHHDFFDPSSYRSDPGTL 385
Query: 193 --------QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMG 244
R ATV YL+DV GGET FP G P + S C TGL +KP+ G
Sbjct: 386 QLIENGKKNRYATVFWYLTDVARGGETCFPRHGG----APPPRDFSMC--TGLKVKPQKG 439
Query: 245 DALLFWSMKPDASLDPSSLHGGCPVIKGN--KWSSTKWI 281
++F+S+ +DP SLHG CPV+ KW++ KW+
Sbjct: 440 KVIIFYSLDASGEMDPLSLHGACPVLGKEDIKWAANKWL 478
>gi|294499597|ref|YP_003563297.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
gi|294349534|gb|ADE69863.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
Length = 219
Score = 142 bits (357), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 78/201 (38%), Positives = 112/201 (55%), Gaps = 26/201 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI L+ M++S + G +++ + +RTSSG F ++
Sbjct: 38 EPLVLVLGNVLSNEECDELIQLSKDKMQRSKI-----GAAREVNSIRTSSGMFFEESENE 92
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
++ IE+R++ +E EGLQVL Y Q+Y+ H DYF + + R++T++MY
Sbjct: 93 LVHQIERRLSKIMGPSIEYAEGLQVLKYLPDQEYKAHHDYFTSA-SKASKNNRISTLVMY 151
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K GLS+ P G A+ F DA L+
Sbjct: 152 LNDVEEGGETYFP-------------------KLGLSVSPTKGMAVYFEYFYSDAELNDR 192
Query: 262 SLHGGCPVIKGNKWSSTKWIR 282
+LHGG PVIKG KW +T+W+R
Sbjct: 193 TLHGGAPVIKGEKWVATQWMR 213
>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
Length = 522
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 75/210 (35%), Positives = 115/210 (54%), Gaps = 29/210 (13%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E + +P+ ++H+ LS E E L LA P + ++T+ + TGK++ S+ R S ++
Sbjct: 322 LEEMHLKPKIVIFHDVLSDTEIELLKRLAKPILERATIANQQTGKAERSKDRVSKSSWFP 381
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGG 192
IR I KR+AD T ++ E LQV++Y G +Y+PHFD+F + E N
Sbjct: 382 DEYHSTIRTITKRVADMTGLSMDTAEELQVVNYGLGGQYDPHFDFFHWGKLKEVN----- 436
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL Y+SDV GG TVFP K G++++ + G A ++++
Sbjct: 437 -RIATVLFYMSDVSIGGATVFP-------------------KLGVTLEARKGTAAFWYNL 476
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
LD S+LHG CPV+ G KW + KWIR
Sbjct: 477 HSSGELDYSTLHGACPVLIGEKWVANKWIR 506
>gi|325915062|ref|ZP_08177391.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
gi|325538760|gb|EGD10427.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
ATCC 35937]
Length = 286
Score = 141 bits (356), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 79/204 (38%), Positives = 109/204 (53%), Gaps = 26/204 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V FLS EC+ +I LA P + +S VD+ G RTS L G+D +
Sbjct: 96 PRVMVLGGFLSDAECDAMIALAQPRLARSRTVDNANGAHVVHAARTSDSMCLQLGQDALC 155
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATV 198
+ IE RIA +P+ENGEGLQVL Y G +Y+PH+DYF D T + GGQR+A++
Sbjct: 156 QRIEARIARLLDWPVENGEGLQVLRYGTGAEYQPHYDYFDPDAAGTPVLLQAGGQRVASL 215
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL+ + GG T FP+ +I+A+ G+A+ F +P
Sbjct: 216 VMYLNTPDRGGATRFPDVHLDIAAI-------------------KGNAVFFSYDRPHPMT 256
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
SLH G PV+ G KW +TKW+R
Sbjct: 257 --RSLHAGAPVLAGEKWVATKWLR 278
>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
Length = 219
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 77/200 (38%), Positives = 110/200 (55%), Gaps = 24/200 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
EP V N LS EEC+ LI L+ M++S + + + + +RTSSG F +++
Sbjct: 38 EPLVLVLGNVLSNEECDELIQLSKDKMQRSKI----GAEREVNSIRTSSGMFFEESENEL 93
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ IE+R++ +E EGLQ+L Y Q+Y+ H DYF + + R++T++MYL
Sbjct: 94 VHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSA-SKASKNNRISTLVMYL 152
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
+DVEEGGET FP K GLSI P G A+ F DA L+ +
Sbjct: 153 NDVEEGGETYFP-------------------KLGLSISPTKGMAVYFEYFYSDAELNDRT 193
Query: 263 LHGGCPVIKGNKWSSTKWIR 282
LHGG PVIKG KW +T+W+R
Sbjct: 194 LHGGAPVIKGEKWVATQWMR 213
>gi|77748547|ref|NP_641044.2| hypothetical protein XAC0691 [Xanthomonas axonopodis pv. citri str.
306]
gi|381169877|ref|ZP_09879039.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380689647|emb|CCG35526.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 286
Score = 141 bits (355), Expect = 3e-31, Method: Compositional matrix adjust.
Identities = 90/242 (37%), Positives = 123/242 (50%), Gaps = 31/242 (12%)
Query: 51 KANDLSSIVRKSMERSEGDE-----GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLA 105
+AN L VR + + D G E V V PR V FLS EC+ LI LA
Sbjct: 58 QANGLPMPVRVPALQQDTDASLLALGDREVRVLVSLLLPRVVVLGGFLSDGECDALIALA 117
Query: 106 TPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQ 165
P + +S VD+ G+ RTS L G+D + + IE RIA +P+++GEGLQ
Sbjct: 118 RPRLARSRTVDNANGEHMVHAARTSDSMCLRVGQDALCQRIEARIARLFDWPVDHGEGLQ 177
Query: 166 VLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATVLMYLSDVEEGGETVFPNAQGNI 220
VL Y G +Y PH+DYF D T + GGQR+A+++MYL+ E GG T FP+A ++
Sbjct: 178 VLRYATGAEYRPHYDYFDPDAAGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAHLDV 237
Query: 221 SAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKW 280
+AV G+A+ F +P SLH G PV+ G KW +TKW
Sbjct: 238 AAV-------------------KGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKW 276
Query: 281 IR 282
+R
Sbjct: 277 LR 278
>gi|21106803|gb|AAM35580.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 306
Score = 141 bits (355), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 90/242 (37%), Positives = 123/242 (50%), Gaps = 31/242 (12%)
Query: 51 KANDLSSIVRKSMERSEGDE-----GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLA 105
+AN L VR + + D G E V V PR V FLS EC+ LI LA
Sbjct: 78 QANGLPMPVRVPALQQDTDASLLALGDREVRVLVSLLLPRVVVLGGFLSDGECDALIALA 137
Query: 106 TPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQ 165
P + +S VD+ G+ RTS L G+D + + IE RIA +P+++GEGLQ
Sbjct: 138 RPRLARSRTVDNANGEHMVHAARTSDSMCLRVGQDALCQRIEARIARLFDWPVDHGEGLQ 197
Query: 166 VLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATVLMYLSDVEEGGETVFPNAQGNI 220
VL Y G +Y PH+DYF D T + GGQR+A+++MYL+ E GG T FP+A ++
Sbjct: 198 VLRYATGAEYRPHYDYFDPDAAGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAHLDV 257
Query: 221 SAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKW 280
+AV G+A+ F +P SLH G PV+ G KW +TKW
Sbjct: 258 AAV-------------------KGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKW 296
Query: 281 IR 282
+R
Sbjct: 297 LR 298
>gi|195061074|ref|XP_001995919.1| GH14105 [Drosophila grimshawi]
gi|193891711|gb|EDV90577.1| GH14105 [Drosophila grimshawi]
Length = 513
Score = 140 bits (354), Expect = 5e-31, Method: Compositional matrix adjust.
Identities = 72/209 (34%), Positives = 113/209 (54%), Gaps = 19/209 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++ +P +YH+ +S E E L LA P ++++ VVD T ++ + RTS T+L
Sbjct: 314 MELLQLDPYMVLYHDAISPREIEDLQFLAMPRLKRAKVVDQVTHRNMMVKERTSKVTWLG 373
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ + KRI D + F + E LQV++Y G Y H+D+ T+ G R+A
Sbjct: 374 DATNAFTMRLNKRIEDMSGFTMYGSEMLQVMNYGLGGHYASHYDFLNATSKTRLNGDRIA 433
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TV+ YLSDVE+GG TVFP Q ++ P+ G A++++++K +
Sbjct: 434 TVMFYLSDVEQGGATVFPKIQK-------------------AVFPQRGTAIIWYNLKENG 474
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
D +++H CPVI G+KW KWIR NE
Sbjct: 475 DFDTNTIHAACPVIVGSKWVCNKWIRENE 503
>gi|389809938|ref|ZP_10205598.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
gi|388441354|gb|EIL97635.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
Length = 284
Score = 140 bits (353), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 81/209 (38%), Positives = 109/209 (52%), Gaps = 29/209 (13%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P V N LS EC+ LI LA P ++++ VDS+ G+ + R RTS G F ++
Sbjct: 95 PALRVLENILSARECDELIALARPRLQRALTVDSE-GRQQVDRRRTSEGMFFTLDEVPLV 153
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-----EFNTKNGGQRMATV 198
IE+R+A P +GEGLQ+LHY GQ YEPHFD+F E T GGQR+A+V
Sbjct: 154 GRIERRVAALLDVPASHGEGLQILHYLPGQAYEPHFDWFDPDQPGYETITAVGGQRIASV 213
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL+ GG T FP GL++ + G A+ F D
Sbjct: 214 VMYLNTPARGGGTAFP-------------------ALGLTVTARRGAAVYFAYEGGDC-- 252
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
SSLH G PV++G KW +TKW+R Y+
Sbjct: 253 --SSLHAGLPVLEGEKWIATKWLRERPYR 279
>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
Length = 209
Score = 140 bits (353), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 77/206 (37%), Positives = 110/206 (53%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP + N LS EC+ LI+LA+ M+++ + G S D S VRTSS F ++
Sbjct: 31 EPLILILDNVLSWAECDLLIDLASARMQRAKI-----GSSHDVSEVRTSSSMFFEESENE 85
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
I +E R+A+ P+ + E LQVL Y+ G++Y PHFDYF + N R++T++MY
Sbjct: 86 CIGQVEARVAELMNIPVSHAEPLQVLRYQPGEQYHPHFDYFTQGSSMNN---RISTLVMY 142
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP+ S+ PK G A+ F D L+
Sbjct: 143 LNDVEEGGETYFPSLH-------------------FSVTPKKGSAVYFEYFYNDTRLNEL 183
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LH G PV G KW +T+W+R Y+
Sbjct: 184 TLHAGHPVEAGEKWVATQWMRRQRYR 209
>gi|295704991|ref|YP_003598066.1| 2OG-Fe(II) oxygenase [Bacillus megaterium DSM 319]
gi|294802650|gb|ADF39716.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium DSM 319]
Length = 219
Score = 140 bits (353), Expect = 6e-31, Method: Compositional matrix adjust.
Identities = 77/201 (38%), Positives = 112/201 (55%), Gaps = 26/201 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP V N LS EEC+ LI L+ M++S + G +++ + +RTSSG F ++
Sbjct: 38 EPLVLVLGNVLSNEECDELIRLSKDKMQRSKI-----GAAREVNSIRTSSGMFFDESENE 92
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
++ IE+R++ +E EGLQ+L Y Q+Y+ H DYF + + R++T++MY
Sbjct: 93 LVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSA-SKASKNNRISTLVMY 151
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP K GLS+ P G A+ F DA L+
Sbjct: 152 LNDVEEGGETYFP-------------------KLGLSVSPTKGMAVYFEYFYSDAELNDR 192
Query: 262 SLHGGCPVIKGNKWSSTKWIR 282
+LHGG PVIKG KW +T+W+R
Sbjct: 193 TLHGGAPVIKGEKWVATQWMR 213
>gi|418515355|ref|ZP_13081536.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|410708074|gb|EKQ66523.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 216
Score = 140 bits (352), Expect = 8e-31, Method: Compositional matrix adjust.
Identities = 80/204 (39%), Positives = 110/204 (53%), Gaps = 26/204 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V FLS EC+ LI LA P + +S VD+ G+ RTS L G+D +
Sbjct: 26 PRVVVLGGFLSDGECDALIALARPRLARSRTVDNANGEHLVHAARTSDSMCLRVGQDALC 85
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATV 198
+ IE RIA +P+++GEGLQVL Y G +Y PH+DYF D T + GGQR+A++
Sbjct: 86 QRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILLQAGGQRVASL 145
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL+ E GG T FP+A +++AV G+A+ F +P
Sbjct: 146 VMYLNTPERGGATRFPDAHLDVAAV-------------------KGNAVFFSYDRPHPMT 186
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
SLH G PV+ G KW +TKW+R
Sbjct: 187 --RSLHAGAPVLAGEKWVATKWLR 208
>gi|384429387|ref|YP_005638747.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
gi|341938490|gb|AEL08629.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
campestris pv. raphani 756C]
Length = 286
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 78/204 (38%), Positives = 109/204 (53%), Gaps = 26/204 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V LS +EC+ LI LA P + +S VD+ G RTS L G+D +
Sbjct: 96 PRVVVLGGLLSDDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 155
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATV 198
+ IE RIA +P+E+GEGLQVL Y G +Y PH+DYF D T ++GGQR+A++
Sbjct: 156 QRIEARIARLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRVASL 215
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL+ E GG T FP+ +++AV G+A+ F +P
Sbjct: 216 VMYLNTPERGGATRFPDVHLDVAAV-------------------KGNAVFFSYDRPHPMT 256
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
+LH G PV+ G KW +TKW+R
Sbjct: 257 --RTLHAGAPVLAGEKWVATKWLR 278
>gi|196011902|ref|XP_002115814.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
gi|190581590|gb|EDV21666.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
Length = 534
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 77/211 (36%), Positives = 113/211 (53%), Gaps = 24/211 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VEVIS +P +YHN L+ E E L LA P ++++TV + DTGK + + R S +L
Sbjct: 327 VEVISLQPYILIYHNLLNDLEVEALKTLAAPMLQRATVHNKDTGKLEYATYRISKSAWLN 386
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-----MDEFNTKNG 191
++R I I D T +E+ E LQ+ +Y G YEPHFD+ D F T G
Sbjct: 387 DDDHPLVRRISTLIEDVTGLTMESAEALQIANYGIGGHYEPHFDHADVRSGTDVFKTWKG 446
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
G R+AT+L+YLS VE GG TVF +A G+ I+P+ G A +++
Sbjct: 447 GNRIATMLIYLSSVELGGATVFSSA-------------------GVRIEPRQGSAAFWYN 487
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ + + + + H CPV+ G+KW + KWI
Sbjct: 488 LHRNGNGNNLTRHAACPVLIGSKWIANKWIH 518
>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
Length = 210
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 74/200 (37%), Positives = 117/200 (58%), Gaps = 26/200 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
EP V N LS EEC+ LI+L+ M +S + ++++ +RTS+ FL ++
Sbjct: 32 EPFVAVLGNVLSDEECDELISLSKDRMNRSKI-----AGNQENDIRTSTSVFLPEDASEV 86
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
++ +EKRI+ P+E+GEGLQ+L+Y+ GQ+Y+ HFD+F + +N R++T+++YL
Sbjct: 87 VQRVEKRISQIMNIPVEHGEGLQLLNYQIGQEYKAHFDFFSPKKLIEN--PRISTLVLYL 144
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
+DVEEGG+T FPN + LS+ P G A+ F D L+ +
Sbjct: 145 NDVEEGGDTYFPNLK-------------------LSVSPHKGMAVYFEYFYDDPMLNELT 185
Query: 263 LHGGCPVIKGNKWSSTKWIR 282
LHGG PV G+KW++T W+R
Sbjct: 186 LHGGAPVTIGDKWAATMWMR 205
>gi|357135727|ref|XP_003569460.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 2
[Brachypodium distachyon]
Length = 314
Score = 139 bits (351), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 75/210 (35%), Positives = 124/210 (59%), Gaps = 16/210 (7%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD----TGKSKDSRVRTSSGTFL 135
++W PR F+Y FLS EC++L+ +A ++ S +V++ T S D+R + L
Sbjct: 63 LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLVNAGARNITQNSTDARFKFQ----L 118
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
A +D ++ IE RI+ ++F P E+GE +Q+L Y + Q D+ D + +GG R+
Sbjct: 119 ADSKDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-----DHNKDGTQSSSGGNRL 173
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
T+LMYLSDV++GGETVFP ++ + LSEC G ++KP GDA+L ++++PD
Sbjct: 174 VTILMYLSDVKQGGETVFPRSELKDTQAK-EGALSECA--GYAVKPVKGDAILLFNLRPD 230
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
D S + C V++G KW + K + +++
Sbjct: 231 GVTDSDSHYEDCSVLEGEKWLAIKHLHISK 260
>gi|159462456|ref|XP_001689458.1| predicted protein [Chlamydomonas reinhardtii]
gi|158283446|gb|EDP09196.1| predicted protein [Chlamydomonas reinhardtii]
Length = 221
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 79/201 (39%), Positives = 106/201 (52%), Gaps = 31/201 (15%)
Query: 87 FVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDI 146
VYHNFLS EC ++I+LA M++STVV S D +RTS GTFL R D +I I
Sbjct: 1 MVYHNFLSDRECRHIIDLAHAQMKRSTVVGSKNAGVVDD-IRTSYGTFLRRVPDPVIAAI 59
Query: 147 EKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVE 206
E R+A ++ P + E +QVL Y KY PH D G +R+ATVL+YL E
Sbjct: 60 EHRLALWSHLPASHQEDMQVLRYGPTNKYGPHID----------GLERVATVLIYLGQAE 109
Query: 207 EGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD-ASLDPSSLHG 265
LS+C + ++ KPK GDAL+F+ PD D S+H
Sbjct: 110 RA-------------------NLSQCARGRVAYKPKRGDALMFFDTMPDYKQTDVHSMHT 150
Query: 266 GCPVIKGNKWSSTKWIRVNEY 286
GCPV++G KW++ KW+ Y
Sbjct: 151 GCPVVEGVKWNAVKWLHGTPY 171
>gi|195341590|ref|XP_002037389.1| GM12139 [Drosophila sechellia]
gi|194131505|gb|EDW53548.1| GM12139 [Drosophila sechellia]
Length = 525
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 118/211 (55%), Gaps = 21/211 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++ EP +YH+ LS +E L +ATP ++++TV + +G+++ + RTS +
Sbjct: 322 MELVGLEPYMVLYHDVLSPKEITELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAWFP 381
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN--TKNGGQR 194
G + + + RI+D T F L E LQ+++Y G Y+ H+D+F + + T G R
Sbjct: 382 DGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNNTNSNMTAMSGDR 441
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ATVL YL+DVE+GG TVFPN + ++ P+ G +++++++
Sbjct: 442 IATVLFYLTDVEQGGATVFPNIRK-------------------AVFPQRGSVVMWYNLRD 482
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
+ +D +LH CPVI G+KW KWIR E
Sbjct: 483 NGQIDTQTLHAACPVIVGSKWVCNKWIRERE 513
>gi|21711777|gb|AAM75079.1| RE70601p [Drosophila melanogaster]
Length = 316
Score = 139 bits (350), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 118/211 (55%), Gaps = 21/211 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++ +P +YH+ LS +E + L +ATP ++++TV + +G+++ + RTS +
Sbjct: 113 MELVGLDPYMVLYHDVLSPKEIKELQGMATPSLKRATVYQASSGRNEVVKTRTSKVAWFP 172
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF--MDEFNTKNGGQR 194
G + + + RI+D T F L E LQ+++Y G Y+ H+D+F + T G R
Sbjct: 173 DGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDR 232
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ATVL YL+DVE+GG TVFPN + ++ P+ G ++++++K
Sbjct: 233 IATVLFYLTDVEQGGATVFPNIRK-------------------AVFPQRGSVVMWYNLKD 273
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
+ +D +LH CPVI G+KW KWIR E
Sbjct: 274 NGQIDTQTLHAACPVIVGSKWVCNKWIRERE 304
>gi|24651477|ref|NP_733395.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
gi|20269812|gb|AAM18061.1|AF495539_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]PV [Drosophila
melanogaster]
gi|23172718|gb|AAN14252.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
Length = 525
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 118/211 (55%), Gaps = 21/211 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++ +P +YH+ LS +E + L +ATP ++++TV + +G+++ + RTS +
Sbjct: 322 MELVGLDPYMVLYHDVLSPKEIKELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAWFP 381
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF--MDEFNTKNGGQR 194
G + + + RI+D T F L E LQ+++Y G Y+ H+D+F + T G R
Sbjct: 382 DGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDR 441
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ATVL YL+DVE+GG TVFPN + ++ P+ G ++++++K
Sbjct: 442 IATVLFYLTDVEQGGATVFPNIRK-------------------AVFPQRGSVVMWYNLKD 482
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
+ +D +LH CPVI G+KW KWIR E
Sbjct: 483 NGQIDTQTLHAACPVIVGSKWVCNKWIRERE 513
>gi|389775678|ref|ZP_10193553.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
gi|388437120|gb|EIL93940.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
Length = 284
Score = 139 bits (350), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 80/208 (38%), Positives = 108/208 (51%), Gaps = 29/208 (13%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P V N L+ EECE LI LA P ++++ V SD D R RTS G F ++
Sbjct: 95 PALRVLENLLAAEECEELIALAQPRLKRALTVASDGSNQVDQR-RTSEGMFFTLNELPLV 153
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN-----TKNGGQRMATV 198
IE+R+A P+ +GEGLQ+LHY GQ+YEPHFD+F + T GGQR+A+V
Sbjct: 154 GRIEQRLATLLGMPVSHGEGLQILHYLPGQEYEPHFDWFDPQQPGYDTITAVGGQRVASV 213
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL+ +GG T FP + GL++ + G A+ F
Sbjct: 214 VMYLNTPAQGGGTAFP-------------------ELGLTVTARRGAAVYFAY----EGG 250
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
D SLH G PV +G KW +TKW+R Y
Sbjct: 251 DQQSLHAGLPVQRGEKWIATKWLRERPY 278
>gi|77761111|ref|YP_241833.2| hypothetical protein XC_0735 [Xanthomonas campestris pv. campestris
str. 8004]
Length = 288
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 77/204 (37%), Positives = 109/204 (53%), Gaps = 26/204 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V L+ +EC+ LI LA P + +S VD+ G RTS L G+D +
Sbjct: 98 PRVVVLGGLLADDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 157
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATV 198
+ IE RIA +P+E+GEGLQVL Y G +Y PH+DYF D T ++GGQR+A++
Sbjct: 158 QRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRVASL 217
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL+ E GG T FP+ +++AV G+A+ F +P
Sbjct: 218 VMYLNTPERGGATRFPDVHLDVAAV-------------------KGNAVFFSYDRPHPMT 258
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
+LH G PV+ G KW +TKW+R
Sbjct: 259 --RTLHAGAPVLAGEKWVATKWLR 280
>gi|389793983|ref|ZP_10197143.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
gi|388433014|gb|EIL89992.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
Length = 282
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 78/204 (38%), Positives = 106/204 (51%), Gaps = 29/204 (14%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P V LS+ EC LI LA P ++++ VDSD + D R RTS G F G ++
Sbjct: 93 PALRVLDGLLSERECADLIELARPRLQRALTVDSDGKQQIDQR-RTSEGMFFRAGETPLV 151
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN-----TKNGGQRMATV 198
IE+R+A P +GEGLQ+LHY GQ+YEPH+D+F T GQR+A+V
Sbjct: 152 AAIEQRLAQLLGVPASHGEGLQILHYGPGQEYEPHYDWFDPALPGYDKLTARAGQRIASV 211
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL+ E GG T FP + GL++ + G A+ F
Sbjct: 212 VMYLNTPERGGGTAFP-------------------EIGLTVTARRGAAVYFAY----EGG 248
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D SSLH G PV++G KW +T W+R
Sbjct: 249 DQSSLHAGLPVLQGEKWIATHWLR 272
>gi|66572403|gb|AAY47813.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 308
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 77/204 (37%), Positives = 109/204 (53%), Gaps = 26/204 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V L+ +EC+ LI LA P + +S VD+ G RTS L G+D +
Sbjct: 118 PRVVVLGGLLADDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 177
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATV 198
+ IE RIA +P+E+GEGLQVL Y G +Y PH+DYF D T ++GGQR+A++
Sbjct: 178 QRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRVASL 237
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL+ E GG T FP+ +++AV G+A+ F +P
Sbjct: 238 VMYLNTPERGGATRFPDVHLDVAAV-------------------KGNAVFFSYDRPHPMT 278
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
+LH G PV+ G KW +TKW+R
Sbjct: 279 --RTLHAGAPVLAGEKWVATKWLR 300
>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
NRRL B-14911]
Length = 217
Score = 139 bits (349), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 74/200 (37%), Positives = 115/200 (57%), Gaps = 24/200 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
EP + N LS EECE LI ++ +++S + ++ T +RTSS F G +++
Sbjct: 38 EPLIVILGNVLSDEECEGLIRMSEDKLKRSKIGNTRTVDD----IRTSSSMFFEEGENEL 93
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ IE+R++ P+E+GEGLQ+L+Y GQ+Y+ HFD+F + R++T++MYL
Sbjct: 94 VARIERRLSQIMNIPVEHGEGLQMLNYHIGQEYKAHFDFFS-SSSRAASNPRISTLVMYL 152
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
+DVEEGGET FP K S+ P+ G A+ F + L+ +
Sbjct: 153 NDVEEGGETYFP-------------------KLNFSVNPQKGSAVYFEYFYDNQDLNDLT 193
Query: 263 LHGGCPVIKGNKWSSTKWIR 282
LHGG PVIKG+KW++T+W+R
Sbjct: 194 LHGGAPVIKGSKWAATQWMR 213
>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
KWC4]
Length = 215
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 78/214 (36%), Positives = 113/214 (52%), Gaps = 27/214 (12%)
Query: 69 DEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVR 128
D+G E V+ EP + LS +EC LI A P +++S +V+ S +R
Sbjct: 17 DDGVVE--ATVLHQEPLIVRFERLLSDDECRQLIETAAPRLKESKLVNKVV-----SDIR 69
Query: 129 TSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT 188
TS G F I IE+RIA P+E+ EGLQVLHY GQ+Y+ H D+F +
Sbjct: 70 TSRGMFFEEEESPFIHRIERRIAQLMNVPIEHAEGLQVLHYGPGQEYKAHHDFFAPG-SP 128
Query: 189 KNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALL 248
R++T+++YL+DVEEGGETVFP G+++KPK G AL
Sbjct: 129 AARNNRISTLIVYLNDVEEGGETVFP-------------------LLGIAMKPKRGAALY 169
Query: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
F + +L+ +LH PV++G KW +T+W+R
Sbjct: 170 FEYFYRNQALNDLTLHSSVPVVRGEKWVATQWMR 203
>gi|359490628|ref|XP_002271805.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Vitis
vinifera]
Length = 312
Score = 138 bits (348), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 85/226 (37%), Positives = 132/226 (58%), Gaps = 20/226 (8%)
Query: 62 SMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGK 121
S+E + D R Q +SW+PRAF+Y FLS EEC++LI+LA + D+G
Sbjct: 46 SIEYNRVDPSRVIQ----LSWQPRAFLYRGFLSDEECDHLISLALGKKEELATNGGDSGN 101
Query: 122 SKDSRV-RTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYE---AGQKYEP 177
R+ ++S G D++ IEKRI+ +TF P EN E L+V+ Y+ A QKY
Sbjct: 102 VVLKRLLKSSEGPLYID--DEVAARIEKRISAWTFLPKENSEPLEVVQYQFENAKQKY-- 157
Query: 178 HFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGL 237
+YF ++ +K G MATVL++LS+V GGE FP ++ S + LS+C ++
Sbjct: 158 --NYFSNKSTSKFGEPLMATVLLHLSNVTRGGELFFPESESK-SGI-----LSDCTESSS 209
Query: 238 SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRV 283
++P G+A+LF+++ P+AS D SS + CPV++G W +TK+ +
Sbjct: 210 GLRPVKGNAILFFNVHPNASPDKSSSYARCPVLEGEMWCATKFFHL 255
>gi|386712780|ref|YP_006179102.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
gi|384072335|emb|CCG43825.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
2266]
Length = 211
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 77/204 (37%), Positives = 113/204 (55%), Gaps = 26/204 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P + N +S+EECE LI L+ M +S + + + S +RTSS TFL D +
Sbjct: 34 PLIAILGNVVSEEECEELIFLSKNKMNRSKI----GSQHEVSDIRTSSSTFLPE--DDLT 87
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLS 203
IEKR+A P+E+GEGL +L+Y+ GQ+Y+ H+DYF + N R++T+++YL+
Sbjct: 88 NRIEKRVAQIMNVPVEHGEGLHILNYKQGQEYKAHYDYFRSKAKAAN-NPRISTLVLYLN 146
Query: 204 DVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSL 263
DVEEGGET FP+ LSI P G A+ F D ++ +L
Sbjct: 147 DVEEGGETYFPH-------------------MNLSISPHKGMAVYFEYFYSDPLINERTL 187
Query: 264 HGGCPVIKGNKWSSTKWIRVNEYK 287
HGG PV G KW++T W+R +Y+
Sbjct: 188 HGGSPVTSGEKWAATMWVRRKQYR 211
>gi|428183249|gb|EKX52107.1| hypothetical protein GUITHDRAFT_150687 [Guillardia theta CCMP2712]
Length = 315
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 79/223 (35%), Positives = 117/223 (52%), Gaps = 26/223 (11%)
Query: 72 RAEQWVEVIS--WEPRAFVYHNFLSKEECEYLINLAT-PHMRKSTVVDSDTGKSKDSRVR 128
+ E W+E IS PR +V HN L+KEECE L +L M K+ ++ + +S R
Sbjct: 74 KNEFWIETISDLPGPRIYVLHNILTKEECESLKSLGVMAGMEKALIIPYGGKELVESSTR 133
Query: 129 TSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEF-- 186
T++ +L + ++ +E +A T ENGE LQ+LHY+ Q+++ H DYF
Sbjct: 134 TNTAAWLEYHQGPVVTKLENLLAKVTNTEPENGENLQILHYQTSQQFKEHHDYFDPATDP 193
Query: 187 --NTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMG 244
N + GG R+AT ++YL + EEGGET F K +KP+ G
Sbjct: 194 PENFEPGGNRLATAIIYLQNAEEGGETDF-------------------MKIDTKVKPEAG 234
Query: 245 DALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
A+LF+ +KPD S+D ++H G P G KW +TKWI Y+
Sbjct: 235 SAVLFYDLKPDGSVDKLTIHSGNPPKGGEKWVATKWIHERRYQ 277
>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
2522]
Length = 229
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 70/200 (35%), Positives = 112/200 (56%), Gaps = 25/200 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
EP + N LS+EEC+ LI+L+ + +S + + +RTSS F + +
Sbjct: 43 EPLIVLLGNVLSEEECDQLISLSKDRIERSKISNKSVHD-----LRTSSSMFFDDAENDV 97
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ +EKR++ P+++GEG+Q+L+Y GQ+Y+ H+DYF N+K R++T++MYL
Sbjct: 98 VSTVEKRVSQIMKIPVDHGEGIQILNYAIGQEYKAHYDYF-SSGNSKVNNPRISTLVMYL 156
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
+DVE GGET FP K + PK G A+ F D +L+ +
Sbjct: 157 NDVEAGGETYFP-------------------KLNFYVAPKKGMAVYFEYFYNDTTLNELT 197
Query: 263 LHGGCPVIKGNKWSSTKWIR 282
LHGG PV+ G+KW++T+W+R
Sbjct: 198 LHGGAPVVIGDKWAATQWMR 217
>gi|398810140|ref|ZP_10568970.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
gi|398083831|gb|EJL74535.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
Length = 296
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 74/204 (36%), Positives = 110/204 (53%), Gaps = 24/204 (11%)
Query: 88 VYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIE 147
V + +ECE LI LA P + ST VD +G+ R+S G F + I ++
Sbjct: 103 VLDDVFDPQECEELIALARPRLAPSTTVDPLSGRDLVGEQRSSLGMFFRLRENAFIARLD 162
Query: 148 KRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNGGQRMATVLMYL 202
+R+++ P+ENGEGLQVL Y AG + PHFD+ + ++ + GQR++T++ YL
Sbjct: 163 QRVSELMNLPVENGEGLQVLCYPAGAQSMPHFDFLVPSNAANKASLARSGQRVSTLVSYL 222
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
++VEEGGET+FP EC G S+ P+ G A+ F +D +S
Sbjct: 223 NEVEEGGETIFP----------------EC---GWSVPPRRGSAVYFEYCNSLGQVDHAS 263
Query: 263 LHGGCPVIKGNKWSSTKWIRVNEY 286
LH G PV+ G KW +TKW+R +
Sbjct: 264 LHAGGPVLHGEKWVATKWMRQRRF 287
>gi|195575145|ref|XP_002105540.1| GD16902 [Drosophila simulans]
gi|194201467|gb|EDX15043.1| GD16902 [Drosophila simulans]
Length = 525
Score = 138 bits (347), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 69/211 (32%), Positives = 117/211 (55%), Gaps = 21/211 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++ +P +YH+ LS +E L +ATP ++++TV + +G+++ + RTS +
Sbjct: 322 MELVGLDPYMVLYHDVLSPKEITELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAWFP 381
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF--MDEFNTKNGGQR 194
G + + + RI+D T F L E LQ+++Y G Y+ H+D+F + T G R
Sbjct: 382 DGYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDR 441
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ATVL YL+DVE+GG TVFPN + ++ P+ G +++++++
Sbjct: 442 IATVLFYLTDVEQGGATVFPNIRK-------------------AVFPQRGSVVMWYNLRD 482
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
+ +D +LH CPVI G+KW KWIR E
Sbjct: 483 NGQIDTQTLHAACPVIVGSKWVCNKWIRERE 513
>gi|294627644|ref|ZP_06706226.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292597996|gb|EFF42151.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 296
Score = 138 bits (347), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 89/242 (36%), Positives = 122/242 (50%), Gaps = 31/242 (12%)
Query: 51 KANDLSSIVRKSMERSEGDE-----GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLA 105
+AN L VR + + D G E V V P V FLS EC+ LI LA
Sbjct: 68 QANGLPMPVRVPALQQDTDASLLALGDREVRVLVSLLLPCVVVLGGFLSGGECDALIALA 127
Query: 106 TPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQ 165
P + +S VD+ G+ RTS L G+D + + IE RIA +P+++GEGLQ
Sbjct: 128 RPRLARSRTVDNANGEHVVHAARTSDSMCLRVGQDALCQRIEARIARLLDWPVDHGEGLQ 187
Query: 166 VLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATVLMYLSDVEEGGETVFPNAQGNI 220
VL Y G +Y PH+DYF D T + GGQR+A+++MYL+ E GG T FP+A ++
Sbjct: 188 VLRYGTGAEYRPHYDYFDPDAAGTPVLLQAGGQRVASLVMYLNTPERGGATRFPDAHLDV 247
Query: 221 SAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKW 280
+AV G+A+ F +P SLH G PV+ G KW +TKW
Sbjct: 248 AAV-------------------KGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKW 286
Query: 281 IR 282
+R
Sbjct: 287 LR 288
>gi|294666178|ref|ZP_06731433.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292604043|gb|EFF47439.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 296
Score = 137 bits (346), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 89/242 (36%), Positives = 122/242 (50%), Gaps = 31/242 (12%)
Query: 51 KANDLSSIVRKSMERSEGDE-----GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLA 105
+AN L VR + + D G E V V P V FLS EC+ LI LA
Sbjct: 68 QANGLPMPVRVPALQQDTDASLLALGDREVRVLVSLLLPCVVVLGGFLSGGECDALIALA 127
Query: 106 TPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQ 165
P + +S VD+ G+ RTS L G+D + + IE RIA +P+++GEGLQ
Sbjct: 128 RPRLARSRTVDNANGEHVVHAARTSDSMCLRVGQDALCQRIEARIARLLDWPVDHGEGLQ 187
Query: 166 VLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATVLMYLSDVEEGGETVFPNAQGNI 220
VL Y G +Y PH+DYF D T + GGQR+A+++MYL+ E GG T FP+A ++
Sbjct: 188 VLRYGTGAEYRPHYDYFDPDAAGTPVLLQAGGQRVASLVMYLNTPERGGATRFPDAHLDV 247
Query: 221 SAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKW 280
+AV G+A+ F +P SLH G PV+ G KW +TKW
Sbjct: 248 AAV-------------------KGNAVFFSYDRPHPMT--RSLHAGAPVLAGEKWVATKW 286
Query: 281 IR 282
+R
Sbjct: 287 LR 288
>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
Length = 211
Score = 137 bits (346), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 72/200 (36%), Positives = 114/200 (57%), Gaps = 25/200 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
EP V N LS EEC+ LI LA +++S + + +++ +RTSS F+ + I
Sbjct: 32 EPLIVVLGNVLSDEECDELIQLAGDKVKRSKIGTT----REENELRTSSSMFIEDDENLI 87
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ ++KRI+ P+E+GEGLQ+L Y GQ+Y+ H D+F + N R++T++MYL
Sbjct: 88 VTRVKKRISAIMKIPMEHGEGLQILRYTPGQQYKAHHDFFSSDSKITNN--RISTLVMYL 145
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
+DVE+GGET FP+ + S+ P+ G A+ F D +L+ +
Sbjct: 146 NDVEQGGETFFPHLK-------------------FSVSPRKGMAVYFEYFYSDQTLNDFT 186
Query: 263 LHGGCPVIKGNKWSSTKWIR 282
LHGG PV++G KW +T+W+R
Sbjct: 187 LHGGAPVVEGEKWVATQWMR 206
>gi|302143843|emb|CBI22704.3| unnamed protein product [Vitis vinifera]
Length = 317
Score = 137 bits (344), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 85/226 (37%), Positives = 131/226 (57%), Gaps = 15/226 (6%)
Query: 62 SMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGK 121
S+E + D R Q +SW+PRAF+Y FLS EEC++LI+LA + D+G
Sbjct: 46 SIEYNRVDPSRVIQ----LSWQPRAFLYRGFLSDEECDHLISLALGKKEELATNGGDSGN 101
Query: 122 SKDSRV-RTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYE---AGQKYEP 177
R+ ++S G D++ IEKRI+ +TF P EN E L+V+ Y+ A QKY
Sbjct: 102 VVLKRLLKSSEGPLYID--DEVAARIEKRISAWTFLPKENSEPLEVVQYQFENAKQKY-- 157
Query: 178 HFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGL 237
+YF ++ +K G MATVL++LS+V GGE FP ++ S LS+C ++
Sbjct: 158 --NYFSNKSTSKFGEPLMATVLLHLSNVTRGGELFFPESELKNSQSKS-GILSDCTESSS 214
Query: 238 SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRV 283
++P G+A+LF+++ P+AS D SS + CPV++G W +TK+ +
Sbjct: 215 GLRPVKGNAILFFNVHPNASPDKSSSYARCPVLEGEMWCATKFFHL 260
>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
Length = 520
Score = 137 bits (344), Expect = 7e-30, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 112/207 (54%), Gaps = 20/207 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++ P +YH+ LS E + + +ATP ++++TV + GK++ + RTS +
Sbjct: 320 MEIVGLNPYMVIYHDVLSSAEIDEMKEMATPSLKRATVYKASLGKNEVVKTRTSKVAWFP 379
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNTKNGGQRM 195
+ + + RI D T F L E LQ+++Y G Y+ H+D+F E ++ G R+
Sbjct: 380 DSYNSLTLRLNARIHDMTGFDLSGSEMLQLMNYGLGGHYDKHYDFFNATEKSSSLTGDRI 439
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
ATVL Y+SDVE+GG TVFPN ++ P+ G A++++++K D
Sbjct: 440 ATVLFYMSDVEQGGATVFPNIYK-------------------TVYPQRGTAVMWYNLKDD 480
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D +LH CPV+ G+KW KWIR
Sbjct: 481 GQPDEQTLHAACPVLVGSKWVCNKWIR 507
>gi|77747935|ref|NP_638775.2| hypothetical protein XCC3429 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
Length = 288
Score = 136 bits (343), Expect = 8e-30, Method: Compositional matrix adjust.
Identities = 76/204 (37%), Positives = 108/204 (52%), Gaps = 26/204 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V L+ +EC+ LI LA P + +S VD+ G RTS L G+D +
Sbjct: 98 PRVVVLGGLLADDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 157
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATV 198
+ IE RIA +P+E+GEGLQVL Y G +Y PH+DYF D T ++GGQR+A++
Sbjct: 158 QRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRVASL 217
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL+ E GG T P+ +++AV G+A+ F +P
Sbjct: 218 VMYLNTPERGGATRVPDVHLDVAAV-------------------KGNAVFFSYDRPHPMT 258
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
+LH G PV+ G KW +TKW+R
Sbjct: 259 --RTLHAGAPVLAGEKWVATKWLR 280
>gi|433460968|ref|ZP_20418587.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
gi|432190746|gb|ELK47751.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
Length = 211
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 77/206 (37%), Positives = 112/206 (54%), Gaps = 28/206 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP+ + N +S+EECE LI L+ K V S G D S +RTSS FL D+
Sbjct: 33 EPKIAILGNVVSEEECEALIRLS-----KDKVNRSKIGSDHDVSDIRTSSSAFLPD--DE 85
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKR+A P+E+GEG+ +LHY+ GQ+Y+ H DYF R++T+++Y
Sbjct: 86 LTGRIEKRLAQIMNVPVEHGEGIHILHYKPGQEYKAHHDYFRSTSRAAK-NPRISTLVLY 144
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP + L++ P G A+ F D +++
Sbjct: 145 LNDVEEGGETYFP-------------------EMNLTVSPHKGMAVYFEYFYNDPAINER 185
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYK 287
+LHGG PV G KW++T W+R +Y+
Sbjct: 186 TLHGGSPVTAGEKWAATMWVRRQQYR 211
>gi|357459545|ref|XP_003600053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
gi|355489101|gb|AES70304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
truncatula]
Length = 156
Score = 136 bits (343), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 68/126 (53%), Positives = 91/126 (72%), Gaps = 4/126 (3%)
Query: 92 FLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIA 151
+ SKEECE+LI L P++ +S + D TGK ++R + G F+ +DKII++IE+RI
Sbjct: 25 YESKEECEHLIKLGKPYLERSRISDKRTGKGIENRFAYACGGFV---KDKIIKNIEQRIP 81
Query: 152 DFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGET 211
D P+ENGEGLQV+HY GQK+ PH+D +E + NGG R+AT LMYLSDVEEGGET
Sbjct: 82 DIISIPVENGEGLQVIHYGVGQKFVPHYDSRSNE-SFWNGGPRVATFLMYLSDVEEGGET 140
Query: 212 VFPNAQ 217
VFP+A+
Sbjct: 141 VFPSAK 146
>gi|21114687|gb|AAM42699.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
Length = 308
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 76/204 (37%), Positives = 108/204 (52%), Gaps = 26/204 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V L+ +EC+ LI LA P + +S VD+ G RTS L G+D +
Sbjct: 118 PRVVVLGGLLADDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 177
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT----KNGGQRMATV 198
+ IE RIA +P+E+GEGLQVL Y G +Y PH+DYF D T ++GGQR+A++
Sbjct: 178 QRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRVASL 237
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+MYL+ E GG T P+ +++AV G+A+ F +P
Sbjct: 238 VMYLNTPERGGATRVPDVHLDVAAV-------------------KGNAVFFSYDRPHPMT 278
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
+LH G PV+ G KW +TKW+R
Sbjct: 279 --RTLHAGAPVLAGEKWVATKWLR 300
>gi|159490898|ref|XP_001703410.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
gi|158280334|gb|EDP06092.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
Length = 429
Score = 136 bits (342), Expect = 1e-29, Method: Compositional matrix adjust.
Identities = 84/209 (40%), Positives = 116/209 (55%), Gaps = 6/209 (2%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSK-DSRVRTSSGTFLA 136
+++S PR V+ NF+ K E +I LA+ M S + + + + +VRTS GTFL
Sbjct: 216 QILSLYPRIKVFPNFVDKARREEIIALASKFMYPSGLAYRPGEQVEAEQQVRTSKGTFLG 275
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +E +IA T P +NGE VL+Y+ Q Y+ H D F + + QR+A
Sbjct: 276 GDSSPALTWLESKIAAVTDIPRQNGEFWNVLNYKHTQHYDSHMDSFDPKEYGQQYSQRIA 335
Query: 197 TVLMYLSDVE-EGGETVFP-NAQGNIS-AVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
TV++ LSD GGETVF + NI + W + G GL KP+ GDA+LFWS
Sbjct: 336 TVIVVLSDEGLVGGETVFKREGKANIDKPITNWTDCDADG--GLRYKPRAGDAVLFWSAF 393
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
PD LD +LHG CPV+ GNKW + KWIR
Sbjct: 394 PDGRLDQHALHGSCPVVTGNKWVAVKWIR 422
>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
Length = 215
Score = 135 bits (341), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 73/204 (35%), Positives = 106/204 (51%), Gaps = 25/204 (12%)
Query: 79 VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
V+ EP + L+ +EC LI A P +R+S +V+ S +RTS G F
Sbjct: 25 VLHKEPLIMRFERLLTDDECRQLIEAAAPRLRESKLVNKVV-----SEIRTSRGMFFEEE 79
Query: 139 RDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATV 198
+ I IEKRI+ P+E+ EGLQVLHY GQ+Y+ H+D+F + R++T+
Sbjct: 80 ENPFIHRIEKRISALMNVPIEHAEGLQVLHYGPGQEYQAHYDFFGPN-SPSASNNRISTL 138
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+DVE GGETVFP L +KP+ G AL F L
Sbjct: 139 IIYLNDVEAGGETVFP-------------------LLDLEVKPERGSALYFEYFYRQQEL 179
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
+ +LH PV++G KW +T+W+R
Sbjct: 180 NNLTLHSSVPVVRGEKWVATQWMR 203
>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
Length = 525
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 69/211 (32%), Positives = 116/211 (54%), Gaps = 21/211 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++ +P +YH+ LS +E + L +ATP + ++TV + +G+++ + RTS +
Sbjct: 322 MELVGLDPYMVLYHDVLSAKEIKELQGMATPGLTRATVFQASSGRNEVVKTRTSKVAWFP 381
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF--MDEFNTKNGGQR 194
+ + + RIAD T F L E LQ+++Y G Y+ H+D+F ++ T G R
Sbjct: 382 DSYNPLTVRLNARIADMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNTINSNLTAMSGDR 441
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ATVL YL+DVE+GG TVFPN + ++ P+ G +++++++
Sbjct: 442 IATVLFYLTDVEQGGATVFPNIRK-------------------AVFPQRGSVIMWYNLQD 482
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
+ D +LH CPVI G+KW KWIR E
Sbjct: 483 NGQTDNKTLHAACPVIVGSKWVCNKWIRERE 513
>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
Length = 528
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 69/209 (33%), Positives = 115/209 (55%), Gaps = 22/209 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++ +P +YH+ +S E L ++ATP ++++TV + +S+ + RTS +
Sbjct: 323 MELVGLDPYMVLYHDVISAPEISQLQDMATPGLKRATVYKASGRRSEVVKTRTSKVAWFP 382
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN---TKNGGQ 193
+++ + +RIAD T F L E LQ ++Y G Y+ H+D+F T+ G
Sbjct: 383 DTFNELTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFNASTATNLTQMNGD 442
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+ATVL YL+DVE+GG TVFPN + ++ P+ G A++++++K
Sbjct: 443 RIATVLFYLTDVEQGGATVFPNIRK-------------------AVFPQRGSAIIWYNLK 483
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D +P +LH CPV+ G+KW KWIR
Sbjct: 484 DDGDPNPQTLHAACPVLVGSKWVCNKWIR 512
>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
Length = 215
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 76/207 (36%), Positives = 114/207 (55%), Gaps = 28/207 (13%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
EP + N LS EEC+ LI + +++S + G+ + +++RTSSG F ++
Sbjct: 35 EPLIVILGNVLSNEECDELIEHSKERLQRSKI-----GEERSVNQIRTSSGVFCEE--NE 87
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
+ IEKRI+ P+E+G+GLQVL Y GQ+Y+PHFD+F D + + R++T++MY
Sbjct: 88 TVAKIEKRISQIMNIPIEHGDGLQVLLYAPGQEYKPHFDFFADT-SRASANNRISTLVMY 146
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+DVEEGGET FP LS+ P G A+ F + L+
Sbjct: 147 LNDVEEGGETTFP-------------------MLNLSVFPSKGMAVYFEYFYSNHELNER 187
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEYKV 288
+LH G PV KG KW +T W+R ++V
Sbjct: 188 TLHAGAPVRKGEKWVATMWMRRQTFRV 214
>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
Length = 493
Score = 135 bits (340), Expect = 2e-29, Method: Compositional matrix adjust.
Identities = 69/209 (33%), Positives = 115/209 (55%), Gaps = 22/209 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++ +P +YH+ +S E L ++ATP ++++TV + +S+ + RTS +
Sbjct: 288 MELVGLDPYMVLYHDVISALEISQLQDMATPGLKRATVYKASGRRSEVVKTRTSKVAWFP 347
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN---TKNGGQ 193
+++ + +RIAD T F L E LQ ++Y G Y+ H+D+F T+ G
Sbjct: 348 DTFNELTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFNASTAANLTQMNGD 407
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+ATVL YL+DVE+GG TVFPN + ++ P+ G A++++++K
Sbjct: 408 RIATVLFYLTDVEQGGATVFPNIRK-------------------AVFPQRGSAIIWYNLK 448
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D +P +LH CPV+ G+KW KWIR
Sbjct: 449 DDGDPNPQTLHAACPVLVGSKWVCNKWIR 477
>gi|428182311|gb|EKX51172.1| hypothetical protein GUITHDRAFT_92735 [Guillardia theta CCMP2712]
Length = 190
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 78/190 (41%), Positives = 108/190 (56%), Gaps = 21/190 (11%)
Query: 109 MRKSTVVDS-DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVL 167
M +ST+ ++ + K+ RTSS +L++ D ++ I R+A+ P+E E +QVL
Sbjct: 1 MGRSTIAEAGNEAKNGVGSARTSSTAWLSKTADPLVAKIRTRVAELVKLPMELAEDMQVL 60
Query: 168 HYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAV 223
HY Q Y H D+F F T G R TV YLSDVEEGGETVFP A G+ V
Sbjct: 61 HYSKNQHYWAHHDFFDPNIYRGFVTSPGQNRFITVFFYLSDVEEGGETVFPFANGDDRRV 120
Query: 224 PWWNELSECGKTGLSIKPKMGDALLFWSM---------KPD---ASLDPSSLHGGCPVIK 271
+ ++C + GL +KPK G+A++F+SM PD +LD SLHGGC VIK
Sbjct: 121 ---TDFADCSR-GLKVKPKAGNAIIFYSMLAKRQQEICPPDDLGCNLDVRSLHGGCDVIK 176
Query: 272 GNKWSSTKWI 281
G+KW++ WI
Sbjct: 177 GDKWAANYWI 186
>gi|290243077|ref|YP_003494747.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
gi|288945582|gb|ADC73280.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
Length = 575
Score = 134 bits (338), Expect = 3e-29, Method: Compositional matrix adjust.
Identities = 86/228 (37%), Positives = 119/228 (52%), Gaps = 29/228 (12%)
Query: 64 ERSEGDEGRAEQW--VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGK 121
ER E + AE++ +E +S +P FL ECE LI+LA M+++ V S G
Sbjct: 35 ERVERNRMPAERYDGMETLSQDPLVVYLDEFLEPGECEALIHLAQGRMKRALV--SLDGS 92
Query: 122 SKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFD- 180
S S+ RT S +L + + R I +R+A FPLE E LQV+HY Q+Y PH+D
Sbjct: 93 SGVSQGRTGSNCWLRYQEEPLARRIGERVAKRVGFPLEYAEPLQVIHYGHEQEYRPHYDA 152
Query: 181 YFMDEFN----TKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTG 236
Y +D T+ GGQRM T L+YL++VEEGG T FPNA G
Sbjct: 153 YDLDTPRGLRCTRQGGQRMVTALLYLNEVEEGGATAFPNA-------------------G 193
Query: 237 LSIKPKMGDALLFWSMKPDASL-DPSSLHGGCPVIKGNKWSSTKWIRV 283
+ + P+ G +F ++ D P SLHGG PV G KW+++ W R
Sbjct: 194 VEVAPRKGRIAIFNNVGADPGRPHPRSLHGGMPVKSGEKWAASIWFRA 241
>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
Length = 454
Score = 134 bits (336), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 73/203 (35%), Positives = 103/203 (50%), Gaps = 24/203 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR ++ L+ EC+ L+ LA + +S V++ DTG RTS G G +I
Sbjct: 132 PRVTLFQQLLTDAECDALVALARGRLARSPVINPDTGDENLIEARTSLGAMFQVGEHPLI 191
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNGGQRMATV 198
IE IA T E GEGLQ+L+Y+ G +Y+PH+D+F + K GGQR+ T+
Sbjct: 192 ERIEDCIAAVTGIAAERGEGLQILNYKPGGEYQPHYDFFNPQRPGEARQLKVGGQRVGTL 251
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+ GG T FP K GL + P G+A+ F K D +L
Sbjct: 252 VIYLNSPLAGGATAFP-------------------KLGLEVAPVKGNAVYFSYRKSDGAL 292
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D +LH G PV G KW +TKW+
Sbjct: 293 DERTLHAGLPVEAGEKWIATKWL 315
>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
Length = 511
Score = 134 bits (336), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 70/205 (34%), Positives = 115/205 (56%), Gaps = 24/205 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+EV+ +P ++H+ LS E + L +A PH+ +S VV R+ S+GT++
Sbjct: 312 MEVLVLDPLVVIFHDVLSSREIDGLQEIARPHLERSMVVKYRANVQGKHRI--SAGTWVE 369
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
R + + IE+RIAD LE E V++Y G +Y+ H+D+F + N R+A
Sbjct: 370 RKYNNLTWRIERRIADMVDLNLEGSEPFYVINYGIGGQYKAHWDFFGADTVEDN---RLA 426
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVL Y++DVE+GG TVFP + G +++ K G+AL +++M+ +
Sbjct: 427 TVLFYMNDVEQGGATVFP-------------------RLGQTVRAKRGNALFWYNMQHNG 467
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWI 281
++D +LHGGCP++ G+KW T+WI
Sbjct: 468 TVDDRTLHGGCPILVGSKWIFTQWI 492
>gi|91091610|ref|XP_969386.1| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
gi|270001037|gb|EEZ97484.1| hypothetical protein TcasGA2_TC011321 [Tribolium castaneum]
Length = 536
Score = 133 bits (335), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 70/209 (33%), Positives = 110/209 (52%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE P F++ + L+ E + +A P +++TV ++DTG+ + ++ R S +L
Sbjct: 325 VEEAHHRPDIFIFRDVLADSEIATIKRMAQPRFKRATVQNTDTGELEIAQYRISKSAWLK 384
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGG 192
K I D+ +R++D T + E LQV++Y G YEPHFD+ + F + G
Sbjct: 385 EEEHKHIADVSQRVSDMTGLTMSTAEELQVVNYGIGGHYEPHFDFARRDERNAFKSLGTG 444
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL Y+SDVE+GG TVFP+ Q +S+ P+ G A ++++
Sbjct: 445 NRIATVLFYMSDVEQGGATVFPSIQ-------------------VSLWPQKGSAAFWYNL 485
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
P D + H CPV+ G+KW S KWI
Sbjct: 486 HPSGDGDKMTRHAACPVLTGSKWVSNKWI 514
>gi|389728965|ref|ZP_10189244.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
gi|388441204|gb|EIL97500.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
Length = 285
Score = 133 bits (335), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 79/210 (37%), Positives = 107/210 (50%), Gaps = 31/210 (14%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P V+ LS +EC LI LA P ++++ V D + D RTS G F G +I
Sbjct: 95 PPLRVFDGLLSDDECAALIELAKPRLQRARTVAEDGAQQIDEH-RTSDGMFFGLGEQPLI 153
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN------TKNGGQRMAT 197
IE RIA P+++GEGLQVLHY GQ+YEPH D+F D T GGQR+A+
Sbjct: 154 ERIEARIAALLGIPVDHGEGLQVLHYLPGQQYEPHQDWF-DPTQPGYAAITATGGQRIAS 212
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
+++YL+ + GG T FP + GL++ G A+ F S
Sbjct: 213 LVIYLNTPDAGGGTAFP-------------------EIGLTVTALRGSAVCFTY----ES 249
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SLH G PV +G KW +TKW+R Y+
Sbjct: 250 GDVFSLHAGLPVTRGEKWIATKWLRERPYR 279
>gi|195425415|ref|XP_002061004.1| GK10713 [Drosophila willistoni]
gi|194157089|gb|EDW71990.1| GK10713 [Drosophila willistoni]
Length = 502
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 75/208 (36%), Positives = 114/208 (54%), Gaps = 22/208 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE+++ P +YH+ L E E L LA P + +ST+ D D + RTS+ FL
Sbjct: 280 VEILNNLPFVAIYHDVLYDREIEELKRLAVPTITRSTIYDYDKEGNVPVNFRTSNSVFLL 339
Query: 137 RGRDKIIRDIEKRIADFTFFPL--ENGEGLQVLHYEAGQKYEPHFDYF-MDEFNTKNGGQ 193
++ + +R+AD T + + + LQV++Y G Y HFD+F DE K G
Sbjct: 340 NNASYLVDILRQRVADMTHLNVFKNSSDDLQVMNYGLGGYYRYHFDFFGKDESPNKLLGD 399
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+ TVL+Y++DV++GG TVFP + ++ PK G AL+F ++
Sbjct: 400 RIITVLIYMTDVQQGGATVFPALR-------------------ITNFPKKGSALIFRNLD 440
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ S DPS+LH GCPV+ G+KW++TKWI
Sbjct: 441 NNISPDPSTLHAGCPVLFGSKWAATKWI 468
>gi|3297815|emb|CAA19873.1| putative protein [Arabidopsis thaliana]
gi|7270340|emb|CAB80108.1| putative protein [Arabidopsis thaliana]
Length = 257
Score = 133 bits (334), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 70/195 (35%), Positives = 114/195 (58%), Gaps = 7/195 (3%)
Query: 68 GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR- 126
G+E +V+SW PRA + NF + E+C+ +I A +++ S + +++++
Sbjct: 18 GEESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKG 77
Query: 127 VRTSSGTFLARGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD 184
RTSSGTF++ + + +E++IA T P +GE +L YE GQKY+ H+D F
Sbjct: 78 TRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNP 137
Query: 185 EFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMG 244
QR+A+ L+YLSDVEEGGET+FP G+ + + + +C GL +KP+ G
Sbjct: 138 TEYGPQSSQRIASFLLYLSDVEEGGETMFPFENGSNMGIGY--DYKQC--IGLKVKPRKG 193
Query: 245 DALLFWSMKPDASLD 259
D LLF+S+ P+ ++D
Sbjct: 194 DGLLFYSVFPNGTID 208
>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
Length = 539
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 75/209 (35%), Positives = 110/209 (52%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE++ ++P A ++ N +S E E + LA+P ++++TV +S TG+ + + R S +L
Sbjct: 318 VEILRFDPLAVLFKNVISDSEIEVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLK 377
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
D +I + +RI DFT E LQV +Y G Y+PHFD+ E F T N G
Sbjct: 378 GDLDPVIDRVNRRIEDFTGLNQATSEELQVANYGLGGHYDPHFDFARKEEKNAFKTLNTG 437
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL Y+S E GG TVF N L G ++ P DAL ++++
Sbjct: 438 NRIATVLFYMSQPERGGATVF-------------NHL------GTAVFPSKNDALFWYNL 478
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ D D + H CPV+ G KW S KWI
Sbjct: 479 RRDGEGDLRTRHAACPVLLGVKWVSNKWI 507
>gi|345481336|ref|XP_001600680.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Nasonia
vitripennis]
Length = 556
Score = 132 bits (333), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 107/203 (52%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +YH+ + +E E + +A P +++TV + TG+ + + R S +L K
Sbjct: 348 DPRIVIYHDVIYDDEIETIKRMAQPRFKRATVQNYKTGELEIANYRISKSAWLQEHEHKH 407
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
+R + +R+ T +E E LQV++Y G YEPHFD+ E F + G R+ATV
Sbjct: 408 VRAVSQRVEHMTSMSIETAEELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRIATV 467
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDVE+GG TVF K +S+ PK G A ++++KP+
Sbjct: 468 LYYMSDVEQGGGTVFT-------------------KINISLWPKKGSAAFWYNLKPNGEG 508
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G+KW + KW+
Sbjct: 509 DYKTRHAACPVLTGSKWVANKWL 531
>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
IMCC9480]
Length = 280
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 71/208 (34%), Positives = 109/208 (52%), Gaps = 24/208 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR V N LS +EC+ + ++ +ST +D+ +G ++ RTS + RG ++I
Sbjct: 92 PRIVVLGNVLSDDECDAIAAMSRTRFARSTTIDNASGINRFDDSRTSESAHIQRGETELI 151
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEF-----NTKNGGQRMATV 198
I+ R+A + +P+++GE LQ+ Y+AG +Y PHFD+F + + GQR+AT+
Sbjct: 152 ARIDARLAALSGWPVDHGEPLQLQKYQAGNEYRPHFDWFDPALAGTAKHLEKSGQRLATI 211
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
++YL+DVEEGG T FP GL + P+ G AL F + P
Sbjct: 212 ILYLTDVEEGGGTSFPG-------------------IGLDVHPQKGGALFFRNTTPYGVP 252
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
D + H G PV KG K + KW+R Y
Sbjct: 253 DRKTQHAGLPVEKGTKIIANKWLREKPY 280
>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
Length = 487
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 74/216 (34%), Positives = 112/216 (51%), Gaps = 30/216 (13%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E+I +P +YH+ +S E L +A P ++++ V +S + S+ RT+ +
Sbjct: 281 MELIGLDPYMVLYHDVISPNEIAELQEMAKPQLKRARVYNSTKNTDQLSKTRTAKLAWFL 340
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG----- 191
+++ + +RI D T F L E LQV++Y G Y HFDY FNT G
Sbjct: 341 DTFNQLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDY----FNTTKGPHITQ 396
Query: 192 --GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLF 249
G R+ATVL YL+DVE+GG TVFP + ++ PK G A+++
Sbjct: 397 INGDRIATVLFYLNDVEQGGATVFPEIKK-------------------AVFPKRGSAIMW 437
Query: 250 WSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
+++K D + +LH GCPVI G+KW KWIR E
Sbjct: 438 YNLKDDGEGNRDTLHAGCPVIVGSKWVCNKWIRERE 473
>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
Length = 534
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 76/215 (35%), Positives = 113/215 (52%), Gaps = 29/215 (13%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E I +P +YH LS E LI A +M K+T V + G K +R RT+ G +
Sbjct: 319 IEQIGLDPYVVLYHEVLSAREISMLIGKAAQNM-KNTRVHKEQGVPKKNRGRTAKGFWFK 377
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNG- 191
+ +++ + I +RI D T F L + EG QV++Y G Y H DYF + +T++G
Sbjct: 378 KESNELTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYFDFASSNHTDTRSGY 437
Query: 192 ----GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDAL 247
G R+ATVL YL+DVE+GG TVF + G S+ P+ G A+
Sbjct: 438 SMDLGDRIATVLFYLTDVEQGGATVFAD-------------------VGYSVYPQAGTAI 478
Query: 248 LFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
++++ + DP + H CPVI G+KW T+WIR
Sbjct: 479 FWYNLDTNGKGDPRTRHAACPVIVGSKWVMTEWIR 513
>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
Length = 534
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 77/217 (35%), Positives = 112/217 (51%), Gaps = 33/217 (15%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E I +P +YH LS E LI AT +M K+T V + G K +R RT+ G +
Sbjct: 319 IEQIGLDPYVVLYHEVLSAREISMLIGKATQNM-KNTRVHKEQGVPKKNRGRTAKGFWFK 377
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG----- 191
+ +++ + I +RI D T F L + EG QV++Y G Y H DYF +F + N
Sbjct: 378 KESNELTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYF--DFASSNHTDTRS 435
Query: 192 ------GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGD 245
G R+ATVL YL+DVE+GG TVF + G S+ P+ G
Sbjct: 436 SYSMDLGDRIATVLFYLTDVEQGGATVFADV-------------------GYSVYPQAGT 476
Query: 246 ALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
A+ ++++ + DP + H CPVI G+KW T+WIR
Sbjct: 477 AIFWYNLDTNGKGDPRTKHAACPVIVGSKWVMTEWIR 513
>gi|347964867|ref|XP_309164.4| AGAP000971-PA [Anopheles gambiae str. PEST]
gi|333466515|gb|EAA04901.5| AGAP000971-PA [Anopheles gambiae str. PEST]
Length = 553
Score = 132 bits (332), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 105/202 (51%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P +YH+ +S E E + + A P R++TV + TG+ + + R S +L D++I
Sbjct: 349 PYIVIYHDVMSDREIERIKHYARPRFRRATVQNYKTGELEFANYRISKSAWLKDAEDEMI 408
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATVL 199
R I +R+ D T +E E LQV++Y G YEPHFD+ E F + G R+ATVL
Sbjct: 409 RTISQRVEDMTGLTMETAEELQVVNYGIGGHYEPHFDFARREERNAFKSLGTGNRIATVL 468
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDV +GG TVFP+ L++ P+ G A ++++ D
Sbjct: 469 FYMSDVTQGGATVFPS-------------------LNLALWPRKGTAAFWFNLHASGRGD 509
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
++ H CPV+ G KW S KWI
Sbjct: 510 YATRHAACPVLTGTKWVSNKWI 531
>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
Length = 521
Score = 132 bits (331), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 71/212 (33%), Positives = 112/212 (52%), Gaps = 22/212 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E+I +P +YH+ +S E L +A P ++++TV +S ++ + RT+ +
Sbjct: 318 MELIGLDPYMVLYHDVISPNEIAELQEMAKPELKRATVYNSTKNTNQFVKTRTAKVAWFL 377
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN---TKNGGQ 193
+++ + +RI D T F L E LQV++Y G Y HFDYF N ++ G
Sbjct: 378 DTFNQLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTTNPHISQINGD 437
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+ATVL YL+DVE+GG TVFP + ++ PK G A++++++K
Sbjct: 438 RIATVLFYLNDVEQGGATVFPEIKK-------------------AVFPKRGSAIMWYNLK 478
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
D + +LH CPVI G+KW KWIR E
Sbjct: 479 DDGEGNRDTLHAACPVIVGSKWVCNKWIRERE 510
>gi|357135725|ref|XP_003569459.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 1
[Brachypodium distachyon]
Length = 303
Score = 132 bits (331), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 72/206 (34%), Positives = 120/206 (58%), Gaps = 19/206 (9%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
++W PR F+Y FLS EC++L+ +A ++ S +V++ +R T + T
Sbjct: 63 LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLVNAG------ARNITQNST-----D 111
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D ++ IE RI+ ++F P E+GE +Q+L Y + Q D+ D + +GG R+ T+L
Sbjct: 112 DIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-----DHNKDGTQSSSGGNRLVTIL 166
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
MYLSDV++GGETVFP ++ + LSEC G ++KP GDA+L ++++PD D
Sbjct: 167 MYLSDVKQGGETVFPRSELKDTQAK-EGALSECA--GYAVKPVKGDAILLFNLRPDGVTD 223
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRVNE 285
S + C V++G KW + K + +++
Sbjct: 224 SDSHYEDCSVLEGEKWLAIKHLHISK 249
>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
anatinus]
Length = 493
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 74/203 (36%), Positives = 103/203 (50%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR YH +S E E + +LA P + ++TV D +TGK ++ R S +L+ D +
Sbjct: 293 KPRIVRYHEIISDAEIETVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 352
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 353 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 412
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 413 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 453
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW S KWI
Sbjct: 454 DYSTRHAACPVLVGNKWVSNKWI 476
>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 131 bits (330), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 105/204 (51%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|170064960|ref|XP_001867743.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
gi|167882146|gb|EDS45529.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
Length = 545
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 69/209 (33%), Positives = 106/209 (50%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E EP +YH +S E E + LA P R++TV + TG+ + + R S +L
Sbjct: 335 LEEAHLEPYIVIYHEVMSDAEIEVIKRLAKPRFRRATVQNYKTGELEVANYRISKSAWLK 394
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
++R + +R+ D T + E LQV++Y G YEPHFD+ E F + G
Sbjct: 395 DEEHSVVRTVGQRVEDMTGLTMTTAEELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTG 454
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL Y+SDV +GG TVFP+ + ++++PK G A ++++
Sbjct: 455 NRIATVLFYMSDVSQGGATVFPSIR-------------------VALRPKKGTAAFWYNL 495
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
D ++ H CPV+ G KW S KWI
Sbjct: 496 HASGHGDYATRHAACPVLTGTKWVSNKWI 524
>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
Length = 541
Score = 131 bits (329), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 73/208 (35%), Positives = 107/208 (51%), Gaps = 21/208 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E+ S +PR +YHN ++ EE E LA +R+STV +S TG S+ ++ R + FL
Sbjct: 335 MELASLKPRLVIYHNVVTDEEIETAKKLAQSRLRRSTVQNSLTGASEPTKYRIAKAAFLQ 394
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNG-GQR 194
I + +RI D T + E LQV +Y G YEPH+D+ E G G R
Sbjct: 395 NSEHDHIVKMTRRIGDVTGLDMTTAEELQVCNYGIGGHYEPHYDHARKGEVQKDFGWGNR 454
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+AT + Y+SDVE GG TVFP + L++ P+ G A ++++ P
Sbjct: 455 IATWMFYMSDVEAGGATVFP-------------------QINLALWPQKGSAAFWFNLHP 495
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ D + H CPV+ G+KW S KWI
Sbjct: 496 NGEGDDLTQHAACPVLTGSKWVSNKWIH 523
>gi|17541712|ref|NP_502317.1| Protein PHY-2 [Caenorhabditis elegans]
gi|32171589|sp|Q20065.1|P4HA2_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|3876769|emb|CAA93469.1| Protein PHY-2 [Caenorhabditis elegans]
Length = 539
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 74/209 (35%), Positives = 109/209 (52%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE++ ++P A ++ N + E E + LA+P ++++TV +S TG+ + + R S +L
Sbjct: 318 VEILRFDPLAVLFKNVIHDSEIEVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLK 377
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
D +I + +RI DFT E LQV +Y G Y+PHFD+ E F T N G
Sbjct: 378 GDLDPVIDRVNRRIEDFTNLNQATSEELQVANYGLGGHYDPHFDFARKEEKNAFKTLNTG 437
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL Y+S E GG TVF N L G ++ P DAL ++++
Sbjct: 438 NRIATVLFYMSQPERGGATVF-------------NHL------GTAVFPSKNDALFWYNL 478
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ D D + H CPV+ G KW S KWI
Sbjct: 479 RRDGEGDLRTRHAACPVLLGVKWVSNKWI 507
>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Cricetulus griseus]
Length = 534
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
Length = 534
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 80/215 (37%), Positives = 111/215 (51%), Gaps = 30/215 (13%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
E I +P +YH LS E LI+ A +M K+T V +T K K +R RT+ G +L
Sbjct: 320 TEQIGLDPYVVLYHEVLSAREISMLISKAAQNM-KNTRVHRET-KPKTNRGRTAKGHWLK 377
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG----- 191
+ +++ R I +RI D T F L + E QV++Y G Y H DYF + G
Sbjct: 378 KESNELTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYFLHMDYFDYASSNYTGPRSRQ 437
Query: 192 ----GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDAL 247
G R+ATVL YLSDVE+GG TVF G G S+ P+ G A+
Sbjct: 438 SKVLGDRIATVLFYLSDVEQGGATVF-------------------GNVGYSVYPQAGTAI 478
Query: 248 LFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
++++ D + DP + H CPVI G+KW T+WIR
Sbjct: 479 FWYNLDTDGNGDPLTRHASCPVIVGSKWVMTEWIR 513
>gi|307190793|gb|EFN74662.1| Prolyl 4-hydroxylase subunit alpha-2 [Camponotus floridanus]
Length = 476
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 107/203 (52%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR VYHN + EE E + +A P +++TV + TG + + R S +L K
Sbjct: 268 DPRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 327
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
+ + KR+ T +E E LQV++Y G YEPHFD+ E F + G R+ATV
Sbjct: 328 VAAVSKRVEHMTSMSIETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATV 387
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDVE+GG TVF +A+ +S+ P+ G A ++++KP+
Sbjct: 388 LYYMSDVEQGGGTVF-------TAI------------NISLWPRKGSAAFWYNLKPNGEG 428
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G+KW + KW+
Sbjct: 429 DFKTRHAACPVLTGSKWVANKWL 451
>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
musculus]
Length = 534
Score = 130 bits (328), Expect = 5e-28, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
[Rattus norvegicus]
Length = 534
Score = 130 bits (328), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
Length = 534
Score = 130 bits (327), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
Length = 272
Score = 130 bits (327), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 73/209 (34%), Positives = 104/209 (49%), Gaps = 24/209 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+P+ + N LS EEC+ +I +STV G S RTS F+ RG ++
Sbjct: 82 QPQIILLGNVLSDEECDAIIAHCGTRYTRSTVTGEADGSSMVHEGRTSEMAFIQRGEAEV 141
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNGGQRMAT 197
IE+R+A +P E E Q+ Y+A Q+Y PH+D+ + + GGQR+AT
Sbjct: 142 AERIERRLAALAHWPAECSEPFQLQKYDATQEYRPHYDWLDPDSSGHRSHLARGGQRLAT 201
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
++YLSDVE+GG TVFP GL + PK G AL F + +
Sbjct: 202 FILYLSDVEQGGGTVFPG-------------------LGLEVYPKKGSALWFLNTDINHQ 242
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
D +LHGG PV++G K + KW+R Y
Sbjct: 243 PDKRTLHGGAPVVRGTKIIANKWLRQGRY 271
>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide precursor [Salmo
salar]
gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
Length = 545
Score = 130 bits (326), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 72/202 (35%), Positives = 103/202 (50%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR YH+ LS E E + LA P +R++T+ + TG + + R S +L D ++
Sbjct: 346 PRIIRYHDVLSNSEIEKVKELAKPRLRRATISNPITGVLETAHYRISKSAWLTAYEDPVV 405
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 199
I +RI D T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 406 DKINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWL 465
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
+Y+SDV GG TVF + G ++ PK G A+ ++++ P D
Sbjct: 466 IYMSDVPSGGATVFTDV-------------------GAAVWPKKGSAVFWYNLFPSGEGD 506
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
S+ H CPV+ GNKW S KWI
Sbjct: 507 YSTRHAACPVLVGNKWVSNKWI 528
>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
Length = 454
Score = 130 bits (326), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 72/203 (35%), Positives = 104/203 (51%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ D +
Sbjct: 254 KPRIIRFHDIISDAENEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 313
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 314 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 373
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 374 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 414
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW S KW+
Sbjct: 415 DYSTRHAACPVLVGNKWVSNKWL 437
>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
harrisii]
Length = 385
Score = 130 bits (326), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 73/203 (35%), Positives = 103/203 (50%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H +S E E + +LA P + ++TV D +TGK ++ R S +L+ D +
Sbjct: 185 KPRIVRFHEIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 244
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 245 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 304
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 305 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 345
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW S KWI
Sbjct: 346 DYSTRHAACPVLVGNKWVSNKWI 368
>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Monodelphis domestica]
Length = 537
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 73/204 (35%), Positives = 103/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H +S E E + +LA P + ++TV D +TGK ++ R S +L+ D +
Sbjct: 337 KPRIVRFHEIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 396
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 397 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 456
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 457 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 497
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KWI
Sbjct: 498 DYSTRHAACPVLVGNKWVSNKWIH 521
>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
Length = 534
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/216 (35%), Positives = 110/216 (50%), Gaps = 33/216 (15%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
E I +P +YH LS E LI A +M K+T + + K +R RT+ G +L +
Sbjct: 320 EQIGLDPYVVLYHEVLSAREISMLIGKAAQNM-KNTKIHKERAVPKKNRGRTAKGFWLKK 378
Query: 138 GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG------ 191
+++ + I +RI D T F L + EG QV++Y G Y H DYF +F + N
Sbjct: 379 ESNELTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYF--DFASSNHTDTRSR 436
Query: 192 -----GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDA 246
G R+ATVL YL+DVE+GG TVF G G + P+ G A
Sbjct: 437 YSIDLGDRIATVLFYLTDVEQGGATVF-------------------GDVGYYVSPQAGTA 477
Query: 247 LLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ ++++ D + DP + H CPVI G+KW T+WIR
Sbjct: 478 IFWYNLDTDGNGDPRTRHAACPVIVGSKWVMTEWIR 513
>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
Length = 534
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 76/216 (35%), Positives = 110/216 (50%), Gaps = 33/216 (15%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
E I +P +YH LS E LI A +M K+T + + K +R RT+ G +L +
Sbjct: 320 EQIGLDPYVVLYHEVLSAREISMLIGKAAQNM-KNTKIHKERAVPKKNRGRTAKGFWLKK 378
Query: 138 GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG------ 191
+++ + I +RI D T F L + EG QV++Y G Y H DYF +F + N
Sbjct: 379 ESNELTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYF--DFASSNHTDTRSR 436
Query: 192 -----GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDA 246
G R+ATVL YL+DVE+GG TVF G G + P+ G A
Sbjct: 437 YSIDLGDRIATVLFYLTDVEQGGATVF-------------------GDVGYYVSPQAGTA 477
Query: 247 LLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ ++++ D + DP + H CPVI G+KW T+WIR
Sbjct: 478 IFWYNLDTDGNGDPRTRHAACPVIVGSKWVMTEWIR 513
>gi|383864775|ref|XP_003707853.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Megachile
rotundata]
Length = 550
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/203 (33%), Positives = 107/203 (52%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +YHN + EE E + +A P +++TV + TG + + R S +L K
Sbjct: 342 DPRIVIYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 401
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
+ + KR+ T +E E LQV++Y G YEPHFD+ E F + G R+ATV
Sbjct: 402 VAAVSKRVEHMTSLNVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATV 461
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDVE+GG TVF +A+ +S+ P+ G A ++++KP+
Sbjct: 462 LYYMSDVEQGGGTVF-------TAI------------NISLWPRKGSAAFWFNLKPNGEG 502
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G+KW + KW+
Sbjct: 503 DLRTRHAACPVLTGSKWVANKWL 525
>gi|357605723|gb|EHJ64752.1| prolyl 4-hydroxylase alpha subunit [Danaus plexippus]
Length = 235
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 107/207 (51%), Gaps = 21/207 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E + P V+++ LS E +Y+ +A P R++TV D TG+ + R S +L
Sbjct: 15 MEYLYRNPDIIVFNDVLSDYEIDYIKRIAQPRFRRATVHDPATGELVPAHYRISKSAWLK 74
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQR 194
++ + +R+AD T + E LQV++Y G Y+PHFD+ E N K G R
Sbjct: 75 DEESAVVARVSRRVADITGLSMTTAEELQVVNYGIGGHYDPHFDFARKEENAFEKFNGNR 134
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ATVL Y+SDV +GG TVF + GLS+ P+ G A+ + ++ P
Sbjct: 135 IATVLFYMSDVAQGGATVF-------------------TELGLSVFPRRGSAVFWLNLHP 175
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWI 281
D ++ H CPV++G+KW KWI
Sbjct: 176 SGEGDLATRHAACPVLRGSKWVCNKWI 202
>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 523
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 74/202 (36%), Positives = 102/202 (50%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P YH+ S++E E + LA P +R++TV D TGK ++ R S +L I+
Sbjct: 324 PYIVRYHDVASEKEMETVKELAKPRLRRATVHDPQTGKLTTAQYRVSKSAWLGSHEHPIV 383
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
I +RI D T + E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 384 DRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHFDFGRKDEADAFEELGTGNRIATWL 443
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
+Y+SDV+ GG TVF +I AV W PK G A+ ++++ D
Sbjct: 444 LYMSDVQAGGNTVFT----DIGAVVW---------------PKKGTAVFWYNLHRSGEGD 484
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ GNKW S KWI
Sbjct: 485 YRTRHAACPVLVGNKWVSNKWI 506
>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
Length = 220
Score = 129 bits (325), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 75/206 (36%), Positives = 111/206 (53%), Gaps = 26/206 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
EP V N LS EC+ LI + +++S + + + S +RTSSG F + +
Sbjct: 40 EPLVVVLGNVLSDSECDELIEHSRERLQRSKIGEDGSVNS----IRTSSGVFCEQ--TET 93
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
I IEKRI+ P+E+G+GLQVL Y GQ+Y+PH+D+F E + + R++T++MYL
Sbjct: 94 ITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFA-ETSRASTNNRISTLVMYL 152
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
+DVE+GGETVFP LS+ P G A+ F + L+ +
Sbjct: 153 NDVEQGGETVFPLLH-------------------LSVFPTKGMAVYFEYFYSNQELNDFT 193
Query: 263 LHGGCPVIKGNKWSSTKWIRVNEYKV 288
LH G VI G KW +T W+R ++V
Sbjct: 194 LHAGTQVIHGEKWVATMWMRRQSFRV 219
>gi|218665910|ref|YP_002425647.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
gi|218518123|gb|ACK78709.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
ferrooxidans ATCC 23270]
Length = 248
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 69/197 (35%), Positives = 100/197 (50%), Gaps = 20/197 (10%)
Query: 91 NFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRI 150
L+ E C+ LI + +R +TV D TG+ R S + R I++ + + I
Sbjct: 70 GLLTPENCQNLIAIGQSLLRPATVTDEQTGQEVAHGERVSEMAWPKRDDYPILQSLAEGI 129
Query: 151 ADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT-KNGGQRMATVLMYLSDVEEGG 209
A T P++ E LQ+LHY G +Y+PH+D F + T + GG R AT+++YL+ VEEGG
Sbjct: 130 AQLTGIPIDCQEPLQILHYRPGGEYKPHYDAFAADAPTLRQGGNRQATLILYLNAVEEGG 189
Query: 210 ETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPV 269
ET FP + GL + P G + F ++ + P SLH G PV
Sbjct: 190 ETAFP-------------------ELGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPV 230
Query: 270 IKGNKWSSTKWIRVNEY 286
KG KW +T+WIR Y
Sbjct: 231 RKGEKWIATQWIRQEAY 247
>gi|292619367|ref|XP_001922562.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Danio rerio]
Length = 541
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 104/202 (51%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR YH ++++E E + L+ P +R++T+ + TG + + R S +LA ++
Sbjct: 342 PRIIRYHEIITEQEIEKIKELSKPRLRRATISNPITGVLETAHYRISKSAWLAAYEHPVV 401
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 199
I +RI D T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 402 DRINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWL 461
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDV GG TVFP + G ++KP G A+ ++++ P D
Sbjct: 462 FYMSDVAAGGATVFP-------------------EVGAAVKPLKGTAVFWYNLFPSGEGD 502
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
S+ H CPV+ GNKW S KWI
Sbjct: 503 YSTRHAACPVLVGNKWVSNKWI 524
>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
Length = 244
Score = 129 bits (324), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 104/203 (51%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 44 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 103
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 104 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 163
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 164 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 204
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW S KW+
Sbjct: 205 DYSTRHAACPVLVGNKWVSNKWL 227
>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
[Loxodonta africana]
Length = 534
Score = 129 bits (324), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP+ G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFPDV-------------------GASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
Length = 526
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 103/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + LA P + ++TV D +TGK ++ R S +L+ D +
Sbjct: 326 KPRIIRFHDIISDAEIEIVKYLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 385
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 386 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 445
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 446 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 486
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 487 DYSTRHAACPVLVGNKWVSNKWLH 510
>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
jacchus]
Length = 534
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
garnettii]
Length = 534
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|302830268|ref|XP_002946700.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
gi|300267744|gb|EFJ51926.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
Length = 186
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 74/165 (44%), Positives = 97/165 (58%), Gaps = 11/165 (6%)
Query: 124 DSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM 183
+ +VRTS GTFL +R +E +IA T P NGE VL+Y K+ H+D M
Sbjct: 17 EQQVRTSKGTFLGGDSSPALRWLEDKIAAVTLLPRTNGEFWNVLNY----KHSQHYDSHM 72
Query: 184 DEFNTKNGG----QRMATVLMYLSDVE-EGGETVFPNAQGNISAVPWWNELSEC-GKTGL 237
D F+ K G QR+ATV++ LSD GGETVF +G S + ++C GL
Sbjct: 73 DSFDPKEYGPQYSQRIATVIVVLSDDGLMGGETVF-KREGKSSINKPISNWTDCDADGGL 131
Query: 238 SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
KP+ GDA+LFWS +PD LDP +LHG CPV+ GNKW + KW+R
Sbjct: 132 KYKPRAGDAVLFWSARPDGQLDPHALHGSCPVVTGNKWVAVKWLR 176
>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|410927705|ref|XP_003977281.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 531
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 75/223 (33%), Positives = 109/223 (48%), Gaps = 29/223 (13%)
Query: 69 DEGRAEQWV-----EVISWE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKS 122
D GR ++V + W+ P YH+ LS E E + LA P +R++TV D TG+
Sbjct: 311 DNGRHPKYVIGPVKQEDEWDRPHIVRYHDILSNREMETVKELAKPRLRRATVHDPQTGQL 370
Query: 123 KDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF 182
+ R S +L ++ I +RI D T + E LQV +Y G +YEPH+D+
Sbjct: 371 TTAPYRVSKSAWLGAFEHPVVDRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHYDFG 430
Query: 183 M----DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLS 238
D F G R+AT L+Y+S+V+ GG TVF + G S
Sbjct: 431 RKDEPDAFKELGTGNRIATWLLYMSEVQAGGATVFTD-------------------IGAS 471
Query: 239 IKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ PK G A+ ++++ P D + H CPV+ GNKW S KWI
Sbjct: 472 VSPKKGSAVFWYNLHPSGDGDYRTRHAACPVLLGNKWVSNKWI 514
>gi|224006596|ref|XP_002292258.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
CCMP1335]
gi|220971900|gb|EED90233.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
CCMP1335]
Length = 206
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 77/214 (35%), Positives = 112/214 (52%), Gaps = 23/214 (10%)
Query: 83 EPRAFVYHNFLSKEECEYLINLA-TPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDK 141
+PR F HNFLS +E + L+ + P + + G + RTS F
Sbjct: 1 DPRVFYVHNFLSADEADELVAFSMAPSTGGTHKAWNQGGSNAKLTTRTSMNAF------D 54
Query: 142 IIRDIEKRIADFTFFPLENG-------EGLQVLHYEAGQKYEPHFDYF-MDEFN------ 187
I + RI F L G +G+Q+L YE GQ Y H DYF + + N
Sbjct: 55 ITTKLSFRIKRRAFRLLRMGAYKENLADGIQILRYELGQAYIAHHDYFPVRQSNDHLWDP 114
Query: 188 TKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDAL 247
+K G R AT+ +YLSDVE GG+T+ +A + A W ++L + + L++ P+ GDA+
Sbjct: 115 SKGGSNRFATIFLYLSDVEVGGQTLEKDA--GVDAGSWEDKLVDQCYSKLAVPPRRGDAI 172
Query: 248 LFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
LF+S PD LDP+SLHG CP++KG KW + W+
Sbjct: 173 LFYSQYPDGHLDPNSLHGACPILKGTKWGANLWV 206
>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-1 [Nomascus leucogenys]
Length = 502
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 302 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 361
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 362 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 421
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 422 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 462
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 463 DYSTRHAACPVLVGNKWVSNKWLH 486
>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
mulatta]
Length = 534
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
melanoleuca]
Length = 534
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
Length = 488
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 288 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 347
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 348 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 407
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 408 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 448
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 449 DYSTRHAACPVLVGNKWVSNKWLH 472
>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
sapiens]
gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
troglodytes]
gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I variant [Homo
sapiens]
gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
sapiens]
gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
sapiens]
gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
lupus familiaris]
Length = 534
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
[Oryctolagus cuniculus]
Length = 534
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
[Papio anubis]
Length = 379
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 104/203 (51%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 179 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 238
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 239 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 298
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 299 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 339
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW S KW+
Sbjct: 340 DYSTRHAACPVLVGNKWVSNKWL 362
>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
Length = 534
Score = 129 bits (323), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
Length = 578
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 67/207 (32%), Positives = 112/207 (54%), Gaps = 21/207 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHM-RKSTVVDSDTGKSKDSRVRTSSGTFL 135
E++S P +YH+ ++ E L NL+ PHM R++ + + RTS+ +L
Sbjct: 378 TELLSLAPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNSVWL 437
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNTKNGGQR 194
+ ++ +E+R+ T F +EN E Q+++Y G Y+PH D+F + + GG R
Sbjct: 438 TSHENAVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQLEHRGGGDR 497
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ATVL YLSDV +GG T+FP + +S++P+ GDALL++++
Sbjct: 498 IATVLFYLSDVPQGGATLFP-------------------RLNISVQPRQGDALLWYNLND 538
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ ++H CP+IKG+KW+ KWI
Sbjct: 539 RGQGEIGTVHTSCPIIKGSKWALVKWI 565
>gi|332026992|gb|EGI67088.1| Prolyl 4-hydroxylase subunit alpha-1 [Acromyrmex echinatior]
Length = 415
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 106/203 (52%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR VYHN + EE E + +A P +++TV + TG + + R S +L K
Sbjct: 207 DPRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 266
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
+ + KR+ T +E E LQV++Y G YEPHFD+ E F + G R+ATV
Sbjct: 267 VAAVSKRVEHMTSMSVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATV 326
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDVE+GG TVF +A+ +S+ P+ G A + ++KP+
Sbjct: 327 LYYMSDVEQGGGTVF-------TAI------------NISLWPRKGSAAFWHNLKPNGEG 367
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G+KW + KW+
Sbjct: 368 DFKTRHAACPVLTGSKWVANKWL 390
>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
SAR86A]
Length = 205
Score = 128 bits (322), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 71/212 (33%), Positives = 108/212 (50%), Gaps = 27/212 (12%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V + S +P +V +NFLS +ECE + + M ++ V+ D + SR T+ +L
Sbjct: 10 VTLYSADPIVYVVNNFLSDDECEAFVEMGKGKMERAKVISDDESEFHASR--TNDFCWLE 67
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-----MDEFNTKNG 191
+I ++ KR + P+ N E Q+++Y G +Y+PHFD F + N G
Sbjct: 68 HSASDVIHEVSKRFSVLVKMPINNAEQFQLVYYGPGNEYKPHFDAFDKTTKEGQNNWFPG 127
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
GQRM T L YL+DVEEGG T FP K +S+KP GD ++F +
Sbjct: 128 GQRMVTALAYLNDVEEGGATDFP-------------------KINVSVKPNKGDVVVFHN 168
Query: 252 -MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
++ ++P +LHGG PV+ G KW+ W R
Sbjct: 169 CIEGTTEINPQALHGGSPVVAGEKWAVNLWFR 200
>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
porcellus]
Length = 534
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|115434812|ref|NP_001042164.1| Os01g0174500 [Oryza sativa Japonica Group]
gi|55296794|dbj|BAD68120.1| prolyl 4-hydroxylase -like [Oryza sativa Japonica Group]
gi|113531695|dbj|BAF04078.1| Os01g0174500 [Oryza sativa Japonica Group]
gi|222617830|gb|EEE53962.1| hypothetical protein OsJ_00571 [Oryza sativa Japonica Group]
Length = 303
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 85/264 (32%), Positives = 138/264 (52%), Gaps = 27/264 (10%)
Query: 30 AILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWV------EVISWE 83
A+L+L+A + P + S + DLS K + + G + V + +SW
Sbjct: 7 AVLVLVAAWLTFAPPGALASSRRFDLSIAQEKLVNSTGGSTASSSHLVFDPSKSKRLSWH 66
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKS-TVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
PR F+Y FLS EC++L+++ +M S D D R SS + D +
Sbjct: 67 PRIFLYEGFLSDMECDHLVSMGRGNMESSLAFTDGD---------RNSSYNNI---EDIV 114
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ IE RI+ ++F P ENGE +QVL Y + +E + +G R+AT+LMYL
Sbjct: 115 VSKIEDRISLWSFLPKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLATILMYL 169
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
SDV++GGETVFP ++ A S+C +G +++P G+A+L ++++PD D S
Sbjct: 170 SDVKQGGETVFPRSEMK-DAQAKEGAPSQC--SGYAVRPAKGNAILLFNLRPDGETDKDS 226
Query: 263 LHGGCPVIKGNKWSSTKWIRVNEY 286
+ CPV++G KW + K I + ++
Sbjct: 227 QYEECPVLEGEKWLAIKHINLRKF 250
>gi|416009427|ref|ZP_11561250.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
gi|339836568|gb|EGQ64151.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
Length = 196
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 68/200 (34%), Positives = 100/200 (50%), Gaps = 20/200 (10%)
Query: 88 VYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIE 147
+ L+ E C+ LI + +R +TV D TG+ R S + R I++ +
Sbjct: 15 AWAGLLTPENCQNLIAIGQSLLRPATVTDEQTGQEVAHGERVSEMAWPKRDDHPILQSLA 74
Query: 148 KRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT-KNGGQRMATVLMYLSDVE 206
+ IA T P++ E LQ+LHY G +Y+PH+D F + T + GG R T+++YL+ VE
Sbjct: 75 EGIAQLTGIPIDCQEPLQILHYRPGGEYKPHYDAFAADAPTLRQGGNRQGTLILYLNAVE 134
Query: 207 EGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGG 266
EGGET FP + GL + P G + F ++ + P SLH G
Sbjct: 135 EGGETAFP-------------------ELGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAG 175
Query: 267 CPVIKGNKWSSTKWIRVNEY 286
PV KG KW +T+WIR Y
Sbjct: 176 LPVRKGEKWIATQWIRQEAY 195
>gi|307211752|gb|EFN87747.1| Prolyl 4-hydroxylase subunit alpha-1 [Harpegnathos saltator]
Length = 415
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 69/203 (33%), Positives = 106/203 (52%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR YHN + EE E + +A P +++TV + TG + + R S +L K
Sbjct: 207 DPRIVFYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 266
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
+ + KR+ T +E E LQV++Y G YEPHFD+ E F + G R+ATV
Sbjct: 267 VAAVSKRVEHMTSMSVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATV 326
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDVE+GG TVF +A+ +S+ P+ G A ++++KP+
Sbjct: 327 LYYMSDVEQGGGTVF-------TAI------------NISLWPRKGSAAFWYNLKPNGEG 367
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G+KW + KW+
Sbjct: 368 DFKTRHAACPVLTGSKWVANKWL 390
>gi|24651407|ref|NP_733371.1| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
gi|20269806|gb|AAM18058.1|AF495536_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]EFB
[Drosophila melanogaster]
gi|15292529|gb|AAK93533.1| SD05564p [Drosophila melanogaster]
gi|23172692|gb|AAF57053.2| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
gi|220946562|gb|ACL85824.1| PH4alphaEFB-PA [synthetic construct]
Length = 550
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 70/209 (33%), Positives = 106/209 (50%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E + +P +YH+ + E + + +A P R++TV +S TG + + R S +L
Sbjct: 339 LEEVHADPYIVIYHDAMYDSEIDLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLK 398
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
D++I + +R AD T +++ E LQV++Y G YEPHFD+ E F N G
Sbjct: 399 TQEDRVIETVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARKEEQRAFEGLNLG 458
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL Y+SDVE+GG TVF + ++ PK G A + ++
Sbjct: 459 NRIATVLFYMSDVEQGGATVFTSLHT-------------------ALFPKKGTAAFWMNL 499
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
D D + H CPV+ G KW S KWI
Sbjct: 500 HRDGQGDVRTRHAACPVLTGTKWVSNKWI 528
>gi|195575089|ref|XP_002105512.1| GD21521 [Drosophila simulans]
gi|194201439|gb|EDX15015.1| GD21521 [Drosophila simulans]
Length = 550
Score = 128 bits (322), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 70/209 (33%), Positives = 106/209 (50%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E + +P +YH+ + E + + +A P R++TV +S TG + + R S +L
Sbjct: 339 LEEVHADPYIVIYHDAMYDSEIDLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLK 398
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
D++I + +R AD T +++ E LQV++Y G YEPHFD+ E F N G
Sbjct: 399 TQEDRVIETVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARKEEERAFEGLNLG 458
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL Y+SDVE+GG TVF + ++ PK G A + ++
Sbjct: 459 NRIATVLFYMSDVEQGGATVFTSLHT-------------------ALFPKKGTAAFWMNL 499
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
D D + H CPV+ G KW S KWI
Sbjct: 500 HRDGQGDVRTRHAACPVLTGTKWVSNKWI 528
>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
caballus]
Length = 302
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 104/203 (51%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 102 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 161
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 162 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 221
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 222 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 262
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW S KW+
Sbjct: 263 DYSTRHAACPVLVGNKWVSNKWL 285
>gi|195341536|ref|XP_002037362.1| GM12882 [Drosophila sechellia]
gi|194131478|gb|EDW53521.1| GM12882 [Drosophila sechellia]
Length = 550
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 70/209 (33%), Positives = 106/209 (50%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E + +P +YH+ + E + + +A P R++TV +S TG + + R S +L
Sbjct: 339 LEEVHADPYIVIYHDAMYDSEIDLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLK 398
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
D++I + +R AD T +++ E LQV++Y G YEPHFD+ E F N G
Sbjct: 399 TQEDRVIETVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARKEEERAFEGINLG 458
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL Y+SDVE+GG TVF + ++ PK G A + ++
Sbjct: 459 NRIATVLFYMSDVEQGGATVFTSLHT-------------------ALFPKKGTAAFWMNL 499
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
D D + H CPV+ G KW S KWI
Sbjct: 500 HRDGQGDVRTRHAACPVLTGTKWVSNKWI 528
>gi|350416719|ref|XP_003491070.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
impatiens]
Length = 557
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 107/203 (52%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR VYHN + EE E + +A P +++TV + TG + + R S +L +
Sbjct: 349 DPRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHEH 408
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
+ + +R+ T ++ E LQV++Y G YEPHFD+ E F + G R+ATV
Sbjct: 409 VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATV 468
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDVE+GG TVF +A+ +S+ PK G A ++++KP+
Sbjct: 469 LYYMSDVEQGGGTVF-------TAI------------NISLWPKKGSAAFWYNLKPNGEG 509
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G+KW + KW+
Sbjct: 510 DFKTRHAACPVLTGSKWVANKWL 532
>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
Length = 215
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 74/206 (35%), Positives = 111/206 (53%), Gaps = 26/206 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
EP V N LS EC+ LI + +++S + + + S +RTSSG F + +
Sbjct: 35 EPLVVVLGNVLSDSECDELIEHSRERLQRSKIGEDRSVNS----IRTSSGVFCEQ--TET 88
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
I IEKRI+ P+E+G+GLQVL Y GQ+Y+PH+D+F E + + R++T++MYL
Sbjct: 89 ITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFA-ETSRASTNNRISTLVMYL 147
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
+DVE+GGETVFP LS+ P G A+ F + ++ +
Sbjct: 148 NDVEQGGETVFPLLH-------------------LSVFPTKGMAVYFEYFYRNQEVNEFT 188
Query: 263 LHGGCPVIKGNKWSSTKWIRVNEYKV 288
LH G VI G KW +T W+R ++V
Sbjct: 189 LHAGAQVIHGEKWVATMWMRRQSFRV 214
>gi|340722330|ref|XP_003399560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
terrestris]
Length = 557
Score = 128 bits (321), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 107/203 (52%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR VYHN + EE E + +A P +++TV + TG + + R S +L +
Sbjct: 349 DPRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHEH 408
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
+ + +R+ T ++ E LQV++Y G YEPHFD+ E F + G R+ATV
Sbjct: 409 VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATV 468
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDVE+GG TVF +A+ +S+ PK G A ++++KP+
Sbjct: 469 LYYMSDVEQGGGTVF-------TAI------------NISLWPKKGSAAFWYNLKPNGEG 509
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G+KW + KW+
Sbjct: 510 DFKTRHAACPVLTGSKWVANKWL 532
>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
Length = 534
Score = 128 bits (321), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEVVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVLAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|319795182|ref|YP_004156822.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
gi|315597645|gb|ADU38711.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
Length = 296
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 67/197 (34%), Positives = 101/197 (51%), Gaps = 24/197 (12%)
Query: 91 NFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRI 150
N + EC+ LI +A P + ST+VD +G+ S R S G F + ++ +++R+
Sbjct: 107 NVVDAHECKALIEMAKPRLAPSTLVDPMSGRDVVSDKRASWGMFFRLCENDLVARLDRRL 166
Query: 151 ADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-----MDEFNTKNGGQRMATVLMYLSDV 205
+ PLENGEGL +L+Y G EPH DY + + GQR++T++ YL+D
Sbjct: 167 SALMNLPLENGEGLHLLYYPTGAGSEPHHDYLAPTNAANRESIARSGQRVSTLVTYLNDA 226
Query: 206 EEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHG 265
EGG+TVFP + GL++ P G+A F + +D SLH
Sbjct: 227 PEGGQTVFP-------------------QLGLAVSPIRGNACYFEYCDGNGRVDARSLHA 267
Query: 266 GCPVIKGNKWSSTKWIR 282
PV +G+KW TKW+R
Sbjct: 268 SAPVTRGDKWVMTKWMR 284
>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
aries]
Length = 534
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 104/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVLAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|66820122|ref|XP_643703.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
gi|60471803|gb|EAL69758.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
Length = 221
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 77/211 (36%), Positives = 110/211 (52%), Gaps = 38/211 (18%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+S PR + FL+ EECE+LI+ + +R + S R+ G F+ G
Sbjct: 28 LSQAPRIYRIPGFLTDEECEFLIDTSKNKLRPCNEISSGVH-------RSGWGLFMKEGE 80
Query: 140 D--KIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG------ 191
+ +I ++I ++ F E+ E +QV+ Y G++ HFDYF T NG
Sbjct: 81 EDHQITKNIFNKMKSFVNIS-ESCEVMQVIRYNQGEETSSHFDYFNPL--TTNGSMKIGL 137
Query: 192 -GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
GQR+ T+LMYL DVEEGGET FP + G+ +KP GDA+LF+
Sbjct: 138 YGQRVCTILMYLCDVEEGGETTFP-------------------EVGIKVKPIKGDAVLFY 178
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ KP+ +DP SLH G PV+KGNKW + K I
Sbjct: 179 NCKPNGDVDPLSLHQGDPVLKGNKWVAIKLI 209
>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
Length = 513
Score = 127 bits (320), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 66/205 (32%), Positives = 112/205 (54%), Gaps = 21/205 (10%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHM-RKSTVVDSDTGKSKDSRVRTSSGTFLA 136
E++S P +YH+ ++ E L NL+ PHM R++ + + RTS+ +L
Sbjct: 316 EILSLSPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNSVWLT 375
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ ++ +E+R+ T F +EN E Q+++Y G Y+PH D+F + + GG R+A
Sbjct: 376 SHENAVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHF-ETPQHRGGGDRIA 434
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVL YLSDV +GG T+FP + +S++P+ GDALL++++
Sbjct: 435 TVLFYLSDVPQGGATLFP-------------------RLNISVQPRQGDALLWYNLNDRG 475
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWI 281
+ ++H CP+I+G+KW+ KWI
Sbjct: 476 QGEIGTVHTSCPIIQGSKWALVKWI 500
>gi|374620441|ref|ZP_09692975.1| 2OG-Fe(II) oxygenase superfamily enzyme [gamma proteobacterium
HIMB55]
gi|374303668|gb|EHQ57852.1| 2OG-Fe(II) oxygenase superfamily enzyme [gamma proteobacterium
HIMB55]
Length = 570
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 77/214 (35%), Positives = 109/214 (50%), Gaps = 27/214 (12%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
E S +P V +N +S EC YLI LA PH++++ VV D G K+S RT S +L
Sbjct: 15 EAYSLDPLVGVRNNVISPVECAYLIELAKPHIKRAGVV-LDEGY-KESEGRTGSNHWLKY 72
Query: 138 GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-----MDEFNTKNGG 192
D +++ + +RI+D PLE E +Q++HY Q+Y PHFD F + K GG
Sbjct: 73 DEDDVVQSVGQRISDIVGLPLEYAESMQIIHYGPEQEYRPHFDAFNLSLPKGQRAAKWGG 132
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
QR+ T L+YL+ VE GG T FP K G+++ G ++F +
Sbjct: 133 QRLVTALVYLNKVEAGGATQFP-------------------KLGITVPALPGRMVIFHNT 173
Query: 253 KPDAS-LDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
D S P SLH G PV G KW+ W R+ +
Sbjct: 174 THDISGPHPLSLHAGMPVEAGEKWAFNMWFRLQD 207
>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
gallus]
Length = 536
Score = 127 bits (320), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 72/203 (35%), Positives = 102/203 (50%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR + + +S EE E + LA P + ++TV D +TGK + R S +L+ +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 395
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 396 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 455
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++ P
Sbjct: 456 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFPSGEG 496
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW S KW+
Sbjct: 497 DYSTRHAACPVLVGNKWVSNKWL 519
>gi|242018356|ref|XP_002429643.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
humanus corporis]
gi|212514628|gb|EEB16905.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
humanus corporis]
Length = 534
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 104/203 (51%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +YH+ LS E + + LA P +++TV +S+TGK + + R S +L
Sbjct: 333 DPRIVLYHDVLSDREIKTIQQLAVPRFKRATVQNSETGKLEVAHYRISKSAWLEDVDHPY 392
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
+ + +R+ D T + E LQV++Y G YEPHFD+ E F + G R+AT+
Sbjct: 393 VAKVSQRVEDITGLNMATAESLQVVNYGIGGHYEPHFDFARKEEKNAFQSLGTGNRIATI 452
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV +GG TVFP + V W PK G A +++++ +
Sbjct: 453 LFYMSDVSQGGATVFPGIK-----VSLW--------------PKKGTAAFWYNLRKNGEG 493
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G+KW KWI
Sbjct: 494 DYLTRHAACPVLTGSKWVCNKWI 516
>gi|194765194|ref|XP_001964712.1| GF22904 [Drosophila ananassae]
gi|190614984|gb|EDV30508.1| GF22904 [Drosophila ananassae]
Length = 547
Score = 127 bits (319), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 72/209 (34%), Positives = 104/209 (49%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E EP +YH+ + E E + +A P R++TV +S TG + + R S +L
Sbjct: 336 LEEAHQEPYIVIYHDAMYDSEIELIKRMARPRFRRATVQNSVTGALETANYRISKSAWLK 395
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
D +I + +R AD T +++ E LQV++Y G YEPHFD+ E F N G
Sbjct: 396 TEEDHVIGTVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARKEEKRAFEGLNLG 455
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL Y+SDVE+GG TVF + ++ PK G A + ++
Sbjct: 456 NRIATVLFYMSDVEQGGATVFTSLHT-------------------ALFPKKGTAAFWMNL 496
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
D D + H CPV+ G KW S KWI
Sbjct: 497 HRDGEGDVRTRHAACPVLTGTKWVSNKWI 525
>gi|195452726|ref|XP_002073473.1| GK14136 [Drosophila willistoni]
gi|194169558|gb|EDW84459.1| GK14136 [Drosophila willistoni]
Length = 550
Score = 127 bits (319), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 72/209 (34%), Positives = 106/209 (50%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E +P +YH+ + E + + +A P R++TV +S TG + + R S +L
Sbjct: 339 LEEAHMDPYIVIYHDAMYDSEMDLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLK 398
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
D++I + +R AD T +++ E LQV++Y G YEPHFD+ E F N G
Sbjct: 399 TEEDQVIGTVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARREEKRAFEGLNLG 458
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL Y+SDVE+GG TVF ++ A W PK G A + ++
Sbjct: 459 NRIATVLFYMSDVEQGGATVFT----SLHAALW---------------PKKGTAAFWMNL 499
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
D D + H CPV+ G KW S KWI
Sbjct: 500 HRDGEGDVRTRHAACPVLTGTKWVSNKWI 528
>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
gallus]
Length = 536
Score = 127 bits (318), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 72/203 (35%), Positives = 102/203 (50%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR + + +S EE E + LA P + ++TV D +TGK + R S +L+ +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 395
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 396 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 455
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++ P
Sbjct: 456 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFPSGEG 496
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW S KW+
Sbjct: 497 DYSTRHAACPVLVGNKWVSNKWL 519
>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Meleagris gallopavo]
Length = 536
Score = 127 bits (318), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 72/203 (35%), Positives = 102/203 (50%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR + + +S EE E + LA P + ++TV D +TGK + R S +L+ +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 395
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 396 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 455
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++ P
Sbjct: 456 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFPSGEG 496
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW S KW+
Sbjct: 497 DYSTRHAACPVLVGNKWVSNKWL 519
>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
(Silurana) tropicalis]
Length = 526
Score = 127 bits (318), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 69/203 (33%), Positives = 104/203 (51%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR YH+ +S EE + LA P +R++T+ + TG + ++ R + +L+ D +
Sbjct: 326 KPRIVRYHDIISDEEISKVKELAKPRLRRATISNPITGVLETAQYRITKSAWLSGYEDPV 385
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ + +RI T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 386 VARLNRRIEGVTGLDMSTAEELQVANYGIGGQYEPHFDFLRKYEPDAFKKLGTGNRVATW 445
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDVE GG TVFP + G ++ PK G A+ ++++
Sbjct: 446 LFYMSDVEAGGATVFP-------------------EVGAAVYPKKGTAVFWYNLLESGEG 486
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW S KWI
Sbjct: 487 DYSTRHAACPVLVGNKWVSNKWI 509
>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
carolinensis]
Length = 542
Score = 127 bits (318), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 100/202 (49%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR + +S EE E + LA P + ++TV D TGK + R S +L+ + I+
Sbjct: 343 PRIVRFVEIISDEEIETVKELAKPRLSRATVHDPQTGKLTTAHYRVSKSAWLSGYENPIV 402
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 199
I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 403 ARINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWL 462
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDV GG TVFP + G S+ P+ G A+ ++++ P D
Sbjct: 463 FYMSDVSAGGATVFP-------------------EVGASVWPRKGTAVFWYNLFPSGEGD 503
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
S+ H CPV+ GNKW S KWI
Sbjct: 504 YSTRHAACPVLVGNKWVSNKWI 525
>gi|242051901|ref|XP_002455096.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
gi|241927071|gb|EES00216.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
Length = 303
Score = 127 bits (318), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 73/204 (35%), Positives = 111/204 (54%), Gaps = 18/204 (8%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+SW PR F+Y FLS EC++LI++A + S VV G + A
Sbjct: 62 LSWHPRVFLYEGFLSDMECDHLISMAHGKKQSSLVVGGSAGNNSQG----------ASIE 111
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D I+ IE RI+ ++F P + GE +Q+L YE + DY E + +G R+ TVL
Sbjct: 112 DTIVSTIEDRISVWSFLPKDFGESMQILKYEVNKS-----DYNNYESQSSSGHDRLVTVL 166
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
MYLSDV+ GGET FP ++ + V SEC G +++P G+A+L +++KPD +D
Sbjct: 167 MYLSDVKRGGETAFPRSELKGTKVELAAP-SEC--AGYAVQPVRGNAILLFNLKPDGVID 223
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRV 283
S + C V++G +W + K I +
Sbjct: 224 KDSQYEMCSVLEGEEWLAIKHIHL 247
>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
gallus]
Length = 489
Score = 127 bits (318), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 72/203 (35%), Positives = 102/203 (50%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR + + +S EE E + LA P + ++TV D +TGK + R S +L+ +
Sbjct: 289 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 348
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 349 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 408
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++ P
Sbjct: 409 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFPSGEG 449
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW S KW+
Sbjct: 450 DYSTRHAACPVLVGNKWVSNKWL 472
>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
[Cricetulus griseus]
Length = 534
Score = 126 bits (317), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 70/204 (34%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLETVHYRISKSAWLSGYEDPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|157114985|ref|XP_001658091.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
gi|108877086|gb|EAT41311.1| AAEL007038-PA [Aedes aegypti]
Length = 545
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 71/209 (33%), Positives = 107/209 (51%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E + +P +YH+ +S+ E E + LA P R++TV + TG+ + + R S +L
Sbjct: 335 LEEANLKPYIVIYHDVISEAEMELVKRLAKPRFRRATVQNYKTGELEVANYRISKSAWLK 394
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
I+ I +R+ D T + E LQV++Y G YEPHFD+ E F + G
Sbjct: 395 DHEHPYIKAIGERVEDMTGLTMSTAEELQVVNYGIGGHYEPHFDFARREETNAFKSLGTG 454
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL Y+SDV +GG TVFP+ + L++ PK G A ++++
Sbjct: 455 NRIATVLFYMSDVTQGGATVFPSLR-------------------LALWPKKGAAAFWFNL 495
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ G KW S KWI
Sbjct: 496 HASGQGDYSTRHAACPVLTGTKWVSNKWI 524
>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
Length = 534
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 70/204 (34%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1
Length = 516
Score = 126 bits (317), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 72/203 (35%), Positives = 102/203 (50%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR + + +S EE E + LA P + ++TV D +TGK + R S +L+ +
Sbjct: 316 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 375
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 376 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 435
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++ P
Sbjct: 436 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFPSGEG 476
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW S KW+
Sbjct: 477 DYSTRHAACPVLVGNKWVSNKWL 499
>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
Length = 561
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 70/204 (34%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide [Mus musculus]
gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
musculus]
Length = 534
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 70/204 (34%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|380025232|ref|XP_003696381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Apis florea]
Length = 537
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 67/203 (33%), Positives = 107/203 (52%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR VYHN + +E E + +A P +++TV + TG + + R S +L K
Sbjct: 329 DPRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 388
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
+ + +R+ T ++ E LQV++Y G YEPHFD+ E F + G R+ATV
Sbjct: 389 VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATV 448
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDVE+GG TVF +A+ +++ PK G A ++++KP+
Sbjct: 449 LYYMSDVEQGGGTVF-------TAI------------NIALWPKKGSAAFWYNLKPNGEG 489
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G+KW + KW+
Sbjct: 490 DFKTRHAACPVLTGSKWVANKWL 512
>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
Length = 536
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 102/203 (50%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR + + +S EE E + LA P + ++TV D +TGK + R S +L+ +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 395
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 396 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 455
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ P+ G A+ ++++ P
Sbjct: 456 LFYMSDVSAGGATVFP-------------------EVGASVWPRKGTAVFWYNLFPSGEG 496
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW S KW+
Sbjct: 497 DYSTRHAACPVLVGNKWVSNKWL 519
>gi|47213360|emb|CAF90979.1| unnamed protein product [Tetraodon nigroviridis]
Length = 511
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 78/230 (33%), Positives = 111/230 (48%), Gaps = 36/230 (15%)
Query: 69 DEGRAEQWV-----EVISWE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKS 122
D GR ++V + W+ PR YH+ LS E E + LA P +R++TV D TG+
Sbjct: 286 DNGRHPKYVIGPVKQEDEWDHPRIVRYHDVLSNREMEKVKELARPRLRRATVHDPRTGQL 345
Query: 123 KDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF 182
+ R S +L I+ I +RI D T + E LQV +Y G +YEPHFD+
Sbjct: 346 TTAPYRVSKSAWLGAFEHPIVDQINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHFDFG 405
Query: 183 M----DEFNTKNGGQRMATVLMY-------LSDVEEGGETVFPNAQGNISAVPWWNELSE 231
D F G R+AT L+Y +SDV+ GG TVF +
Sbjct: 406 QKDEPDAFEELGTGNRIATWLLYVSAAVLRMSDVQAGGATVFTD---------------- 449
Query: 232 CGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
G S+ P+ G A+ +++++P D + H CPV+ GNKW S KWI
Sbjct: 450 ---IGASVLPQKGSAVFWYNLRPSGDGDYRTRHAACPVLLGNKWVSNKWI 496
>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
[Rattus norvegicus]
Length = 534
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 70/204 (34%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|328790718|ref|XP_392392.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Apis mellifera]
Length = 415
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 67/203 (33%), Positives = 107/203 (52%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR VYHN + +E E + +A P +++TV + TG + + R S +L K
Sbjct: 207 DPRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 266
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
+ + +R+ T ++ E LQV++Y G YEPHFD+ E F + G R+ATV
Sbjct: 267 VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATV 326
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDVE+GG TVF +A+ +++ PK G A ++++KP+
Sbjct: 327 LYYMSDVEQGGGTVF-------TAI------------NIALWPKKGSAAFWYNLKPNGEG 367
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G+KW + KW+
Sbjct: 368 DFKTRHAACPVLTGSKWVANKWL 390
>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
Length = 507
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 102/203 (50%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ D +
Sbjct: 307 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLETVHYRISKSAWLSGYEDPV 366
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 367 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATW 426
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 427 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 467
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW S KW+
Sbjct: 468 DYSTRHAACPVLVGNKWVSNKWL 490
>gi|255545252|ref|XP_002513687.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223547595|gb|EEF49090.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 309
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 75/204 (36%), Positives = 114/204 (55%), Gaps = 12/204 (5%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+SW PR F+Y FL+ EEC+ LI+LA H K + G + ++ +S +
Sbjct: 61 LSWRPRVFLYKGFLTDEECDRLISLA--HGAKE--ISKGKGDGSRNNIQLASSESRSHIY 116
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D ++ IE+RI+ +TF P EN + LQV+HY + E HFDYF D + MAT++
Sbjct: 117 DDLLARIEERISAWTFIPKENSKPLQVMHYGIEEARE-HFDYF-DNKTLISNVSLMATLV 174
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
+YLS+V GGE +FP ++ + W S+C K ++P G+A+L ++ +AS D
Sbjct: 175 LYLSNVTRGGEILFPKSE--LKDKVW----SDCTKDSSILRPVKGNAVLIFNAHLNASAD 228
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRV 283
S HG CPV++G W +TK V
Sbjct: 229 SRSTHGRCPVLEGEMWCATKQFLV 252
>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
Length = 545
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 71/202 (35%), Positives = 102/202 (50%), Gaps = 21/202 (10%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR VYH+ +S EE E + LA P ++TV ++G+ + SR R + +L
Sbjct: 345 KPRIVVYHDIISDEEIETIKRLAQPRFERATVQKKESGEREFSRYRIAKSAWLKHEEHDY 404
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNG-GQRMATVLM 200
+ DI R+ D T + E LQV +Y G YEPH+DY E G G R+AT L
Sbjct: 405 VSDINFRVGDITGLDMATSEDLQVCNYGIGGHYEPHYDYARKGEVQQDFGWGGRIATWLF 464
Query: 201 YLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDP 260
Y+SDVE GG TVFP K LS+ P+ G A ++++ P+ +
Sbjct: 465 YMSDVEAGGATVFP-------------------KLNLSLWPQKGSAAFWFNLYPNGEGNE 505
Query: 261 SSLHGGCPVIKGNKWSSTKWIR 282
+ H GCPV+ G+KW + WI
Sbjct: 506 MTQHAGCPVLTGSKWVANYWIH 527
>gi|321474876|gb|EFX85840.1| hypothetical protein DAPPUDRAFT_309107 [Daphnia pulex]
Length = 528
Score = 126 bits (316), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 105/211 (49%), Gaps = 25/211 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+P +YH + E + + LA P +++ V D DTG+S + R + FL +
Sbjct: 328 KPLLVIYHGVIFDAEIDVVKKLAQPRFKRTGVTDRDTGRSMPVQYRIAKAAFLKDSEHNL 387
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNG---GQRMATV 198
I + +R+ D T + E LQV +Y G Y PHFDY E + G R+AT
Sbjct: 388 IVKMSRRVGDITGLDMAASEDLQVCNYGIGGHYVPHFDYARQGEIHGPRDLDWGNRIATW 447
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDVE GG TVFP G ++ P+ G A +++++P+ +
Sbjct: 448 LFYMSDVEAGGATVFP-------------------AVGAALWPQKGSAAFWYNLRPNGNG 488
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI--RVNEYK 287
D +LH GCPV+ G+KW S KWI R E++
Sbjct: 489 DEDTLHAGCPVLTGSKWVSNKWIHERSQEFR 519
>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Monodelphis domestica]
Length = 537
Score = 125 bits (315), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 71/204 (34%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H +S E E + +LA P +R++T+ + TG + + R S +L+ D +
Sbjct: 337 KPRIVRFHEIISDAEIEIVKDLAKPRLRRATISNPITGVLETAHYRISKSAWLSGYEDPV 396
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 397 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 456
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 457 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 497
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KWI
Sbjct: 498 DYSTRHAACPVLVGNKWVSNKWIH 521
>gi|195505190|ref|XP_002099397.1| GE10881 [Drosophila yakuba]
gi|194185498|gb|EDW99109.1| GE10881 [Drosophila yakuba]
Length = 487
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 70/209 (33%), Positives = 105/209 (50%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E +P +YH+ + E + + +A P R++TV +S TG + + R S +L
Sbjct: 276 LEEAHADPYIVIYHDAMYDSEIDVIKRMARPRFRRATVQNSVTGALETANYRISKSAWLK 335
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
D++I + +R AD T +E+ E LQV++Y G YEPHFD+ E F N G
Sbjct: 336 THEDRVIGTVVQRTADMTGLDMESAEELQVVNYGIGGHYEPHFDFARKEEERAFEGLNLG 395
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL Y+SDVE+GG TVF + ++ P+ G A + ++
Sbjct: 396 NRIATVLFYMSDVEQGGATVFTSLH-------------------TALFPRKGTAAFWMNL 436
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
D D + H CPV+ G KW S KWI
Sbjct: 437 HRDGQGDVRTRHAACPVLTGTKWVSNKWI 465
>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
latipes]
Length = 532
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 74/200 (37%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y N LS +E E + LA P + ++TV D TG + R S +L D +I
Sbjct: 335 PHIVRYLNILSDQEIEKIKELAKPRLARATVRDPKTGVLTTAPYRVSKSAWLEGEDDPVI 394
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEF--NTKNGGQRMATVLMY 201
+ +RI D T +E E LQV +Y G +YEPHFD+ F N K G R+AT L Y
Sbjct: 395 DRVNQRIQDITGLTVETAELLQVANYGVGGQYEPHFDFSRRPFDSNLKVDGNRLATFLNY 454
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G SI P+ G A+ ++++ D
Sbjct: 455 MSDVEAGGATVFPD-------------------FGASIWPRKGTAVFWYNLFRSGEGDYR 495
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G+KW S KWI
Sbjct: 496 TRHAACPVLVGSKWVSNKWI 515
>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
Length = 543
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 103/202 (50%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR + + +S EE E + L+ P +R++T+ + TG + + R S +L+ + ++
Sbjct: 344 PRIVRFLDIISNEEIEKVKELSKPRLRRATISNPITGVLETAHYRISKSAWLSGYENPVV 403
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 199
I +RI D T + E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 404 ARINQRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWL 463
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDV GG TVFP + G S+ PK G A+ ++++ P D
Sbjct: 464 FYMSDVAAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFPSGEGD 504
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
S+ H CPV+ GNKW S KWI
Sbjct: 505 YSTRHAACPVLVGNKWVSNKWI 526
>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
Length = 533
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 75/214 (35%), Positives = 109/214 (50%), Gaps = 30/214 (14%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
E I +P +YH LS E L+ A +M K+T V S+ + + R RT+ G +L +
Sbjct: 320 EQIGLKPYVVLYHEVLSAREISMLMGKAAQNM-KNTRVQSEKAVNTN-RERTAKGYWLKK 377
Query: 138 GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG------ 191
+++ R I +RI D T F L + E QV++Y G Y HFDYF + G
Sbjct: 378 ESNEMTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYSLHFDYFGFASSNYTGERSHHS 437
Query: 192 ---GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALL 248
G R+ATVL YL+DVE+GG TVF G G S+ P+ G A+
Sbjct: 438 IVLGDRIATVLFYLTDVEQGGATVF-------------------GNVGYSVYPQAGTAIF 478
Query: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
++++ D + DP + H CPV+ G+KW T+WI
Sbjct: 479 WYNLDTDGNGDPLTRHASCPVVVGSKWVMTEWIH 512
>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
Length = 496
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 76/207 (36%), Positives = 109/207 (52%), Gaps = 22/207 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E+ P V+H+ LS E + L LA P + ++TVV + KDSR RTS GT++
Sbjct: 296 MEIRLLNPFIIVFHDVLSPREIDELQKLARPLLERTTVVKFKKYE-KDSR-RTSKGTWIE 353
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEF-NTKNGGQRM 195
R + + + IE+RI D L E QV++Y G Y H D+ D + + K R+
Sbjct: 354 RDHNNLTKRIERRITDMVELDLRYSEPFQVMNYGLGGHYAAHEDFLGDTWADKKEEDDRI 413
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
ATVL YL+DVE+GG TVF +S PK G AL ++++ +
Sbjct: 414 ATVLFYLTDVEQGGATVFTILNQAVS-------------------PKRGTALFWYNLHRN 454
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ D +LHGGCPV+ G+KW T WIR
Sbjct: 455 GTGDTRTLHGGCPVLVGSKWIMTLWIR 481
>gi|398806116|ref|ZP_10565064.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
gi|398089832|gb|EJL80333.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
Length = 294
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 72/207 (34%), Positives = 100/207 (48%), Gaps = 24/207 (11%)
Query: 79 VISWE---PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
V+++E PR V NFLS EEC+ L A P +TVVD + R++ L
Sbjct: 87 VVTFEQLAPRIVVLDNFLSSEECDGLCEEARPAFAPATVVDPHQDAVHAAHFRSNDSAQL 146
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
+++R +E RI T +P E LQ+ Y GQ Y PH+D+F + GGQR+
Sbjct: 147 PAAGSELVRRVEARIERLTGWPSAFCETLQLQRYAQGQDYRPHYDFFGQDMVEAQGGQRL 206
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
AT+++YL E GG T F N G+ I P+ G AL F PD
Sbjct: 207 ATLILYLRAPEAGGATYFAN-------------------LGMRIAPRKGSALFF--TYPD 245
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ +LHGG V+ G KW +T+W R
Sbjct: 246 PGNNSGTLHGGEAVLAGEKWIATQWFR 272
>gi|328876967|gb|EGG25330.1| putative prolyl 4-hydroxylase alpha subunit [Dictyostelium
fasciculatum]
Length = 244
Score = 125 bits (314), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 80/229 (34%), Positives = 118/229 (51%), Gaps = 40/229 (17%)
Query: 64 ERSEGDEGRAEQWVEVI--SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGK 121
E G++ E+ ++I S PR + +FLS ECE+LI+++ +R + S
Sbjct: 11 EVVSGEKCETEKLPKLIEMSQCPRVYRVPDFLSPAECEHLIDISKNKLRPCNEISSGVH- 69
Query: 122 SKDSRVRTSSGTFLARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHF 179
R+ G F+ G + +++ I +R+ EN E +QV+ Y G++ H+
Sbjct: 70 ------RSGWGLFMKEGEEDHDVVKKIFQRMKMLVNL-TENCEVMQVIRYHPGEETSAHY 122
Query: 180 DYFMDEFNTKNG-------GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSEC 232
DYF T NG GQR+ T+LMYLS+VEEGGET FP
Sbjct: 123 DYFNPL--TTNGAMKIGLYGQRVCTILMYLSEVEEGGETSFP------------------ 162
Query: 233 GKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ G+ +KP GDA+LF++ KP+ +DP SLH G PVIKG KW + K I
Sbjct: 163 -EVGVKVKPVKGDAVLFYNCKPNGEVDPLSLHQGDPVIKGTKWVAIKLI 210
>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
kowalevskii]
Length = 533
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 67/209 (32%), Positives = 107/209 (51%), Gaps = 23/209 (11%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV+ +P+ ++H+ + E + LA+P +R++T+ +S TG + + R S +L+
Sbjct: 327 EVVFDKPKLIIFHDAILTNEIRKVKALASPRLRRATIQNSVTGNLEFAEYRISKSAWLSE 386
Query: 138 GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQ 193
++ + RI +T ++ E LQV +Y G YEPHFD+ E F + N G
Sbjct: 387 DDGDVVHRLNHRIEQYTGLTMDTAEELQVANYGLGGHYEPHFDFARKEEINAFKSLNTGN 446
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+AT L Y+SDVE GG TVFP + G + P+ G A ++++
Sbjct: 447 RIATFLFYMSDVEAGGATVFP-------------------QVGARLIPEKGSAAFWYNLL 487
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ D S+ H CPV+ G+KW S KWI
Sbjct: 488 KNGEGDYSTRHAACPVLVGSKWVSNKWIH 516
>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
Length = 537
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 67/204 (32%), Positives = 103/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+P VYH+ +S +E E + +A P +++T+ +S TG+ + + R S +L
Sbjct: 335 KPMIVVYHDVMSDDEIETVKKMAKPRFKRATIRNSKTGELEPANYRISKSAWLKSEEHDH 394
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
I + +R+ D T + E LQV++Y G YEPHFDY E F G R+AT
Sbjct: 395 ILKVTRRVGDITGLDMSTAEDLQVVNYGIGGHYEPHFDYARTETTEAFKELGWGNRIATW 454
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDVE GG TVFP TG ++ P+ G A ++++ P+
Sbjct: 455 LFYMSDVEAGGATVFP-------------------PTGAAVWPRKGSAAFWYNLYPNGKG 495
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
+ + H CPV+ G+KW S +WI
Sbjct: 496 NELTRHAACPVLSGSKWVSNRWIH 519
>gi|195110919|ref|XP_002000027.1| GI24860 [Drosophila mojavensis]
gi|193916621|gb|EDW15488.1| GI24860 [Drosophila mojavensis]
Length = 487
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 71/203 (34%), Positives = 105/203 (51%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+P +YH+ + E E L +A P R++TV ++ TG + + R S +L ++
Sbjct: 282 DPYIVIYHDAMFDSEIEVLKRMARPRFRRATVQNAVTGALETANYRISKSAWLKTAEHRV 341
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
I + +R AD T +++ E LQV++Y G YEPHFD+ E F N G R+ATV
Sbjct: 342 IGTVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARREEIRAFEGLNLGNRIATV 401
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDVE+GG TVF ++ AV +KPK G A + ++
Sbjct: 402 LFYMSDVEQGGATVFT----SLHAV---------------LKPKKGTAAFWMNLHRSGEG 442
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G+KW S KWI
Sbjct: 443 DVRTRHAACPVLTGSKWVSNKWI 465
>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
rubripes]
Length = 538
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 73/202 (36%), Positives = 100/202 (49%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y +FLS EE E + LA P + ++TV D +G + R S +L D II
Sbjct: 339 PNIVRYLDFLSNEEIEKIKELAKPKLARATVRDPKSGVLTTASYRVSKSAWLEGEEDPII 398
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 199
+ +RI D T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 399 ARVNQRIEDLTGLTVKTAELLQVANYGVGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFL 458
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I P+ G A+ ++++ D
Sbjct: 459 NYMSDVEAGGATVFPD-------------------FGAAIWPRKGTAVFWYNLFKSGEGD 499
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ GNKW S KWI
Sbjct: 500 YRTRHAACPVLVGNKWVSNKWI 521
>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
Length = 517
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 69/209 (33%), Positives = 115/209 (55%), Gaps = 25/209 (11%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSK---DSRVRTSSGTF 134
E++S P +YH+ ++ E L NL+ P M++ +V + K + DS RTS+ +
Sbjct: 316 EILSLSPYMVLYHDVITPLESLTLKNLSKPLMKRRAMVMVNNLKVRPFIDSG-RTSNSVW 374
Query: 135 LARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM--DEFNTKNGG 192
LA + ++ +E+R+ T F +EN E Q+++Y G Y+PH D+F + GG
Sbjct: 375 LASHENAVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQAPEHRGGG 434
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL YLSDV +GG T+FP + +S++P+ GDALL++++
Sbjct: 435 DRIATVLFYLSDVPQGGATLFP-------------------RLNISVQPRQGDALLWYNL 475
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ ++H CP+I+G+KW+ KWI
Sbjct: 476 NDRGQGEIGTVHTSCPIIQGSKWALVKWI 504
>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
gallus]
Length = 536
Score = 125 bits (313), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 102/203 (50%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR + + +S EE E + LA P +R++T+ + TG + + R S +L+ +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALETAHYRISKSAWLSGYESPV 395
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 396 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 455
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++ P
Sbjct: 456 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFPSGEG 496
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW S KW+
Sbjct: 497 DYSTRHAACPVLVGNKWVSNKWL 519
>gi|414587754|tpg|DAA38325.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
Length = 169
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 62/109 (56%), Positives = 75/109 (68%), Gaps = 2/109 (1%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVISW PR V+HNFLS EEC+YL+ +A P ++ STVVD TGK S VRTSSG F+
Sbjct: 58 EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSDVRTSSGMFVNS 117
Query: 138 GRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD 184
K +++ IEKRI+ F+ P ENGE +QVL YEA Q Y PH DYF D
Sbjct: 118 EERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSD 166
>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
[Loxodonta africana]
Length = 534
Score = 124 bits (312), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP+ G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFPDV-------------------GASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|125772807|ref|XP_001357662.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
gi|54637394|gb|EAL26796.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 100/203 (49%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+P +YH+ + E + + +A P R++TV +S TG + + R S +L D +
Sbjct: 344 DPYIVIYHDAMYDSEMDLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTEEDSV 403
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
I + +R AD T +E+ E LQV++Y G Y PHFD+ E F N G R+ATV
Sbjct: 404 IAKVVQRTADMTGLDMESAEELQVVNYGIGGHYAPHFDFARREEKRAFEGLNLGNRIATV 463
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDVE+GG TVF + W PK G A + ++ D
Sbjct: 464 LFYMSDVEQGGATVFT----TLRTALW---------------PKRGTAAFWMNLHRDGEG 504
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G KW S KWI
Sbjct: 505 DKRTQHAACPVLTGTKWVSNKWI 527
>gi|301104296|ref|XP_002901233.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262101167|gb|EEY59219.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 535
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 86/253 (33%), Positives = 116/253 (45%), Gaps = 57/253 (22%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRV--------- 127
+E IS PR F HNF S EE + LI +++ +D + K + S V
Sbjct: 181 IESISESPRTFRLHNFFSGEEADKLI-------KRTLEIDDPSNKLQQSTVGANDNKNKK 233
Query: 128 -----RTSSGTFLARGRDKIIRDIEKRIAD---FTFFPLENGEGLQVLHYEAGQKYEPHF 179
RTS F + DI KR+ D F + +GLQ+L Y+ Q Y H
Sbjct: 234 KKSKHRTSENAFDTVSEAAV--DIRKRVFDVLSLGEFQADMADGLQLLRYQQKQAYIAHE 291
Query: 180 DYF----MDEFN---TKNGGQRMATVLMYLSDVEEGGETVFP----------------NA 216
DYF +FN K G R ATV +YLSDV GG+TVFP N+
Sbjct: 292 DYFPVGAAKDFNFDPHKGGSNRFATVFLYLSDVPRGGQTVFPLAEMPEGLPTEYQHPPNS 351
Query: 217 QGNISAV--------PWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCP 268
+ A+ W ++ T L+ P G A+LF+S KP+ LDP SLHGGCP
Sbjct: 352 AQDYEAIGAELFEPGSWEMDMVRKCSTKLASYPSKGGAVLFYSQKPNGELDPKSLHGGCP 411
Query: 269 VIKGNKWSSTKWI 281
V++G KW + W+
Sbjct: 412 VLEGTKWGANLWV 424
>gi|147834798|emb|CAN75013.1| hypothetical protein VITISV_039948 [Vitis vinifera]
Length = 282
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 73/194 (37%), Positives = 95/194 (48%), Gaps = 39/194 (20%)
Query: 127 VRTSSGTFLARGRDKI--IRDIEKRIADFTFFPLENGE---------------------- 162
+R SG F++ DK + IE++IA P +GE
Sbjct: 90 IRLCSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEIKPKENCLNWLGQVPPFEFVVM 149
Query: 163 -----------GLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGET 211
+L YE GQ+Y H+D F R+AT L+YLSDVEEGGET
Sbjct: 150 KRFLTDVVYHVAFNILRYEIGQRYNSHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGET 209
Query: 212 VFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIK 271
+FP G + + C GL +KP GD LLF+SM P+ ++DP+SLHG CPVIK
Sbjct: 210 MFPFENGLNMDKDY--DFQRC--IGLKVKPHQGDGLLFYSMFPNGTIDPTSLHGSCPVIK 265
Query: 272 GNKWSSTKWIRVNE 285
G KW +TKWIR E
Sbjct: 266 GEKWVATKWIRDQE 279
>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
Length = 534
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
mulatta]
Length = 534
Score = 124 bits (312), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
garnettii]
Length = 534
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|348523976|ref|XP_003449499.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 594
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 67/202 (33%), Positives = 103/202 (50%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P YHN +S+++ E + LA P +R++T+ + TG + + R S +L ++
Sbjct: 395 PHIVRYHNIVSEKDMEKVKELAKPRLRRATISNPVTGVLETAHYRISKSAWLGAYEHPVV 454
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 199
I + I D T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 455 DKINQLIEDVTGLNVKTAEDLQVANYGLGGQYEPHFDFGRKDEPDAFEELGTGNRIATWL 514
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
+Y++DV+ GG TVF + G ++KPK G A+ ++++ P D
Sbjct: 515 LYMTDVQAGGATVFTD-------------------IGAAVKPKKGTAVFWYNLYPSGEGD 555
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ GNKW S KWI
Sbjct: 556 YRTRHAACPVLLGNKWVSNKWI 577
>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
sapiens]
gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
troglodytes]
gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
troglodytes]
gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
sapiens]
gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
sapiens]
gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [synthetic
construct]
gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 534
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
[Oryctolagus cuniculus]
Length = 534
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|449673565|ref|XP_002167120.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 571
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 72/211 (34%), Positives = 108/211 (51%), Gaps = 22/211 (10%)
Query: 75 QWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTF 134
Q VE + +P F+ N +S+++ + A+P +R++T+ D TGK + + R S +
Sbjct: 364 QKVERVWVDPEIFILRNIISEKQINLIKEAASPMLRRATIQDPITGKLRHADYRISKSAW 423
Query: 135 LARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM---DEFNTKNG 191
L+ + ++ +E R T L E LQV +Y G YEPHFD+ D F
Sbjct: 424 LSTNKYNFLQALEARTQATTGLDLSYAEQLQVANYGLGGHYEPHFDHSRENEDRFTDLGM 483
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
G R+ATVL YLSDVE GG TVF GKT ++ P GDA+ +++
Sbjct: 484 GNRIATVLFYLSDVEAGGATVFT-----------------VGKT--AVFPSKGDAVFWFN 524
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+K + +P++ H CPV+ G KW S WI
Sbjct: 525 LKRNGKGNPNTRHAACPVLVGQKWVSNWWIH 555
>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
Length = 566
Score = 124 bits (311), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
[Meleagris gallopavo]
Length = 536
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 102/203 (50%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR + + +S EE E + LA P +R++T+ + TG + + R S +L+ +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALETAHYRISKSAWLSGYESPV 395
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 396 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 455
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++ P
Sbjct: 456 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFPSGEG 496
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW S KW+
Sbjct: 497 DYSTRHAACPVLVGNKWVSNKWL 519
>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
gallus]
Length = 536
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 102/203 (50%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR + + +S EE E + LA P +R++T+ + TG + + R S +L+ +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALETAHYRISKSAWLSGYESPV 395
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 396 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 455
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++ P
Sbjct: 456 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFPSGEG 496
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW S KW+
Sbjct: 497 DYSTRHAACPVLVGNKWVSNKWL 519
>gi|195159323|ref|XP_002020531.1| GL13463 [Drosophila persimilis]
gi|194117300|gb|EDW39343.1| GL13463 [Drosophila persimilis]
Length = 487
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 100/203 (49%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+P +YH+ + E + + +A P R++TV +S TG + + R S +L D +
Sbjct: 282 DPYIVIYHDAMYDSEMDLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTEEDSV 341
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
I + +R AD T +E+ E LQV++Y G Y PHFD+ E F N G R+ATV
Sbjct: 342 IAKVVQRTADMTGLDMESAEELQVVNYGIGGHYAPHFDFARREEKRAFEGLNLGNRIATV 401
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDVE+GG TVF + W PK G A + ++ D
Sbjct: 402 LFYMSDVEQGGATVFT----TLRTALW---------------PKRGTAAFWMNLHRDGEG 442
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G KW S KWI
Sbjct: 443 DKRTQHAACPVLTGTKWVSNKWI 465
>gi|218187602|gb|EEC70029.1| hypothetical protein OsI_00603 [Oryza sativa Indica Group]
Length = 549
Score = 124 bits (311), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 73/207 (35%), Positives = 115/207 (55%), Gaps = 21/207 (10%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKS-TVVDSDTGKSKDSRVRTSSGTFLARG 138
+SW PR F+Y FLS EC++L++ +M S D D R SS +
Sbjct: 309 LSWHPRIFLYEGFLSDMECDHLVSTGRGNMDSSLAFTDGD---------RNSSYNNI--- 356
Query: 139 RDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATV 198
D ++ IE RI+ ++F P ENGE +QVL Y ++ +E + GG +AT+
Sbjct: 357 EDIVVSKIEDRISLWSFLPKENGENIQVLKYGVNRR-----GSIKEEPKSSTGGHWLATI 411
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L+YLSDV++GGETVFP ++ A S+C +G +++P G+ALL ++++PD +
Sbjct: 412 LIYLSDVKQGGETVFPRSEMK-DAQAKEGAPSQC--SGYAVRPAKGNALLLFNLRPDGEI 468
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIRVNE 285
D S + CPV++G KW + K I + +
Sbjct: 469 DKDSQYEECPVLEGEKWLAIKHIHLRK 495
>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
Length = 249
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 64/202 (31%), Positives = 106/202 (52%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR +Y + L E E + +A P ++++TV + TG+ + + R S +L D ++
Sbjct: 44 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFADYRISKSAWLKEHEDVVV 103
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATVL 199
++ KR+ T E E LQV++Y G Y+PH+D+ E F + G R+ATVL
Sbjct: 104 ANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIATVL 163
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDV +GG TVF PW G++++P G A +++++ P + D
Sbjct: 164 FYMSDVAQGGATVF----------PW---------LGVALQPVKGTAAVWFNLYPSGNGD 204
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV++G+KW KW+
Sbjct: 205 LRTRHAACPVLQGSKWVCNKWL 226
>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
guttata]
Length = 536
Score = 124 bits (310), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 70/203 (34%), Positives = 101/203 (49%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR + + +S EE E + LA P + ++TV D +TGK + R S +L+ +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 395
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 396 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 455
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ P+ G A+ ++++ P
Sbjct: 456 LFYMSDVSAGGATVFP-------------------EVGASVWPRKGTAVFWYNLFPSGEG 496
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D S+ H CPV+ GNKW KW+
Sbjct: 497 DYSTRHAACPVLVGNKWVFNKWL 519
>gi|328696638|ref|XP_003240086.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Acyrthosiphon pisum]
Length = 534
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 64/203 (31%), Positives = 106/203 (52%), Gaps = 23/203 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR +Y + L E E + +A P ++++TV + TG+ + + R S +L D ++
Sbjct: 329 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFADYRISKSAWLKEHEDVVV 388
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATVL 199
++ KR+ T E E LQV++Y G Y+PH+D+ E F + G R+ATVL
Sbjct: 389 ANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIATVL 448
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDV +GG TVF PW G++++P G A +++++ P + D
Sbjct: 449 FYMSDVAQGGATVF----------PW---------LGVALQPVKGTAAVWFNLYPSGNGD 489
Query: 260 PSSLHGGCPVIKGNKWSSTKWIR 282
+ H CPV++G+KW KW+
Sbjct: 490 LRTRHAACPVLQGSKWVCNKWLH 512
>gi|346724248|ref|YP_004850917.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346648995|gb|AEO41619.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 418
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 74/212 (34%), Positives = 98/212 (46%), Gaps = 31/212 (14%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR Y LS +EC L+ LA PH+R S V+D + + + VRTS G L D II
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTGRAPVRTSHGATL----DPII 283
Query: 144 RDI-----EKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD---EFNTKNGGQRM 195
D + R+A PL + E L VL Y G++Y H DY + G R
Sbjct: 284 EDFAARAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQ 343
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
TV +YL+DV GGET FP A G+ ++P+ G + F ++ D
Sbjct: 344 RTVCVYLNDVGAGGETEFPVA-------------------GVRVRPRPGTLVCFDNLHAD 384
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SLH G PV G+KW T W R Y+
Sbjct: 385 GRPDADSLHAGLPVTAGSKWLGTLWFRQQRYR 416
>gi|193688213|ref|XP_001943683.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Acyrthosiphon pisum]
Length = 552
Score = 124 bits (310), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 64/202 (31%), Positives = 106/202 (52%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR +Y + L E E + +A P ++++TV + TG+ + + R S +L D ++
Sbjct: 347 PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFADYRISKSAWLKEHEDVVV 406
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATVL 199
++ KR+ T E E LQV++Y G Y+PH+D+ E F + G R+ATVL
Sbjct: 407 ANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIATVL 466
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDV +GG TVF PW G++++P G A +++++ P + D
Sbjct: 467 FYMSDVAQGGATVF----------PW---------LGVALQPVKGTAAVWFNLYPSGNGD 507
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV++G+KW KW+
Sbjct: 508 LRTRHAACPVLQGSKWVCNKWL 529
>gi|260825357|ref|XP_002607633.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
gi|229292981|gb|EEN63643.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
Length = 520
Score = 123 bits (309), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 71/209 (33%), Positives = 107/209 (51%), Gaps = 22/209 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E + +P+ +V HN L+ E E + LA P +R++ V TG+ + + R S +L
Sbjct: 315 LEQVFDKPKLWVLHNILTDPEMEVIKKLAQPRLRRARVESPTTGEGELASYRISKSAWLY 374
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM--DEFNTK-NGGQ 193
++IR + +R+ D T +E E LQV++Y G YEPHFD +EF N G
Sbjct: 375 DWEHRVIRRVNQRVEDVTGLTMETAELLQVVNYGIGGHYEPHFDCATKDEEFALDPNEGD 434
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+AT+L Y+SDVE GG TVFP + G + P+ G ++++
Sbjct: 435 RIATMLFYMSDVEAGGATVFP-------------------QVGARVVPEKGAGAFWYNLL 475
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D + H GCPV+ G+KW S KWI
Sbjct: 476 KSGEGDMLTEHAGCPVLVGSKWVSNKWIH 504
>gi|78046960|ref|YP_363135.1| hypothetical protein XCV1404 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78035390|emb|CAJ23035.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 418
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 74/212 (34%), Positives = 98/212 (46%), Gaps = 31/212 (14%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR Y LS +EC L+ LA PH+R S V+D + + + VRTS G L D II
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTGRAPVRTSHGATL----DPII 283
Query: 144 RDI-----EKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD---EFNTKNGGQRM 195
D + R+A PL + E L VL Y G++Y H DY + G R
Sbjct: 284 EDFAARAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQ 343
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
TV +YL+DV GGET FP A G+ ++P+ G + F ++ D
Sbjct: 344 RTVCVYLNDVGAGGETEFPVA-------------------GVRVRPRPGTLVCFDNLHAD 384
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SLH G PV G+KW T W R Y+
Sbjct: 385 GRPDADSLHAGLPVTAGSKWLGTLWFRQQRYR 416
>gi|115495019|ref|NP_001069238.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|122144801|sp|Q1RMU3.1|P4HA1_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; Flags: Precursor
gi|92097479|gb|AAI14709.1| Prolyl 4-hydroxylase, alpha polypeptide I [Bos taurus]
gi|296472132|tpg|DAA14247.1| TPA: prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
gi|440892721|gb|ELR45796.1| Prolyl 4-hydroxylase subunit alpha-1 [Bos grunniens mutus]
Length = 534
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVLAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide I [Danio rerio]
Length = 536
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 100/202 (49%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR YH +S E E + +A P +R++T+ + TG + + R S +L+ I
Sbjct: 337 PRIVRYHEIISDSEIETVKEMAKPRLRRATISNPITGVLETAPYRISKSAWLSGYEHSTI 396
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 199
I +RI D T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 397 ERINQRIEDVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWL 456
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDV GG TVF + G ++ PK G A+ ++++ P D
Sbjct: 457 FYMSDVSAGGATVFTD-------------------VGAAVWPKKGTAVFWYNLFPSGEGD 497
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
S+ H CPV+ GNKW S KWI
Sbjct: 498 YSTRHAACPVLVGNKWVSNKWI 519
>gi|156370133|ref|XP_001628326.1| predicted protein [Nematostella vectensis]
gi|156215300|gb|EDO36263.1| predicted protein [Nematostella vectensis]
Length = 526
Score = 123 bits (309), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 75/209 (35%), Positives = 112/209 (53%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++S P+ ++HN LS+ E E ++ LA P +R++ V + +TG+ +D R S +L+
Sbjct: 319 MEIVSVNPQITLFHNVLSEMEIEQMLELARPRLRRARVNNLETGEIEDVDYRISQIAWLS 378
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTK----NGG 192
I+R I +R+ T GE LQV +Y G YEPHFD+ +D N+ G
Sbjct: 379 DSDGDIVRRINRRVGFITGLNTNTGECLQVNNYGVGGHYEPHFDHSLDMENSPIASLGQG 438
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT + YLS+VE GG TVF KTG+ P G A+ ++++
Sbjct: 439 NRIATFMFYLSEVEAGGSTVFI-------------------KTGVKTNPFKGGAVFWYNL 479
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
K D SLH GCPV+ GNKW + KW+
Sbjct: 480 KKSGEGDWDSLHAGCPVLIGNKWVANKWL 508
>gi|383642155|ref|ZP_09954561.1| hypothetical protein SeloA3_06917 [Sphingomonas elodea ATCC 31461]
Length = 327
Score = 123 bits (308), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 70/201 (34%), Positives = 104/201 (51%), Gaps = 25/201 (12%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDK- 141
+PR + FLS+EEC ++ A + S V+D ++G+ +RTS G + +
Sbjct: 139 DPRVEHFPGFLSREECAHVATTAQDLLEPSFVLDPNSGRPIPHPIRTSDGGAIGPTNENL 198
Query: 142 IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMY 201
++R I RIA T +E GE L VL Y GQ+Y H D N QR+AT ++Y
Sbjct: 199 VVRAINLRIAAATGTAVEQGESLTVLRYARGQEYRRHLDTIAGAEN-----QRIATFIVY 253
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
L+D EGGET FP + ++P++GDA+ F +++PD + DP
Sbjct: 254 LNDGFEGGETHFP-------------------LLNIQVRPRIGDAIRFDTIRPDGTPDPR 294
Query: 262 SLHGGCPVIKGNKWSSTKWIR 282
+H G PV G KW +T+WIR
Sbjct: 295 LVHAGQPVRNGVKWIATRWIR 315
>gi|332140647|ref|YP_004426385.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Deep ecotype']
gi|327550669|gb|AEA97387.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Deep ecotype']
Length = 376
Score = 123 bits (308), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 74/200 (37%), Positives = 103/200 (51%), Gaps = 25/200 (12%)
Query: 88 VYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR-DKIIRDI 146
VY + LS+ EC YLI ++ S VVD TG+ K VRTS + D I R +
Sbjct: 180 VYESILSEYECRYLITKFNALLKPSMVVDPVTGRGKIDSVRTSYVAVIEPAHCDWITRKL 239
Query: 147 EKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT----KNGGQRMATVLMYL 202
+K I+ T +NGE L +L Y GQ+Y+PH+D ++E N K+G QR+ T L+YL
Sbjct: 240 DKTISQITHTLRQNGEALNLLRYSPGQQYKPHYD-GLNEINDALMFKDGKQRIKTALVYL 298
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
+ + EGGET+FP K + I PK G ++F + + L +S
Sbjct: 299 NTISEGGETLFP-------------------KLDIRIAPKSGTMVVFSNSDENGKLLLNS 339
Query: 263 LHGGCPVIKGNKWSSTKWIR 282
H G P + NKW TKWIR
Sbjct: 340 YHAGAPTVSENKWLVTKWIR 359
>gi|426255744|ref|XP_004021508.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Ovis
aries]
Length = 534
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 69/204 (33%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVLAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|343172438|gb|AEL98923.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
gi|343172440|gb|AEL98924.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
[Silene latifolia]
Length = 120
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 60/120 (50%), Positives = 78/120 (65%), Gaps = 5/120 (4%)
Query: 166 VLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPW 225
VL YE GQKY H+D F QR+A+ L+YLSDVEEGGET+FP NI +
Sbjct: 4 VLRYEVGQKYNSHYDAFHPAEYGPQKSQRIASFLLYLSDVEEGGETMFPYENDNIDSN-- 61
Query: 226 WNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
+ +C GL +KP+ GD LLF+S+ + ++DP+S+HG CPVIKG KW +TKWIR E
Sbjct: 62 -YDYVQC--IGLKVKPRQGDGLLFYSLFSNGTIDPTSIHGSCPVIKGEKWVATKWIRNEE 118
>gi|321463241|gb|EFX74258.1| hypothetical protein DAPPUDRAFT_22132 [Daphnia pulex]
Length = 523
Score = 123 bits (308), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 75/218 (34%), Positives = 106/218 (48%), Gaps = 31/218 (14%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E S+EP + +H+ LS EE E + LA P + +S V + S VRTS +L
Sbjct: 312 IEQHSFEPAIYTFHDVLSDEEIETIKELAKPLLARSMVQGKLGVGHEVSNVRTSKTAWLP 371
Query: 137 RGRDKIIRDIEKRIADFTFF---PLEN-GEGLQVLHYEAGQKYEPHFDYFMDE------- 185
G ++ + +RI T P+ + E LQV +Y G Y PH DY M +
Sbjct: 372 EGLHPLLNRLSRRIGLITGLKTDPIRDEAELLQVANYGIGGHYSPHHDYLMKDKADFEYM 431
Query: 186 -FNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMG 244
G R+AT + YL+DVE GG T FP A G+++KP G
Sbjct: 432 HHRELQAGDRIATFMFYLNDVERGGSTAFPRA-------------------GVAVKPVKG 472
Query: 245 DALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
A ++++K DP +LHG CPV+ G+KW S KWIR
Sbjct: 473 GAAFWFNLKRSGKPDPLTLHGACPVLLGHKWVSNKWIR 510
>gi|195391754|ref|XP_002054525.1| GJ24502 [Drosophila virilis]
gi|194152611|gb|EDW68045.1| GJ24502 [Drosophila virilis]
Length = 487
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 69/203 (33%), Positives = 103/203 (50%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+P +YH+ + E E + +A P R++TV +S TG + + R S +L ++
Sbjct: 282 DPYIVIYHDAMYDSEIEIIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTAEHRV 341
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
I + +R AD T +++ E LQV++Y G YEPHFD+ E F N G R+AT+
Sbjct: 342 IGTVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARREEKRAFEGLNLGNRIATM 401
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDVE+GG TVF ++ A W PK G A + ++
Sbjct: 402 LFYMSDVEQGGATVFT----SLHAALW---------------PKKGTAAFWMNLHRSGEG 442
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G+KW S KWI
Sbjct: 443 DVRTRHAACPVLTGSKWVSNKWI 465
>gi|356559784|ref|XP_003548177.1| PREDICTED: uncharacterized protein LOC100795761 [Glycine max]
Length = 264
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 77/202 (38%), Positives = 117/202 (57%), Gaps = 18/202 (8%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
ISW+PR F+Y FLS +EC+YL++LA KS+ + G S+ V TS
Sbjct: 20 ISWQPRVFLYKGFLSDKECDYLVSLAYAVKEKSS---GNGGLSEG--VETSLDM-----E 69
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D I+ IE+R++ + F P E + LQV+HY Q + DYF ++ + G MAT++
Sbjct: 70 DDILARIEERLSVWAFLPKEYSKPLQVMHYGPEQNGR-NLDYFTNKTQLELSGPLMATII 128
Query: 200 MYLS-DVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
+YLS DV +GG+ +FP + VP + S C + ++P G+A+LF+S+ P AS
Sbjct: 129 LYLSNDVTQGGQILFPES------VPGSSSWSSCSNSSNILQPVKGNAILFFSLHPSASP 182
Query: 259 DPSSLHGGCPVIKGNKWSSTKW 280
D SS H CPV++G+ WS+ K+
Sbjct: 183 DKSSFHARCPVLEGDMWSAIKY 204
>gi|113682363|ref|NP_001038463.1| prolyl 4-hydroxylase, alpha polypeptide I a precursor [Danio rerio]
Length = 522
Score = 122 bits (307), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 73/224 (32%), Positives = 105/224 (46%), Gaps = 45/224 (20%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKST----------------------VVDSDTGK 121
PR YH ++++E E + L+ P +R++T V D TGK
Sbjct: 301 PRIIRYHEIITEQEIEKIKELSKPRLRRATISNPITGVLETAHYRISKRRATVHDPQTGK 360
Query: 122 SKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY 181
++ R S +LA ++ I +RI D T ++ E LQV +Y G +YEPHFD+
Sbjct: 361 LTTAQYRVSKSAWLAAYEHPVVDRINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDF 420
Query: 182 FM----DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGL 237
D F G R+AT L Y+SDV GG TVFP + G
Sbjct: 421 GRKDEPDAFKELGTGNRIATWLFYMSDVAAGGATVFP-------------------EVGA 461
Query: 238 SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
++KP G A+ ++++ P D S+ H CPV+ GNKW S KWI
Sbjct: 462 AVKPLKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWI 505
>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
Length = 538
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 72/202 (35%), Positives = 97/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y N LS E E + LA P + ++TV D TG + R S +L D +I
Sbjct: 339 PHIVRYLNALSDSEIEKIKELAKPRLARATVRDPKTGVLTTANYRVSKSAWLEGEEDPVI 398
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 199
+ +RI D T + E LQ+ +Y G +YEPHFD+ D F T G R+AT L
Sbjct: 399 ERVNQRIEDITGLTTQTAELLQIANYGVGGQYEPHFDFSRKDEPDAFKTLGTGNRVATFL 458
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 459 NYMSDVEAGGATVFPD-------------------FGAAIYPKKGTAVFWYNLFRSGEGD 499
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KWI
Sbjct: 500 YRTRHAACPVLVGCKWVSNKWI 521
>gi|112984520|ref|NP_001037195.1| prolyl 4-hydroxylase alpha subunit precursor [Bombyx mori]
gi|37543673|gb|AAM21932.1| prolyl 4-hydroxylase alpha subunit [Bombyx mori]
Length = 550
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 106/207 (51%), Gaps = 21/207 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE + +P F++H ++ +E E++ A P +++ V D TG+ + R S ++L
Sbjct: 330 VEQMYVKPDIFMFHEVMTDDEIEFIKKRAKPRFKRAVVHDPKTGELTPAHYRISKSSWLR 389
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN--TKNGGQR 194
+I I +R+ D T + + E LQV++Y G YEPHFD+ N TK GG R
Sbjct: 390 DEESPVIARITQRVTDMTGLSMLHAEELQVVNYGIGGHYEPHFDFARKRENPFTKFGGNR 449
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ATVL Y+SDV +GG TVF + GLS+ P A + ++
Sbjct: 450 IATVLFYMSDVAQGGATVF-------------------TELGLSLFPIKRAAAFWLNLHA 490
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWI 281
D ++ H CPV++G+KW S KWI
Sbjct: 491 SGEGDLATRHAACPVLRGSKWVSNKWI 517
>gi|321474877|gb|EFX85841.1| hypothetical protein DAPPUDRAFT_208740 [Daphnia pulex]
Length = 545
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 66/204 (32%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+P +YHN ++ +E E + +A P +++TV +S TG + + R S +L
Sbjct: 343 KPLIVIYHNVINDDEIETVKKMAQPRFKRATVQNSVTGNLEPANYRISKSAWLKSEEHDH 402
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
+ + +R+ D T + E LQV++Y G YEPHFDY E F G R+AT
Sbjct: 403 VFKVTRRVGDVTGLDMATAEDLQVVNYGIGGHYEPHFDYARKEEVNAFKDLGWGNRVATW 462
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+S+VE GG TVFP K L++ P+ G A ++++ P+
Sbjct: 463 LFYMSEVEAGGATVFP-------------------KLNLALWPQKGSAAFWYNLHPNGEG 503
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
+ + H CPV+ G+KW S KWI
Sbjct: 504 NELTRHAACPVLTGSKWVSNKWIH 527
>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
Length = 514
Score = 122 bits (306), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 67/211 (31%), Positives = 110/211 (52%), Gaps = 26/211 (12%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
++ ++ +P +YH+ +S +E + +I+++ P M +S V D + S+ RTSS +L
Sbjct: 309 LQEVNHDPMIVMYHDVISNKEIDAIISISKPLMHRSMV--GDDHEKAVSKTRTSSNAWLD 366
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNG 191
++R + +R D T + E LQV +Y G Y PH+DY + E + +
Sbjct: 367 DVMHPVVRTLSQRTEDMTNLAMTAAERLQVGNYGIGGHYLPHYDYAVAEEGKEVYPSIGK 426
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
G R+ATV+ YLSDV GG TVFP + GL + P+ G A+ +++
Sbjct: 427 GNRIATVMYYLSDVAIGGATVFP-------------------QLGLGVFPQKGSAIFWYN 467
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ + ++D +LHG CPV G+KW KWI
Sbjct: 468 LHANGTVDHRTLHGACPVFVGSKWVGNKWIH 498
>gi|148233143|ref|NP_001090904.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Sus scrofa]
gi|83778522|gb|ABC47142.1| procollagen-proline 2-oxoglutarate-4-dioxygenase [Sus scrofa]
Length = 534
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 67/204 (32%), Positives = 102/204 (50%), Gaps = 23/204 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E + + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIDIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ + RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRLNMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CPV+ GNKW S KW+
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518
>gi|407699315|ref|YP_006824102.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii str.
'Black Sea 11']
gi|407248462|gb|AFT77647.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
'Black Sea 11']
Length = 354
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 83/262 (31%), Positives = 129/262 (49%), Gaps = 26/262 (9%)
Query: 27 FTFAI--LILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEP 84
F++ + L+ LA I + + +++ +D+ + R ++E G+ EV+
Sbjct: 96 FSYIVKRLMSLAPNICNQLNLVPQAQRQSDVK-VYRLAVEEISSALGKIYAPTEVLDQTL 154
Query: 85 RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR-DKII 143
+Y + LS+ EC YLI + ++ S VVD TG K VRTS +A D I
Sbjct: 155 PVELYVDVLSEYECAYLITKFSSLLQPSMVVDPLTGNGKVDNVRTSYVAIIAPSYCDWIT 214
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN---TKNGGQRMATVLM 200
R ++K I+ T P NGE L +L Y GQ+Y+PH+D ++ + K+G QR+ T L+
Sbjct: 215 RKLDKVISQVTHTPRCNGEALNLLRYTPGQQYKPHYDALNEDHDGSMYKDGKQRIKTALV 274
Query: 201 YLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDP 260
YL+ V +GGET FP K +S+ P +G+ ++F + L
Sbjct: 275 YLNTVRQGGETRFP-------------------KLDISVSPTLGNMVVFSNSDESGKLLL 315
Query: 261 SSLHGGCPVIKGNKWSSTKWIR 282
+S H G P NKW TKWIR
Sbjct: 316 NSYHLGAPTFSENKWLVTKWIR 337
>gi|410860761|ref|YP_006975995.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii AltDE1]
gi|410818023|gb|AFV84640.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii AltDE1]
Length = 376
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 74/200 (37%), Positives = 104/200 (52%), Gaps = 25/200 (12%)
Query: 88 VYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR-DKIIRDI 146
VY + LS+ EC YLI + ++ S VVD TG+ K VRTS + D I R +
Sbjct: 180 VYESILSEYECRYLIAKFSALLKPSMVVDPVTGRGKIDSVRTSYVAVIEPTHCDWITRKL 239
Query: 147 EKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT----KNGGQRMATVLMYL 202
+K I+ T +NGE L +L Y GQ+Y+PH+D ++E N K+G QR+ T L+YL
Sbjct: 240 DKIISQITHTLRQNGEALNLLRYSPGQQYKPHYD-GLNEINDALMFKDGKQRIKTALVYL 298
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
+ + EGGET+FP K + I PK G ++F + + L +S
Sbjct: 299 NTINEGGETLFP-------------------KLDIRIAPKSGTMVVFSNSDENGKLLLNS 339
Query: 263 LHGGCPVIKGNKWSSTKWIR 282
H G P + NKW TKWIR
Sbjct: 340 YHAGAPTVSENKWLVTKWIR 359
>gi|344175386|emb|CCA88057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
Length = 331
Score = 122 bits (305), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 72/229 (31%), Positives = 114/229 (49%), Gaps = 26/229 (11%)
Query: 61 KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
+++E++ G + V+ +S PRA + + LS +EC+ LI A + S V++ ++G
Sbjct: 109 RTLEQNRLCVGDRQVSVQFVSHHPRAALISDLLSTQECDALIEQARSRLTTSYVIEYESG 168
Query: 121 KS--KDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPH 178
+ ++ S +F + + I +R A P + EG+ Y G+++ PH
Sbjct: 169 QEVVNEATRSCSCASFPPEEMSMLQKRIVERAARLVGQPGAHCEGVTFARYLPGEQFRPH 228
Query: 179 FDYFMDEF--NTK---NGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECG 233
DYF N K + G R+ATVL+YL++VE GG T FPN
Sbjct: 229 VDYFRGAVLNNDKIMGSSGHRIATVLLYLNEVEAGGATFFPN------------------ 270
Query: 234 KTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
G ++P+ G AL F + D S+DP+SLH GC V +G KW +T W R
Sbjct: 271 -PGFEVRPQKGGALYFAYQQADGSMDPTSLHEGCAVTQGEKWIATLWFR 318
>gi|281206564|gb|EFA80750.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
pallidum PN500]
Length = 251
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 74/211 (35%), Positives = 108/211 (51%), Gaps = 38/211 (18%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+S +PR + FL+ EECE+LI + ++ + S R+ G F+ G
Sbjct: 60 VSQKPRIYRIPKFLTDEECEHLIETSKNKLKPCNEISSGVH-------RSGWGLFMKEGE 112
Query: 140 DK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG------ 191
+ + ++I R+ F E+ E +QV+ Y G++ HFDYF T NG
Sbjct: 113 EDHPVTQNIFNRMKTFVNL-TESSEVMQVIRYNPGEETSAHFDYFNPL--TTNGAMKIGL 169
Query: 192 -GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
GQR+ T+LMYL+DVEEGGET FP + + +KP GDA+LF+
Sbjct: 170 YGQRICTILMYLADVEEGGETSFP-------------------EVNVKVKPIKGDAVLFY 210
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ KP+ +DP SLH G PVIKG KW + K +
Sbjct: 211 NCKPNGEVDPLSLHQGDPVIKGTKWIAIKLV 241
>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
latipes]
Length = 555
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 101/202 (50%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y + +S+ E + + LA P +R++T+ + TG + + R S +L D ++
Sbjct: 351 PYIVRYIDIISEAEMDKIKQLAKPRLRRATISNPVTGVLETAPYRISKSAWLTAYEDPVV 410
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 199
I +RI D T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 411 EKINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWL 470
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDV GG TVFP+ G S+ P+ G A+ ++++ D
Sbjct: 471 FYMSDVSAGGATVFPDV-------------------GASVGPQKGTAVFWYNLFASGEGD 511
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
S+ H CPV+ GNKW S KWI
Sbjct: 512 YSTRHAACPVLVGNKWVSNKWI 533
>gi|170591592|ref|XP_001900554.1| prolyl 4-hydroxylase [Brugia malayi]
gi|16415740|emb|CAC82616.1| prolyl 4-hydroxylase [Brugia malayi]
gi|21425621|emb|CAD19314.1| prolyl 4-hydroxylase [Brugia malayi]
gi|158592166|gb|EDP30768.1| prolyl 4-hydroxylase, putative [Brugia malayi]
Length = 541
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 68/209 (32%), Positives = 105/209 (50%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE++ + P A ++ + ++ EE + LATP +R++TV +S TG+ + + RTS +L
Sbjct: 325 VEILRFNPLAVLFRDVITDEEVTMIQMLATPRLRRATVQNSITGELETASYRTSKSAWLK 384
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
+++ I KRI T E E LQV +Y G Y+PHFD+ E F + N G
Sbjct: 385 DEEHEVVHRINKRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSLNTG 444
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT+L Y++ E GG TVF + ++ P DAL ++++
Sbjct: 445 NRLATLLFYMTQPESGGATVFTEVKT-------------------TVMPSKNDALFWYNL 485
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G KW S KWI
Sbjct: 486 LRSGEGDLRTRHAACPVLTGTKWVSNKWI 514
>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
Length = 490
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 73/200 (36%), Positives = 103/200 (51%), Gaps = 32/200 (16%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
EP+ YH+ +S E E L ++A P + +S TG S +RTS FL
Sbjct: 307 EPKIIRYHDVISDTEIETLKDIARPELTRS-----QTGWGVISDIRTSQSVFLEEV--GT 359
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ I +RIAD T +E+ E L V +Y G +Y PHFD DE N +R AT L+Y+
Sbjct: 360 VARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT-GDEVN-----ERTATFLIYM 413
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
SDVE GG TVF N G+++KP+ G A+ ++++ + LD +
Sbjct: 414 SDVEVGGATVFTNV-------------------GVAVKPEKGSAVFWYNLHKNGELDLKT 454
Query: 263 LHGGCPVIKGNKWSSTKWIR 282
H GCPV+ GNKW + KWI
Sbjct: 455 KHAGCPVLVGNKWVANKWIH 474
>gi|301115862|ref|XP_002905660.1| conserved hypothetical protein [Phytophthora infestans T30-4]
gi|262110449|gb|EEY68501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
Length = 215
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 74/212 (34%), Positives = 109/212 (51%), Gaps = 22/212 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRV-RTSSGTFLARGRDKI 142
P F FL +E + ++ L+ PH+ S V D +++ + RTS+ +L +
Sbjct: 3 PLVFSVEEFLRDDEIDVILELSMPHLAPSGVTLQDGHENRPATDWRTSTTYWLDSSSHPV 62
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-------------FNTK 189
++ I+KR AD P+ + E +QVL YE Q Y+ H DYF E + K
Sbjct: 63 VQTIDKRTADLVKVPISHQESVQVLRYEPTQHYDQHLDYFSAERHRNSPDVLKRIEYGYK 122
Query: 190 NGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLF 249
N RM TV Y+SDV +GG T F + G +P + +C + G+S+ PK ++F
Sbjct: 123 N---RMITVFWYMSDVAKGGHTNFARSGG----LPRPSSNKDCSQ-GISVAPKKRKVVVF 174
Query: 250 WSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+SM P+ DP SLH GCPV +G K S KWI
Sbjct: 175 YSMLPNGEGDPMSLHAGCPVEEGIKLSGNKWI 206
>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
Length = 508
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 73/200 (36%), Positives = 103/200 (51%), Gaps = 32/200 (16%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
EP+ YH+ +S E E L ++A P + +S TG S +RTS FL
Sbjct: 325 EPKIIRYHDVISDTEIETLKDIARPELTRS-----QTGWGVISDIRTSQSVFLEEV--GT 377
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ I +RIAD T +E+ E L V +Y G +Y PHFD DE N +R AT L+Y+
Sbjct: 378 VARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDT-GDEVN-----ERTATFLIYM 431
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
SDVE GG TVF N G+++KP+ G A+ ++++ + LD +
Sbjct: 432 SDVEVGGATVFTNV-------------------GVAVKPEKGSAVFWYNLHKNGELDLKT 472
Query: 263 LHGGCPVIKGNKWSSTKWIR 282
H GCPV+ GNKW + KWI
Sbjct: 473 KHAGCPVLVGNKWVANKWIH 492
>gi|330799463|ref|XP_003287764.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
gi|325082219|gb|EGC35708.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
Length = 220
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 74/224 (33%), Positives = 117/224 (52%), Gaps = 40/224 (17%)
Query: 69 DEGRAEQWVEVI--SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR 126
D + E+ +++I S +PR + FL++EEC +LI+ + +R + S
Sbjct: 15 DSCKVEKPIKLIELSQKPRVYRIPEFLTEEECNHLIDTSKNKLRPCNEISSGVH------ 68
Query: 127 VRTSSGTFLARGRDK--IIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD 184
R+ G F+ G ++ + ++I ++ +F ++ E +Q++ Y G++ H+DYF
Sbjct: 69 -RSGWGLFMKEGEEEHPVTKNIFNKMKNFVNIS-DSCEVMQIIRYNPGEETSAHYDYFNP 126
Query: 185 EFNTKNG-------GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGL 237
T NG GQR+ T+LMYL DVEEGGET FP + G+
Sbjct: 127 L--TTNGSMKIGLYGQRICTILMYLCDVEEGGETSFP-------------------EVGI 165
Query: 238 SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+KP GDA+LF++ KP+ +DP SLH G PV KG KW + K I
Sbjct: 166 KVKPIRGDAVLFYNCKPNGDVDPLSLHQGDPVTKGTKWVAIKLI 209
>gi|348688210|gb|EGZ28024.1| hypothetical protein PHYSODRAFT_321730 [Phytophthora sojae]
Length = 487
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 80/222 (36%), Positives = 113/222 (50%), Gaps = 16/222 (7%)
Query: 71 GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRV-RT 129
GR + +E IS P F FL +E + ++ L+ PH+ S V D +++ + RT
Sbjct: 262 GRGDLVMETISMTPLVFSVEEFLRDDEIDVVLELSMPHLAPSGVTLQDGHENRPATDWRT 321
Query: 130 SSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF------- 182
S+ +L +++DI+KR AD P+ + E +QVL YE Q Y+ H DYF
Sbjct: 322 STTYWLESSSHPVVQDIDKRTADLVKVPISHQESVQVLRYEHTQHYDQHLDYFSVKRHRN 381
Query: 183 -MDEFNTKNGG--QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSI 239
D G RM TV Y+SDV +GG T F A G +P C + GLS+
Sbjct: 382 SADVLKKIEHGYKNRMITVFWYMSDVAKGGHTNFARAGG----LPPPPTNKGCTQ-GLSV 436
Query: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
PK ++F+SM P+ DP SLH GCPV +G K S KW+
Sbjct: 437 VPKKRKVVVFYSMLPNGEGDPMSLHAGCPVEEGIKMSGNKWV 478
>gi|402593814|gb|EJW87741.1| hypothetical protein WUBG_01349 [Wuchereria bancrofti]
Length = 541
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 68/209 (32%), Positives = 105/209 (50%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE++ + P A ++ + ++ EE + LATP +R++TV +S TG+ + + RTS +L
Sbjct: 325 VEILRFNPLAVLFRDVITDEEITMIQMLATPRLRRATVQNSITGELETASYRTSKSAWLK 384
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
+++ I KRI T E E LQV +Y G Y+PHFD+ E F + N G
Sbjct: 385 DEEHEVVHRINKRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSLNTG 444
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT+L Y++ E GG TVF + ++ P DAL ++++
Sbjct: 445 NRLATLLFYMTQPESGGATVFTEVKT-------------------TVMPSKNDALFWYNL 485
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G KW S KWI
Sbjct: 486 LRSGEGDLRTRHAACPVLTGTKWVSNKWI 514
>gi|196011900|ref|XP_002115813.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
gi|190581589|gb|EDV21665.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
Length = 581
Score = 121 bits (304), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 69/214 (32%), Positives = 107/214 (50%), Gaps = 28/214 (13%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS-DTGKSKDSRVRTSSGTFL 135
VEV+S +P +YHN L+ E L LA+P ++++ VV D +++ R S +L
Sbjct: 345 VEVLSLQPYIVIYHNLLTNSEVVLLKTLASPLLKRAVVVGKPDKEYGEETTYRISKTAWL 404
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-------MDEFNT 188
+ ++ I I D E E LQ+ +Y G YEPH D+ + E+ T
Sbjct: 405 DKEDHPAVKRITTLIGDIIGLTSETAEPLQIANYGIGGHYEPHLDFIESEDKEALSEY-T 463
Query: 189 KNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALL 248
G R+ATVL+YLS+VE GG TVFP K G+ ++P+ G A
Sbjct: 464 SRIGNRIATVLIYLSNVEAGGATVFP-------------------KAGVRVEPRQGSAAF 504
Query: 249 FWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+++M + + S+H CPV+ G+KW++ W R
Sbjct: 505 WYNMHRNGEGNKLSVHAACPVLIGSKWAANLWFR 538
>gi|443709454|gb|ELU04126.1| hypothetical protein CAPTEDRAFT_167710 [Capitella teleta]
Length = 535
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 69/208 (33%), Positives = 108/208 (51%), Gaps = 22/208 (10%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
E ++++P VYH +S ++ + + LATP + ++TVV+S TG+ + ++ R S +L
Sbjct: 331 ETMNFDPWIAVYHQLMSDKDIDDIKALATPRLARATVVNSVTGELEFAKYRISKSGWLKD 390
Query: 138 GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNTKNG--GQR 194
+ I R + T L E LQ+ +Y G YEPHFDY + E + + G R
Sbjct: 391 EEHPTVAKISNRCSALTNLSLSTVEELQIANYGIGGHYEPHFDYSRLAEVTSFDHWRGNR 450
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ TV+ YLSDVE GG TVF A G ++P+ G A +++++ P
Sbjct: 451 ILTVIFYLSDVEAGGGTVFMTA-------------------GTKLRPEKGAAAVWYNLHP 491
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D + D + H CPV+ GNKW + KW
Sbjct: 492 DGTGDDETKHAACPVLTGNKWVANKWFH 519
>gi|194905436|ref|XP_001981196.1| GG11753 [Drosophila erecta]
gi|190655834|gb|EDV53066.1| GG11753 [Drosophila erecta]
Length = 550
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 68/209 (32%), Positives = 104/209 (49%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E +P ++H+ + E + + +A P R++TV +S TG + + R S +L
Sbjct: 339 LEEAHADPYIVIFHDAMYDGEIDLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLK 398
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
++I + +R AD T +++ E LQV++Y G YEPHFD+ E F N G
Sbjct: 399 TPEHRVIETVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARKEEQRAFEGLNLG 458
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL Y+SDVE+GG TVF + ++ PK G A + ++
Sbjct: 459 NRIATVLFYMSDVEQGGATVFTSLH-------------------TALFPKKGTAAFWMNL 499
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
D D + H CPV+ G KW S KWI
Sbjct: 500 HRDGQGDVRTRHAACPVLTGTKWVSNKWI 528
>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
Length = 510
Score = 121 bits (303), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 68/206 (33%), Positives = 106/206 (51%), Gaps = 23/206 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++ P VYH+ LS E ++ +A M +++ V S S RT+ G +L
Sbjct: 312 MELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRTS--SPTRTAMGAWLK 369
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
R + + R I +R+ D + LE E +QV++Y G Y PH D+F + + G R+A
Sbjct: 370 RSSNALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFTQ--HPEVMGNRLA 427
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVL YL+DVE+GG T+F A+ + P+ G AL ++++ D
Sbjct: 428 TVLFYLTDVEQGGATMFNKAEHKVL-------------------PRRGTALFWYNLHTDG 468
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CP+I G+KW T+WIR
Sbjct: 469 EGDWSTTHAACPIIVGSKWVLTQWIR 494
>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
Length = 535
Score = 120 bits (302), Expect = 5e-25, Method: Compositional matrix adjust.
Identities = 68/206 (33%), Positives = 106/206 (51%), Gaps = 23/206 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++ P VYH+ LS E ++ +A M +++ V S S RT+ G +L
Sbjct: 337 MELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRTS--SPTRTALGAWLK 394
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
R + + R I +R+ D + LE E +QV++Y G Y PH D+F + + G R+A
Sbjct: 395 RSSNALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFTQ--HPEVMGNRLA 452
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVL YL+DVE+GG T+F A+ + P+ G AL ++++ D
Sbjct: 453 TVLFYLTDVEQGGATMFNKAEHKVL-------------------PRRGTALFWYNLHTDG 493
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIR 282
D S+ H CP+I G+KW T+WIR
Sbjct: 494 EGDWSTTHAACPIIVGSKWVLTQWIR 519
>gi|395509387|ref|XP_003758979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Sarcophilus harrisii]
Length = 534
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 69/200 (34%), Positives = 102/200 (51%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ LS EE E + LA P + ++TV D TG + R S ++L G D +I
Sbjct: 337 PHIVRYYDVLSDEEIERIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEGDDPVI 396
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 397 AQLNRRMHYITGLSVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 456
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G ++ ++++ D
Sbjct: 457 MSDVEAGGATVFPDF-------------------GATIWPKKGTSVFWYNLFRSGEGDYR 497
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G+KW S KW
Sbjct: 498 TRHAACPVLVGSKWVSNKWF 517
>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
Length = 517
Score = 120 bits (302), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 75/207 (36%), Positives = 110/207 (53%), Gaps = 24/207 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE+++ P Y++ L+ E E L +++P +R+S + + + RTS+ F+
Sbjct: 317 VELLNRSPYVAAYYDVLNDSEIEELKLMSSPQIRRSLLYNHTLDIDQADVDRTSNSVFME 376
Query: 137 RGRDKIIRDIEKRIADFT--FFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQR 194
++ I +R AD T + + E LQV++Y G +Y PH DYF DE N +NG R
Sbjct: 377 ETGITLLETISQRAADMTDLYVTAISSEDLQVINYGLGGQYTPHCDYF-DE-NAENGD-R 433
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ATVL YL+DV++GG TVFP + LS PK G AL+F ++
Sbjct: 434 LATVLFYLTDVQQGGATVFPFLR-------------------LSYFPKKGSALIFRNLDN 474
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWI 281
S D S H CPV+ GNKW +TKWI
Sbjct: 475 AMSGDKDSTHSACPVLFGNKWVATKWI 501
>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
Length = 549
Score = 120 bits (301), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 71/212 (33%), Positives = 108/212 (50%), Gaps = 27/212 (12%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE +S +P ++H+ + + E + L+ LA + ++TV ++ S S RTS TFL
Sbjct: 327 VEELSHDPLLVLFHDVIYQSEIDTLMRLAKNKIHRATVTGHNS--SVVSNARTSQFTFLP 384
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKN----- 190
+ R K++R I++R+AD T LE E Q+ +Y G Y H D+F F TK
Sbjct: 385 KTRHKVLRTIDQRVADMTDLHLEYAEDHQLANYGIGGHYAQHMDWFYPITFETKQVSNPE 444
Query: 191 GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
G R+ TVL YLSDVE+GG T FP + ++PK A ++
Sbjct: 445 MGNRIGTVLFYLSDVEQGGATAFPALKQ-------------------LLRPKKHAAAFWY 485
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
++ D ++HG CP+I G+KW +WIR
Sbjct: 486 NLHASGVGDARTMHGACPIIVGSKWVLNRWIR 517
>gi|403183473|gb|EJY58123.1| AAEL017524-PA, partial [Aedes aegypti]
Length = 212
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 113/211 (53%), Gaps = 26/211 (12%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E S +P +YHN +S +E E +I ++ P +++S V +S + + S RTS +LA
Sbjct: 6 LEEASLDPLIVIYHNAISDKEIEQIIQVSKPMLKRSMVGESFSKEV--SNERTSQNAWLA 63
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNG 191
++++ + R D T ++ E LQV +Y G Y PHFD+ + +
Sbjct: 64 DYDFELVKVLSLRTEDMTGLDRKSYESLQVNNYGIGGFYLPHFDWVRTNGTEEPYKDMGL 123
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
G R+AT++ YLSDVE+GG TVFP + G+ + PK G A+ +++
Sbjct: 124 GNRIATLMYYLSDVEQGGATVFP-------------------QIGVGVFPKKGSAIFWYN 164
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ PD + D +LHG CPV+ G+KW + KWI
Sbjct: 165 LLPDGTGDERTLHGACPVLLGSKWVANKWIH 195
>gi|324507368|gb|ADY43128.1| Prolyl 4-hydroxylase subunit alpha-2 [Ascaris suum]
Length = 534
Score = 120 bits (301), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 72/209 (34%), Positives = 104/209 (49%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE++ + P ++ +S E E + LA P ++++TV ++ TG + + R S +L
Sbjct: 318 VEILRFSPLVVLFKQVISDYEIEVIEKLAIPKLKRATVQNARTGDLEYANYRISKSAWLK 377
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
I I KRI T E E LQ +Y G Y+PHFD+ E F T N G
Sbjct: 378 GTDHPAIDRINKRIDLMTNLNQETAEELQAQNYGIGGHYDPHFDFARKEDINAFKTLNTG 437
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT+L+Y+SDVE GG TVF N GN ++ P DAL ++++
Sbjct: 438 NRIATILIYMSDVESGGATVF-NHLGN------------------AVFPSKYDALFWYNL 478
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ D D + H CPV+ G KW S KWI
Sbjct: 479 RRDGEGDLRTRHAACPVLTGIKWVSNKWI 507
>gi|195055779|ref|XP_001994790.1| GH14110 [Drosophila grimshawi]
gi|193892553|gb|EDV91419.1| GH14110 [Drosophila grimshawi]
Length = 487
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 69/203 (33%), Positives = 102/203 (50%), Gaps = 23/203 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+P +YH+ + E E L +A P R++TV +S TG + + R S +L +I
Sbjct: 282 DPYIVIYHDAMYDSEIEVLKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTPEHEI 341
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
I + +R AD T +++ E LQV++Y G YEPHFD+ E F N G R+AT+
Sbjct: 342 IGTVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARREEKLAFEGLNLGNRIATM 401
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV++GG TVF ++ W PK G A + ++
Sbjct: 402 LFYMSDVQQGGATVFT----SLRTALW---------------PKKGTAAFWMNLHRSGEG 442
Query: 259 DPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G+KW S KWI
Sbjct: 443 DARTRHAACPVLTGSKWVSNKWI 465
>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
Length = 538
Score = 120 bits (300), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 66/207 (31%), Positives = 106/207 (51%), Gaps = 20/207 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E S +P YH+ LS ++ L +A PHM++STV +SK S R S +L
Sbjct: 325 LEEHSLDPLVVSYHDMLSPQQIIELRQMAVPHMKRSTVNPLPGRQSKKSAFRVSKNAWLE 384
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNGGQRM 195
++ + + ++D T + E LQV +Y G YEPH+D+F+D + G R+
Sbjct: 385 YDTHPMMGRMLRDLSDATGLDMTYCEQLQVANYGVGGHYEPHWDFFVDSQHYPAEEGNRI 444
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
AT + YLSDVE+GG T FP +++P++G+ L ++++
Sbjct: 445 ATAIFYLSDVEQGGATAFPF-------------------LNFAVRPQLGNILFWYNLHRS 485
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+D + H GCPV+KG+KW + WI
Sbjct: 486 LDMDYRTKHAGCPVLKGSKWIANIWIH 512
>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
Length = 541
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 68/209 (32%), Positives = 105/209 (50%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE++ + P A + + ++ EE + LATP +R++TV +S TG+ + + RTS +L
Sbjct: 325 VEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELETASYRTSKSAWLK 384
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
+I+ I +RI T E E LQV +Y G Y+PHFD+ E F + N G
Sbjct: 385 DEEHEIVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSLNTG 444
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT+L Y++ E GG TVF + ++ P DAL ++++
Sbjct: 445 NRLATLLFYMTQPESGGATVFTEVKT-------------------TVMPSKNDALFWYNL 485
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G+KW S KWI
Sbjct: 486 LRSGEGDLRTRHAACPVLIGSKWVSNKWI 514
>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
Length = 536
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 66/207 (31%), Positives = 105/207 (50%), Gaps = 20/207 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E S +P YH+ LS + L +A PHMR+STV G++K S R S +LA
Sbjct: 323 MEEHSLDPFVVTYHDMLSPNKIAQLREMAVPHMRRSTVNPLPGGQNKKSSFRVSKNAWLA 382
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNGGQRM 195
+ + + ++D T + E LQV +Y G YEPH+D+F + + G R+
Sbjct: 383 YETHPTMGKMLRDLSDTTGLDMTYCEQLQVANYGVGGHYEPHWDFFRNPDHYPAEEGNRI 442
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
AT + YLS+VE+GG T FP +++P++G+ L ++++
Sbjct: 443 ATAIYYLSEVEQGGATAFP-------------------FLNFAVRPQLGNVLFWYNLHRS 483
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ +D + H GCPV+KG+KW WI
Sbjct: 484 SDMDYRTKHAGCPVLKGSKWIGNVWIH 510
>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
Length = 542
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 68/209 (32%), Positives = 105/209 (50%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE++ + P A + + ++ EE + LATP +R++TV +S TG+ + + RTS +L
Sbjct: 326 VEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELETASYRTSKSAWLK 385
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
+I+ I +RI T E E LQV +Y G Y+PHFD+ E F + N G
Sbjct: 386 DEEHEIVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSLNTG 445
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT+L Y++ E GG TVF + ++ P DAL ++++
Sbjct: 446 NRLATLLFYMTQPESGGATVFTEVKT-------------------TVMPSKNDALFWYNL 486
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G+KW S KWI
Sbjct: 487 LRSGEGDLRTRHAACPVLIGSKWVSNKWI 515
>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
rubripes]
Length = 548
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 101/202 (50%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y + +S +E E + LA P +R++T+ + TG + + R S +L +I
Sbjct: 349 PYIVRYIDIISDKEIETVKKLAKPRLRRATISNPITGVLETASYRISKSAWLTGYEHPVI 408
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 199
I +RI D T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 409 EIINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWL 468
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDV GG TVFP+ G ++ P+ G A+ ++++ + D
Sbjct: 469 FYMSDVAAGGATVFPDV-------------------GAAVWPQKGTAVFWYNLFANGEGD 509
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
S+ H CPV+ GNKW S KWI
Sbjct: 510 YSTRHAACPVLVGNKWVSNKWI 531
>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
Length = 548
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 65/197 (32%), Positives = 103/197 (52%), Gaps = 23/197 (11%)
Query: 85 RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIR 144
R V+ F S EEC +L + + ++ V G+ + R S+ +L D I++
Sbjct: 339 RLQVFRQFASPEECRHLQHAGKRRLERA--VAWTDGRFQPVEFRISTAAWLQPDHDAIVK 396
Query: 145 DIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSD 204
I RI D T +E E LQ+ +Y G YEPHFD+ T G+R+AT ++YL+
Sbjct: 397 RIHGRIEDATQVDIEYAEALQISNYGMGGFYEPHFDH--SSRGTNPDGERLATFMIYLNP 454
Query: 205 VEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLH 264
V++GG T FP + G +++P GDA+ +++++P DP +LH
Sbjct: 455 VKQGGFTAFP-------------------RLGAAVQPGYGDAVFWYNLQPSGVGDPLTLH 495
Query: 265 GGCPVIKGNKWSSTKWI 281
G CPV++G+KW + KWI
Sbjct: 496 GACPVLRGSKWVANKWI 512
>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 509
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 110/207 (53%), Gaps = 24/207 (11%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV++ +P VYH+ S E LI LA + ++T+ D G+ + S RTS +L
Sbjct: 303 EVLNLDPFITVYHDVASDREISKLIELAKSRISRATI--RDDGEPQVSNARTSQNAWLDA 360
Query: 138 GRDKIIRDIEKRIADFTF-FPLENGEGLQVLHYEAGQKYEPHFDYFMDE--FNTKNGGQR 194
G D+++ +++R+ D T ++ E LQV +Y G Y H D+ M+ + G R
Sbjct: 361 GDDRVVTTLDRRVGDMTGGLRQQSYEMLQVNNYGVGGHYVAHHDWAMEAVPYAGLRVGNR 420
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ATV+ YLSDVE GG TVFP + GL++ P+ G A+L++++
Sbjct: 421 IATVMFYLSDVEIGGATVFP-------------------QLGLAVFPRKGSAILWYNLYR 461
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ D +LH CPV+ G+KW + +WI
Sbjct: 462 NGKGDRRTLHAACPVLSGSKWVANQWI 488
>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
Length = 516
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 106/208 (50%), Gaps = 24/208 (11%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV++ +P VYH+ S E +I L P + +S V D K + S+ RTS ++L
Sbjct: 314 EVVNLDPFVAVYHDAASDAEINKVIELGRPQINRSMV--GDAAKKEVSKSRTSQNSWLTD 371
Query: 138 GRDKIIRDIEKRIADFTFFPLENG-EGLQVLHYEAGQKYEPHFDYFMDE--FNTKNGGQR 194
++ + +R D E E LQV +Y G Y PH+D+ +E + N G R
Sbjct: 372 YDHPVVAALSRRTKDMALGLDETAYESLQVNNYGIGGHYLPHYDWSREENPYPELNTGNR 431
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+AT++ YLSDVEEGG TVFP+ G+ + PK G A+ +++++
Sbjct: 432 IATLMFYLSDVEEGGATVFPH-------------------LGVGVFPKKGTAIFWYNLRA 472
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D +LHG CPV+ G+KW + KWI
Sbjct: 473 SGKGDEKTLHGACPVLIGSKWVANKWIH 500
>gi|321474953|gb|EFX85917.1| hypothetical protein DAPPUDRAFT_309108 [Daphnia pulex]
Length = 549
Score = 119 bits (299), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 66/205 (32%), Positives = 102/205 (49%), Gaps = 24/205 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+P +YH+ + EE E + LA P +++TV++S TGK + ++ R S FL
Sbjct: 346 KPLLVIYHDVIFDEEIETVKKLAHPRFKRTTVMNSATGKLETAKYRISKAAFLKNKEHHH 405
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNG-GQRMAT 197
+ + +R+ T + E LQV +Y G YEPHFDY FN +G R+AT
Sbjct: 406 VLKMSRRVGAITGLDMSTAEDLQVCNYGIGGHYEPHFDYARKNETIGFNKDSGWRNRIAT 465
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
L Y+SDVE GG TVFP +++ P+ G A ++++ P+
Sbjct: 466 WLFYMSDVEAGGATVFP-------------------ALNVALWPQKGSAAFWYNLFPNGE 506
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWIR 282
+ + H CPV+ G+KW + KWI
Sbjct: 507 GNELTRHAACPVLTGSKWVANKWIH 531
>gi|326928728|ref|XP_003210527.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Meleagris
gallopavo]
Length = 535
Score = 119 bits (298), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 70/207 (33%), Positives = 105/207 (50%), Gaps = 23/207 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + LA P + ++TV D TG + R S ++L D ++
Sbjct: 338 PHIVRYYDVMSDEEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 397
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K+ G R+AT L Y
Sbjct: 398 AKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSTLKSEGNRLATFLNY 457
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 458 MSDVEAGGATVFPDF-------------------GAAIWPKKGTAVFWYNLFRSGEGDYR 498
Query: 262 SLHGGCPVIKGNKWSSTKWI--RVNEY 286
+ H CPV+ G KW S KW R NE+
Sbjct: 499 TRHAACPVLVGCKWVSNKWFHERGNEF 525
>gi|57525020|ref|NP_001006155.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Gallus gallus]
gi|82082587|sp|Q5ZLK5.1|P4HA2_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|53129464|emb|CAG31388.1| hypothetical protein RCJMB04_5l17 [Gallus gallus]
Length = 534
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 70/207 (33%), Positives = 105/207 (50%), Gaps = 23/207 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + LA P + ++TV D TG + R S ++L D ++
Sbjct: 337 PHIVRYYDVMSDEEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 396
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K+ G R+AT L Y
Sbjct: 397 AKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSTLKSEGNRLATFLNY 456
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 457 MSDVEAGGATVFPDF-------------------GAAIWPKKGTAVFWYNLFRSGEGDYR 497
Query: 262 SLHGGCPVIKGNKWSSTKWI--RVNEY 286
+ H CPV+ G KW S KW R NE+
Sbjct: 498 TRHAACPVLVGCKWVSNKWFHERGNEF 524
>gi|323454062|gb|EGB09933.1| hypothetical protein AURANDRAFT_14928, partial [Aureococcus
anophagefferens]
Length = 182
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 77/204 (37%), Positives = 105/204 (51%), Gaps = 30/204 (14%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P + NFL++EEC+ LI+ A HM + VV G+ SR TSS +LAR + +
Sbjct: 1 PPIYTVQNFLTEEECDALIDSAKDHMTPAPVVGPGNGEVSVSR--TSSTCYLAR---EDL 55
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN-----TKNGGQRMATV 198
+ ++ T PLE+ E QV Y G+ Y+PH+D F +NGGQR+ATV
Sbjct: 56 PSVCTKVCALTGKPLEHLELPQVGRYRGGEFYKPHYDAFDTSSADGRRFAQNGGQRVATV 115
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L+YL+DVE GGET F K G+ IKP+ G+AL+F+ D L
Sbjct: 116 LVYLNDVERGGETSF-------------------SKLGVRIKPRKGNALIFFPATLDGVL 156
Query: 259 DPSSLHGGCPVIKGNKWSSTKWIR 282
D + LH P + KW S WIR
Sbjct: 157 DQNYLHAAEPAVD-PKWVSQIWIR 179
>gi|308476969|ref|XP_003100699.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
gi|308264511|gb|EFP08464.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
Length = 573
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/227 (31%), Positives = 109/227 (48%), Gaps = 41/227 (18%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE++ ++P A ++ N +S E + + LA+P ++++TV +S TG+ + + R S +L
Sbjct: 334 VEILRFDPLAVLFKNVISDSEIKVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLK 393
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM------------- 183
+I + +RI DFT E LQV +Y G Y+PHFD+
Sbjct: 394 GDLHPVIERVNRRIEDFTGLYQGTSEELQVANYGLGGHYDPHFDFARIANYGLGGHYEPH 453
Query: 184 ---------DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGK 234
+ F T N G R+ATVL Y+S E GG TVF N L
Sbjct: 454 YDMSLKEEKNAFKTLNTGNRIATVLFYMSQPERGGATVF-------------NHL----- 495
Query: 235 TGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
G ++ P DAL +++++ D D + H CPV+ G KW S KWI
Sbjct: 496 -GTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWI 541
>gi|428178571|gb|EKX47446.1| hypothetical protein GUITHDRAFT_152114 [Guillardia theta CCMP2712]
Length = 262
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 74/224 (33%), Positives = 115/224 (51%), Gaps = 31/224 (13%)
Query: 76 WVEVISWEPRAFVYHNFLSKEECEYLINLA-TPHMRKSTVVDSDTGKSKDSRVRTSSGTF 134
++E I+ PR F N L+K+ECE+L+ LA + K+ ++ T K +S RT+ G +
Sbjct: 56 YLEQINASPRVFRIRNLLTKQECEHLMLLAFRKGLSKTMIMPYGTHKLVESTTRTNDGAW 115
Query: 135 LARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAG-QKYEPHFDYFMDEFNT----K 189
L +D ++R +E+ + T + GE LQVLHY G Q ++ H+DYF + +
Sbjct: 116 LDFLQDDVVRRLEETLGKLTKTTPQQGENLQVLHYSNGAQFFQEHYDYFDPARDPPESFE 175
Query: 190 NGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLF 249
GG R TV++YL EGGET FP + GL + + GDAL+F
Sbjct: 176 QGGNRYITVIVYLEAALEGGETHFP-------------------ELGLKLTAQPGDALMF 216
Query: 250 WSMKPDAS-LDP-----SSLHGGCPVIKGNKWSSTKWIRVNEYK 287
+++K S DP ++H P ++G KW + KWI Y+
Sbjct: 217 YNLKEHCSGTDPDCVEKKTIHAALPPVRGEKWVAVKWIHEKPYQ 260
>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
Length = 235
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/201 (35%), Positives = 103/201 (51%), Gaps = 36/201 (17%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
EP+ YH+ +S E E L ++A P + +S TG S +RTS FL D++
Sbjct: 52 EPKIIRYHDVISDTEIETLKDIARPELTRS-----QTGWGVISEIRTSQSVFL----DEV 102
Query: 143 --IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLM 200
+ I +RIAD T +E+ E L V +Y G +Y PHFD D +R AT L+
Sbjct: 103 GTVARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDAGGDV------NERTATFLI 156
Query: 201 YLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDP 260
Y+SDVE GG TVF N G+++KP+ G A+ + ++ + LD
Sbjct: 157 YMSDVEVGGATVFTNV-------------------GVAVKPEKGSAVFWNNLHKNGELDL 197
Query: 261 SSLHGGCPVIKGNKWSSTKWI 281
+ H GCPV+ GNKW + KWI
Sbjct: 198 KTKHAGCPVLVGNKWVANKWI 218
>gi|224068121|ref|XP_002191580.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Taeniopygia
guttata]
Length = 539
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 70/207 (33%), Positives = 105/207 (50%), Gaps = 23/207 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + LA P + ++TV D TG + R S ++L D ++
Sbjct: 342 PHIVRYYDVMSDEEIEKIKQLAKPRLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 401
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K+ G R+AT L Y
Sbjct: 402 AKVNQRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSTLKSEGNRLATFLNY 461
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 462 MSDVEAGGATVFPDF-------------------GAAIWPKKGTAVFWYNLFRSGEGDYR 502
Query: 262 SLHGGCPVIKGNKWSSTKWI--RVNEY 286
+ H CPV+ G KW S KW R NE+
Sbjct: 503 TRHAACPVLVGCKWVSNKWFHERGNEF 529
>gi|452752943|ref|ZP_21952682.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
proteobacterium JLT2015]
gi|451959765|gb|EMD82182.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
proteobacterium JLT2015]
Length = 314
Score = 119 bits (297), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 72/217 (33%), Positives = 101/217 (46%), Gaps = 26/217 (11%)
Query: 74 EQW-VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSG 132
E W E +S P + + S EC YL ++ P +R ST++D TG + VRTS G
Sbjct: 118 ENWRTEPVSETPSIRMVRHLFSSAECAYLQQMSAPRLRPSTILDPQTGARRPDPVRTSVG 177
Query: 133 TFLAR-GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG 191
L+ D ++ + +RIA T GE L +L Y Q+Y PH D N
Sbjct: 178 AALSPVEEDLVVGMLNRRIAAATGTDRMQGEPLHILRYSGAQEYRPHHDAVAGLEN---- 233
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
QR T+++YL+ EGGET FP + G ++ + GDALLF +
Sbjct: 234 -QRSHTLIVYLTADYEGGETAFP-------------------ELGFRLRGRQGDALLFAN 273
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
++ D D H G P G KW +T+WIR Y V
Sbjct: 274 LREDGRPDLRMRHAGLPATSGAKWIATRWIRTRPYHV 310
>gi|395509389|ref|XP_003758980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Sarcophilus harrisii]
Length = 536
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 100/202 (49%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ LS EE E + LA P + ++TV D TG + R S ++L G D +I
Sbjct: 337 PHIVRYYDVLSDEEIERIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEGDDPVI 396
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 397 AQLNRRMHYITGLSVKTAELLQVANYGMGGQYEPHFDFSRKGEQDAFKHLGTGNRVATFL 456
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G ++ ++++ D
Sbjct: 457 NYMSDVEAGGATVFPDF-------------------GATIWPKKGTSVFWYNLFRSGEGD 497
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G+KW S KW
Sbjct: 498 YRTRHAACPVLVGSKWVSNKWF 519
>gi|325920649|ref|ZP_08182559.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas gardneri ATCC 19865]
gi|325548839|gb|EGD19783.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas gardneri ATCC 19865]
Length = 422
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 70/204 (34%), Positives = 95/204 (46%), Gaps = 31/204 (15%)
Query: 92 FLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIA 151
LS +EC L+ LA PH+R S VVD + + + +RTS G L D I+ D R A
Sbjct: 240 VLSADECRLLMLLARPHLRASQVVDPNDASTHRTPIRTSRGATL----DPILEDFAARAA 295
Query: 152 DFTF-----FPLENGEGLQVLHYEAGQKYEPHFDYFMD---EFNTKNGGQRMATVLMYLS 203
PL + E L VL Y G+ Y H DY + G R+ T +YL+
Sbjct: 296 QARVAACAQLPLTHAEALSVLCYAPGEHYRAHRDYLPPGTIAADRPGAGNRLRTACVYLN 355
Query: 204 DVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSL 263
DV+ GGET FP A G+ ++P+ G + F +++ D DP SL
Sbjct: 356 DVDAGGETEFPVA-------------------GIRVQPRAGSVVCFDNLQADGCPDPDSL 396
Query: 264 HGGCPVIKGNKWSSTKWIRVNEYK 287
H G PV G+KW T W R Y+
Sbjct: 397 HAGLPVTTGSKWLGTLWFRQQRYR 420
>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 575
Score = 118 bits (296), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 378 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 437
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 438 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 497
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 498 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 538
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 539 TRHAACPVLVGCKWVSNKWF 558
>gi|356530852|ref|XP_003533993.1| PREDICTED: uncharacterized protein LOC100775928 [Glycine max]
Length = 302
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 77/209 (36%), Positives = 118/209 (56%), Gaps = 24/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V ISW+PR F+Y FLS +EC+YL++LA KS+ + G S+ TFL
Sbjct: 55 VVQISWQPRVFLYKGFLSDKECDYLVSLAYAVKEKSS---GNGGFSEGVE------TFLD 105
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPH---FDYFMDEFNTKNGGQ 193
D I+ IE+R++ + F P E + LQV+HY EP+ DYF ++ + G
Sbjct: 106 I-EDDILARIEERLSLWAFLPKEYSKPLQVMHYGP----EPNGRNLDYFTNKTQLELSGP 160
Query: 194 RMATVLMYLSDVE-EGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
MAT+++YLS+ +GG+ +FP + VP + S C + ++P G+A+LF+S+
Sbjct: 161 LMATIVLYLSNAATQGGQILFPES------VPRSSSWSSCSNSSNILQPVKGNAILFFSL 214
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
P AS D +S H CPV++GN WS+ K+
Sbjct: 215 HPSASPDKNSFHARCPVLEGNMWSAIKYF 243
>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
leucogenys]
Length = 556
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 418
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 419 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 478
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 479 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 519
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 520 TRHAACPVLVGCKWVSNKWF 539
>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
gorilla]
Length = 565
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 427
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 428 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 487
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 488 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 528
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 529 TRHAACPVLVGCKWVSNKWF 548
>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_c
[Homo sapiens]
Length = 565
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 427
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 428 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 487
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 488 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 528
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 529 TRHAACPVLVGCKWVSNKWF 548
>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
boliviensis boliviensis]
gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
boliviensis boliviensis]
gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
boliviensis boliviensis]
Length = 533
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 TRHAACPVLVGCKWVSNKWF 516
>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
sapiens]
gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_a
[Homo sapiens]
gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II [synthetic
construct]
Length = 533
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 TRHAACPVLVGCKWVSNKWF 516
>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
Length = 537
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 67/207 (32%), Positives = 101/207 (48%), Gaps = 20/207 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E + +P YH+ LS ++ L +A P MR+STV G++K S R S +LA
Sbjct: 324 LEEHNLDPYVVTYHDMLSAQKIRDLRQMAVPRMRRSTVNPLPGGQNKKSAFRVSKNAWLA 383
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNGGQRM 195
+ + + + D T E LQV +Y G YEPH+D+F D G R+
Sbjct: 384 YESHPTMEGMLRDLKDATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRI 443
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
AT + YLSDVE+GG T FP ++KP++G+ L ++++
Sbjct: 444 ATAIFYLSDVEQGGATAFPFLD-------------------FAVKPQLGNVLFWYNLHRS 484
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+D + H GCPV+KG+KW WI
Sbjct: 485 LDMDYRTKHAGCPVLKGSKWIGNVWIH 511
>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
abelii]
gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 533
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 TRHAACPVLVGCKWVSNKWF 516
>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
leucogenys]
gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
leucogenys]
Length = 535
Score = 118 bits (296), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 397
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 398 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 457
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 458 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 498
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 499 TRHAACPVLVGCKWVSNKWF 518
>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
Length = 525
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 68/212 (32%), Positives = 107/212 (50%), Gaps = 27/212 (12%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
E +S +P +YH+ + + E + + L T ++++T+ + T +S S VRTS TFL
Sbjct: 295 AEELSRDPLLILYHDVIYQSEIDTIRKLTTNKLKRATI--TSTNESVVSNVRTSQFTFLP 352
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE------FNTKN 190
DK++ I++R+AD T F + E Q +Y G Y H D+F ++
Sbjct: 353 VTEDKVLATIDRRVADMTNFNMRYAEDHQFANYGIGGHYGQHMDWFYQPSFDAGLVSSPE 412
Query: 191 GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
G R+ATVL YLSDV +GG T FP+ + + +KPK A ++
Sbjct: 413 MGNRIATVLFYLSDVTQGGGTAFPHLR-------------------VLLKPKKYAAAFWY 453
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
++ DP + HG CP+I G+KW +WIR
Sbjct: 454 NLHASGVGDPRTQHGACPIISGSKWVQNRWIR 485
>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
mulatta]
Length = 533
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 TRHAACPVLVGCKWVSNKWF 516
>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Loxodonta africana]
Length = 534
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 337 PHIVRYYDVMSDEEIERIKQIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 396
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 397 AQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 456
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 457 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 497
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 498 TRHAACPVLVGCKWVSNKWF 517
>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 1 [Oryctolagus
cuniculus]
Length = 533
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
I +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 396 ARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 TRHAACPVLVGCKWVSNKWF 516
>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
troglodytes]
gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
troglodytes]
gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
troglodytes]
gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
paniscus]
gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
paniscus]
gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
paniscus]
gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 533
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 TRHAACPVLVGCKWVSNKWF 516
>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
Length = 533
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 TRHAACPVLVGCKWVSNKWF 516
>gi|443709455|gb|ELU04127.1| hypothetical protein CAPTEDRAFT_149240 [Capitella teleta]
Length = 532
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 69/206 (33%), Positives = 105/206 (50%), Gaps = 22/206 (10%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
E S EP VYH ++ E E + +ATP + ++TV +S TG+ + ++ R S +L
Sbjct: 330 ETASLEPWIAVYHQLMNDHEIERIKEMATPRLARATVHNSATGQLEHAKYRISKSGWLRD 389
Query: 138 GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT---KNGGQR 194
D +I I +R + T L E LQV++Y G +YEPHFD+ T K G R
Sbjct: 390 EEDPLIARISERCSALTNLSLTTVEELQVVNYGIGGQYEPHFDFSRRSEPTAFEKWRGNR 449
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ TV+ Y++DVE GG TVF +A G+ + P+ G A ++ ++ P
Sbjct: 450 ILTVIYYMTDVEAGGATVFLDA-------------------GVKVYPEKGSAAVWHNLLP 490
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKW 280
D + H CPV+ G+KW + KW
Sbjct: 491 SGEGDMRTRHAACPVLTGSKWVANKW 516
>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
niloticus]
Length = 536
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 70/202 (34%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y + LS EE E + LA P + ++TV D TG + R S +L D +I
Sbjct: 337 PHIVRYLDLLSDEEIEKIKELAKPRLARATVRDPKTGVLTTANYRVSKSAWLEGEEDPVI 396
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 199
+ +RI T +E E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 397 DRVNQRIEAITGLTVETAELLQVANYGVGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFL 456
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I P+ G ++ ++++ D
Sbjct: 457 NYMSDVEAGGATVFPD-------------------FGAAIWPRKGTSVFWYNLFRSGEGD 497
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G+KW S KWI
Sbjct: 498 YRTRHAACPVLVGSKWVSNKWI 519
>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
Length = 534
Score = 118 bits (295), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 64/206 (31%), Positives = 104/206 (50%), Gaps = 20/206 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E S +P +H+ LS+ L +A PHM++STV G+ + S R S +L
Sbjct: 321 LEEHSLDPLVVTFHDMLSQHRIAELREMAVPHMQRSTVNPLPGGQRRKSAFRVSKNAWLP 380
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG-GQRM 195
+ + + ++D T + E LQV +Y G YEPH+D+F D + G R+
Sbjct: 381 YSTHPTMGRMLRDVSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRI 440
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
AT + YLSDVE+GG T FP +++P++G+ L ++++
Sbjct: 441 ATAIFYLSDVEQGGATAFP-------------------FLNFAVRPQLGNILFWYNLHRS 481
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ +D + H GCPV+KG+KW + WI
Sbjct: 482 SDMDFRTKHAGCPVLKGSKWIANIWI 507
>gi|312383453|gb|EFR28539.1| hypothetical protein AND_03427 [Anopheles darlingi]
Length = 341
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 66/208 (31%), Positives = 109/208 (52%), Gaps = 10/208 (4%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+P +YH+ +S E E + + A P R++TV + TG+ + + R S +L ++
Sbjct: 113 QPDIVIYHDVMSDREIELIKHYARPRFRRATVQNYKTGELEFANYRISKSAWLKDTEHEV 172
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATV 198
IR + +R+ D T + E LQV++Y G YEPHFD+ E F + G R+ATV
Sbjct: 173 IRTVNQRVEDMTGLTMATAEELQVVNYGIGGHYEPHFDFARREERNAFKSLGTGNRIATV 232
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKT-----GLSIKPKMGDALLFWSMK 253
L Y+SD+ T NA +V +++++ G T L+++P+ G A + ++
Sbjct: 233 LFYVSDL-CLCHTSHTNADFRFLSVGQMSDVTQGGATVFPSLNLALRPRKGTAAFWHNLH 291
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ D ++ H CPV+ G KW S KWI
Sbjct: 292 ASGNGDYATRHAACPVLTGTKWVSNKWI 319
>gi|327265288|ref|XP_003217440.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Anolis
carolinensis]
Length = 554
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 72/209 (34%), Positives = 102/209 (48%), Gaps = 25/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y+N LS EE E + LA P + ++TV D TG + R S ++L D ++
Sbjct: 355 PHIVRYYNVLSDEEIEKIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEEDDLVV 414
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 415 AKVNQRMEHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKEEPDAFKRLGTGNRVATFL 474
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 475 NYMSDVEAGGATVFPDF-------------------GAAIWPKKGTAVFWYNLFRSGEGD 515
Query: 260 PSSLHGGCPVIKGNKWSSTKWI--RVNEY 286
+ H CPV+ G KW S KW R NE+
Sbjct: 516 YRTRHAACPVLVGCKWVSNKWFHERGNEF 544
>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
Length = 504
Score = 118 bits (295), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 366
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 367 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 426
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 427 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 467
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 468 TRHAACPVLVGCKWVSNKWF 487
>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
Length = 519
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 70/208 (33%), Positives = 104/208 (50%), Gaps = 22/208 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E + +P+ +V HN LS E E + LA P +R + + TG + S R S +L
Sbjct: 314 LEQVFDKPKLWVLHNILSDPEMEVIKKLAQPRLRPAATQNPTTGGAVLSSYRISKNAWLY 373
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM--DEFNTK-NGGQ 193
++I +++R+ D T +E E LQV++Y G YEPHFD +EF N G
Sbjct: 374 YWEHRLINRVKQRVEDATGLTMETAEPLQVINYGIGGHYEPHFDCATKDEEFALDPNEGD 433
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+AT+L Y+SDVE GG TVFP + G + P+ G ++++
Sbjct: 434 RIATMLFYMSDVEAGGATVFP-------------------QVGARVVPEKGAGAFWYNLL 474
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWI 281
D + H GCPV+ G+KW S WI
Sbjct: 475 KSGEGDMLTEHAGCPVLVGSKWVSNMWI 502
>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
catus]
gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
catus]
Length = 533
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 TRHAACPVLVGCKWVSNKWF 516
>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_d
[Homo sapiens]
Length = 488
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 291 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 350
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 351 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 410
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 411 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 451
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 452 TRHAACPVLVGCKWVSNKWF 471
>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
musculus]
gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
Length = 535
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 397
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 398 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 457
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 458 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 498
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 499 TRHAACPVLVGCKWVSNKWF 518
>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
[Cricetulus griseus]
Length = 533
Score = 117 bits (294), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 TRHAACPVLVGCKWVSNKWF 516
>gi|348501574|ref|XP_003438344.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
niloticus]
Length = 615
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y + +S E E + LA P +R++T+ + TG + + R S +L D +I
Sbjct: 416 PYIVRYLDIISDAEIERVKQLAKPRLRRATISNPITGVLETASYRISKSAWLTEYDDPMI 475
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 199
I RI T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 476 EKINDRIEGVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWL 535
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDV GG TVFP+ G ++ P+ G A+ ++++ D
Sbjct: 536 FYMSDVSAGGATVFPDV-------------------GAAVWPQKGTAVFWYNLFASGEGD 576
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
S+ H CPV+ GNKW S KWI
Sbjct: 577 YSTRHAACPVLVGNKWVSNKWI 598
>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
musculus]
Length = 593
Score = 117 bits (294), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 396 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 455
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 456 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 515
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 516 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 556
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 557 TRHAACPVLVGCKWVSNKWF 576
>gi|323452216|gb|EGB08091.1| hypothetical protein AURANDRAFT_26622 [Aureococcus anophagefferens]
Length = 190
Score = 117 bits (293), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 74/204 (36%), Positives = 101/204 (49%), Gaps = 28/204 (13%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR F+ LS+ EC+++I L T +RKS V G S+ RTS +L R I+
Sbjct: 1 PRVFLVREMLSEFECDHIIELGTKVVRKSMV---GQGGGFTSKTRTSENGWLRRSASPIL 57
Query: 144 RDIEKRIADFTFFPLE------NGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMAT 197
+I KR D + N E LQV+ Y+ Q+Y PH D F D+ + QR T
Sbjct: 58 ENIYKRFGDVLGIDHDLLRSGKNAEELQVVRYDRSQEYAPHHD-FGDDGTPQ---QRFLT 113
Query: 198 VLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS 257
+L+Y+ EEGG T FP A + G+ + P GDA+LF+SM PD +
Sbjct: 114 LLLYIQLPEEGGATSFPKANDGM---------------GVQVVPARGDAVLFYSMLPDGN 158
Query: 258 LDPSSLHGGCPVIKGNKWSSTKWI 281
D +LH G PV KG KW W+
Sbjct: 159 ADDLALHAGMPVRKGQKWVCNLWV 182
>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
Length = 492
Score = 117 bits (293), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 64/198 (32%), Positives = 99/198 (50%), Gaps = 21/198 (10%)
Query: 85 RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIR 144
R ++ NF S +EC +L + S V G + R S+ +L D ++
Sbjct: 305 RLQIFRNFASAQECAHLREEGRKKL--SRAVAWTDGAFRPVEFRISTAAWLQPDHDDVVT 362
Query: 145 DIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSD 204
++ RIAD T LE E LQV +Y G YE H+D+ G R+AT ++YL+
Sbjct: 363 NLHTRIADATQLDLEFAEALQVSNYGIGGFYETHYDHHASRERELPEGDRIATFMIYLNQ 422
Query: 205 VEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLH 264
VE+GG T FP + G +++P GDA+ ++++ PD D ++LH
Sbjct: 423 VEQGGYTAFP-------------------RLGAAVEPGHGDAVFWYNLLPDGESDNNTLH 463
Query: 265 GGCPVIKGNKWSSTKWIR 282
G CPV++G+KW + KWI
Sbjct: 464 GACPVLQGSKWVANKWIH 481
>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
Length = 533
Score = 117 bits (292), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 65/206 (31%), Positives = 103/206 (50%), Gaps = 20/206 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E S +P YH+ LS ++ L +A PHM++STV G+ S R S +L
Sbjct: 320 LEEHSLDPLVVSYHDMLSPQQIGELRAMAVPHMQRSTVNPLSGGQRMKSAFRVSKNAWLP 379
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG-GQRM 195
++ + + + D T + E LQV +Y G YEPH+D+F D + G R+
Sbjct: 380 YSTHPMMGRMLRDVGDATGLDMTYCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRI 439
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
AT + YLSDVE+GG T FP +++P++G+ L ++++
Sbjct: 440 ATAIFYLSDVEQGGATAFPF-------------------LNFAVRPQLGNILFWYNLHRS 480
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ D + H GCPV+KG+KW + WI
Sbjct: 481 SDEDYRTKHAGCPVLKGSKWIANIWI 506
>gi|224009604|ref|XP_002293760.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
CCMP1335]
gi|220970432|gb|EED88769.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
CCMP1335]
Length = 206
Score = 117 bits (292), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 74/210 (35%), Positives = 110/210 (52%), Gaps = 18/210 (8%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATP-HMRKSTVVDSD--TGKSKDS--RVRTSS 131
++V+S PRAF NFLS+ E ++++ L T + +ST SD T +DS RTS
Sbjct: 3 LKVLSCAPRAFEIENFLSQTEVDHIMYLTTGMKLHRSTTAGSDQITADERDSTRNTRTSL 62
Query: 132 GTFLARGRDKIIRDIEKRIADFTFF-PLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKN 190
T++ R + II I +R AD E LQ++HY+ GQ+Y H D+ + + +
Sbjct: 63 NTWVYREKSAIIDTIYRRAADLQLMNEALIAEALQLVHYDVGQEYTAHHDWGHPDIDNEY 122
Query: 191 GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
R T+L+YL++ EGG T FP W N + + GL ++PK+G A+LF+
Sbjct: 123 QPARYCTLLLYLNEGMEGGATQFPR---------WVNAET---RNGLDVEPKIGKAVLFY 170
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKW 280
S PD ++D S H PV G KW W
Sbjct: 171 SQLPDGNMDDWSHHAAMPVRVGEKWLMNLW 200
>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Cavia porcellus]
Length = 533
Score = 117 bits (292), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEEDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 396 ARVNRRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G ++ PK G A+ ++++ D
Sbjct: 456 MSDVEAGGATVFPD-------------------LGAALWPKKGTAVFWYNLLRSGEGDYR 496
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 TRHAACPVLVGCKWVSNKWF 516
>gi|281350467|gb|EFB26051.1| hypothetical protein PANDA_009188 [Ailuropoda melanoleuca]
Length = 511
Score = 117 bits (292), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 66/197 (33%), Positives = 98/197 (49%), Gaps = 23/197 (11%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P +R++T+ + TG + R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATV 198
+ I RI D T + E LQV +Y G +YEPHFD+ D F G R+AT
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453
Query: 199 LMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASL 258
L Y+SDV GG TVFP + G S+ PK G A+ ++++
Sbjct: 454 LFYMSDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEG 494
Query: 259 DPSSLHGGCPVIKGNKW 275
D S+ H CPV+ GNKW
Sbjct: 495 DYSTRHAACPVLVGNKW 511
>gi|54792285|emb|CAG28668.1| prolyl 4-hydroxylase alpha-2 subunit [Gallus gallus]
Length = 538
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 70/209 (33%), Positives = 102/209 (48%), Gaps = 25/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + LA P + ++TV D TG + R S ++L D ++
Sbjct: 340 PHIVRYYDVMSDEEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 399
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 400 AKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFL 459
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 460 NYMSDVEAGGATVFPDF-------------------GAAIWPKKGTAVFWYNLFRSGEGD 500
Query: 260 PSSLHGGCPVIKGNKWSSTKWI--RVNEY 286
+ H CPV+ G KW S KW R NE+
Sbjct: 501 YRTRHAACPVLVGCKWVSNKWFHERGNEF 529
>gi|449267219|gb|EMC78185.1| Prolyl 4-hydroxylase subunit alpha-2 [Columba livia]
Length = 538
Score = 117 bits (292), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 70/209 (33%), Positives = 102/209 (48%), Gaps = 25/209 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + LA P + ++TV D TG + R S ++L D ++
Sbjct: 339 PHIVRYYDVMSDEEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 398
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 399 AKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFL 458
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 459 NYMSDVEAGGATVFPDF-------------------GAAIWPKKGTAVFWYNLFRSGEGD 499
Query: 260 PSSLHGGCPVIKGNKWSSTKWI--RVNEY 286
+ H CPV+ G KW S KW R NE+
Sbjct: 500 YRTRHAACPVLVGCKWVSNKWFHERGNEF 528
>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
araneus]
Length = 533
Score = 116 bits (291), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 99/200 (49%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTTASYRVSKSSWLEETDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 396 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 TRHAACPVLVGCKWVSNKWF 516
>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
laevis]
gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
Length = 533
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 70/200 (35%), Positives = 97/200 (48%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR Y + LS EE E + LA P + ++TV D TG + R S +L D +I
Sbjct: 336 PRIVRYLDVLSDEEIEKIKELAKPRLARATVRDPKTGVLTVANYRVSKSAWLEEYDDPVI 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEF--NTKNGGQRMATVLMY 201
+ R+ T + E LQV +Y G +YEPHFD+ F N K G R+AT L Y
Sbjct: 396 GRVNSRMQAITGLTKDTAELLQVANYGMGGQYEPHFDFSRRPFDSNLKTEGNRLATYLNY 455
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I P+ G A+ ++++ D
Sbjct: 456 MSDVEAGGATVFPDF-------------------GAAIWPRKGTAVFWYNLFRSGEGDYR 496
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G+KW S KW
Sbjct: 497 TRHAACPVLVGSKWVSNKWF 516
>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
garnettii]
Length = 538
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 99/200 (49%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 400
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 401 ARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRVATFLNY 460
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 461 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 501
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 502 TRHAACPVLVGCKWVSNKWF 521
>gi|344264847|ref|XP_003404501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Loxodonta africana]
Length = 536
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 337 PHIVRYYDVMSDEEIERIKQIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 396
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 397 AQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSHEQDAFKRLGTGNRVATFL 456
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 457 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 497
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 498 YRTRHAACPVLVGCKWVSNKWF 519
>gi|156370129|ref|XP_001628324.1| predicted protein [Nematostella vectensis]
gi|156215298|gb|EDO36261.1| predicted protein [Nematostella vectensis]
Length = 541
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 68/230 (29%), Positives = 111/230 (48%), Gaps = 44/230 (19%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG---------------- 120
+E + +P ++ NF++ E + + LATP ++++TV D TG
Sbjct: 303 IERVFVKPEVLIFRNFITDSEIKRIKELATPRLKRATVKDPVTGELIFANYRISKRRATI 362
Query: 121 ------KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQK 174
K + + R S +L D++++ I R+ ++ + E LQV++Y G
Sbjct: 363 QHPVTGKLEFANYRISKSGWLRDEEDELVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGH 422
Query: 175 YEPHFDYFM---DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSE 231
YEPH+D+ D+F + G R+AT L YLSDVE GG TVF
Sbjct: 423 YEPHYDFARDGEDKFTSLGTGNRIATFLSYLSDVEAGGGTVFT----------------- 465
Query: 232 CGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ G ++ P+ GDA ++++K D S+ H CPV+ G+KW + KWI
Sbjct: 466 --RVGATVWPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKWVANKWI 513
>gi|291387304|ref|XP_002710243.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 3 [Oryctolagus
cuniculus]
Length = 535
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/202 (33%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
I +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 396 ARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRNNERDAFKRLGTGNRVATFL 455
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWF 518
>gi|289662828|ref|ZP_06484409.1| hypothetical protein XcampvN_06993, partial [Xanthomonas campestris
pv. vasculorum NCPPB 702]
Length = 301
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 70/207 (33%), Positives = 96/207 (46%), Gaps = 31/207 (14%)
Query: 89 YHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEK 148
Y LS +EC L+ LA PH+R S V+D + ++ + VRTS G L D II D
Sbjct: 116 YAGVLSADECRLLMLLARPHLRDSQVIDPNDASTQRAPVRTSRGATL----DPIIEDFAA 171
Query: 149 RIADFTF-----FPLENGEGLQVLHYEAGQKYEPHFDYFMD---EFNTKNGGQRMATVLM 200
R+A L + E L VL Y G++Y H DY + N G R TV +
Sbjct: 172 RVAQARLAACAQLTLTHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADHPNAGNRQRTVCV 231
Query: 201 YLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDP 260
YL+ V+ GGET FP A G+ ++P+ G + F ++ D +
Sbjct: 232 YLNVVDAGGETEFPLA-------------------GVRVQPRPGALVCFDNLHADGRPNA 272
Query: 261 SSLHGGCPVIKGNKWSSTKWIRVNEYK 287
SLH G PV G+KW T W R Y+
Sbjct: 273 DSLHAGLPVTAGSKWLGTLWFRQQRYR 299
>gi|354474413|ref|XP_003499425.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
[Cricetulus griseus]
Length = 535
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFL 455
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWF 518
>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
Length = 487
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 99/201 (49%), Gaps = 21/201 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 349
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 350 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 409
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 410 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 450
Query: 262 SLHGGCPVIKGNKWSSTKWIR 282
+ H CPV+ G KW S KW
Sbjct: 451 TRHAACPVLVGCKWVSNKWFH 471
>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
[Ovis aries]
Length = 487
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 99/201 (49%), Gaps = 21/201 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 349
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 350 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 409
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 410 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 450
Query: 262 SLHGGCPVIKGNKWSSTKWIR 282
+ H CPV+ G KW S KW
Sbjct: 451 TRHAACPVLVGCKWVSNKWFH 471
>gi|334311009|ref|XP_001371555.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Monodelphis
domestica]
Length = 534
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 100/200 (50%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ LS EE E + ++ P + ++TV D TG R S ++L D II
Sbjct: 337 PHIVRYYDVLSDEEIEKIKEISKPKLSRATVRDPKTGHLIVVSYRISKSSWLKEDDDPII 396
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ +R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 397 AQVNRRMQYITGLSVKTAELLQVSNYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 456
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G ++ ++++ D
Sbjct: 457 MSDVEAGGATVFPDF-------------------GAAIWPKKGTSVFWYNLFRSGECDYR 497
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G+KW S KW
Sbjct: 498 TRHAACPVLVGSKWVSNKWF 517
>gi|607947|gb|AAA62207.1| prolyl 4-hydroxylase alpha subunit [Caenorhabditis elegans]
Length = 558
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 69/217 (31%), Positives = 105/217 (48%), Gaps = 25/217 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE+ + P A ++ + +S +E + LA P + ++TV DS TGK + R S +L
Sbjct: 320 VEIKRFNPLAVLFKDVISDDEVAAIQELAKPKLARATVHDSVTGKLVTATYRISKSAWLK 379
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
++ + KRI T +E E LQ+ +Y G Y+PHFD+ E F + G
Sbjct: 380 EWEGDVVETVNKRIGYMTNLEMETAEELQIANYGIGGHYDPHFDHAKKEESKSFESLGTG 439
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL Y+S GG TVF A+ +I P DAL ++++
Sbjct: 440 NRIATVLFYMSQPSHGGGTVFTEAKS-------------------TILPTKNDALFWYNL 480
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI--RVNEYK 287
+P + H CPV+ G KW S KWI + NE++
Sbjct: 481 YKQGDGNPDTRHAACPVLVGIKWVSNKWIHEKGNEFR 517
>gi|410447164|ref|ZP_11301266.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [SAR86 cluster
bacterium SAR86E]
gi|409980151|gb|EKO36903.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [SAR86 cluster
bacterium SAR86E]
Length = 214
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 74/223 (33%), Positives = 114/223 (51%), Gaps = 40/223 (17%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRV--RTSSGTF 134
V + S P ++ NFLS EC+ IN A ++ STV+ G + + ++ RTS +
Sbjct: 14 VYLYSVNPIVYLVKNFLSDLECDAFINEAEGRLQDSTVI----GANDEIKLGARTSQNCW 69
Query: 135 LARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT----KN 190
+ ++++ ++ KR++ P+ N E Q+ YE ++Y+P FD F +F+T KN
Sbjct: 70 IEHDANELVHEVSKRLSILAQIPIRNAEQYQLACYEKDEEYKPRFDSF--DFDTLEGKKN 127
Query: 191 ---GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDAL 247
GGQRM T+++YL+DV+ GG T FP K G +I PK GD +
Sbjct: 128 WEPGGQRMLTIIVYLNDVQSGGGTDFP-------------------KLGFTIPPKKGDVV 168
Query: 248 LFWSMKPDAS------LDPSSLHGGCPVIKGNKWSSTKWIRVN 284
+ + D S + P+SLH G PV+ G KW T W R N
Sbjct: 169 VLNNTCDDDSQNGHPNIHPNSLHAGMPVLSGKKWIVTLWFRQN 211
>gi|226874876|ref|NP_035161.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Mus
musculus]
gi|148701601|gb|EDL33548.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_f [Mus
musculus]
Length = 537
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 397
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 398 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFL 457
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 458 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 498
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 499 YRTRHAACPVLVGCKWVSNKWF 520
>gi|410948134|ref|XP_003980796.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Felis
catus]
Length = 535
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFL 455
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWF 518
>gi|345326417|ref|XP_001510155.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
[Ornithorhynchus anatinus]
Length = 888
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 68/202 (33%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ LS EE E + LA P + ++TV D TG + R S ++L D ++
Sbjct: 689 PHIVRYYDVLSDEEIEKIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEEDDPVV 748
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 749 AQVNRRMQYITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFL 808
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 809 NYMSDVEAGGATVFPD-------------------FGAAIWPKKGTAVFWYNLFRSGEGD 849
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 850 YRTRHAACPVLVGCKWVSNKWF 871
>gi|332221664|ref|XP_003259983.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Nomascus
leucogenys]
Length = 558
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 418
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 419 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFL 478
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 479 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 519
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 520 YRTRHAACPVLVGCKWVSNKWF 541
>gi|403255941|ref|XP_003920663.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Saimiri
boliviensis boliviensis]
gi|403255945|ref|XP_003920665.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Saimiri
boliviensis boliviensis]
Length = 535
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDAFKHLGTGNRVATFL 455
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWF 518
>gi|395736141|ref|XP_003776706.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
Length = 577
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 378 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 437
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 438 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFL 497
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 498 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 538
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 539 YRTRHAACPVLVGCKWVSNKWF 560
>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
Length = 532
Score = 116 bits (290), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 99/200 (49%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 396 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 TRHAACPVLVGCKWVSNKWF 516
>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
Length = 535
Score = 115 bits (289), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 99/201 (49%), Gaps = 21/201 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 397
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 398 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 457
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 458 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 498
Query: 262 SLHGGCPVIKGNKWSSTKWIR 282
+ H CPV+ G KW S KW
Sbjct: 499 TRHAACPVLVGCKWVSNKWFH 519
>gi|195390831|ref|XP_002054071.1| GJ22995 [Drosophila virilis]
gi|194152157|gb|EDW67591.1| GJ22995 [Drosophila virilis]
Length = 485
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 70/206 (33%), Positives = 105/206 (50%), Gaps = 35/206 (16%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR-VRTSSGTFL 135
+EV+ +P +H+ LS E L LA P ++++TV DS+ G + RTS G +L
Sbjct: 297 MEVLVVKPFIVAFHDVLSPHEIGELQQLAMPLLKRTTVYDSNAGLHGSVKGTRTSKGIWL 356
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRM 195
+R + + + I +RI+D T F LE LQV++Y Y H DY FNT
Sbjct: 357 SRSHNNLTKRIGRRISDMTGFHLEGSTSLQVMNYGLSGHYALHTDY----FNTAE----- 407
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
LSDVE+GG+TVFP + + KP+ G ALL++++ +
Sbjct: 408 ------LSDVEQGGDTVFPRIEQ-------------------AFKPERGKALLWYNLHRN 442
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ D + HG CPV+ G+KW T+WI
Sbjct: 443 GTGDKRTEHGACPVLVGSKWIMTQWI 468
>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
lupus familiaris]
Length = 533
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/200 (33%), Positives = 99/200 (49%), Gaps = 21/200 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 396 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 262 SLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 TRHAACPVLVGCKWVSNKWF 516
>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
Length = 534
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 68/207 (32%), Positives = 101/207 (48%), Gaps = 20/207 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E S +P YH+ LS + L +A P M +STV G++K S R S +LA
Sbjct: 321 LEEHSLDPFVVTYHDMLSPRKIADLRLMAVPRMHRSTVNPLPGGQNKKSSFRVSKNAWLA 380
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNGGQRM 195
+ + ++D T + E LQV +Y G YEPH+D+F D + G RM
Sbjct: 381 YDSHPTMGGMLSDLSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRM 440
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
AT + YLSDVE+GG T FP ++KP++G+ L ++++
Sbjct: 441 ATAIFYLSDVEQGGATAFP-------------------FLNFAVKPQLGNVLFWYNVHRS 481
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+D + H GCPV+KG+KW WI
Sbjct: 482 LDVDYRTKHAGCPVLKGSKWIGNVWIH 508
>gi|17552840|ref|NP_499464.1| Protein DPY-18 [Caenorhabditis elegans]
gi|20455505|sp|Q10576.2|P4HA1_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
alpha-1; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1; AltName: Full=Protein dumpy-18; Flags:
Precursor
gi|3881011|emb|CAA21045.1| Protein DPY-18 [Caenorhabditis elegans]
gi|6900013|emb|CAB71298.1| prolyl 4-hydroxylase alpha subunit 1 [Caenorhabditis elegans]
Length = 559
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 69/217 (31%), Positives = 105/217 (48%), Gaps = 25/217 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE+ + P A ++ + +S +E + LA P + ++TV DS TGK + R S +L
Sbjct: 321 VEIKRFNPLAVLFKDVISDDEVAAIQELAKPKLARATVHDSVTGKLVTATYRISKSAWLK 380
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
++ + KRI T +E E LQ+ +Y G Y+PHFD+ E F + G
Sbjct: 381 EWEGDVVETVNKRIGYMTNLEMETAEELQIANYGIGGHYDPHFDHAKKEESKSFESLGTG 440
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL Y+S GG TVF A+ +I P DAL ++++
Sbjct: 441 NRIATVLFYMSQPSHGGGTVFTEAKS-------------------TILPTKNDALFWYNL 481
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI--RVNEYK 287
+P + H CPV+ G KW S KWI + NE++
Sbjct: 482 YKQGDGNPDTRHAACPVLVGIKWVSNKWIHEKGNEFR 518
>gi|119582752|gb|EAW62348.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_f
[Homo sapiens]
Length = 567
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 427
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 428 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFL 487
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 488 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 528
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 529 YRTRHAACPVLVGCKWVSNKWF 550
>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
Length = 545
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 69/212 (32%), Positives = 108/212 (50%), Gaps = 27/212 (12%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE +S +P +YH+ + + E + L L + ++TV ++ S S RTS TF+
Sbjct: 323 VEELSHDPLLVLYHDVIYQSEIDTLAKLTKNKIHRATVTGNNA--SVVSNARTSQFTFIP 380
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-DEFNTKN----- 190
+ R K++R I++R+AD T + E Q+ +Y G Y H D+F + F TK
Sbjct: 381 KTRHKVLRTIDQRVADMTDLNMVFAEDHQLANYGIGGHYAQHMDWFSPNAFETKQVANSE 440
Query: 191 GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
G R+ATVL YL+DVE+GG T FP + +KPK A ++
Sbjct: 441 MGNRIATVLFYLTDVEQGGGTAFPVLKQ-------------------LLKPKKYAAAFWY 481
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
++ + D ++HG CP+I G+KW +WIR
Sbjct: 482 NLHASGAGDVRTMHGACPIIVGSKWVLNRWIR 513
>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
queenslandica]
Length = 525
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 67/210 (31%), Positives = 108/210 (51%), Gaps = 25/210 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
EV +P+ +++++ ++ E E L LA P + ++TV + G+ + R S +L+
Sbjct: 319 TEVAFVKPKIYIFYDIVTDREIERLKELANPKLNRATV-HGENGELLHATYRISKSGWLS 377
Query: 137 RGRDKI--IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM---DEFNTKNG 191
D + + I++RI D T + E LQV++Y G +YEPH+D+ D F +
Sbjct: 378 GSDDPLGYVDRIDQRIEDVTGLTMSTAEQLQVVNYGIGGQYEPHYDFARTGEDTFTSLGS 437
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
G R++T+L+Y+SDVE+GG TVFP G + P A +W+
Sbjct: 438 GNRISTLLIYMSDVEKGGATVFPGV-------------------GARLVPIKRAAAYWWN 478
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+K D S+ H GCPV+ G+KW KWI
Sbjct: 479 LKRSGDGDYSTRHAGCPVLVGSKWVCNKWI 508
>gi|148701597|gb|EDL33544.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_b [Mus
musculus]
Length = 506
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 366
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 367 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFL 426
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 427 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 467
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 468 YRTRHAACPVLVGCKWVSNKWF 489
>gi|332221660|ref|XP_003259981.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Nomascus
leucogenys]
Length = 537
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 397
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 398 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFL 457
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 458 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 498
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 499 YRTRHAACPVLVGCKWVSNKWF 520
>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
magnipapillata]
Length = 533
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/210 (31%), Positives = 106/210 (50%), Gaps = 22/210 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+EV+ +P +Y+ ++ +E +++I A P +R++ V D TG + R S T++A
Sbjct: 326 MEVLHHDPYIELYYELITDDEAKHIIKFAKPLLRRAFVHDMVTGDLIYADYRVSKNTWIA 385
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGG 192
D I I +R+ D T + E LQV +Y +YEPHFD+ F+ + GG
Sbjct: 386 EDMDVIAAKIIRRVGDVTGLNMRYAEHLQVANYGIAGQYEPHFDHSTGTRPKHFD-RWGG 444
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT+L+YLSDV+ GG TVF N G+ P G + ++++
Sbjct: 445 NRIATMLLYLSDVDWGGRTVFTNTA-----------------PGVGTDPIKGAGVFWYNL 487
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ +P + H GCPV+ G KW + WI
Sbjct: 488 LRNGKSNPKTQHAGCPVVLGQKWVANLWIH 517
>gi|114601566|ref|XP_001162222.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
troglodytes]
gi|114601568|ref|XP_001162843.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 17 [Pan
troglodytes]
gi|397518358|ref|XP_003829358.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pan
paniscus]
gi|397518362|ref|XP_003829360.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Pan
paniscus]
gi|410215944|gb|JAA05191.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410255608|gb|JAA15771.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
gi|410331279|gb|JAA34586.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
Length = 535
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFL 455
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWF 518
>gi|312092237|ref|XP_003147267.1| hypothetical protein LOAG_11701 [Loa loa]
Length = 553
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 64/207 (30%), Positives = 103/207 (49%), Gaps = 23/207 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE++ P A ++H+ +S EE + LA P + ++TV + +TG + + R S +L
Sbjct: 320 VEIVYQNPLAVLFHDIMSDEESRIIEMLAVPKLDRATVHNVETGNLETASYRISKSAWLR 379
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE--FNTKNGGQR 194
+++ I +R+ T + E LQV +Y G YEPH D DE F G R
Sbjct: 380 STEHEVVNRINRRLDLATNLEIATAEELQVQNYGIGGHYEPHLDCSRDEDAFERTGTGNR 439
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW-SMK 253
+AT+L+Y+++ E GG TVF N + ++ P +A LFW ++
Sbjct: 440 IATILIYMTEPEIGGRTVFINLKASV--------------------PCTKNAALFWYNLM 479
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKW 280
++D S H CPV+ G KW++ KW
Sbjct: 480 RSGAVDMRSYHAACPVLTGTKWTANKW 506
>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
taurus]
gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
Length = 533
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/201 (32%), Positives = 99/201 (49%), Gaps = 21/201 (10%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ R+ T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y
Sbjct: 396 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 MSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYR 496
Query: 262 SLHGGCPVIKGNKWSSTKWIR 282
+ H CPV+ G KW S KW
Sbjct: 497 TRHAACPVLVGCKWVSNKWFH 517
>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
Length = 534
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 68/207 (32%), Positives = 101/207 (48%), Gaps = 20/207 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E S +P YH+ LS + L +A P M +STV G++K S R S +LA
Sbjct: 321 LEEHSLDPFVVTYHDMLSPRKIADLRLMAVPRMHRSTVNPLPGGQNKKSSFRVSKNAWLA 380
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNGGQRM 195
+ + ++D T + E LQV +Y G YEPH+D+F D + G RM
Sbjct: 381 YDSHPTMGGMLSDLSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRM 440
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
AT + YLSDVE+GG T FP ++KP++G+ L ++++
Sbjct: 441 ATAIFYLSDVEQGGATAFP-------------------FLNFAVKPQLGNVLFWYNVHRS 481
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+D + H GCPV+KG+KW WI
Sbjct: 482 LDVDYRTKHAGCPVLKGSKWIGNVWIH 508
>gi|335283456|ref|XP_003354320.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Sus scrofa]
Length = 535
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFL 455
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWF 518
>gi|297675929|ref|XP_002815906.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pongo
abelii]
Length = 535
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFL 455
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWF 518
>gi|351706369|gb|EHB09288.1| Prolyl 4-hydroxylase subunit alpha-2 [Heterocephalus glaber]
Length = 535
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y+N +S EE + + LA P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYNVMSDEEIDRIKELAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 396 ARVNRRMQYITGLTVQTAELLQVANYGMGGQYEPHFDFSRNHERDAFKRLGTGNRVATFL 455
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G ++ PK G A+ ++++ D
Sbjct: 456 NYMSDVEAGGATVFPD-------------------LGAALWPKKGTAVFWYNLLRSGEGD 496
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWF 518
>gi|256083648|ref|XP_002578053.1| prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
gi|360044447|emb|CCD81995.1| putative prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
Length = 584
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 69/207 (33%), Positives = 103/207 (49%), Gaps = 24/207 (11%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
E ++ +PR ++++ + E E + LATP +R++TV + TG + + RTS +L
Sbjct: 373 ETLNPDPRIVMWYDLIFPSEIEKIKELATPRLRRATVKNPVTGILEIAFYRTSKSAWLPH 432
Query: 138 GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQ 193
+I I +RI T LE E LQV +Y G Y PHFD+ D F KNG
Sbjct: 433 SMSEITDQISQRIRAVTGLSLETAEDLQVGNYGLGGHYAPHFDFGRKREKDAFEVKNGN- 491
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+AT++ YLSDV+ GG TVF + G + PK G A ++++
Sbjct: 492 RIATIIFYLSDVQAGGATVF-------------------NRIGTRVVPKKGAAGFWFNLL 532
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKW 280
P+ D + H CPV+ G+KW W
Sbjct: 533 PNGEGDLRTRHAACPVLAGSKWVMNLW 559
>gi|157818741|ref|NP_001101745.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Rattus norvegicus]
gi|149052604|gb|EDM04421.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_a [Rattus norvegicus]
Length = 535
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRVATFL 455
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWF 518
>gi|77748579|ref|NP_641686.2| hypothetical protein XAC1351 [Xanthomonas axonopodis pv. citri str.
306]
Length = 418
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 72/212 (33%), Positives = 97/212 (45%), Gaps = 31/212 (14%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR Y LS +EC L+ LA PH+R S V+D + ++ + +RTS G L D II
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTQRAPIRTSRGATL----DPII 283
Query: 144 RDIEKRIADFTF-----FPLENGEGLQVLHYEAGQKYEPHFDYFMD---EFNTKNGGQRM 195
D R A PL + E L VL Y G++Y H DY + G R
Sbjct: 284 EDFAARAAQARLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQ 343
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
TV +YL+DV GG+T FP A G+ ++P+ G + F ++ D
Sbjct: 344 RTVCVYLNDVGAGGDTEFPIA-------------------GVRVRPRPGTLVCFDNLHAD 384
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SLH G PV G+KW T W R Y+
Sbjct: 385 GRPDADSLHAGLPVTAGSKWLGTLWFRQQRYR 416
>gi|4758868|ref|NP_004190.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|217272863|ref|NP_001136071.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
sapiens]
gi|20455169|sp|O15460.1|P4HA2_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|2439985|gb|AAB71339.1| prolyl 4-hydroxylase alpha (II) subunit [Homo sapiens]
gi|18073926|emb|CAC85689.1| Prolyl 4-hydroxylase alpha IIb subunit [Homo sapiens]
gi|119582746|gb|EAW62342.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
gi|119582747|gb|EAW62343.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha polypeptide II, isoform CRA_b
[Homo sapiens]
Length = 535
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFL 455
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWF 518
>gi|350014318|dbj|GAA37183.1| prolyl 4-hydroxylase [Clonorchis sinensis]
Length = 595
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 67/208 (32%), Positives = 105/208 (50%), Gaps = 24/208 (11%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV+ +PR ++++ + E + LA P +R++TV + TGK +++ RTS +L
Sbjct: 384 EVLYPDPRIVMWYDVIHPSEVGRIQELALPRLRRATVKNPVTGKLENAYYRTSKSAWLQD 443
Query: 138 GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQ 193
G D++ + +RI T +E E LQV +Y G Y PHFD+ D F +N G
Sbjct: 444 GLDEVTHRLNQRIHALTGLAMETAEDLQVGNYGIGGYYAPHFDFGRKREKDAFEVEN-GN 502
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+AT++ YL+DV+ GG TVF + G S+KP G A ++++
Sbjct: 503 RIATIIFYLTDVKAGGATVF-------------------NRFGASVKPVRGAAGFWYNLH 543
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWI 281
P D + H CPV+ G+KW W
Sbjct: 544 PSGEGDLRTRHVACPVLVGSKWVMNVWF 571
>gi|381173085|ref|ZP_09882194.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380686458|emb|CCG38681.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 418
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 72/212 (33%), Positives = 97/212 (45%), Gaps = 31/212 (14%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR Y LS +EC L+ LA PH+R S V+D + ++ + +RTS G L D II
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTQRAPIRTSRGATL----DPII 283
Query: 144 RDIEKRIADFTF-----FPLENGEGLQVLHYEAGQKYEPHFDYFMD---EFNTKNGGQRM 195
D R A PL + E L VL Y G++Y H DY + G R
Sbjct: 284 EDFAARAAQARLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQ 343
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
TV +YL+DV GG+T FP A G+ ++P+ G + F ++ D
Sbjct: 344 RTVCVYLNDVGAGGDTEFPIA-------------------GVRVRPRPGTLVCFDNLHAD 384
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SLH G PV G+KW T W R Y+
Sbjct: 385 GRPDADSLHAGLPVTAGSKWLGTLWFRQQRYR 416
>gi|387016442|gb|AFJ50340.1| Prolyl 4-hydroxylase subunit alpha-2-like [Crotalus adamanteus]
Length = 533
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 71/207 (34%), Positives = 100/207 (48%), Gaps = 23/207 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y+ LS EE E + LA P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYEVLSDEEIEKIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEEDDLVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMY 201
+ R+ T + E LQV +Y G +YEPHFD+ F+ K G R+AT L Y
Sbjct: 396 ARVNHRMEQITGLTTKTAELLQVANYGMGGQYEPHFDFSRRPFDITLKTEGNRLATFLNY 455
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 MSDVEAGGATVFPDF-------------------GAAIWPKKGTAVFWYNLFRSGEGDYR 496
Query: 262 SLHGGCPVIKGNKWSSTKWI--RVNEY 286
+ H CPV+ G KW S KW R NE+
Sbjct: 497 TRHAACPVLVGCKWVSNKWFHERGNEF 523
>gi|2498741|sp|Q60716.1|P4HA2_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
alpha-2; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-2; Flags: Precursor
gi|836900|gb|AAC52198.1| prolyl 4-hydroxylase alpha(II)-subunit [Mus musculus]
gi|18073923|emb|CAC85691.1| Prolyl 4-hydroxylase alpha IIb subunit [Mus musculus]
gi|1096888|prf||2112362B Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=II
Length = 537
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 397
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 398 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDDEDAFKRLGTGNRVATFL 457
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 458 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 498
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 499 YRTRHAACPVLVGCKWVSNKWF 520
>gi|418521653|ref|ZP_13087695.1| hypothetical protein WS7_11622 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410702188|gb|EKQ60697.1| hypothetical protein WS7_11622 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 418
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 72/212 (33%), Positives = 97/212 (45%), Gaps = 31/212 (14%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR Y LS +EC L+ LA PH+R S V+D + ++ + +RTS G L D II
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTQRAPIRTSRGATL----DPII 283
Query: 144 RDIEKRIADFTF-----FPLENGEGLQVLHYEAGQKYEPHFDYFMD---EFNTKNGGQRM 195
D R A PL + E L VL Y G++Y H DY + G R
Sbjct: 284 EDFAARAAQARLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQ 343
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
TV +YL+DV GG+T FP A G+ ++P+ G + F ++ D
Sbjct: 344 RTVCVYLNDVGAGGDTEFPIA-------------------GVRVRPRPGTLVCFDNLHAD 384
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SLH G PV G+KW T W R Y+
Sbjct: 385 GRPDADSLHAGLPVTAGSKWLGTLWFRQQRYR 416
>gi|397643670|gb|EJK76008.1| hypothetical protein THAOC_02250 [Thalassiosira oceanica]
Length = 480
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 80/249 (32%), Positives = 111/249 (44%), Gaps = 50/249 (20%)
Query: 81 SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSK------DSRV---RTSS 131
S EPR F HNFLS E + + +T + S G K D V RTS
Sbjct: 202 SSEPRVFYVHNFLSAAEADEFVKFSTAPENPYKMAPSTGGTHKAWNQGGDGAVLTTRTSE 261
Query: 132 GTFLARGRDKIIRDIEKR---IADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MD 184
F + D++KR + + +G+Q+L Y+ GQ Y H DYF
Sbjct: 262 NAFDITTKQSF--DVKKRAFRLLRMNGYQENMADGIQILRYKVGQAYVAHHDYFPTHQSK 319
Query: 185 EFN---TKNGGQRMATVLMYLSDVEEGGETVFPNAQG-----------NISAVPWWNELS 230
+FN G R AT+ +YLSDV GG+TVFPN + + P +EL
Sbjct: 320 DFNWDPLSGGSNRFATIFLYLSDVSYGGQTVFPNCEKLSAEKSPELVERLGESPSASELK 379
Query: 231 E-CGKTGL-----------------SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKG 272
E GL ++ P+ GDA+LF+S +PD LD +SLHG CP++ G
Sbjct: 380 EFVSNAGLMEGSWEDNLIHKCYEKFAVPPRRGDAILFYSQRPDGLLDTNSLHGACPILNG 439
Query: 273 NKWSSTKWI 281
KW + W+
Sbjct: 440 TKWGANLWV 448
>gi|325915856|ref|ZP_08178155.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas vesicatoria ATCC 35937]
gi|325537977|gb|EGD09674.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas vesicatoria ATCC 35937]
Length = 418
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 72/204 (35%), Positives = 94/204 (46%), Gaps = 31/204 (15%)
Query: 92 FLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIA 151
LS +EC LI LA PH+R S VVD D S+ + +RTS G L D I+ D R A
Sbjct: 236 VLSADECRLLILLARPHLRASQVVDPDDASSQRTPIRTSRGATL----DPILEDFAARAA 291
Query: 152 DFTF-----FPLENGEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGGQRMATVLMYLS 203
PL + E L VL Y G++Y H DY + G TV +YL+
Sbjct: 292 QARLAACARLPLTHAEPLSVLCYAPGEQYRAHRDYLPASRIAADRPAAGNHQRTVCVYLN 351
Query: 204 DVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSL 263
V+ GG+T FP A G+S++P G + F ++ D DP SL
Sbjct: 352 AVQAGGDTEFPVA-------------------GVSVQPCAGAVVCFDNLHADGRPDPESL 392
Query: 264 HGGCPVIKGNKWSSTKWIRVNEYK 287
H G PV G KW +T W R Y+
Sbjct: 393 HAGLPVTAGTKWLATLWFRQQCYR 416
>gi|344199983|ref|YP_004784309.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
gi|343775427|gb|AEM47983.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
Length = 212
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 70/199 (35%), Positives = 98/199 (49%), Gaps = 20/199 (10%)
Query: 89 YHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEK 148
+ LS EEC LI H + S V+ + S ++ R S+ + + II+ + +
Sbjct: 17 FSGLLSPEECTELIAAGGSHAKPSEVIYGVSDVSHETSGRRSTVASPSADKYPIIKAVRR 76
Query: 149 RIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNGGQRMATVLMYLSDVEE 207
RI+ F EN E LQVLHY G +Y+ H+D F++ +NGG RM TVL+YL+DVE+
Sbjct: 77 RISLFIGVAEENQEPLQVLHYTRGGRYDIHYDSFLEGSPQLENGGNRMLTVLLYLNDVEQ 136
Query: 208 GGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGC 267
GG T FP+ NI P +G +LF + SLH G
Sbjct: 137 GGWTQFPHIMANIV-------------------PNVGTGILFRNTDAQNLQLRESLHAGL 177
Query: 268 PVIKGNKWSSTKWIRVNEY 286
PVI G KW ++ WIR Y
Sbjct: 178 PVIDGEKWIASIWIREKSY 196
>gi|149052606|gb|EDM04423.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_c [Rattus norvegicus]
Length = 506
Score = 115 bits (288), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 366
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 367 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRVATFL 426
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 427 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 467
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 468 YRTRHAACPVLVGCKWVSNKWF 489
>gi|405964867|gb|EKC30309.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 591
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 72/227 (31%), Positives = 109/227 (48%), Gaps = 41/227 (18%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKS------KDSRVRTSS 131
EV+++EPR ++H+ +S E+L ++A+ +STV +TG K VR S
Sbjct: 361 EVVNYEPRIAIFHDVISPTSIEHLKSVASKGFTRSTVFLENTGPDGHVTYGKLDNVRVSQ 420
Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLE------NGEGLQVLHYEAGQKYEPHFDYF--- 182
++L + +E RI T E + E QVL+Y G Y H+DY
Sbjct: 421 TSWLGTDEYPELSRLENRIKLTTGLSAEYKSVRSHSEKFQVLNYGVGGMYTVHYDYTGYM 480
Query: 183 -------MDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKT 235
+D + + G+RMAT + YL+DV+ GG TVFP + I
Sbjct: 481 LGIPSNPLDSDDIRTSGERMATWMFYLNDVKAGGATVFPEVKTRIPVAK----------- 529
Query: 236 GLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
G A +++++P + DP +LHGGCPV+ G+KW S KWIR
Sbjct: 530 --------GGAAFWYNVRPSGATDPRTLHGGCPVLVGSKWVSNKWIR 568
>gi|198429625|ref|XP_002128613.1| PREDICTED: similar to procollagen-proline, 2-oxoglutarate
4-dioxygenase (proline 4-hydroxylase), alpha 1
polypeptide [Ciona intestinalis]
Length = 195
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 65/197 (32%), Positives = 102/197 (51%), Gaps = 32/197 (16%)
Query: 93 LSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIAD 152
+S +E + +LA P +R++TV + TG + + R S +L +I+ + +RI+D
Sbjct: 1 MSDKEMAMIKSLAKPRLRRATVQNPVTGVLEFAHYRVSKSAWLKDEDHPVIKRVCQRISD 60
Query: 153 FTFFPLENGEGLQVLHYEAGQKYEPHFDY--------FMDEFNTKNGGQRMATVLMYLSD 204
T +E E LQ+ +Y G +YEPHFDY F DE G R+AT L Y+S+
Sbjct: 61 VTGLSMETAEELQIANYGVGGQYEPHFDYSRKSDFGKFDDEV-----GNRIATFLTYMSN 115
Query: 205 VEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLH 264
VE+GG TVF + G++++P G A+ ++++ P + D + H
Sbjct: 116 VEQGGSTVFLHP-------------------GIAVRPIKGSAVFWYNLLPSGAGDERTRH 156
Query: 265 GGCPVIKGNKWSSTKWI 281
CPV+ G KW S KWI
Sbjct: 157 AACPVLTGVKWVSNKWI 173
>gi|355691582|gb|EHH26767.1| hypothetical protein EGK_16829 [Macaca mulatta]
gi|355750162|gb|EHH54500.1| hypothetical protein EGM_15360 [Macaca fascicularis]
gi|384939464|gb|AFI33337.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Macaca
mulatta]
Length = 535
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 99/202 (49%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ ++ F G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERHTFKHLGTGNRVATFL 455
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWF 518
>gi|289526401|gb|ADD01323.1| FI13021p [Drosophila melanogaster]
gi|373432715|gb|AEY70761.1| FI17809p1 [Drosophila melanogaster]
Length = 193
Score = 115 bits (287), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 68/193 (35%), Positives = 100/193 (51%), Gaps = 33/193 (17%)
Query: 101 LINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLEN 160
LI A +M K+T + + K +R RT+ G +L + +++ + I +RI D T F L +
Sbjct: 2 LIGKAAQNM-KNTKIHKERAVPKKNRGRTAKGFWLKKESNELTKRITRRIMDMTGFDLAD 60
Query: 161 GEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG-----------GQRMATVLMYLSDVEEGG 209
EG QV++Y G Y H DYF +F + N G R+ATVL YL+DVE+GG
Sbjct: 61 SEGFQVINYGIGGHYFLHMDYF--DFASSNHTDTRSRYSIDLGDRIATVLFYLTDVEQGG 118
Query: 210 ETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPV 269
TVF G G + P+ G A+ ++++ D + DP + H CPV
Sbjct: 119 ATVF-------------------GDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPV 159
Query: 270 IKGNKWSSTKWIR 282
I G+KW T+WIR
Sbjct: 160 IVGSKWVMTEWIR 172
>gi|390989336|ref|ZP_10259634.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
gi|372555840|emb|CCF66609.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
Length = 228
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 72/212 (33%), Positives = 98/212 (46%), Gaps = 31/212 (14%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR Y LS +EC L+ LA PH+R S V+D + ++ + +RTS G L D II
Sbjct: 38 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTQRAPIRTSRGATL----DPII 93
Query: 144 RDIEKRIADFTF-----FPLENGEGLQVLHYEAGQKYEPHFDYFMD---EFNTKNGGQRM 195
D R A PL + E L VL Y G++Y H DY + + G R
Sbjct: 94 EDFAARAAQARLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRRTAGNRQ 153
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
TV +YL+DV GG+T FP A G+ ++P+ G + F ++ D
Sbjct: 154 RTVCVYLNDVGAGGDTEFPIA-------------------GVRVRPRPGTLVCFDNLHAD 194
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SLH G PV G+KW T W R Y+
Sbjct: 195 GRPDADSLHAGLPVTAGSKWLGTLWFRQQRYR 226
>gi|196011912|ref|XP_002115819.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
gi|190581595|gb|EDV21671.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
Length = 300
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 62/210 (29%), Positives = 105/210 (50%), Gaps = 23/210 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD--SRVRTSSGTF 134
+E +S +P +YHN S E E L LA ++ + V + + +++ R + F
Sbjct: 97 IEEMSRDPLIILYHNLTSNAEMESLKALAAKQLQPAGVYHTTSADNRNLEGYTRIAKMAF 156
Query: 135 LARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKN--GG 192
+ + I +R+ D T + E LQV++Y +Y PH+D F + ++
Sbjct: 157 ILDEESAVASAITQRLQDVTGLNMNFSEPLQVINYGIAGQYTPHYDTFPAKSGDRSHPSH 216
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT ++YLSDVE GG TVF N + + P+ G+ +++++
Sbjct: 217 DRLATAILYLSDVERGGATVFTN-------------------INVRVLPRKGNVIIWYNY 257
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
PD +L P +LH GCPV+ G+KW + KWI+
Sbjct: 258 LPDGNLHPGTLHAGCPVLVGSKWIANKWIQ 287
>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
garnettii]
Length = 540
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 97/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 400
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 401 ARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRNHERDAFKRLGTGNRVATFL 460
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 461 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 501
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 502 YRTRHAACPVLVGCKWVSNKWF 523
>gi|348557542|ref|XP_003464578.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Cavia porcellus]
Length = 535
Score = 114 bits (286), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 98/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEEDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ +R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 396 ARVNRRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRSHERDAFKRLGTGNRVATFL 455
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G ++ PK G A+ ++++ D
Sbjct: 456 NYMSDVEAGGATVFPD-------------------LGAALWPKKGTAVFWYNLLRSGEGD 496
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWF 518
>gi|240974259|ref|XP_002401836.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215491070|gb|EEC00711.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 490
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 66/208 (31%), Positives = 104/208 (50%), Gaps = 23/208 (11%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV+ +PR +YH+ +SK E + + LA P ++++TV + +G+ + + R S +L
Sbjct: 285 EVMFPKPRIVIYHDVMSKHEMDVVKLLAQPRLKRATVQNYKSGELEVANYRISKSAWLRN 344
Query: 138 GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQ 193
+I + +RI T + E LQV++Y G YEPHFD+ E F + G
Sbjct: 345 EEHGVIARVTRRIEHITGLSADTAEELQVVNYGIGGHYEPHFDFARREEKNAFQSLGTGN 404
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+AT L Y+SDV GG TVFP + L++ P+ G A ++++
Sbjct: 405 RIATWLNYMSDVPAGGATVFPQLR-------------------LTLWPEKGAAAFWYNLH 445
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G+KW S KW
Sbjct: 446 RSGEGDMLTRHAACPVLAGSKWVSNKWF 473
>gi|170064956|ref|XP_001867741.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
gi|167882144|gb|EDS45527.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
Length = 520
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 68/208 (32%), Positives = 107/208 (51%), Gaps = 24/208 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+EV++ EP VYH +S E LI LA P +++S V DT + S++R S +
Sbjct: 317 LEVVNLEPLIVVYHEAVSDREIAKLIELARPLIKRSAV--GDTRSEQISKIRISQNAWFE 374
Query: 137 RGRDKIIRDIEKRIADFTFFPLE-NGEGLQVLHYEAGQKYEPHFDYFM--DEFNTKNGGQ 193
D I+ + +R D E + E LQV +Y G Y H+D+ + F K G
Sbjct: 375 NEHDPIVETLNQRARDMAGGLNEPSYELLQVNNYGLGGFYSIHYDWSTSANPFPNKGMGN 434
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+AT++ YLSDV+EGG TVFP + L+++P+ G A+ ++++
Sbjct: 435 RIATLMFYLSDVQEGGSTVFP-------------------RLNLAVRPRKGTAIFWYNLH 475
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ + +LH CPV+ G+KW + KWI
Sbjct: 476 RNGKGNKKTLHAACPVLIGSKWVANKWI 503
>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
Length = 537
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 72/210 (34%), Positives = 106/210 (50%), Gaps = 25/210 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE ++ P +YH+ + + E + L L ++ VV + T S S+ RTS F+A
Sbjct: 328 VEELNRNPLLVLYHDVIYQSEIDVLNKLNRKRYERAGVVINST--STVSKKRTSQHIFIA 385
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF--MDEFNTK--NGG 192
R K++R I++R+AD T ++ E Q+ Y G Y HFD+F D N+K G
Sbjct: 386 ATRHKVLRTIDQRVADMTNLNMQYAEDHQLADYGIGGHYSQHFDWFGNSDLANSKCDEMG 445
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL YLSDV +GG T FP + +KPK A ++++
Sbjct: 446 NRIATVLFYLSDVAQGGGTAFPILKQ-------------------LLKPKKYAAAFWYNL 486
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D +LHGGCP+I G+KW +WIR
Sbjct: 487 HASGKGDWRNLHGGCPIIVGSKWVLNRWIR 516
>gi|30686940|ref|NP_194290.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
gi|26451153|dbj|BAC42680.1| unknown protein [Arabidopsis thaliana]
gi|29893542|gb|AAP06823.1| unknown protein [Arabidopsis thaliana]
gi|332659681|gb|AEE85081.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
Length = 291
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 84/259 (32%), Positives = 133/259 (51%), Gaps = 38/259 (14%)
Query: 31 ILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSE----GDEGRAEQWVEVISWEPRA 86
+++++ S P SG SRK I KS + G + V +SW PR
Sbjct: 9 LILMITMSSSSPPFCSGGSRKELRDKEITSKSDDTQASYVLGSKFVDPTRVLQLSWLPRV 68
Query: 87 FVYHNFLSKEECEYLINLATPHMRKSTV----VDSDTGKSKDSRVRTSSGTFLARGRDKI 142
F+Y FLS+EEC++LI+L RK T VD+D GK++ D +
Sbjct: 69 FLYRGFLSEEECDHLISL-----RKETTEVYSVDAD-GKTQ---------------LDPV 107
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ IE++++ +TF P ENG ++V Y + +K DYF +E ++ +ATV++YL
Sbjct: 108 VAGIEEKVSAWTFLPGENGGSIKVRSYTS-EKSGKKLDYFGEEPSSVLHESLLATVVLYL 166
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
S+ +GGE +FPN++ + C + G ++P G+A+LF++ +ASLD S
Sbjct: 167 SNTTQGGELLFPNSE--------MKPKNSCLEGGNILRPVKGNAILFFTRLLNASLDGKS 218
Query: 263 LHGGCPVIKGNKWSSTKWI 281
H CPV+KG +TK I
Sbjct: 219 THLRCPVVKGELLVATKLI 237
>gi|301754231|ref|XP_002912939.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Ailuropoda
melanoleuca]
Length = 535
Score = 114 bits (286), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 66/202 (32%), Positives = 97/202 (48%), Gaps = 23/202 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 396 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFL 455
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KW
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWF 518
>gi|268572523|ref|XP_002641343.1| C. briggsae CBR-DPY-18 protein [Caenorhabditis briggsae]
gi|94442971|emb|CAJ98658.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
Length = 559
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 68/217 (31%), Positives = 105/217 (48%), Gaps = 25/217 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE+ + P A ++ + +S EE + LA P + ++TV DS TGK + R S +L
Sbjct: 321 VEIKRFNPLAVLFKDVISDEEVATIQELAKPKLARATVHDSVTGKLVTATYRISKSAWLK 380
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
+++ + KRI T +E E LQ+ +Y G Y+PHFD+ E F + G
Sbjct: 381 AWEHEVVERVNKRIDLMTNLEMETAEELQIANYGIGGHYDPHFDHAKKEESKSFESLGTG 440
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL Y+S GG TVF + ++ P DAL ++++
Sbjct: 441 NRIATVLFYMSQPSHGGGTVFTEVKS-------------------TVLPTKNDALFWYNL 481
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI--RVNEYK 287
+P + H CPV+ G KW S KWI + NE++
Sbjct: 482 YKQGDGNPDTRHAACPVLVGIKWVSNKWIHEKGNEFR 518
>gi|37912909|gb|AAR05245.1| conserved hypothetical protein [uncultured marine proteobacterium
ANT32C12]
Length = 186
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 69/201 (34%), Positives = 100/201 (49%), Gaps = 27/201 (13%)
Query: 88 VYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIE 147
+Y F L+ L + ++TV+ + DSR T+S ++ +II ++
Sbjct: 2 LYQIFYPLMSARPLLRLDQARVERATVITDSEHQFHDSR--TNSYAWIQHDASEIIHEVS 59
Query: 148 KRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF--MDEFNTKN---GGQRMATVLMYL 202
KR + P+ N E Q++HY G +Y+PHFD F E N GGQRM T L YL
Sbjct: 60 KRFSILVKMPINNAEQFQLVHYGPGTEYKPHFDAFDKSTEEGRNNWFPGGQRMVTALAYL 119
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS-LDPS 261
+DVE+GG T FP+ +S+KP GD ++F + K S ++P+
Sbjct: 120 NDVEDGGATDFPDIH-------------------VSVKPNKGDVVVFHNCKDGTSDINPN 160
Query: 262 SLHGGCPVIKGNKWSSTKWIR 282
SLHGG PVI G KW+ W R
Sbjct: 161 SLHGGSPVISGEKWAVNLWFR 181
>gi|412986224|emb|CCO17424.1| predicted protein [Bathycoccus prasinos]
Length = 557
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 78/233 (33%), Positives = 113/233 (48%), Gaps = 37/233 (15%)
Query: 79 VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL--A 136
+S P FV+ NFL + ECE+L LA +++S V D K S RTSS FL A
Sbjct: 318 CVSLSPLLFVFENFLHESECEFLRTLADKDLKRSRVTD-----GKLSNGRTSSSCFLIGA 372
Query: 137 RGRDKIIRDIEKRIAD------------FTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD 184
+G++ +++ IE+R+ D F L+ E +Q++ Y +KY HFD
Sbjct: 373 KGKEDVVKTIERRMLDAIRSTPVLTTRRFDTLKLKGSEPMQIVRYGKNEKYTSHFD---- 428
Query: 185 EFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQ-----------GNISAVPWWNELSECG 233
N +R+AT + YLSD EGG T FP A+ G +
Sbjct: 429 --NKAGSFRRVATFMCYLSDQCEGGCTNFPKAEPLFLEPSFDEHGAFKPFGRKKKTVASE 486
Query: 234 KTGLSIKPKMGDALLFWSMKPDA-SLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
+ G+ I PK+G A+LF+S+ + +P SLH G V KG K+ TKW+ E
Sbjct: 487 QHGVKIHPKLGRAILFFSISEEPFRENPLSLHEGQTVRKGEKFICTKWLTRTE 539
>gi|195172672|ref|XP_002027120.1| GL20071 [Drosophila persimilis]
gi|194112933|gb|EDW34976.1| GL20071 [Drosophila persimilis]
Length = 455
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 73/209 (34%), Positives = 107/209 (51%), Gaps = 24/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE +S +P +YH+ + E E L + A P M +S V SK++ RTS F
Sbjct: 252 VEPLSQDPYIAMYHDVIYDSEIEELKDNAFPDMERSKVYTYSDEDSKNTG-RTSMSAFQT 310
Query: 137 RGRDKIIRDIEKRIADFTFFP-LENG--EGLQVLHYEAGQKYEPHFDYFMDEFNTK-NGG 192
+ K + + +R+ T F L +G + L VL+Y +Y H DYF ++ G
Sbjct: 311 DHQYKAVTKVNRRVMHMTGFEVLADGSSDELLVLNYATAAQYLTHSDYFGPAYSEYIQRG 370
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL YL+DVE+GG+TVFP + G+ P G A++F++M
Sbjct: 371 DRIATVLFYLNDVEQGGKTVFP-------------------RLGIFRSPMKGSAVVFYNM 411
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
DP + HGGCPV+ G KW++TKWI
Sbjct: 412 NSSLQGDPRTEHGGCPVLVGTKWAATKWI 440
>gi|56118630|ref|NP_001007975.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
(Silurana) tropicalis]
gi|51513259|gb|AAH80485.1| p4ha2 protein [Xenopus (Silurana) tropicalis]
Length = 527
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 70/205 (34%), Positives = 98/205 (47%), Gaps = 23/205 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR Y N LS EE + LA P + ++TV D TG + R S +L D +I
Sbjct: 338 PRIVRYLNALSDEEIAKIKELAKPKLARATVRDPKTGVLSVANYRVSKSAWLEENDDPVI 397
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEF--NTKNGGQRMATVLMY 201
+ R+ T ++ E LQV +Y G +YEPHFD+ F N K G R+AT L Y
Sbjct: 398 ARVNLRMQAITGLTVDTAELLQVANYGMGGQYEPHFDFSRRPFDSNLKTDGNRLATFLNY 457
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPS 261
+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 458 MSDVEAGGATVFPD-------------------FGAAIWPKKGTAVFWYNLFRSGEGDYR 498
Query: 262 SLHGGCPVIKGNKWSSTKWIRVNEY 286
+ H CPV+ G+KW KW ++
Sbjct: 499 TRHAACPVLVGSKWG--KWTHTQDH 521
>gi|21107513|gb|AAM36222.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 273
Score = 114 bits (285), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 72/212 (33%), Positives = 97/212 (45%), Gaps = 31/212 (14%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR Y LS +EC L+ LA PH+R S V+D + ++ + +RTS G L D II
Sbjct: 83 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTQRAPIRTSRGATL----DPII 138
Query: 144 RDIEKRIADFTF-----FPLENGEGLQVLHYEAGQKYEPHFDYFMD---EFNTKNGGQRM 195
D R A PL + E L VL Y G++Y H DY + G R
Sbjct: 139 EDFAARAAQARLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQ 198
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
TV +YL+DV GG+T FP A G+ ++P+ G + F ++ D
Sbjct: 199 RTVCVYLNDVGAGGDTEFPIA-------------------GVRVRPRPGTLVCFDNLHAD 239
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SLH G PV G+KW T W R Y+
Sbjct: 240 GRPDADSLHAGLPVTAGSKWLGTLWFRQQRYR 271
>gi|226874889|ref|NP_001152881.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Bos
taurus]
gi|296485624|tpg|DAA27739.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Bos taurus]
Length = 535
Score = 114 bits (285), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 66/203 (32%), Positives = 97/203 (47%), Gaps = 23/203 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 396 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFL 455
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 260 PSSLHGGCPVIKGNKWSSTKWIR 282
+ H CPV+ G KW S KW
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFH 519
>gi|426229219|ref|XP_004008688.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
[Ovis aries]
Length = 535
Score = 114 bits (284), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 66/203 (32%), Positives = 97/203 (47%), Gaps = 23/203 (11%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 396 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFL 455
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 456 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 496
Query: 260 PSSLHGGCPVIKGNKWSSTKWIR 282
+ H CPV+ G KW S KW
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFH 519
>gi|47218149|emb|CAG10069.1| unnamed protein product [Tetraodon nigroviridis]
Length = 595
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 71/224 (31%), Positives = 101/224 (45%), Gaps = 45/224 (20%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKST----------------------VVDSDTGK 121
P Y + +S +E E + LA P +R++T V D TGK
Sbjct: 374 PYIVRYLDIISDKEIELVKQLAKPRLRRATISNPITGVLETASYRISKRRATVHDPQTGK 433
Query: 122 SKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY 181
++ R S +L +I I +RI D T ++ E LQV +Y G +YEPHFD+
Sbjct: 434 LTTAQYRVSKSAWLTGYEHPVIETINQRIEDLTGLEVDTAEELQVANYGVGGQYEPHFDF 493
Query: 182 FM----DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGL 237
D F G R+AT L Y+SDV GG TVFP+ G
Sbjct: 494 GRKDEPDAFKELGTGNRIATWLFYMSDVAAGGATVFPD-------------------VGA 534
Query: 238 SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
++ P+ G A+ ++++ D S+ H CPV+ GNKW S KWI
Sbjct: 535 AVWPQKGSAVFWYNLFTSGEGDYSTRHAACPVLVGNKWVSNKWI 578
>gi|381200649|ref|ZP_09907785.1| Prolyl 4-hydroxylase alpha subunit [Sphingobium yanoikuyae XLDN2-5]
Length = 305
Score = 114 bits (284), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 74/244 (30%), Positives = 112/244 (45%), Gaps = 27/244 (11%)
Query: 44 SSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLIN 103
++ GD L+ I R ++ E W+ R F FL+ +EC ++I+
Sbjct: 83 AAQGDPVAQQQLALINRMALAPDGAPVAVPEGEALAGGWDVRLF--RQFLTGDECHHVIS 140
Query: 104 LATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR-DKIIRDIEKRIADFTFFPLENGE 162
+ + V+D +G+ VRTS G R D +I+ I +RIA + L GE
Sbjct: 141 EGQALLEPAMVIDPRSGRPMPHPVRTSDGGIFGPAREDLVIQAINRRIAAASGTMLSGGE 200
Query: 163 GLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISA 222
L +L Y GQ+Y H D N QR T+L+YL++ GGET+FP
Sbjct: 201 PLTLLRYAVGQQYRQHHDCLPHVRN-----QRAWTMLIYLNEGYAGGETIFP-------- 247
Query: 223 VPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ GLS+K + GDALLF + +++H G PV+ G KW T+WIR
Sbjct: 248 -----------RLGLSVKGRKGDALLFRNTDAQGQAAEAAVHLGAPVMAGQKWLCTRWIR 296
Query: 283 VNEY 286
+ +
Sbjct: 297 HDRH 300
>gi|308497208|ref|XP_003110791.1| CRE-DPY-18 protein [Caenorhabditis remanei]
gi|308242671|gb|EFO86623.1| CRE-DPY-18 protein [Caenorhabditis remanei]
Length = 559
Score = 113 bits (283), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 67/217 (30%), Positives = 105/217 (48%), Gaps = 25/217 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE+ + P A ++ + +S +E + LA P + ++TV DS TGK + R S +L
Sbjct: 321 VEIKRFNPLAVLFKDVISDDEVATIQELAKPKLARATVHDSATGKLVTATYRISKSAWLK 380
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
+++ + KRI T +E E LQ+ +Y G Y+PHFD+ E F + G
Sbjct: 381 EWEHEVVERVNKRIELMTNLEMETAEELQIANYGIGGHYDPHFDHAKKEESKSFESLGTG 440
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL Y+S GG TVF + ++ P DAL ++++
Sbjct: 441 NRIATVLFYMSQPSHGGGTVFTEVKS-------------------TVLPTKNDALFWYNL 481
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI--RVNEYK 287
+P + H CPV+ G KW S KWI + NE++
Sbjct: 482 FKQGDGNPDTRHAACPVLVGIKWVSNKWIHEKGNEFR 518
>gi|325929527|ref|ZP_08190641.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas perforans 91-118]
gi|325540037|gb|EGD11665.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
[Xanthomonas perforans 91-118]
Length = 418
Score = 113 bits (283), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 72/212 (33%), Positives = 95/212 (44%), Gaps = 31/212 (14%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
PR Y LS +EC L+ LA PH+R S V+D + + + +RTS G L D II
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTGRAPIRTSHGATL----DPII 283
Query: 144 RDIEKRIADFTF-----FPLENGEGLQVLHYEAGQKYEPHFDYFMD---EFNTKNGGQRM 195
D R A PL + E L VL Y G++Y H DY + G R
Sbjct: 284 EDFAARAAQARLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQ 343
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
TV +YL+DV GET FP A G+ ++P+ G + F ++ D
Sbjct: 344 RTVCVYLNDVGAAGETEFPVA-------------------GVRVRPRPGTLVCFDNLHAD 384
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287
D SLH G PV G+KW T W R Y+
Sbjct: 385 GRPDADSLHAGLPVTAGSKWLGTLWFRQQRYR 416
>gi|147791524|emb|CAN70717.1| hypothetical protein VITISV_029140 [Vitis vinifera]
Length = 173
Score = 113 bits (283), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 82/143 (57%), Gaps = 32/143 (22%)
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
++ IEKRI+ ++ P+ENGE +Q FN K GGQR+AT+L+YL
Sbjct: 55 LQAIEKRISVYSQVPVENGELIQ--------------------FNLKRGGQRVATMLIYL 94
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKT---GLSIKPKMGDALLFWSMKPDASLD 259
SD EGGET FP A + CG GLS+ P G+A+LFWSM D D
Sbjct: 95 SDNVEGGETYFPMAG---------SGFCRCGGKSVRGLSVAPVKGNAVLFWSMGLDGQSD 145
Query: 260 PSSLHGGCPVIKGNKWSSTKWIR 282
P+S+HGGC V+ G KWS+TKW+R
Sbjct: 146 PNSIHGGCEVLAGEKWSATKWMR 168
>gi|255607134|ref|XP_002538686.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
gi|223510975|gb|EEF23697.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
Length = 318
Score = 113 bits (283), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 68/211 (32%), Positives = 107/211 (50%), Gaps = 25/211 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V ++ PR ++ + LS EC+ LI + +++S VV + RTS G +
Sbjct: 119 VVMVCTAPRIALFDDVLSDAECDALIAASRSRLQRSKVVANRGSGEFVDDTRTSYGAYFN 178
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE-----FNTKNG 191
+G + ++ I++RIA+ T +PL + E LQ+L+Y G +Y PHFDYF + ++G
Sbjct: 179 KGENSLVATIQRRIAELTRWPLTHAEPLQILNYGLGGEYLPHFDYFEPQQPGLPSPLESG 238
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
GQR+ATV+MYL+DVE GG T+FP+ L +P+ G A+ F S
Sbjct: 239 GQRIATVVMYLNDVEAGGGTIFPH-------------------LNLETRPRKGGAIYF-S 278
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ + S I KW +T+W R
Sbjct: 279 YQLAVARSIRSRCMAARRIARRKWIATQWFR 309
>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
Length = 550
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 65/209 (31%), Positives = 105/209 (50%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE++ + P A ++ + +S EE + + +ATP ++++TV +S TG+ + + R S +L
Sbjct: 322 VEILRFNPLAVLFVDIISDEEAKMIQQIATPRLKRATVQNSKTGELETAAYRISKSAWLK 381
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
G ++I I +RI T E E LQ+ +Y G Y+PHFD+ E F + G
Sbjct: 382 GGDHELIDRINRRIELMTNLIQETSEELQIANYGVGGHYDPHFDFARKEEPKAFESLGTG 441
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL YL++ E GG TVF + ++ P AL ++++
Sbjct: 442 NRLATVLFYLTEPEIGGGTVFTELRT-------------------AVMPSKNGALFWYNL 482
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G KW + KWI
Sbjct: 483 YRSGEGDLRTRHAACPVLVGIKWVANKWI 511
>gi|405965633|gb|EKC30995.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
Length = 617
Score = 113 bits (282), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 64/208 (30%), Positives = 102/208 (49%), Gaps = 23/208 (11%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
E + P +YH+ +S +E + + +ATP + ++TV + TGK + + R S +L
Sbjct: 412 EEVYLNPWIVIYHDVVSDKEIDTIKRIATPLLSRATVHNPRTGKLETAEYRVSKSAWLKD 471
Query: 138 GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGGQ 193
G D +I ++ RI+D T + E LQ+ +Y G +YEPHFD+ E F G
Sbjct: 472 GDDPVIHNVNNRISDITGLSMATAEELQIANYGLGGQYEPHFDFARREETEAFRDLGSGN 531
Query: 194 RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMK 253
R+AT L Y+++V+ GG TVF + G+ + P G A ++++
Sbjct: 532 RIATWLTYMTNVDAGGATVFTH-------------------IGVKLFPIKGAAAFWYNLY 572
Query: 254 PDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW S KWI
Sbjct: 573 RSGDGIFDTRHAACPVLVGQKWVSNKWI 600
>gi|299115443|emb|CBN75608.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 548
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 80/243 (32%), Positives = 118/243 (48%), Gaps = 39/243 (16%)
Query: 73 AEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD---SRVRT 129
AE +E +S PR F +NF+ EE + +I A +++ + + +K S+ RT
Sbjct: 203 AEVVLETLSHSPRVFSLYNFMDMEEADSIIEDALGMTQEAYRLKRSSTGTKGKAISKTRT 262
Query: 130 SSGTFLARGRDKIIRDIEKRIADFTFFPLEN-----GEGLQVLHYEAGQKYEPHFDYFMD 184
S F+ + +++RI F +E +GLQVL Y Q Y HFDY
Sbjct: 263 SDNAFVTH--TNTAQALKRRI--FQLLGIEEYHETWADGLQVLRYNESQAYVAHFDYLES 318
Query: 185 ----EFNTKN-GGQRMATVLMYLSDVEEGGETVFPNAQG-NISAVP-------------- 224
+F ++ G R ATV++Y +DV EGGETVF +A G + VP
Sbjct: 319 AEGHDFKSEGLGTNRFATVVLYFNDVREGGETVFTHAPGIDHHLVPDTKVPVREVLENLD 378
Query: 225 ----WWNE--LSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSST 278
W E L +C + + + PK G A+LF++ PD D SS HG CPVI G KW++
Sbjct: 379 LPRSGWEEKLLLQC-RRHMVVAPKRGQAVLFYNQHPDGRKDLSSEHGACPVIDGQKWAAN 437
Query: 279 KWI 281
W+
Sbjct: 438 LWV 440
>gi|326435474|gb|EGD81044.1| hypothetical protein PTSG_10986 [Salpingoeca sp. ATCC 50818]
Length = 264
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 71/229 (31%), Positives = 122/229 (53%), Gaps = 33/229 (14%)
Query: 66 SEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD- 124
++ D R + ++S +P ++NF+S+E + +++ A P +ST +G ++
Sbjct: 51 ADADWLRQHYNITMLSEDPPVIQFNNFISQERIDAILHFAKPKFARST-----SGIEREV 105
Query: 125 SRVRTSSGTFL---ARGRDKI---IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPH 178
S RTSS ++ G D + ++D+E+ IA P+EN E QVL Y+ Q Y+ H
Sbjct: 106 SNYRTSSTAWMLPDVLGNDPMQAHLKDMEEEIARIVRLPVENQEHFQVLQYQKNQYYKVH 165
Query: 179 FDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLS 238
DY ++E + G R+AT +YL+DVEEGG T FPN L+
Sbjct: 166 SDY-IEEQRQQPCGIRVATFFLYLNDVEEGGGTRFPN-------------------LNLT 205
Query: 239 IKPKMGDALLFWSMKPDAS-LDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
++P G+A+L++S P+ + +D + H PV KG K+ + KWI ++++
Sbjct: 206 VQPAKGNAVLWYSAYPNTTRMDSRTDHEAMPVAKGMKYGANKWIHIHDF 254
>gi|195159148|ref|XP_002020444.1| GL13996 [Drosophila persimilis]
gi|194117213|gb|EDW39256.1| GL13996 [Drosophila persimilis]
Length = 559
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 74/213 (34%), Positives = 105/213 (49%), Gaps = 28/213 (13%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS----- 131
+E +S +P +YHN LS EE L N++TP + ++ + D +T K K S VR++
Sbjct: 343 MEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARIFDKETKKPKISPVRSADEVGIP 402
Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNTKN 190
L G +++ I+KRI D T L + +Q L Y G Y PH D+F + ++
Sbjct: 403 NPKLVTGDIQLVECIQKRITDLTGLMLTSMRRIQFLKYGFGGIYVPHHDFFSVHTPTSRL 462
Query: 191 GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
G R+ATV+ YL+DVE GG T FPN + P A+LFW
Sbjct: 463 HGDRIATVIFYLNDVEHGGATAFPNLD--------------------LVVPTERGAVLFW 502
Query: 251 -SMKPDA-SLDPSSLHGGCPVIKGNKWSSTKWI 281
+M + LD +LHG CPVI G K WI
Sbjct: 503 HNMDGETYDLDYRTLHGACPVIVGTKMVMAGWI 535
>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
musculus]
gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
4-hydroxylase), alpha II polypeptide (predicted),
isoform CRA_d [Rattus norvegicus]
Length = 189
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 96/191 (50%), Gaps = 21/191 (10%)
Query: 93 LSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIAD 152
+S EE E + +A P + ++TV D TG + R S ++L D ++ + +R+
Sbjct: 1 MSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQH 60
Query: 153 FTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMYLSDVEEGGE 210
T ++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y+SDVE GG
Sbjct: 61 ITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGA 120
Query: 211 TVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVI 270
TVFP+ G +I PK G A+ ++++ D + H CPV+
Sbjct: 121 TVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVL 161
Query: 271 KGNKWSSTKWI 281
G KW S KW
Sbjct: 162 VGCKWVSNKWF 172
>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
Length = 537
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 66/207 (31%), Positives = 100/207 (48%), Gaps = 20/207 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E S +P YH+ LS + L +A P MR+STV G+ K S R S +LA
Sbjct: 324 LEEHSLDPYVATYHDMLSPRKISQLREMAVPRMRRSTVNPLPGGQHKKSAFRVSKNAWLA 383
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNGGQRM 195
+ + + + + T E LQV +Y G YEPH+D+F D + G R+
Sbjct: 384 YESHPTMVGMLRDLKEATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPEEEGNRI 443
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
AT + YLS+VE+GG T FP +++KP++G+ L ++++
Sbjct: 444 ATAIFYLSEVEQGGATAFPFLD-------------------IAVKPQLGNVLFWYNLHRS 484
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D + H GCPV+KG+KW WI
Sbjct: 485 LDKDYRTKHAGCPVLKGSKWIGNVWIH 511
>gi|391342914|ref|XP_003745760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Metaseiulus
occidentalis]
Length = 525
Score = 112 bits (280), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 62/209 (29%), Positives = 103/209 (49%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+EVI P ++H+ +S +E + +I L+ P ++++TV ++ +G+ + + R S +L
Sbjct: 319 LEVIHERPYLALFHDIMSDDEIQTVIELSAPRLKRATVQNAKSGELEVANYRISKSAWLK 378
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM----DEFNTKNGG 192
+++ + R T E LQV++Y G YE HFD+ D F G
Sbjct: 379 NHDHEVVERLSFRFEYLTGLTHLTAEELQVVNYGIGGHYEAHFDFARRDEKDAFKQLGTG 438
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT + Y+SDV+ GG TVFP + GL++ P+ G A +W++
Sbjct: 439 NRIATWINYMSDVKAGGATVFP-------------------RLGLTVWPEKGSAAFWWNL 479
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
D + H CPV+ G+KW S KW
Sbjct: 480 HRSGEGDILTRHAACPVLAGSKWVSNKWF 508
>gi|223993535|ref|XP_002286451.1| predicted protein [Thalassiosira pseudonana CCMP1335]
gi|220977766|gb|EED96092.1| predicted protein [Thalassiosira pseudonana CCMP1335]
Length = 679
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 80/250 (32%), Positives = 124/250 (49%), Gaps = 48/250 (19%)
Query: 76 WVEVISWEPRAFVYHNFLSKEECEYLINLA------TPHMRKSTVVDSDTGKSKDSRVRT 129
W+EVIS +PR F NF KEE + +++ A T M++S+ +G + +S+ RT
Sbjct: 331 WLEVISLKPRVFDIFNFFDKEESKAIVDKAIAETSETHRMKRSST--GASGYNVNSQ-RT 387
Query: 130 SSGTFLARGRD-KIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT 188
S F G++ + ++ I F + +GLQVL Y Y PH D+ +D+++
Sbjct: 388 SENGFDTHGKEAQAVKHRCMEILGFDEYIESFTDGLQVLRYNKTTAYIPHLDW-IDDYHK 446
Query: 189 KN---------GGQRMATVLMYLSDVEEG--GETVF----PNAQ---------------- 217
K G R AT+L+Y+SD+ EG GETVF P Q
Sbjct: 447 KEEHNYDSAGIGSNRFATILLYMSDLGEGDGGETVFVKGWPPGQSEEERVQLKDALASLR 506
Query: 218 --GNISAV----PWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIK 271
G+++ + W ++ ++ L+++P A LF+S PD S D SLHGGCPVI
Sbjct: 507 ESGDVTGLLKEGSWEEKMVANCRSRLAVRPHSSRAALFYSQNPDGSPDEDSLHGGCPVIN 566
Query: 272 GNKWSSTKWI 281
G KW++ W+
Sbjct: 567 GEKWAANLWV 576
>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
Length = 541
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 66/212 (31%), Positives = 109/212 (51%), Gaps = 27/212 (12%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE ++ P +YH+ + + E + + NL + ++TV+ + S+ S+VRTS TF+
Sbjct: 323 VEELNHNPLLVLYHDVIYQSEIDVIRNLTENEISRATVIGAKG--SEVSKVRTSQFTFIP 380
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNTK-----N 190
+ R K+++ I++R+AD + ++ E Q +Y G Y H D+F D F+ +
Sbjct: 381 KTRHKVLQTIDQRVADMSNLNMDYAELHQFANYGIGGHYAQHNDWFGQDAFDNELVSSPE 440
Query: 191 GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
G R+ATVL YLSDV +GG T FP+ + ++PK A +
Sbjct: 441 MGNRIATVLFYLSDVAQGGGTAFPHLKQ-------------------LLQPKKYAAAFWH 481
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
++ D +LHG CP+I G+KW +WIR
Sbjct: 482 NLHASGVGDLRTLHGACPIIAGSKWVQNRWIR 513
>gi|326914688|ref|XP_003203656.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Meleagris
gallopavo]
Length = 539
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 68/209 (32%), Positives = 101/209 (48%), Gaps = 26/209 (12%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
E + +P +YH+F+S E E + LA P +++S V + K + R S +L
Sbjct: 335 ETLRLQPYIVLYHDFVSDAEAETIKGLAGPWLQRSVVASGE--KQQKVEYRISKSAWLKD 392
Query: 138 GRDKIIRDIEKRIADFTFFPLEN--GEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 192
D ++R +E R+A T L E LQV++Y G YEPHFD+ G
Sbjct: 393 TADPVVRALELRMAAITGLDLRPPYAEYLQVVNYGLGGHYEPHFDHATSRKSPLYRMKSG 452
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATV++YLS VE GG T F A ++ V AL +W++
Sbjct: 453 NRIATVMIYLSAVEAGGSTAFIYANFSVPVV-------------------KNAALFWWNL 493
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ + D +LH GCPV+ G+KW + KWI
Sbjct: 494 RRNGDGDGDTLHAGCPVLAGDKWVANKWI 522
>gi|339236271|ref|XP_003379690.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
gi|316977627|gb|EFV60702.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
Length = 558
Score = 112 bits (279), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 65/231 (28%), Positives = 110/231 (47%), Gaps = 44/231 (19%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VEV+ W+P+ + +S EE + LA+P ++++TV ++DTG+ + + R S +L
Sbjct: 329 VEVMHWKPKIVYFRGVISDEEIAVIKQLASPLLKRATVHNADTGQLETASYRISKSAWLK 388
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFD---------------- 180
++++ I RI T +E E LQ+ +Y G Y+PHFD
Sbjct: 389 DTEHEVVKRISDRIDMMTDLTMETAELLQIANYGIGGHYDPHFDMSTRGESDPYEEGTGN 448
Query: 181 ------YFMDE---FNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSE 231
++ ++ F + N G R+ATVL Y+S E GG TVF + +
Sbjct: 449 RIATVLFYTNDPYSFESLNAGNRIATVLFYISQPEAGGGTVFTSHK-------------- 494
Query: 232 CGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
++++P DA ++++ D S+ H CPV+ G KW + KWI
Sbjct: 495 -----ITVEPSKYDAAFWFNVLQGGEPDMSTRHAACPVLAGTKWVANKWIH 540
>gi|195505209|ref|XP_002099405.1| GE10885 [Drosophila yakuba]
gi|194185506|gb|EDW99117.1| GE10885 [Drosophila yakuba]
Length = 473
Score = 112 bits (279), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 66/209 (31%), Positives = 105/209 (50%), Gaps = 22/209 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++S +P ++H+ +S ++ + NLA + ++ V D G ++ RT+ GT+L
Sbjct: 276 MELLSLDPYMVLFHDVVSDKDITSIRNLAKGGLVRAVTVTKD-GSYEEDPARTTKGTWLV 334
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
K+I+ + + D T + + + QVL+Y G Y HFD+ D N R+A
Sbjct: 335 EN-SKLIQRLSQLAQDMTNLDIRDADPFQVLNYGIGGYYGTHFDFLADT-EMGNFSNRIA 392
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
T + YLSDV +GG T+FP K GLS+ PK G ALL++++
Sbjct: 393 TAVFYLSDVPQGGATIFP-------------------KLGLSVFPKKGSALLWYNLDHKG 433
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
D + H CP I G++W TKWI E
Sbjct: 434 DGDNRTAHSACPTIVGSRWVMTKWINERE 462
>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
Length = 535
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 66/210 (31%), Positives = 100/210 (47%), Gaps = 23/210 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVD-SDTGKSKDSRVRTSSGTFL 135
+E + +P H + ++ E L A P +++STV + G S + RTS G
Sbjct: 319 LEELHLDPPVVQLHQVIGSKDAESLQRTARPRIKRSTVYSLAGNGDSTAAAFRTSQGASF 378
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG---G 192
R+ + + + DF+ +E E LQV +Y G YEPH+D F D + G G
Sbjct: 379 NYSRNAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPHWDSFPDNHVYQEGDLHG 438
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT + YLSDVE GG T FP +P L + P+ G L ++++
Sbjct: 439 NRIATAIYYLSDVEAGGGTAFP-------FLP------------LLVTPERGSLLFWYNL 479
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
P D + H CPV++G+KW + WIR
Sbjct: 480 HPSGDQDFRTKHAACPVLQGSKWIANVWIR 509
>gi|449467908|ref|XP_004151664.1| PREDICTED: uncharacterized protein LOC101218099, partial [Cucumis
sativus]
Length = 122
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 66/128 (51%), Positives = 83/128 (64%), Gaps = 11/128 (8%)
Query: 1 MAKPRYSRFPTRKSSS---STLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSS 57
++K +Y + RK S+ S +I+ L++ F +LI L F LS P +S + S
Sbjct: 3 ISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRF--LSPPETS------HHRFS 54
Query: 58 IVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
VR + S+G R +QWVE ISWEPRAFVYHNFLSKEEC YLI+LA PHM KSTVVDS
Sbjct: 55 SVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDS 114
Query: 118 DTGKSKDS 125
TG+S DS
Sbjct: 115 KTGESVDS 122
>gi|395521232|ref|XP_003764722.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Sarcophilus
harrisii]
Length = 521
Score = 111 bits (278), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 67/210 (31%), Positives = 100/210 (47%), Gaps = 26/210 (12%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV+ EP +YH+F+S E + + A P +++S V + K + R S +L
Sbjct: 317 EVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVASGE--KQQQVEYRISKSAWLKD 374
Query: 138 GRDKIIRDIEKRIADFTFFPLE--NGEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 192
D I+ +++RIA T ++ E LQV++Y G YEPHFD+ N G
Sbjct: 375 TVDPILVSLDRRIAALTGLNVQPPYAEHLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSG 434
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT ++YLS VE GG T F A ++ V AL +W++
Sbjct: 435 NRVATFMIYLSSVEAGGSTAFIYANFSVPVV-------------------KNAALFWWNL 475
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D +LH GCPV+ G+KW + KWI
Sbjct: 476 HRSGQGDGDTLHAGCPVLVGDKWVANKWIH 505
>gi|390176836|ref|XP_003736216.1| GA26872, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388858809|gb|EIM52289.1| GA26872, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 567
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 73/213 (34%), Positives = 105/213 (49%), Gaps = 28/213 (13%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E +S +P +YHN LS EE L N++TP + ++ V DS K K S RT+ +
Sbjct: 351 MEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARVFDSGIRKPKISPARTADEVQIP 410
Query: 137 RGR-----DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNTKN 190
+ +++ I+KR+ D T L + +Q L Y G Y PH D+F + ++
Sbjct: 411 NPKLVAEDIQLVERIQKRMTDLTGLVLTSMRRIQFLKYGFGGIYVPHHDFFSVHTPTSRL 470
Query: 191 GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
G R+ATV+ YL+DVE GG T FPN + P A+LFW
Sbjct: 471 HGDRIATVIFYLNDVEHGGATAFPNLD--------------------LVVPTERGAVLFW 510
Query: 251 -SMKPDA-SLDPSSLHGGCPVIKGNKWSSTKWI 281
+M + LD +LHG CPVI G K T+WI
Sbjct: 511 HNMDGETYDLDYRTLHGACPVIVGTKMVMTRWI 543
>gi|51490656|emb|CAF31507.1| prolyl 4-hydroxylase 2 precursor [Brugia malayi]
Length = 551
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 63/206 (30%), Positives = 104/206 (50%), Gaps = 21/206 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE++ P ++ + +S EE + LA P + ++TV + TG + + RTS ++L
Sbjct: 321 VEIVHQNPLVVLFRDIVSDEEMRIIEMLAVPKLARATVHNVVTGNIETAFYRTSQSSWLG 380
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE--FNTKNGGQR 194
++++ I KR+ T E E LQV +Y G YEPH+D E F G R
Sbjct: 381 STEHEVVKRINKRLDLATNLETETAEELQVQNYGIGGHYEPHYDCSRRENVFEKTKNGNR 440
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+AT+L+Y+++ E GG TVF + + ++S C K AL ++++
Sbjct: 441 IATILIYMTEPEIGGGTVFIDLKTSVS----------CTKNA---------ALFWYNLMR 481
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKW 280
++D S H CPV+ G KW++ KW
Sbjct: 482 SGAVDMRSYHAACPVLTGTKWTANKW 507
>gi|198459366|ref|XP_002138685.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
gi|198136669|gb|EDY69243.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
Length = 448
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 71/209 (33%), Positives = 105/209 (50%), Gaps = 24/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE +S +P +YH+ + E E L + A P M +S V KD+ RTS F
Sbjct: 245 VEPLSQDPYIAMYHDVIYDSEIEELKDNAFPDMERSKVYTYSDKDGKDTG-RTSMSAFQT 303
Query: 137 RGRDKIIRDIEKRIADFTFFP-LENG--EGLQVLHYEAGQKYEPHFDYFMDEFNTK-NGG 192
+ + + +R+ T F L +G + L VL+Y +Y H DYF ++ G
Sbjct: 304 DHQYTAVTKVNRRVMHMTGFEVLADGSSDELLVLNYATAAQYLTHSDYFGPAYSEYIQRG 363
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATVL YL+DVE+GG+TVFP + G+ P G A++F+++
Sbjct: 364 DRIATVLFYLNDVEQGGKTVFP-------------------RLGIFRSPMKGSAVVFYNL 404
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
DP + HGGCPV+ G KW++TKWI
Sbjct: 405 NSSLQGDPRTEHGGCPVLVGTKWAATKWI 433
>gi|198449518|ref|XP_002136915.1| GA26872, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198130643|gb|EDY67473.1| GA26872, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 543
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 73/213 (34%), Positives = 105/213 (49%), Gaps = 28/213 (13%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E +S +P +YHN LS EE L N++TP + ++ V DS K K S RT+ +
Sbjct: 327 MEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARVFDSGIRKPKISPARTADEVQIP 386
Query: 137 RGR-----DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNTKN 190
+ +++ I+KR+ D T L + +Q L Y G Y PH D+F + ++
Sbjct: 387 NPKLVAEDIQLVERIQKRMTDLTGLVLTSMRRIQFLKYGFGGIYVPHHDFFSVHTPTSRL 446
Query: 191 GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
G R+ATV+ YL+DVE GG T FPN + P A+LFW
Sbjct: 447 HGDRIATVIFYLNDVEHGGATAFPNLD--------------------LVVPTERGAVLFW 486
Query: 251 -SMKPDA-SLDPSSLHGGCPVIKGNKWSSTKWI 281
+M + LD +LHG CPVI G K T+WI
Sbjct: 487 HNMDGETYDLDYRTLHGACPVIVGTKMVMTRWI 519
>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
Length = 549
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 68/208 (32%), Positives = 103/208 (49%), Gaps = 24/208 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE +S +P +H+ LS +E E +I + +S + TG S S +RTS T+L
Sbjct: 339 VEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEI--GQTGNSTVSEIRTSQNTWLW 396
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG--GQR 194
+ + DI++R+ D T + E LQ+++Y G +YEPHFD FMD+ G G R
Sbjct: 397 YENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFD-FMDDAEKNFGWKGNR 455
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ T L YL+DV GG T FP L++ P G L+++++
Sbjct: 456 LLTALFYLNDVPLGGATAFPFLH-------------------LAVPPVKGSLLVWYNLHR 496
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D + H GCPV+KG+KW +W
Sbjct: 497 SLHKDFRTKHAGCPVLKGSKWICNEWFH 524
>gi|310831339|ref|YP_003969982.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
roenbergensis virus BV-PW1]
gi|309386523|gb|ADO67383.1| putative prolyl 4-hydroxylase alpha subunit [Cafeteria
roenbergensis virus BV-PW1]
Length = 210
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 69/212 (32%), Positives = 106/212 (50%), Gaps = 28/212 (13%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
++S +P + N L+K+EC ++I + + ++ + V S + S RT + +L+
Sbjct: 4 HILSQDPLIYYVDNVLNKQECYHIIKITSNKLKPALV--SGNSRGFLSTGRTGTNCWLSH 61
Query: 138 GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNT-----KNG 191
D+I +I +I + PLEN E QVLHY QKYE H+D F +D K G
Sbjct: 62 KNDEITFNIALKITNLVNKPLENAENFQVLHYSTNQKYEYHYDAFPIDNSEKAKRCLKKG 121
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW- 250
GQR+ T L+YL++V +GGET F N + I PK+G L+F
Sbjct: 122 GQRLLTALIYLNNVTKGGETEFKNL-------------------NIKITPKIGRILVFEN 162
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+++ + P SLH G VI+G K+ W R
Sbjct: 163 TLQNSLNKHPDSLHSGKQVIEGEKYVINLWFR 194
>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 70/217 (32%), Positives = 107/217 (49%), Gaps = 37/217 (17%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
E +S +P +YH+ + + E + + L T M ++ V + T +S S VRTS TF+A
Sbjct: 321 AEELSHDPLLVLYHDVIYQSEIDVIRQLTTNRMARAMV--TLTNQSTVSNVRTSQITFIA 378
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG----- 191
+ ++++ I++R+AD T ++ E Q +Y G Y H D+F E NG
Sbjct: 379 KTEHEVLQTIDRRVADMTNLNMDYAEDHQFANYGIGGHYGQHMDWFT-ETTFDNGLVSST 437
Query: 192 --GQRMATVLMYLSDVEEGGETVFPNAQGNIS----AVPWWNELSECGKTGLSIKPKMGD 245
G R+ATVL YLSDV +GG T FP + ++ A +W+ L G+ GD
Sbjct: 438 EMGNRIATVLFYLSDVAQGGGTAFPYLKQHLRPKKYAAAFWHNLHAAGR---------GD 488
Query: 246 ALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
A + HG CP+I G+KW +WIR
Sbjct: 489 A--------------RTQHGACPIIAGSKWVLNRWIR 511
>gi|156352054|ref|XP_001622587.1| predicted protein [Nematostella vectensis]
gi|156209158|gb|EDO30487.1| predicted protein [Nematostella vectensis]
Length = 531
Score = 111 bits (277), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 69/214 (32%), Positives = 109/214 (50%), Gaps = 28/214 (13%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE + +P ++ + + E EY+ ATP +R++TV + TG+ + + R S +L
Sbjct: 320 VEELHSDPPIWMLRDVMYDSEIEYIKRTATPKLRRATVTNLKTGELEFADYRISKSGWLE 379
Query: 137 RGRD----KIIRDIEKRIADFTFFPL--ENGEGLQVLHYEAGQKYEPHFDYFMDEFNT-- 188
RD KI+ + +R + T + E LQ+++Y A YEPHFD+ + ++
Sbjct: 380 DPRDDNEEKILNRVNRRTSIITGLDTTPRSAEALQIVNYGAAGHYEPHFDHATEAVSSIL 439
Query: 189 KNG-GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDAL 247
K G G R+ATVL Y+SDVE GG TVF +A+ +KP GDA
Sbjct: 440 KLGIGNRIATVLYYMSDVEAGGATVFVDAEA-------------------IVKPSKGDAA 480
Query: 248 LFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
++++ + D + H CP+I G+KW KWI
Sbjct: 481 FWYNLHKNGKGDERTRHAACPIIVGSKWVCNKWI 514
>gi|313229039|emb|CBY18191.1| unnamed protein product [Oikopleura dioica]
Length = 522
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 67/205 (32%), Positives = 102/205 (49%), Gaps = 20/205 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE I+ +P + + L+ E L L + ++TV D T K ++ R S +L
Sbjct: 316 VEEIAKQPYVVRFFDILNDNEINSLERLGEEKLARATVFDPATHKLVNADYRVSKSAWLK 375
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +RI+ T LE E LQ+ +Y G +YEPH+DY E++ N +R+A
Sbjct: 376 DEDSDTVEKYNRRISRLTGLDLEYAEQLQMSNYGIGGQYEPHYDYSRREWDIYN-NRRIA 434
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
T L YL+ VE+GG TVF + GL I+ G A+ ++++ P+
Sbjct: 435 TWLSYLTTVEQGGGTVFT-------------------ELGLHIRSIKGSAVFWYNLLPNG 475
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWI 281
S D + H CPV++GNKW S KWI
Sbjct: 476 SGDERTRHAACPVLRGNKWVSNKWI 500
>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
Length = 285
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 106/211 (50%), Gaps = 30/211 (14%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
E +S +P +YH+F+S E E + + A +R+S V D K + R S +L
Sbjct: 81 ETLSLQPYVVLYHDFISDTEAEEIKHHAQLGLRRSVVATRD--KQVTAEYRISKSAWLKG 138
Query: 138 GRDKIIRDIEKRIADFTFFPLEN--GEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNG 191
+ +++RI+ T +++ GE LQV++Y G YEPHFD+ F K G
Sbjct: 139 SAQSAVSRLDQRISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKLKTG 198
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW- 250
R+ATV++YLS VE GG T F A ++ P M +A +FW
Sbjct: 199 -NRVATVMIYLSSVEAGGSTAFIYANFSV--------------------PVMKNAAIFWW 237
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
++ + DP +LH GCPV+ G+KW + KWI
Sbjct: 238 NLHRNGRGDPDTLHAGCPVLIGDKWVANKWI 268
>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
Length = 549
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 68/208 (32%), Positives = 103/208 (49%), Gaps = 24/208 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE +S +P +H+ LS +E E +I + +S + TG S S +RTS T+L
Sbjct: 339 VEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEI--GQTGNSTVSDIRTSQNTWLW 396
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG--GQR 194
+ + DI++R+ D T + E LQ+++Y G +YEPHFD FMD+ G G R
Sbjct: 397 YENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFD-FMDDAEKNFGWKGNR 455
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+ T L YL+DV GG T FP L++ P G L+++++
Sbjct: 456 LLTALFYLNDVPLGGATAFPFLH-------------------LAVPPVKGSLLVWYNLHR 496
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D + H GCPV+KG+KW +W
Sbjct: 497 SLHKDFRTKHAGCPVLKGSKWICNQWFH 524
>gi|195159164|ref|XP_002020452.1| GL13506 [Drosophila persimilis]
gi|194117221|gb|EDW39264.1| GL13506 [Drosophila persimilis]
Length = 536
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 74/213 (34%), Positives = 107/213 (50%), Gaps = 28/213 (13%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E +S +P VYHN LS E + +A P ++ V + D SK S+VRT+ G ++
Sbjct: 325 MEELSLDPYIVVYHNVLSDAEIAKVERVAEPLLKSIGVGEMDN--SKKSKVRTALGAWIP 382
Query: 137 RGRDKI-----IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN-TKN 190
I I+ I +RI D T ++ G+ +Q++ Y G Y+ HFDY D T+
Sbjct: 383 DENMHISGWPVIQRIVRRIHDMTGLIIKRGQVVQLIKYGYGGHYDTHFDYLNDSLPITQA 442
Query: 191 GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
G RMATVL YL+DV+ GG TVFP Q L + + G L+++
Sbjct: 443 LGDRMATVLFYLNDVKHGGSTVFPVLQ-------------------LKVPSERGKVLVWY 483
Query: 251 SMKPDA-SLDPSSLHGGCPVIKGNKWSSTKWIR 282
+M + LD +LHG CPVI G K + WI
Sbjct: 484 NMHGETHDLDSRTLHGSCPVIDGAKTVLSCWIH 516
>gi|116008432|ref|NP_651804.2| CG15539, isoform A [Drosophila melanogaster]
gi|66772391|gb|AAY55507.1| IP10910p [Drosophila melanogaster]
gi|66772535|gb|AAY55579.1| IP10810p [Drosophila melanogaster]
gi|113194858|gb|AAF57060.2| CG15539, isoform A [Drosophila melanogaster]
Length = 386
Score = 110 bits (276), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 64/209 (30%), Positives = 107/209 (51%), Gaps = 22/209 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++S +P ++H+ +S ++ + NL + ++ V D ++D RT+ GT+L
Sbjct: 189 MELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTVSKDGNYTEDPD-RTTKGTWLV 247
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +I+ + + D T F + + + QVL+Y G Y HFD F+++ N R+A
Sbjct: 248 EN-NALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFD-FLEDAELDNFSDRIA 305
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
T + YLSDV +GG T+FP K GLS+ PK G ALL++++
Sbjct: 306 TAVFYLSDVPQGGATIFP-------------------KLGLSVFPKKGSALLWYNLDHKG 346
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
D + H CP + G++W TKWI E
Sbjct: 347 DGDNRTAHSACPTVVGSRWVMTKWINERE 375
>gi|427410797|ref|ZP_18900999.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
51230]
gi|425710785|gb|EKU73805.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
51230]
Length = 322
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 66/207 (31%), Positives = 101/207 (48%), Gaps = 27/207 (13%)
Query: 81 SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR- 139
W+ R F FL+ +EC ++I+ + + V+D +G+ +RTS G R
Sbjct: 137 GWDVRLF--RQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHPIRTSDGGIFGPARE 194
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D +I+ I +RIA + L GE L +L Y GQ+Y H D N QR T+L
Sbjct: 195 DLVIQAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCLPHVRN-----QRAWTML 249
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
+YL++ GGET+FP + GLS+K + G+ALLF +
Sbjct: 250 IYLNEGYAGGETIFP-------------------RLGLSVKGRKGNALLFRNTDAQGQAA 290
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+++H G PV+ G KW T+WIR + +
Sbjct: 291 EAAVHLGAPVMAGQKWLCTRWIRHDRH 317
>gi|363729586|ref|XP_417248.3| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Gallus gallus]
Length = 542
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 67/209 (32%), Positives = 101/209 (48%), Gaps = 26/209 (12%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
E + +P +YH+F+S E E + LA P +++S V + K + R S +L
Sbjct: 338 ETLRLQPYIVLYHDFVSDAEAETIKGLAGPWLQRSVVASGE--KQQKVEYRISKSAWLKD 395
Query: 138 GRDKIIRDIEKRIADFTFFPLE--NGEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 192
D +++ +E R+A T L E LQV++Y G YEPHFD+ G
Sbjct: 396 TADPVVQALELRMAAITGLDLRPPYAEYLQVVNYGLGGHYEPHFDHATSRKSPLYRMKSG 455
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+ATV++YLS VE GG T F A ++ V AL +W++
Sbjct: 456 NRIATVMIYLSAVEAGGSTAFIYANFSVPVV-------------------KNAALFWWNL 496
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ + D +LH GCPV+ G+KW + KWI
Sbjct: 497 RRNGDGDGDTLHAGCPVLAGDKWVANKWI 525
>gi|116008128|ref|NP_001036776.1| CG15539, isoform B [Drosophila melanogaster]
gi|113194857|gb|ABI31220.1| CG15539, isoform B [Drosophila melanogaster]
Length = 509
Score = 110 bits (276), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 64/209 (30%), Positives = 107/209 (51%), Gaps = 22/209 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++S +P ++H+ +S ++ + NL + ++ V D ++D RT+ GT+L
Sbjct: 312 MELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTVSKDGNYTEDPD-RTTKGTWLV 370
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ +I+ + + D T F + + + QVL+Y G Y HFD F+++ N R+A
Sbjct: 371 EN-NALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFD-FLEDAELDNFSDRIA 428
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
T + YLSDV +GG T+FP K GLS+ PK G ALL++++
Sbjct: 429 TAVFYLSDVPQGGATIFP-------------------KLGLSVFPKKGSALLWYNLDHKG 469
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
D + H CP + G++W TKWI E
Sbjct: 470 DGDNRTAHSACPTVVGSRWVMTKWINERE 498
>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
Length = 187
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 63/188 (33%), Positives = 94/188 (50%), Gaps = 21/188 (11%)
Query: 96 EECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTF 155
EE E + +A P + ++TV D TG + R S ++L D ++ + +R+ T
Sbjct: 2 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITG 61
Query: 156 FPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT--KNGGQRMATVLMYLSDVEEGGETVF 213
++ E LQV +Y G +YEPHFD+ F++ K G R+AT L Y+SDVE GG TVF
Sbjct: 62 LTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVF 121
Query: 214 PNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGN 273
P+ G +I PK G A+ ++++ D + H CPV+ G
Sbjct: 122 PD-------------------LGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGC 162
Query: 274 KWSSTKWI 281
KW S KW
Sbjct: 163 KWVSNKWF 170
>gi|348505573|ref|XP_003440335.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oreochromis
niloticus]
Length = 517
Score = 110 bits (275), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 67/211 (31%), Positives = 107/211 (50%), Gaps = 28/211 (13%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
E++S +P +YH+F++ E E + +LA P +R+S V + + D R+ S +L
Sbjct: 313 ELVSLQPYVVLYHDFVTDTEAEDIKSLAHPGLRRSVVAAGEKQATADYRI--SKSAWLKG 370
Query: 138 GRDKIIRDIEKRIADFTFFPLEN--GEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNG 191
I+ +++RI+ T +++ GE LQV++Y G YEPHFD+ F K G
Sbjct: 371 SAQSIVGKLDQRISLLTGLNVKHPYGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKLKTG 430
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
R+AT ++YLS VE GG T F A ++ V A+ +W+
Sbjct: 431 N-RVATFMIYLSPVEAGGSTAFIYANFSVPVVE-------------------KAAIFWWN 470
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ + D +LH GCPV+ G+KW + KWI
Sbjct: 471 LHRNGEGDDDTLHAGCPVLIGDKWVANKWIH 501
>gi|397620233|gb|EJK65613.1| hypothetical protein THAOC_13503 [Thalassiosira oceanica]
Length = 643
Score = 110 bits (275), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 82/250 (32%), Positives = 120/250 (48%), Gaps = 48/250 (19%)
Query: 76 WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVD-SDTGKS--KDSRVRTSSG 132
W+EVIS EPR F NF ++E +++ A +S + S TG S + RTS
Sbjct: 291 WLEVISLEPRVFDVFNFFDRDESAAIVDKALKETSESHRIKRSSTGASGYNVNSQRTSEN 350
Query: 133 TFLARGRDKIIRDIEKR---IADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTK 189
F G K+ + +++R + F + +GLQVL Y Y PH D+ +D++ K
Sbjct: 351 GFDTHG--KVSQAVKRRCMNVLGFDEYEESLTDGLQVLRYNKTTAYIPHLDW-IDDYGKK 407
Query: 190 N---------GGQRMATVLMYLSD--VEEGGETVF----PNAQG---------------- 218
G R AT+L+Y+SD V +GGETVF P Q
Sbjct: 408 QEHNFDSAGLGSNRFATILLYMSDLGVGDGGETVFTSGWPVGQAEEDHVQTNEAIDALRE 467
Query: 219 -----NISAVPWWNE--LSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIK 271
NI W E ++ C ++ L+++P A+LF+S PD + D SS HGGCPVI
Sbjct: 468 SGDVENILTRDSWEEKMVANC-RSRLAVRPHSSRAVLFYSQNPDGTPDRSSKHGGCPVIN 526
Query: 272 GNKWSSTKWI 281
G KW++ W+
Sbjct: 527 GEKWAANLWV 536
>gi|443712762|gb|ELU05926.1| hypothetical protein CAPTEDRAFT_153364 [Capitella teleta]
Length = 491
Score = 110 bits (274), Expect = 8e-22, Method: Compositional matrix adjust.
Identities = 68/218 (31%), Positives = 107/218 (49%), Gaps = 34/218 (15%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV + +P ++++ +S E +++I A P M + V +S +S D R+ S +L
Sbjct: 276 EVHNVDPHVAIFYDVISDAEADHIIRHAFPGMFRGLVGNSTLRQSSDQRI--SKVGWLFD 333
Query: 138 GRDKIIRDIEKRIADFT-----FFPLENG-EGLQVLHYEAGQKYEPHFDYFMDEFNTKN- 190
D +I+ + RI D T + P+ + E +QV++Y G +YEPH D++ D KN
Sbjct: 334 NVDTLIKKLSARIGDVTGLNTVYTPVRSPVEAMQVVNYGIGGQYEPHLDFYEDPEMLKNV 393
Query: 191 ------GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMG 244
G R++T L YLS V GG TVFP K + + P
Sbjct: 394 NPSLQDTGDRISTFLFYLSRVHLGGATVFP-------------------KLNVRVPPVKN 434
Query: 245 DALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
A +++ +P+ D +LH GCPV+ G KW + KWIR
Sbjct: 435 GAAFWYNARPNGEHDKRTLHAGCPVVLGEKWVANKWIR 472
>gi|297803562|ref|XP_002869665.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
gi|297315501|gb|EFH45924.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
Length = 290
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 69/202 (34%), Positives = 108/202 (53%), Gaps = 26/202 (12%)
Query: 80 ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
+SW+PR F+Y FLS+EE ++LI+L KD+ TS
Sbjct: 61 LSWQPRVFLYRGFLSEEESDHLISL-----------------RKDTSEVTSGDADGKTQL 103
Query: 140 DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVL 199
D ++ IE++I+ +TF P ENG ++V Y + +K DYF +E ++ +ATV+
Sbjct: 104 DPVVAGIEEKISAWTFLPRENGGSIKVRSYTS-EKSGKKLDYFGEEPSSVLRESLLATVV 162
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
+YLS+ +GGE +FPN++ C + G ++P G+A+LF+S +ASLD
Sbjct: 163 LYLSNTTQGGELLFPNSE--------VKPKKSCSEDGNILRPVKGNAVLFFSRLLNASLD 214
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+S H CPV+KG +TK I
Sbjct: 215 ETSTHLICPVVKGELLVATKLI 236
>gi|297301157|ref|XP_001103971.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Macaca
mulatta]
Length = 512
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 65/200 (32%), Positives = 96/200 (48%), Gaps = 37/200 (18%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ I RI D T + E LQV +Y G +YEPHFD+ RM
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDF-----------ARM------- 435
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
SDV GG TVFP + G S+ PK G A+ ++++ D S+
Sbjct: 436 SDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEGDYST 476
Query: 263 LHGGCPVIKGNKWSSTKWIR 282
H CPV+ GNKW S KW+
Sbjct: 477 RHAACPVLVGNKWVSNKWLH 496
>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
Length = 537
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 65/206 (31%), Positives = 99/206 (48%), Gaps = 20/206 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E S +P +H+ LS + L +A P M++STV G+ K S R S +LA
Sbjct: 324 LEEHSLDPYVASFHDMLSPRKISQLREMAVPRMQRSTVNPRPGGQHKKSAFRVSKNAWLA 383
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG-GQRM 195
+ + + + D T E LQV +Y G YEPH+D+F D + G R+
Sbjct: 384 YEAHPTMAGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPSHYPAAEGNRI 443
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
AT + YLS+VE+GG T FP ++KP++G+ L ++++
Sbjct: 444 ATAIFYLSEVEQGGATAFPF-------------------LDFAVKPQLGNVLFWYNLHRS 484
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWI 281
D + H GCPV+KG+KW WI
Sbjct: 485 LDKDYRTKHAGCPVLKGSKWIGNVWI 510
>gi|198449506|ref|XP_002136910.1| GA26925 [Drosophila pseudoobscura pseudoobscura]
gi|198130637|gb|EDY67468.1| GA26925 [Drosophila pseudoobscura pseudoobscura]
Length = 543
Score = 110 bits (274), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 72/213 (33%), Positives = 105/213 (49%), Gaps = 28/213 (13%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E +S +P +YHN LS EE L N++TP + ++ + D +T K K S VR++ +
Sbjct: 327 MEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARIFDKETKKPKISPVRSADEVGIP 386
Query: 137 RGR-----DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNTKN 190
+ +++ I+KRI D T L + +Q L Y G Y PH D+F + ++
Sbjct: 387 NPKLVTEDIQLVECIQKRITDLTGLMLTSMRRIQFLKYGFGGIYVPHHDFFSVHTPTSRL 446
Query: 191 GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
G R+ATV+ YL+DVE GG T FPN + P A+LFW
Sbjct: 447 HGDRIATVIFYLNDVEHGGATAFPNLD--------------------LVVPTERGAVLFW 486
Query: 251 -SMKPDA-SLDPSSLHGGCPVIKGNKWSSTKWI 281
+M + LD +LHG CPVI G K WI
Sbjct: 487 HNMDGETYDLDYRTLHGACPVIVGTKMVMAGWI 519
>gi|421871431|ref|ZP_16303052.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
laterosporus GI-9]
gi|372459315|emb|CCF12601.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
laterosporus GI-9]
Length = 201
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 69/210 (32%), Positives = 106/210 (50%), Gaps = 30/210 (14%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKS--KDSRVRTSSGTFL 135
++++ +P Y + +S E C+ LINLA + +TVV G+S + S VR S +
Sbjct: 6 QLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVV----GQSGLEVSHVRISELAWF 61
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTK----NG 191
+++++ I K+IA+ P+ E LQV HY AG K+E H D + + K +
Sbjct: 62 CHNYNEVVQSICKQIAEIVEQPIHYAEKLQVAHYGAGGKFEAHLDCYDSQEANKTFLEHS 121
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
GQR+ T ++YL+DV GGET FPN + + + P G L+F +
Sbjct: 122 GQRLYTAILYLNDVVSGGETYFPNLK-------------------IEVSPTTGTLLVFEN 162
Query: 252 MKPDASL-DPSSLHGGCPVIKGNKWSSTKW 280
+PD S+ D SLHG + G KW T W
Sbjct: 163 CQPDTSIPDLRSLHGSKILQSGEKWIGTLW 192
>gi|198449524|ref|XP_002136918.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
gi|198130646|gb|EDY67476.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
Length = 530
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 73/213 (34%), Positives = 108/213 (50%), Gaps = 28/213 (13%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E +S +P VYHN LS E + +A P ++ V + D SK S+VRT+ G ++
Sbjct: 319 MEELSLDPYIVVYHNVLSDAEIAKVERVAEPLLKSIGVGEMDN--SKKSKVRTALGAWIP 376
Query: 137 RGRDKI-----IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN-TKN 190
I I+ I +RI D T +++G+ +Q++ Y G Y+ HFDY D T+
Sbjct: 377 DKNMHISGWPVIQRIVRRIHDMTGLIIKHGQVVQLIKYGYGGHYDTHFDYLNDSLPITQA 436
Query: 191 GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
G RMATVL YL+DV+ GG TVFP + L + + G L+++
Sbjct: 437 LGDRMATVLFYLNDVKHGGSTVFPVLK-------------------LKVPSERGKVLVWY 477
Query: 251 SMKPDA-SLDPSSLHGGCPVIKGNKWSSTKWIR 282
+M + LD +LHG CPVI G K + WI
Sbjct: 478 NMHGETHDLDSRTLHGSCPVIDGAKTVLSCWIH 510
>gi|195159160|ref|XP_002020450.1| GL13507 [Drosophila persimilis]
gi|194117219|gb|EDW39262.1| GL13507 [Drosophila persimilis]
Length = 543
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 73/213 (34%), Positives = 105/213 (49%), Gaps = 28/213 (13%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E +S +P +YH+ LS EE L N++TP + ++ V DS K K S RT+ +
Sbjct: 327 MEELSLDPYIVLYHSVLSDEEMARLENMSTPLLHRARVFDSGIRKPKISPARTADEVQIP 386
Query: 137 RGR-----DKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNTKN 190
+ +++ I+KRI D T L + +Q L Y G Y PH D+F + ++
Sbjct: 387 NPKLVAEDIQLVECIQKRITDLTGLMLTSMRRIQFLKYGFGGIYVPHHDFFSVHTPTSRL 446
Query: 191 GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW 250
G R+ATV+ YL+DVE GG T FPN + P A+LFW
Sbjct: 447 HGDRIATVIFYLNDVEHGGATAFPNLD--------------------LVVPTERGAVLFW 486
Query: 251 -SMKPDA-SLDPSSLHGGCPVIKGNKWSSTKWI 281
+M + LD +LHG CPVI G K T+WI
Sbjct: 487 HNMDGETYDLDYRTLHGACPVIVGTKMVMTRWI 519
>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
putative [Tribolium castaneum]
Length = 515
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 100/206 (48%), Gaps = 21/206 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE +P ++HN LS E E + LA + + + + + + R S +L
Sbjct: 320 VEQAHLDPDILIFHNVLSDCEIETMKQLAQSRLVTAVFENPHSKQLELFPFRISKVAWLE 379
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ + + +R+A T L E QV++Y G YEPHFD F + G R+
Sbjct: 380 DQEHQHLAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFD-FQSTVDPAIGS-RIE 437
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVL YLSDVE+GG TVFP Q +S+ P+ G A++++++ P
Sbjct: 438 TVLFYLSDVEQGGATVFPEIQ-------------------VSVWPQKGSAVVWFNLHPSG 478
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIR 282
D + H GCPV+ G+KW +TKWI
Sbjct: 479 DGDQRTKHAGCPVLIGSKWIATKWIH 504
>gi|339009924|ref|ZP_08642495.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
gi|338773194|gb|EGP32726.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
Length = 201
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 69/210 (32%), Positives = 106/210 (50%), Gaps = 30/210 (14%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKS--KDSRVRTSSGTFL 135
++++ +P Y + +S E C+ LINLA + +TVV G+S + S VR S +
Sbjct: 6 QLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVV----GQSGLEVSHVRISELAWF 61
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTK----NG 191
+++++ I K+IA+ P+ E LQV HY AG K+E H D + + K +
Sbjct: 62 CHNYNEVVQSICKQIAEIVEQPIHYAEKLQVAHYGAGGKFEAHLDCYDSQEANKPFLEHS 121
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
GQR+ T ++YL+DV GGET FPN + + + P G L+F +
Sbjct: 122 GQRLYTAILYLNDVVSGGETYFPNLK-------------------IEVSPTTGTLLVFEN 162
Query: 252 MKPDASL-DPSSLHGGCPVIKGNKWSSTKW 280
+PD S+ D SLHG + G KW T W
Sbjct: 163 CQPDTSIPDLRSLHGSKILQSGEKWIGTLW 192
>gi|390989473|ref|ZP_10259770.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
gi|372555742|emb|CCF66745.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
pv. punicae str. LMG 859]
Length = 152
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 63/161 (39%), Positives = 88/161 (54%), Gaps = 26/161 (16%)
Query: 127 VRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDE 185
RTS L G+D + + IE RIA +P+++GEGLQVL Y G +Y PH+DYF D
Sbjct: 5 ARTSDSMCLRVGQDALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDA 64
Query: 186 FNT----KNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKP 241
T + GGQR+A+++MYL+ E GG T FP+A +++AV
Sbjct: 65 AGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAHLDVAAV------------------ 106
Query: 242 KMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
G+A+ F +P SLH G PV+ G KW +TKW+R
Sbjct: 107 -KGNAVFFSYDRPHPMT--RSLHAGAPVLTGEKWVATKWLR 144
>gi|198284815|ref|YP_002221136.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
gi|218668131|ref|YP_002427500.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
gi|198249336|gb|ACH84929.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
gi|218520344|gb|ACK80930.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
ferrooxidans ATCC 23270]
Length = 213
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 70/200 (35%), Positives = 96/200 (48%), Gaps = 21/200 (10%)
Query: 89 YHNFLSKEECEYLINLAT-PHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIE 147
+ LS +EC LI + + + S VVD + + ++ R S+ + II +I
Sbjct: 17 FKGLLSLDECAELIAIGSVSDAKPSVVVDGASDAAYETPGRCSTVVAPSVDAYPIILEIR 76
Query: 148 KRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNGGQRMATVLMYLSDVE 206
+RI F+ EN E LQ+LHY G KY+ H+D F D +NGG R+ TVL+YL+DVE
Sbjct: 77 RRIELFSGISQENQEPLQILHYTRGGKYDIHYDAFSDGSPQLRNGGNRLLTVLLYLNDVE 136
Query: 207 EGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGG 266
GG T FP+ NI P G +LF + SLH G
Sbjct: 137 YGGWTQFPHIMANIV-------------------PNAGSGILFRNTDAQNRQLRESLHAG 177
Query: 267 CPVIKGNKWSSTKWIRVNEY 286
PV G KW ++ WIR N Y
Sbjct: 178 LPVTHGEKWIASIWIRENPY 197
>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
Length = 509
Score = 109 bits (273), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 66/206 (32%), Positives = 100/206 (48%), Gaps = 21/206 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE +P ++HN LS E E + LA + + + + + + R S +L
Sbjct: 314 VEQAHLDPDILIFHNVLSDCEIETMKQLAQSRLVTAVFENPHSKQLELFPFRISKVAWLE 373
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
+ + + +R+A T L E QV++Y G YEPHFD F + G R+
Sbjct: 374 DQEHQHLAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFD-FQSTVDPAIGS-RIE 431
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVL YLSDVE+GG TVFP Q +S+ P+ G A++++++ P
Sbjct: 432 TVLFYLSDVEQGGATVFPEIQ-------------------VSVWPQKGSAVVWFNLHPSG 472
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIR 282
D + H GCPV+ G+KW +TKWI
Sbjct: 473 DGDQRTKHAGCPVLIGSKWIATKWIH 498
>gi|415977972|ref|ZP_11559036.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
gi|339834153|gb|EGQ61937.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
[Acidithiobacillus sp. GGI-221]
Length = 215
Score = 109 bits (272), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 70/200 (35%), Positives = 96/200 (48%), Gaps = 21/200 (10%)
Query: 89 YHNFLSKEECEYLINLAT-PHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIE 147
+ LS +EC LI + + + S VVD + + ++ R S+ + II +I
Sbjct: 19 FKGLLSLDECAELIAIGSVSDAKPSVVVDGASDAAYETPGRCSTVVAPSVDAYPIILEIR 78
Query: 148 KRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNGGQRMATVLMYLSDVE 206
+RI F+ EN E LQ+LHY G KY+ H+D F D +NGG R+ TVL+YL+DVE
Sbjct: 79 RRIELFSGISQENQEPLQILHYTRGGKYDIHYDAFSDGSPQLRNGGNRLLTVLLYLNDVE 138
Query: 207 EGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGG 266
GG T FP+ NI P G +LF + SLH G
Sbjct: 139 YGGWTQFPHIMANIV-------------------PNAGSGILFRNTDAQNRQLRESLHAG 179
Query: 267 CPVIKGNKWSSTKWIRVNEY 286
PV G KW ++ WIR N Y
Sbjct: 180 LPVTHGEKWIASIWIRENPY 199
>gi|194905392|ref|XP_001981188.1| GG11756 [Drosophila erecta]
gi|190655826|gb|EDV53058.1| GG11756 [Drosophila erecta]
Length = 509
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/209 (31%), Positives = 107/209 (51%), Gaps = 22/209 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E++S +P ++H+ +S ++ + NLA + ++ V D G K+ RT+ GT+L
Sbjct: 312 MELLSLDPYVVLFHDVVSDQDILSIRNLAKGGLARAVTVTQD-GNDKEDPARTTKGTWLV 370
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMA 196
K+I+ + + D T F + + + QVL+Y G Y HFD F+++ + R+A
Sbjct: 371 EN-SKLIQRLSQLSQDMTNFDVRDADPFQVLNYGIGGFYGTHFD-FLEDTEMGHFSDRIA 428
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
T + YLSDV +GG T FP+ GLS+ P+ G ALL++++
Sbjct: 429 TAVFYLSDVPQGGATTFPD-------------------LGLSVFPEKGAALLWYNLDHKG 469
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
D + H CP I G++W TKWI E
Sbjct: 470 VGDNRTAHSACPTIVGSRWVMTKWINERE 498
>gi|255085194|ref|XP_002505028.1| predicted protein [Micromonas sp. RCC299]
gi|226520297|gb|ACO66286.1| predicted protein [Micromonas sp. RCC299]
Length = 439
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 73/225 (32%), Positives = 103/225 (45%), Gaps = 39/225 (17%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRV--------- 127
V+ +S PR V HNF+SKEE ++++A P + S VV T K D+
Sbjct: 202 VKQVSRHPRLAVIHNFISKEEAAAIVDVAAPELHPSLVVRHQTAKRGDTAGGDTAVHGEA 261
Query: 128 ---RTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-- 182
RTS ++ I+R +R A + E QV+ Y Q+Y+PH D+F
Sbjct: 262 TAGRTSHNCRVSSSH-PIVRAAIQRAAYLCGLEPSHAEPAQVVRYLPSQEYKPHHDWFDR 320
Query: 183 --MDEFNTKN---GGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGL 237
+ F K GGQR T L YL + E GG T FP K
Sbjct: 321 AHPESFRAKTEGRGGQRAVTCLAYLVEPERGGRTYFP-------------------KLRA 361
Query: 238 SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+PK+GDALL+W++ + + D +LH G PV G KW+ W+R
Sbjct: 362 GFEPKVGDALLWWNVDENGAEDFKTLHAGEPVEAGAKWALNLWLR 406
>gi|390459659|ref|XP_002806656.2| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-2 [Callithrix jacchus]
Length = 579
Score = 109 bits (272), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/222 (30%), Positives = 100/222 (45%), Gaps = 43/222 (19%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 360 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 419
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY------------------FM-- 183
+ +R+ T ++ E LQV +Y G +YEPHFD+ F+
Sbjct: 420 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 479
Query: 184 ----DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSI 239
D F G R+AT L Y+SDVE GG TVFP+ G +I
Sbjct: 480 NDERDAFKHLGTGNRVATFLNYMSDVEAGGATVFPD-------------------LGAAI 520
Query: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
PK G A+ ++++ D + H CPV+ G KW S KW
Sbjct: 521 WPKKGTAVFWYNLLRSGXGDYRTRHAACPVLVGCKWVSNKWF 562
>gi|291387302|ref|XP_002710242.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
precursor (predicted)-like isoform 2 [Oryctolagus
cuniculus]
gi|217273039|gb|ACK28132.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Oryctolagus cuniculus]
Length = 555
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 68/222 (30%), Positives = 100/222 (45%), Gaps = 43/222 (19%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY------------------FM-- 183
I +R+ T ++ E LQV +Y G +YEPHFD+ F+
Sbjct: 396 ARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 184 ----DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSI 239
D F G R+AT L Y+SDVE GG TVFP+ G +I
Sbjct: 456 NNERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPD-------------------LGAAI 496
Query: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
PK G A+ ++++ D + H CPV+ G KW S KW
Sbjct: 497 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWF 538
>gi|167045848|gb|ABZ10515.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Callithrix jacchus]
Length = 555
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/222 (30%), Positives = 100/222 (45%), Gaps = 43/222 (19%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY------------------FM-- 183
+ +R+ T ++ E LQV +Y G +YEPHFD+ F+
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 184 ----DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSI 239
D F G R+AT L Y+SDVE GG TVFP+ G +I
Sbjct: 456 NDERDAFKHLGTGNRVATFLNYMSDVEAGGATVFPD-------------------LGAAI 496
Query: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
PK G A+ ++++ D + H CPV+ G KW S KW
Sbjct: 497 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWF 538
>gi|170649696|gb|ACB21278.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Callicebus moloch]
Length = 555
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/222 (30%), Positives = 100/222 (45%), Gaps = 43/222 (19%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY------------------FM-- 183
+ +R+ T ++ E LQV +Y G +YEPHFD+ F+
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 184 ----DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSI 239
D F G R+AT L Y+SDVE GG TVFP+ G +I
Sbjct: 456 NDERDAFKHLGTGNRVATFLNYMSDVEAGGATVFPD-------------------LGAAI 496
Query: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
PK G A+ ++++ D + H CPV+ G KW S KW
Sbjct: 497 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWF 538
>gi|281348666|gb|EFB24250.1| hypothetical protein PANDA_000722 [Ailuropoda melanoleuca]
Length = 505
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 97/202 (48%), Gaps = 25/202 (12%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 325 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 384
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF----MDEFNTKNGGQRMATVL 199
+ R+ T ++ E LQV +Y G +YEPHFD+ D F G R+AT L
Sbjct: 385 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFL 444
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
Y+SDVE GG TVFP+ G +I PK G A+ ++++ D
Sbjct: 445 NYMSDVEAGGATVFPD-------------------LGAAIWPKKGTAVFWYNLLRSGEGD 485
Query: 260 PSSLHGGCPVIKGNKWSSTKWI 281
+ H CPV+ G KW KW+
Sbjct: 486 YRTRHAACPVLVGCKWG--KWL 505
>gi|219124513|ref|XP_002182546.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|217405892|gb|EEC45833.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 193
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 69/210 (32%), Positives = 101/210 (48%), Gaps = 22/210 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V+ +S PRAF NFL+ E ++++ L ++K + + S RTSS T+LA
Sbjct: 1 VKALSCAPRAFQVENFLTDVEADHIVGL----VQKKNDMQRSSTNGHISETRTSSTTWLA 56
Query: 137 RGRDKIIRDIEKRIADF-----TFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG 191
R D +I I +R+AD E LQ++HY GQ+Y H D+ +
Sbjct: 57 RHSDPVIDSIFRRVADTLKMDEAMLHRRINEDLQIVHYGVGQQYTAHHDFGYPK-GDPGS 115
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
R MYL+DV GG+T FP W G L++ PK G A++F+
Sbjct: 116 PSRSINFCMYLNDVPAGGQTSFPR----------WRNAETNG--ALNVVPKKGTAMIFYM 163
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ PD +LD + H PVI+G K+ S WI
Sbjct: 164 VNPDGNLDDLTHHAALPVIEGEKFFSNLWI 193
>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
Length = 537
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 98/207 (47%), Gaps = 20/207 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E S +P +H+ LS + L +A P M +STV G+ K S R S +LA
Sbjct: 324 LEEHSLDPYVATFHDMLSPRKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLA 383
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNGGQRM 195
+ + + + D T E LQV +Y G YEPH+D+F D G R+
Sbjct: 384 YESHPTMVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRI 443
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
AT + YLS+VE+GG T FP +++KP++G+ L ++++
Sbjct: 444 ATAIFYLSEVEQGGATAFPFLD-------------------IAVKPQLGNVLFWYNLHRS 484
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D + H GCPV+KG+KW WI
Sbjct: 485 LDKDYRTKHAGCPVLKGSKWIGNVWIH 511
>gi|410632646|ref|ZP_11343301.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
gi|410147883|dbj|GAC20168.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
Length = 480
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/195 (34%), Positives = 101/195 (51%), Gaps = 26/195 (13%)
Query: 91 NFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRI 150
+FL +EC+ LI L + ST+ T ++ D + RTSS L +D +IR I+ +I
Sbjct: 103 DFLLPQECQALIELIEQAKQPSTI----TSENPDQQFRTSSTCHLGNMQDPVIRKIDLQI 158
Query: 151 ADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNG--GQRMATVLMYLSDVEE 207
+ E +Q HY+ GQ+++PH DYF E G GQR T ++YL++VE+
Sbjct: 159 CQYLGIDPSYSEVIQGQHYQLGQQFKPHTDYFEPYELAHYGGIQGQRTYTFMIYLNEVEQ 218
Query: 208 GGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGC 267
GG+TVFP + + K K G A+++ ++ PD S++ +LH G
Sbjct: 219 GGDTVFP-------------------ELAIGFKAKKGMAVIWNNINPDGSVNYQTLHQGM 259
Query: 268 PVIKGNKWSSTKWIR 282
PV KG K TKW R
Sbjct: 260 PVQKGEKLIITKWFR 274
>gi|126327904|ref|XP_001367838.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Monodelphis
domestica]
Length = 559
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 66/210 (31%), Positives = 99/210 (47%), Gaps = 26/210 (12%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV+ EP +YH+F+S E + + A P +++S V + K + R S +L
Sbjct: 355 EVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVASGE--KQQQVEYRISKSAWLKD 412
Query: 138 GRDKIIRDIEKRIADFTFFPLE--NGEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 192
D ++ ++ RIA T ++ E LQV++Y G YEPHFD+ N G
Sbjct: 413 TVDPMLVSLDHRIAALTGLNVQPPYAEHLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSG 472
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT ++YLS VE GG T F A ++ V AL +W++
Sbjct: 473 NRVATFMIYLSSVEAGGSTAFIYANFSVPVV-------------------KNAALFWWNL 513
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D +LH GCPV+ G+KW + KWI
Sbjct: 514 HRSGEGDGDTLHAGCPVLVGDKWVANKWIH 543
>gi|405964866|gb|EKC30308.1| KRR1 small subunit processome component-like protein [Crassostrea
gigas]
Length = 885
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/229 (29%), Positives = 106/229 (46%), Gaps = 43/229 (18%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKS--------KDSRVRT 129
EV+++EPR ++H+ +S E+L ++A+ + +STV +TG + K +R
Sbjct: 653 EVVNYEPRIAIFHDVISSTSIEHLKSIASKGLTRSTVFLENTGPNGQVTITYGKQDNIRV 712
Query: 130 SSGTFLARGRDKIIRDIEKRIADFTFFPLE------NGEGLQVLHYEAGQKYEPHFDYF- 182
S ++ + +E RI T E + E QV++Y G Y H DY
Sbjct: 713 SQTCWIRTDEYPELLRLENRIQLITGLSAEYKPVRSHSEKFQVVNYGVGGMYTAHHDYTG 772
Query: 183 ---------MDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECG 233
MD + G RMAT + Y++D + GG TVFP + I
Sbjct: 773 YKLGIISNPMDSEDISTSGDRMATWMFYMNDAKAGGATVFPEVRTRIPVA---------- 822
Query: 234 KTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
G A +++++P + DP +LHGGCPV+ G+KW + KWIR
Sbjct: 823 ---------KGGAAFWFNLRPSGATDPRTLHGGCPVLVGSKWVTNKWIR 862
>gi|198417610|ref|XP_002125349.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1
precursor (4-PH alpha-1)
(Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-1) [Ciona intestinalis]
Length = 527
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 69/211 (32%), Positives = 101/211 (47%), Gaps = 25/211 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE +S P +H+ LS + LA P + ++ SD + + R + +L
Sbjct: 312 VEELSKSPDIVQFHDVLSDTVINEIKKLAKPQLFRAIHAGSDDTDLQKAPYRITKLAWLL 371
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-----DEFNTKNG 191
+ I +RI+D T L E +QV +Y G +Y PHFD D+ +++G
Sbjct: 372 DDDGPEVAKITERISDITGLTLNTSEEIQVANYGVGGEYPPHFDIPTTDEERDDLKSQDG 431
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
+R+AT L+YLSDVE GG T F NA G+S KP G A+ +++
Sbjct: 432 -ERIATFLIYLSDVEVGGRTAFVNA-------------------GVSAKPIKGSAVFWYN 471
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ P D + HG CPV GNKW+ KWIR
Sbjct: 472 VFPSGEPDLRTYHGACPVAFGNKWAGNKWIR 502
>gi|38454288|ref|NP_942070.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Rattus norvegicus]
gi|81870816|sp|Q6W3E9.1|P4HA3_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|36962768|gb|AAQ87605.1| collagen prolyl 4-hydroxylase alpha III subunit [Rattus norvegicus]
Length = 544
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 70/210 (33%), Positives = 101/210 (48%), Gaps = 28/210 (13%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVI P +YH+F+S EE + + LA P +++S V + K R S +L
Sbjct: 340 EVIHLRPLVALYHDFVSDEEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 397
Query: 138 GRDKIIRDIEKRIADFTFFPLE--NGEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 192
D ++ +++RIA T ++ E LQV++Y G YEPHFD+ G
Sbjct: 398 TVDPVLVTLDRRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYKMKSG 457
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW-S 251
R AT+++YLS VE GG T F GN S P + +A LFW +
Sbjct: 458 NRAATLMIYLSSVEAGGATAF--IYGNFSV------------------PVVKNAALFWWN 497
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+ D +LH GCPV+ G+KW + KWI
Sbjct: 498 LHRSGEGDDDTLHAGCPVLVGDKWVANKWI 527
>gi|195392288|ref|XP_002054791.1| GJ24631 [Drosophila virilis]
gi|194152877|gb|EDW68311.1| GJ24631 [Drosophila virilis]
Length = 499
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 63/214 (29%), Positives = 107/214 (50%), Gaps = 34/214 (15%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E +S +P +YH+ + E E+++ LA PH+R++ V ++ R ++G
Sbjct: 298 LEQLSLDPYMVLYHDVVQANEREHIMQLAKPHLRRALV---GAARAHSQRFAMNAGFSYN 354
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQ--- 193
R + + +R+ D + F L N L VL+Y G +Y H+D + F+ + Q
Sbjct: 355 DSRQG--QRLRQRLEDMSGFDLTNSGQLAVLNYGIGGQYYMHYDCW---FSQDDAAQVAS 409
Query: 194 ----RMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLF 249
R+AT+L+YL+DV+ GG T FP GL+++P G AL++
Sbjct: 410 IKDNRIATILLYLTDVQLGGLTSFP-------------------ALGLAVQPSPGSALIW 450
Query: 250 WSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRV 283
+M A D +LH CP++ G +W +T+WI V
Sbjct: 451 HNMNNAAECDRRTLHAACPLLLGTRWVATQWIDV 484
>gi|145344669|ref|XP_001416850.1| predicted protein [Ostreococcus lucimarinus CCE9901]
gi|144577076|gb|ABO95143.1| predicted protein [Ostreococcus lucimarinus CCE9901]
Length = 225
Score = 108 bits (271), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 77/224 (34%), Positives = 109/224 (48%), Gaps = 36/224 (16%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL- 135
V V+S + FV +FLS+EE + LI +A P M++S V D K S RTS+ TFL
Sbjct: 21 VRVLSNDCLLFVLEDFLSEEEGDQLIEIARPSMQRSRVTDG-----KLSEGRTSTSTFLT 75
Query: 136 -ARGRDKIIRDIEKRIADFTFFPL-----------ENGEGLQVLHYEAGQKYEPHFDYFM 183
AR D ++ +IE+RI PL E +Q++ Y ++Y H+D
Sbjct: 76 GARAHDDLVLEIERRIQAAIRLPLIVERRKNVKVMYQHEPMQIVQYGPTERYTAHYD--- 132
Query: 184 DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIK--P 241
N +R T + YL + EEGG T FP VP L C T L I+ P
Sbjct: 133 ---NRAGSLKRSMTFMCYLQEPEEGGATFFPKC------VP----LCGCDSTTLGIRVFP 179
Query: 242 KMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
K G A+LFW++ + SLH PV+ G K T+W+ ++E
Sbjct: 180 KRGRAILFWNVGENGQEAMRSLHEAQPVVSGKKAIFTQWLSISE 223
>gi|229368743|gb|ACQ63024.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Dasypus novemcinctus]
Length = 556
Score = 108 bits (270), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 67/222 (30%), Positives = 100/222 (45%), Gaps = 43/222 (19%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 337 PHIVRYYDIMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEENDDPVV 396
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY------------------FM-- 183
+ +R+ T ++ E LQV +Y G +YEPHFD+ F+
Sbjct: 397 AQVNRRMEHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 456
Query: 184 ----DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSI 239
D F G R+AT L Y+SDVE GG TVFP+ G +I
Sbjct: 457 NHEQDVFKHLGTGNRVATFLNYMSDVEAGGATVFPD-------------------LGAAI 497
Query: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
PK G A+ ++++ D + H CPV+ G KW S KW
Sbjct: 498 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWF 539
>gi|195159150|ref|XP_002020445.1| GL13509 [Drosophila persimilis]
gi|194117214|gb|EDW39257.1| GL13509 [Drosophila persimilis]
Length = 554
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 71/213 (33%), Positives = 105/213 (49%), Gaps = 28/213 (13%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL- 135
+E +S +P VYHN LS E + + P +++S V D K S+ RT+ G +L
Sbjct: 343 MEELSLDPYIVVYHNVLSDAEIAEVERVTEPLLKRSVVFDGKENKMSTSKKRTALGAWLP 402
Query: 136 -----ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNTK 189
GR +I+ I +RI + T + + + +Q++ Y G Y+ HFDYF TK
Sbjct: 403 DDNMDVSGR-AVIQRILRRIHELTGLIMNDRQDMQLIKYGYGGHYDIHFDYFNTSSPITK 461
Query: 190 NGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLF 249
G RMATVL YL+DV+ GG T F + Q L + + G L +
Sbjct: 462 ARGDRMATVLFYLNDVKHGGSTAFTDLQ-------------------LKVPSERGKVLFW 502
Query: 250 WSMKPDA-SLDPSSLHGGCPVIKGNKWSSTKWI 281
++M+ + LD +LHG CPVI G K + WI
Sbjct: 503 YNMRGETHDLDSRTLHGACPVIDGTKSILSCWI 535
>gi|303273602|ref|XP_003056161.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226462245|gb|EEH59537.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 750
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 82/234 (35%), Positives = 113/234 (48%), Gaps = 55/234 (23%)
Query: 88 VYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL--ARGRDKIIRD 145
V+ +FLS EC+ L+ +A P +R+S V D K S RTSS TFL + + ++R
Sbjct: 533 VFDHFLSAVECDDLVAIAAPDLRRSRVTDG-----KLSEGRTSSSTFLTGCKQEEPLVRA 587
Query: 146 IEKRI-------------------------------ADFTFFP--LENGEGLQVLHYEAG 172
IE+R+ + F+ P L+ E +QV+ Y G
Sbjct: 588 IEQRLLRAVQSATLIAAQPNVYDSNERHGQPYRGSTSRFSQRPNLLQGAEPMQVVRYTEG 647
Query: 173 QKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSEC 232
Q Y H+D N + +R AT +MYL+DV GG T FP A VP + C
Sbjct: 648 QMYTAHYD------NKQGCLRRTATFMMYLTDVHSGGATHFPRA------VPV-SMRDGC 694
Query: 233 G-KTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNE 285
G G+ I PK G AL+FWS+ D SLH PVI+G KW +TKW+R +E
Sbjct: 695 GDAAGIRIWPKRGRALVFWSVSGGIE-DVRSLHEAEPVIEGEKWIATKWLREDE 747
>gi|260806889|ref|XP_002598316.1| hypothetical protein BRAFLDRAFT_261183 [Branchiostoma floridae]
gi|229283588|gb|EEN54328.1| hypothetical protein BRAFLDRAFT_261183 [Branchiostoma floridae]
Length = 531
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 69/210 (32%), Positives = 104/210 (49%), Gaps = 28/210 (13%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P ++H+ +S+ E + +A P +S VV D G + R S + D ++
Sbjct: 339 PVIHLFHDIVSESEAARMREMAIPKFHRSVVVGDDGGDAIILN-RVSETAWHFDYDDPVV 397
Query: 144 RDIEKRIADFTFFPLENG--EGLQVLHYEAGQKYEPHFDYFMDEFNTKN--GGQRMATVL 199
+ +R+ T G E QV++Y G +Y PH DYF + T++ G R+ T L
Sbjct: 398 AKLSRRVDYATGLSTAEGTAEAFQVVNYGLGGQYIPHTDYFEGDHVTRHIQNGNRVVTFL 457
Query: 200 MYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLD 259
+YLSDV+ GG TVFP AVP + A +FWSM+ ++
Sbjct: 458 LYLSDVDAGGATVFPIVD---VAVP------------------INSAAVFWSMERSGAVV 496
Query: 260 PSSLHGGCPVIKGNKWSSTKWIRV--NEYK 287
P+SLH GCPV+ G+KW + KWIR NE++
Sbjct: 497 PNSLHAGCPVLIGSKWIANKWIREHGNEFR 526
>gi|281183175|ref|NP_001162504.1| prolyl 4-hydroxylase subunit alpha-2 [Papio anubis]
gi|159461520|gb|ABW96795.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase, alpha
polypeptide II, isoform 1 (predicted) [Papio anubis]
Length = 578
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/222 (30%), Positives = 100/222 (45%), Gaps = 43/222 (19%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 418
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY---------------------F 182
+ +R+ T ++ E LQV +Y G +YEPHFD+ +
Sbjct: 419 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 478
Query: 183 MDE---FNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSI 239
DE F G R+AT L Y+SDVE GG TVFP+ G +I
Sbjct: 479 NDERHTFKHLGTGNRVATFLNYMSDVEAGGATVFPD-------------------LGAAI 519
Query: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
PK G A+ ++++ D + H CPV+ G KW S KW
Sbjct: 520 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWF 561
>gi|194751829|ref|XP_001958226.1| GF23628 [Drosophila ananassae]
gi|190625508|gb|EDV41032.1| GF23628 [Drosophila ananassae]
Length = 484
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 63/210 (30%), Positives = 109/210 (51%), Gaps = 34/210 (16%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKS--KDSRVRTSSGTF 134
+E +S +P +YHN +S E E + K + + D G + +SR S +
Sbjct: 287 LEEVSLDPYIVLYHNVISDREIEEM---------KGLIDEMDNGWTDLNESREIVSRLVW 337
Query: 135 LARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNG 191
L + + + + RI D T F ++ GLQ+ ++ G +++PH+DYF + N
Sbjct: 338 LTK-ESRFRKRLNLRIRDITGFNVDEIRGLQIANFGVGGQFKPHYDYFTERILRLNNTIL 396
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
G R+A+++ Y+ DV GG+TVFP+ Q +++KP+ G +L +++
Sbjct: 397 GDRIASIIFYVGDVVHGGQTVFPDIQ-------------------IAVKPQKGSSLFWFN 437
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
DA+ DP SLH CPV+ G++W+ TKW+
Sbjct: 438 TFDDATPDPRSLHSVCPVLIGDRWTITKWL 467
>gi|90023340|ref|YP_529167.1| response regulator receiver domain-containing protein
[Saccharophagus degradans 2-40]
gi|89952940|gb|ABD82955.1| 2OG-Fe(II) oxygenase [Saccharophagus degradans 2-40]
Length = 269
Score = 108 bits (270), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 102/211 (48%), Gaps = 31/211 (14%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P + +FL++ E +I A M+++ V G +S RT S ++A +K+
Sbjct: 63 PSVTICEDFLTQAEVFQIIKAAGDKMQRARVSSGKEGI--ESAGRTGSNCWVAHDHNKVT 120
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTK-------NGGQRMA 196
+ KRI+ L+N E QV+HY Q+Y HFD + EFNT+ GGQR+
Sbjct: 121 HALAKRISKLVGISLQNAESFQVIHYGVSQEYSSHFDAW--EFNTERGERCMARGGQRLV 178
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
T L+YL+DV GG T FP EL L ++ K G ++F + P
Sbjct: 179 TCLIYLNDVPAGGGTGFP-------------ELD------LEVQAKKGRMVIFHNCYPGT 219
Query: 257 SL-DPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
+ P SLHGG PV +G KW+ W R +Y
Sbjct: 220 NYRHPHSLHGGLPVEEGEKWAVNLWFREADY 250
>gi|395814850|ref|XP_003780953.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Otolemur
garnettii]
Length = 544
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/210 (31%), Positives = 100/210 (47%), Gaps = 26/210 (12%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVI EP +YH+F+S E + + LA P +++S V + D R+ S +L
Sbjct: 340 EVIHLEPFVALYHDFVSDSEAQKIRELAEPWLQRSVVASGEKQLQVDYRI--SKSAWLKD 397
Query: 138 GRDKIIRDIEKRIADFTFFPLE--NGEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 192
D ++ ++ RIA T ++ E LQV++Y G YEPHFD+ G
Sbjct: 398 TVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 457
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT ++YLS VE GG T F A ++ V AL +W++
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYANFSVPVV-------------------KNAALFWWNL 498
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ D +LH GCPV+ G+KW + KWI
Sbjct: 499 HRNGEGDSDTLHAGCPVLVGDKWVANKWIH 528
>gi|351696981|gb|EHA99899.1| Prolyl 4-hydroxylase subunit alpha-3 [Heterocephalus glaber]
Length = 572
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 67/210 (31%), Positives = 99/210 (47%), Gaps = 26/210 (12%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVI EP +YH+F+S E + + LA P +++S V + K R S +L
Sbjct: 368 EVIHLEPYVALYHDFVSDPEAQKIRKLAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 425
Query: 138 GRDKIIRDIEKRIADFTFFPLEN--GEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 192
D ++ ++ RIA T +++ E LQV++Y G YEPHFD+ G
Sbjct: 426 TADPVLVTLDHRIAALTGLDVQHPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 485
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT ++YLS VE GG T F A ++ V AL +W++
Sbjct: 486 NRVATFMIYLSSVEAGGATAFIYANFSVPVV-------------------KNAALFWWNL 526
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D +LH GCPV+ G+KW + KWI
Sbjct: 527 HRSGEGDGDTLHAGCPVLVGDKWVANKWIH 556
>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
latipes]
Length = 517
Score = 108 bits (269), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 69/216 (31%), Positives = 106/216 (49%), Gaps = 32/216 (14%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV+S +P +YHNF++ E E + A P +R+S V + + + R+ S +L
Sbjct: 313 EVLSLQPYVVIYHNFITDREAEEIKGFAQPALRRSVVASGENQATVEYRI--SKSAWLKG 370
Query: 138 GRDKIIRDIEKRIADFTFFPLE--NGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNG 191
I+ +++RI+ T + E LQV++Y G YEPHFD+ F K G
Sbjct: 371 SESCIVGKLDQRISMLTGLNVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPVFKLKTG 430
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW- 250
R+AT ++YLS VE GG T F A ++ P + A +FW
Sbjct: 431 N-RVATFMIYLSSVEAGGSTAFIYANFSV--------------------PVLKKAAIFWW 469
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
++ + D +LH GCPV+ G+KW + KW V+EY
Sbjct: 470 NLHRNGRGDAETLHAGCPVLIGDKWVANKW--VHEY 503
>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
Length = 537
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 98/207 (47%), Gaps = 20/207 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E S +P +H+ LS + L +A P M +STV G+ K S R S +LA
Sbjct: 324 LEEHSLDPYVATFHDILSPGKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLA 383
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNGGQRM 195
+ + + + D T E LQV +Y G YEPH+D+F D G R+
Sbjct: 384 YESHPTMVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRI 443
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
AT + YLS+VE+GG T FP +++KP++G+ L ++++
Sbjct: 444 ATAIFYLSEVEQGGATAFPFLD-------------------IAVKPQLGNVLFWYNLHRS 484
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D + H GCPV+KG+KW WI
Sbjct: 485 LDKDYRTKHAGCPVLKGSKWIGNVWIH 511
>gi|219113719|ref|XP_002186443.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209583293|gb|ACI65913.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 230
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 75/233 (32%), Positives = 109/233 (46%), Gaps = 43/233 (18%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG--------KSKDSRVR 128
++V+S PRAF NFLS++E E+++ LA+ K + TG ++ R R
Sbjct: 5 LKVLSCAPRAFEIENFLSRQEVEHIVQLASGVDLKLSSTGDITGHKETPKELQTDSRRTR 64
Query: 129 TSSGTFLARGRDKIIRDIEKRIADFTFF---------------------PLENGEGLQVL 167
TS +++ R + II I +R AD PL E LQ++
Sbjct: 65 TSYNSWVPREKSPIIDAIYRRAADVMRIDEALLRHRSDHTEWTNLTSTKPL--AEQLQLV 122
Query: 168 HYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWN 227
HY GQ+Y H D+ + + G R T+L+YL++ GGET FP W+
Sbjct: 123 HYGPGQEYTAHHDFGFSRIDDQFQGARFGTLLLYLNEGMTGGETSFPR----------WS 172
Query: 228 ELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKW 280
+ LSIKP++G A+LF+S PD +LD S H PV G KW W
Sbjct: 173 NAETFHE--LSIKPEVGKAVLFYSQLPDGNLDDLSHHAAKPVTDGEKWLINLW 223
>gi|195055775|ref|XP_001994788.1| GH17428 [Drosophila grimshawi]
gi|193892551|gb|EDV91417.1| GH17428 [Drosophila grimshawi]
Length = 540
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 63/211 (29%), Positives = 100/211 (47%), Gaps = 23/211 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E + +P H+ +S EE L LA P +++S V + + R S GTF
Sbjct: 324 LEELHLDPYVIQVHDIISAEETIVLQQLARPELQRSMVYSLSNSEHISTNFRISQGTFFE 383
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
I++ + + + + + + + E LQV +Y G YEPH D F + NT
Sbjct: 384 YHEHPIMQRMSQHLENISGLDMRSAEQLQVANYGIGGHYEPHMDSFSENHNYGINTYMST 443
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT + YLS+VE GG T FP +P L ++P+ G L ++++
Sbjct: 444 NRVATGIYYLSNVEAGGGTAFP-------FLP------------LLVEPERGSLLFWYNL 484
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIRV 283
LD + H GCPV+ G+KW + WIR+
Sbjct: 485 HRSGDLDYRTKHAGCPVLMGSKWIANVWIRL 515
>gi|323455897|gb|EGB11765.1| hypothetical protein AURANDRAFT_52419 [Aureococcus anophagefferens]
Length = 478
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 78/243 (32%), Positives = 112/243 (46%), Gaps = 45/243 (18%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
V +S P+ F F+ E E LI P ++ S V G+S D + RTS+ +
Sbjct: 150 VTTLSMRPQVFRISQFMMGHETEKLIERNKPRIKPSEV--GLVGRSGD-KTRTSTNAWDT 206
Query: 137 RGRDKIIRDIEKRIADFTFFPLENG----EGLQVLHYEAGQKYEPHFDYFM--------- 183
+ RD+ R F ++ +GLQVLHYE Q Y+PH DYF
Sbjct: 207 A--SPVARDVIGRA--FRLLKIDAHRKLEDGLQVLHYERPQWYKPHVDYFTSRNAGGGGA 262
Query: 184 --DEFN-----TKNGGQRMATVLMYLSDVEEGGETVFPNA------------QGNISAVP 224
D F+ NG R ATV +YL++ GGETVFP + Q + P
Sbjct: 263 SEDAFSNAIPTANNGTNRFATVFLYLNNAGSGGETVFPLSTTHEIYQGGRLTQAGTNRTP 322
Query: 225 WWNE------LSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSST 278
+ + + L + P+ GD++LF+S + DASLD SLHG CP+ G KW++
Sbjct: 323 GFIRDADAAWVCDTKSEALRVTPRTGDSVLFYSQRGDASLDGYSLHGSCPMGDGEKWAAN 382
Query: 279 KWI 281
W+
Sbjct: 383 LWV 385
>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
Length = 467
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 98/207 (47%), Gaps = 20/207 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E S +P +H+ LS + L +A P M +STV G+ K S R S +LA
Sbjct: 254 LEEHSLDPYVATFHDILSPGKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLA 313
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNGGQRM 195
+ + + + D T E LQV +Y G YEPH+D+F D G R+
Sbjct: 314 YESHPTMVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRI 373
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
AT + YLS+VE+GG T FP +++KP++G+ L ++++
Sbjct: 374 ATAIFYLSEVEQGGATAFPFLD-------------------IAVKPQLGNVLFWYNLHRS 414
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D + H GCPV+KG+KW WI
Sbjct: 415 LDKDYRTKHAGCPVLKGSKWIGNVWIH 441
>gi|298712929|emb|CBJ26831.1| conserved unknown protein [Ectocarpus siliculosus]
Length = 294
Score = 108 bits (269), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 75/210 (35%), Positives = 103/210 (49%), Gaps = 34/210 (16%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P +V +F S EC+ LI LA +M S VV + G+ +SR TSS FLAR + +
Sbjct: 101 PPLYVVDDFFSGPECDALIALAGNYMIVSPVVGAGAGEVSESR--TSSSCFLAR---EDL 155
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNT-------KNGGQRMA 196
+ ++ T P+E+ E QV Y QKY H+D F + NT +NGGQR+
Sbjct: 156 PTVCHKVMALTGKPIEHLELPQVGRYYTSQKYANHWDAF--DLNTEDGRRFAQNGGQRVC 213
Query: 197 TVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDA 256
TVL+YL+DV GG T FP + G+ ++P+ G A++F+ D
Sbjct: 214 TVLVYLNDVPSGGCTAFP-------------------QLGMKVQPRKGMAVVFFPATLDG 254
Query: 257 SLDPSSLHGGCPVIKGNKWSSTKWIRVNEY 286
LD LH P I KW S WIR Y
Sbjct: 255 VLDSRLLHAAEPAID-TKWVSQIWIRQGAY 283
>gi|197215651|gb|ACH53042.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Otolemur garnettii]
Length = 555
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 67/222 (30%), Positives = 99/222 (44%), Gaps = 43/222 (19%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY------------------FM-- 183
+ R+ T ++ E LQV +Y G +YEPHFD+ F+
Sbjct: 396 ARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRVATFLNY 455
Query: 184 ----DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSI 239
D F G R+AT L Y+SDVE GG TVFP+ G +I
Sbjct: 456 NHERDAFKRLGTGNRVATFLNYMSDVEAGGATVFPD-------------------LGAAI 496
Query: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
PK G A+ ++++ D + H CPV+ G KW S KW
Sbjct: 497 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWF 538
>gi|440899661|gb|ELR50930.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Bos grunniens mutus]
Length = 478
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 100/211 (47%), Gaps = 28/211 (13%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVI EP +YH+F+S E + + LA P +++S V + K R S +L
Sbjct: 274 EVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 331
Query: 138 GRDKIIRDIEKRIADFTFFPLE--NGEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 192
D ++ ++ RIA T ++ E LQV++Y G YEPHFD+ N G
Sbjct: 332 TVDPVLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSG 391
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW-S 251
R+AT ++YLS VE GG T F GN S P + +A LFW +
Sbjct: 392 NRVATFMIYLSSVEAGGATAF--IYGNFSV------------------PVVKNAALFWWN 431
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ D +LH CPV+ G+KW + KWI
Sbjct: 432 LHRSGEGDGDTLHAACPVLVGDKWVANKWIH 462
>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
Length = 537
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 64/207 (30%), Positives = 98/207 (47%), Gaps = 20/207 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E S +P +H+ L+ + L +A P M +STV G+ K S R S +LA
Sbjct: 324 LEEHSLDPYVATFHDMLNPRKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLA 383
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNGGQRM 195
+ + + + D T E LQV +Y G YEPH+D+F D G R+
Sbjct: 384 YESHPTMVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRI 443
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
AT + YLS+VE+GG T FP +++KP++G+ L ++++
Sbjct: 444 ATAIFYLSEVEQGGATAFPFLD-------------------IAVKPQLGNVLFWYNLHRS 484
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D + H GCPV+KG+KW WI
Sbjct: 485 LDKDYRTKHAGCPVLKGSKWIGNVWIH 511
>gi|48675383|ref|NP_001001598.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
gi|75053350|sp|Q75UG4.1|P4HA3_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
alpha-3; AltName:
Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
subunit alpha-3; Flags: Precursor
gi|47115494|dbj|BAD18888.1| Collagen prolyl 4-hydroxylase alpha III subunit [Bos taurus]
gi|296479828|tpg|DAA21943.1| TPA: prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
Length = 544
Score = 107 bits (268), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 100/211 (47%), Gaps = 28/211 (13%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVI EP +YH+F+S E + + LA P +++S V + K R S +L
Sbjct: 340 EVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 397
Query: 138 GRDKIIRDIEKRIADFTFFPLE--NGEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 192
D ++ ++ RIA T ++ E LQV++Y G YEPHFD+ N G
Sbjct: 398 TVDPVLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSG 457
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW-S 251
R+AT ++YLS VE GG T F GN S P + +A LFW +
Sbjct: 458 NRVATFMIYLSSVEAGGATAF--IYGNFSV------------------PVVKNAALFWWN 497
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ D +LH CPV+ G+KW + KWI
Sbjct: 498 LHRSGEGDGDTLHAACPVLVGDKWVANKWIH 528
>gi|451927223|gb|AGF85101.1| 4-hydroxylase [Moumouvirus goulette]
Length = 239
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 102/202 (50%), Gaps = 31/202 (15%)
Query: 87 FVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDI 146
F+ NF++KE+C ++N +S + DS+ K+ +R S ++++ D +++ +
Sbjct: 57 FIIKNFINKEKCGEIMNNT-----QSKLFDSEVISGKNKAIRNSQQCWVSK-YDPMVKSM 110
Query: 147 EKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKN-----GGQRMATVLMY 201
++I+ P++N E LQV+ Y GQ Y H D D + N GGQR TVL+Y
Sbjct: 111 FQKISQQFNIPIQNAEDLQVVRYLPGQYYNEHHDACCDNNDKCNEFISRGGQRCLTVLIY 170
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS-LDP 260
L++ EGG T F N GL +KP+ GDA++F+ + + S P
Sbjct: 171 LNNEFEGGHTFFKN-------------------LGLKVKPETGDAIVFYPLAKNTSKCHP 211
Query: 261 SSLHGGCPVIKGNKWSSTKWIR 282
SLH G PV G KW + W R
Sbjct: 212 LSLHAGMPVTNGEKWIANLWFR 233
>gi|4336512|gb|AAD17844.1| prolyl 4-hydroxylase alpha subunit [Drosophila melanogaster]
Length = 535
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 64/210 (30%), Positives = 100/210 (47%), Gaps = 23/210 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVD-SDTGKSKDSRVRTSSGTFL 135
+E + +P H + ++ + L A P +++STV G S + RTS G
Sbjct: 319 LEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASF 378
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG---G 192
R+ + + + + DF+ ++ E LQV +Y G YEPH+D F + + G G
Sbjct: 379 NYSRNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSFPENHIYQEGDLHG 438
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
RMAT + YLSDVE GG T FP +P L + P+ G L ++++
Sbjct: 439 NRMATGIYYLSDVEAGGGTAFP-------FLP------------LLVTPERGSLLFWYNL 479
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
P D + H CPV++G+KW + WIR
Sbjct: 480 HPSGDQDFRTKHAACPVLQGSKWIANVWIR 509
>gi|426245942|ref|XP_004016760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Ovis
aries]
Length = 514
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 100/211 (47%), Gaps = 28/211 (13%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVI EP +YH+F+S E + + LA P +++S V + K R S +L
Sbjct: 310 EVIHLEPYVVLYHDFVSDAEAQKIRGLAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 367
Query: 138 GRDKIIRDIEKRIADFTFFPLE--NGEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 192
D ++ ++ RIA T ++ E LQV++Y G YEPHFD+ N G
Sbjct: 368 TVDPVLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSG 427
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW-S 251
R+AT ++YLS VE GG T F GN S P + +A LFW +
Sbjct: 428 NRVATFMIYLSSVEAGGATAF--IYGNFSV------------------PVVKNAALFWWN 467
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ D +LH CPV+ G+KW + KWI
Sbjct: 468 LHRSGEGDGDTLHAACPVLVGDKWVANKWIH 498
>gi|195069801|ref|XP_001997031.1| GH12975 [Drosophila grimshawi]
gi|193891500|gb|EDV90366.1| GH12975 [Drosophila grimshawi]
Length = 242
Score = 107 bits (268), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 66/230 (28%), Positives = 107/230 (46%), Gaps = 24/230 (10%)
Query: 59 VRKSMERSEGDEGRAEQW-VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
+R + R G + + +E + +P H+ +S EE L LA P +++S V
Sbjct: 7 LRCRLHRGNGLRSSYQPYRLEELHLDPYVIQVHDIISAEETIVLQQLARPELQRSMVYSL 66
Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEP 177
+ + R S GTF I++ + + + + + + + E LQV +Y G YEP
Sbjct: 67 SNSEHISTNFRISQGTFFEYHEHPIMQRMSQHLENISGLDMRSAEQLQVANYGIGGHYEP 126
Query: 178 HFDYFMDE----FNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECG 233
H D F + NT R+AT + YLS+VE GG T FP +P
Sbjct: 127 HMDSFSENHNYGINTYMSTNRVATGIYYLSNVEAGGGTAFP-------FLP--------- 170
Query: 234 KTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRV 283
L ++P+ G L ++++ LD + H GCPV+ G+KW + WIR+
Sbjct: 171 ---LLVEPERGSLLFWYNLHRSGDLDYRTKHAGCPVLMGSKWIANVWIRL 217
>gi|344274276|ref|XP_003408943.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3
[Loxodonta africana]
Length = 516
Score = 107 bits (267), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 96/200 (48%), Gaps = 33/200 (16%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ I RI D T GL V E QK EP D F G R+AT L Y+
Sbjct: 394 VSRINMRIQDLT--------GLDVSTAEELQKDEP------DAFKELGTGNRIATWLFYM 439
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
SDV GG TVFP+ G S+ PK G A+ ++++ D S+
Sbjct: 440 SDVSAGGATVFPDV-------------------GASVWPKKGTAVFWYNLFASGEGDYST 480
Query: 263 LHGGCPVIKGNKWSSTKWIR 282
H CPV+ GNKW S KW+
Sbjct: 481 RHAACPVLVGNKWVSNKWLH 500
>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
[Drosophila melanogaster]
Length = 286
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 65/207 (31%), Positives = 98/207 (47%), Gaps = 20/207 (9%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E S +P +H+ LS + L +A P M +STV G+ K S R S +LA
Sbjct: 73 LEEHSLDPYVATFHDILSPGKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLA 132
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMD-EFNTKNGGQRM 195
+ + + + D T E LQV +Y G YEPH+D+F D G R+
Sbjct: 133 YESHPTMVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRI 192
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
AT + YLS+VE+GG T FP +++KP++G+ L ++++
Sbjct: 193 ATAIFYLSEVEQGGATAFPFLD-------------------IAVKPQLGNVLFWYNLHRS 233
Query: 256 ASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D + H GCPV+KG+KW WI
Sbjct: 234 LDKDYRTKHAGCPVLKGSKWIGNVWIH 260
>gi|215697788|dbj|BAG91981.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 225
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 64/191 (33%), Positives = 105/191 (54%), Gaps = 21/191 (10%)
Query: 97 ECEYLINLATPHMRKS-TVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTF 155
EC++L+++ +M S D D R SS + D ++ IE RI+ ++F
Sbjct: 2 ECDHLVSMGRGNMESSLAFTDGD---------RNSSYNNI---EDIVVSKIEDRISLWSF 49
Query: 156 FPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPN 215
P ENGE +QVL Y + +E + +G R+AT+LMYLSDV++GGETVFP
Sbjct: 50 LPKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLATILMYLSDVKQGGETVFPR 104
Query: 216 AQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKW 275
++ A S+C +G +++P G+A+L ++++PD D S + CPV++G KW
Sbjct: 105 SEMK-DAQAKEGAPSQC--SGYAVRPAKGNAILLFNLRPDGETDKDSQYEECPVLEGEKW 161
Query: 276 SSTKWIRVNEY 286
+ K I + ++
Sbjct: 162 LAIKHINLRKF 172
>gi|195159146|ref|XP_002020443.1| GL13510 [Drosophila persimilis]
gi|194117212|gb|EDW39255.1| GL13510 [Drosophila persimilis]
Length = 527
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 70/214 (32%), Positives = 105/214 (49%), Gaps = 28/214 (13%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL- 135
+E +S +P VYHN LS E + + P +++S V D K S+ RT+ G +L
Sbjct: 316 MEELSLDPYIVVYHNVLSDAEIAEVERVTEPLLKRSVVFDGKENKMSTSKKRTALGAWLP 375
Query: 136 -----ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN-TK 189
GR +I+ I +RI + T + + + +Q++ Y G Y+ HFDYF TK
Sbjct: 376 DDNMDVSGR-AVIQRIFRRIHELTGLIINDRQDMQLIKYGYGGHYDIHFDYFNTSTPITK 434
Query: 190 NGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLF 249
G RMATVL YL+D++ GG T F + Q L + + G L +
Sbjct: 435 ARGDRMATVLFYLNDMKHGGSTAFTDLQ-------------------LKVPSERGKVLFW 475
Query: 250 WSMKPDA-SLDPSSLHGGCPVIKGNKWSSTKWIR 282
++M+ + LD +LHG CPVI G K + WI
Sbjct: 476 YNMRGETHDLDSRTLHGACPVINGTKTILSCWIH 509
>gi|348555277|ref|XP_003463450.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cavia porcellus]
Length = 584
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 70/211 (33%), Positives = 100/211 (47%), Gaps = 28/211 (13%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVI EP +YH+F+S E + + LA P +++S V + GK R S +L
Sbjct: 380 EVIHLEPYVALYHDFVSDPEAQKIRELAEPWLQRSVV--ASGGKQLQVEYRISKSAWLKD 437
Query: 138 GRDKIIRDIEKRIADFTFFPLE--NGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNG 191
D ++ + RIA T + E LQV++Y G YEPHFD+ F K+G
Sbjct: 438 TVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLFRMKSG 497
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
R+AT ++YLS VE GG T F A ++ V AL +W+
Sbjct: 498 -NRVATFMIYLSSVEAGGATAFIYANFSVPVV-------------------KNAALFWWN 537
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ D +LH GCPV+ G+KW + KWI
Sbjct: 538 LHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 568
>gi|291404186|ref|XP_002718473.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 3
[Oryctolagus cuniculus]
Length = 516
Score = 107 bits (267), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 96/199 (48%), Gaps = 33/199 (16%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ I RI D T GL V E QK EP D F G R+AT L Y+
Sbjct: 394 VSRINMRIQDLT--------GLDVSTAEELQKDEP------DAFKELGTGNRIATWLFYM 439
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
SDV GG TVFP + G S+ PK G A+ ++++ D S+
Sbjct: 440 SDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEGDYST 480
Query: 263 LHGGCPVIKGNKWSSTKWI 281
H CPV+ GNKW S KW+
Sbjct: 481 RHAACPVLVGNKWVSNKWL 499
>gi|395820528|ref|XP_003783616.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Otolemur
garnettii]
Length = 516
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 96/200 (48%), Gaps = 33/200 (16%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ I RI D T GL V E QK EP D F G R+AT L Y+
Sbjct: 394 VSRINMRIQDLT--------GLDVSTAEELQKDEP------DAFKELGTGNRIATWLFYM 439
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
SDV GG TVFP + G S+ PK G A+ ++++ D S+
Sbjct: 440 SDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEGDYST 480
Query: 263 LHGGCPVIKGNKWSSTKWIR 282
H CPV+ GNKW S KW+
Sbjct: 481 RHAACPVLVGNKWVSNKWLH 500
>gi|217272851|ref|NP_001136068.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Homo
sapiens]
gi|114631189|ref|XP_001140871.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 10 [Pan
troglodytes]
Length = 516
Score = 107 bits (267), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 67/200 (33%), Positives = 96/200 (48%), Gaps = 33/200 (16%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ I RI D T GL V E QK EP D F G R+AT L Y+
Sbjct: 394 VSRINMRIQDLT--------GLDVSTAEELQKDEP------DAFKELGTGNRIATWLFYM 439
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
SDV GG TVFP + G S+ PK G A+ ++++ D S+
Sbjct: 440 SDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEGDYST 480
Query: 263 LHGGCPVIKGNKWSSTKWIR 282
H CPV+ GNKW S KW+
Sbjct: 481 RHAACPVLVGNKWVSNKWLH 500
>gi|412986386|emb|CCO14812.1| predicted protein [Bathycoccus prasinos]
Length = 337
Score = 107 bits (266), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 71/220 (32%), Positives = 120/220 (54%), Gaps = 34/220 (15%)
Query: 65 RSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD 124
R++ +G W+E +SWEPR FVYHNFLS++E +YL + H + S +D ++ K
Sbjct: 91 RTDASDGNT-PWIEHVSWEPRVFVYHNFLSEKEAKYLRD---AHKKASKAMDDESMK--- 143
Query: 125 SRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAG--QKYE-PHFDY 181
T RG+D I+ IE+R++ F P +GE + + + G ++ E +FD
Sbjct: 144 --------TTFKRGQDPIVNVIEQRLSAFVMLPETHGENMFIEKIKKGYPKRLELLNFDD 195
Query: 182 FMDEFNTKNGGQRMATVLMYLSDVEE--GGETVFP--------NAQGNISAVPWWNELSE 231
D+ + KNGGQR AT ++L+ + E GGE VFP ++ + ++ P S
Sbjct: 196 EKDKEDLKNGGQRFATTALFLNTISEGKGGELVFPLGTERLYDDSNDSYTSTP-----SA 250
Query: 232 C-GKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVI 270
C GK L+++P++GDA++++S + + D +S C +
Sbjct: 251 CAGKYTLAVEPRVGDAVVWFSTHHNGNDDLNSASMRCDAV 290
>gi|324510827|gb|ADY44523.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
Length = 551
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 62/209 (29%), Positives = 101/209 (48%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE++ P A ++H +S EE + LA P + ++TV ++ TG + + R S +L
Sbjct: 322 VEIMRLNPLAVLFHQIMSDEEAHIIEMLAIPKLNRATVQNAMTGGLETASYRISKSAWLK 381
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNGG 192
+++ KR+ T +E E LQ+ +Y G Y+PHFD E F G
Sbjct: 382 PHEHEVVDRFNKRLDMATNLEMETAEELQIQNYGVGGHYDPHFDCARKEEKNAFKELGTG 441
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT+L+Y+++ E GG TVF + +++ C K AL ++++
Sbjct: 442 NRVATILVYMTEPEIGGGTVFTEVKTSVA----------CTKNA---------ALFWYNL 482
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
+D S H CPV+ G KW + KWI
Sbjct: 483 LRSGEVDMRSRHAACPVLTGVKWVTNKWI 511
>gi|195441323|ref|XP_002068462.1| GK20483 [Drosophila willistoni]
gi|194164547|gb|EDW79448.1| GK20483 [Drosophila willistoni]
Length = 550
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 70/215 (32%), Positives = 101/215 (46%), Gaps = 32/215 (14%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E IS +P YH+ LS E E L K T+++ T + S T +A
Sbjct: 341 LEEISLDPYIVQYHDVLSDNEIEDLKREGI----KGTMINGWTSLKSSNATENESRTIVA 396
Query: 137 R-----GRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEF----N 187
R +I++ I +RI D T F +E + +Q+ + G + PH+DY D
Sbjct: 397 RVAIMSPSLEIVQRINRRIIDMTGFNIEESKTIQLAAFSVGGFFMPHYDYLYDRLLDTDV 456
Query: 188 TKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDAL 247
K G R+A+V+ Y DV EGG T FP Q L ++PK G AL
Sbjct: 457 LKKLGDRVASVIFYAGDVTEGGATNFPRNQ-------------------LVVQPKKGSAL 497
Query: 248 LFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+++ D S DP SLH CPV+ G++W+ TKWI
Sbjct: 498 FWYNKFDDGSPDPRSLHSICPVVVGSRWTITKWIH 532
>gi|312599252|gb|ADQ91275.1| hypothetical protein BpV2_108c [Bathycoccus sp. RCC1105 virus BpV2]
Length = 197
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 68/204 (33%), Positives = 101/204 (49%), Gaps = 29/204 (14%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR V N LS++EC+++ N+A+ ++ STV S + D +R S +L D +
Sbjct: 23 KPR--VLKNVLSEDECKHIQNIASKKLQTSTVSKS---RDIDESIRKSETAWLKASEDPV 77
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ + ++ T PL N E LQVL Y+ G Y+PH D F D+ N +RM T ++ L
Sbjct: 78 VDKLIRKCVSMTDRPLRNCEDLQVLKYKPGGFYKPHQDTFPDDKN-----KRMYTFIIAL 132
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
+D EGGET FPN + S + + GDAL F ++ + +
Sbjct: 133 NDEYEGGETEFPNIKK-------------------SYRLEKGDALFFNTLNNYECITKKA 173
Query: 263 LHGGCPVIKGNKWSSTKWIRVNEY 286
LHGG PV G KW W+R Y
Sbjct: 174 LHGGTPVKSGEKWVCNLWVRKYHY 197
>gi|194213450|ref|XP_001495951.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Equus
caballus]
Length = 548
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 67/210 (31%), Positives = 98/210 (46%), Gaps = 26/210 (12%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVI EP +YH+F+S E + + LA P +++S V + K R S +L
Sbjct: 344 EVIHLEPYVVLYHDFVSDSEAQKIRGLAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 401
Query: 138 GRDKIIRDIEKRIADFTFFPLE--NGEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 192
D ++ ++ RIA T ++ E LQV++Y G YEPHFD+ G
Sbjct: 402 TVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPTSPLYRMKSG 461
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT ++YLS VE GG T F A ++ V AL +W++
Sbjct: 462 NRVATFMIYLSSVEAGGATAFIYANFSVPVV-------------------KNAALFWWNL 502
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D +LH GCPV+ G+KW + KWI
Sbjct: 503 HRSGEGDSDTLHAGCPVLVGDKWVANKWIH 532
>gi|195505199|ref|XP_002099401.1| GE23383 [Drosophila yakuba]
gi|194185502|gb|EDW99113.1| GE23383 [Drosophila yakuba]
Length = 535
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 65/210 (30%), Positives = 99/210 (47%), Gaps = 23/210 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVD-SDTGKSKDSRVRTSSGTFL 135
+E + +P H + ++ E L A P +++STV + G S + RTS G
Sbjct: 319 LEELHLDPLLVQLHQVIGAKDSESLQRTARPRIKRSTVYSLAGNGGSTAAAFRTSQGASF 378
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG---G 192
R + + + DF+ +E E LQV +Y G YEPH+D F + + G G
Sbjct: 379 NYSRSAATKLLSHHVGDFSGLNMEYAEDLQVANYGIGGHYEPHWDSFPENHVYQEGDLHG 438
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT + YLSDVE GG T FP +P L + P+ G L ++++
Sbjct: 439 NRIATGIYYLSDVEAGGGTAFP-------FLP------------LLVTPEKGSLLFWYNL 479
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
P D + H CPV++G+KW + WIR
Sbjct: 480 HPSGDQDFRTKHAACPVLQGSKWIANVWIR 509
>gi|441432545|ref|YP_007354587.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
gi|371944705|gb|AEX62527.1| putative prolyl4-hydroxylase [Moumouvirus Monve]
gi|440383625|gb|AGC02151.1| Prolyl 4-hydroxylase [Acanthamoeba polyphaga moumouvirus]
Length = 239
Score = 107 bits (266), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 65/202 (32%), Positives = 102/202 (50%), Gaps = 31/202 (15%)
Query: 87 FVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDI 146
F+ NF++KE+C+ ++N ++ + DS+ K+ +R S ++++ D +++ +
Sbjct: 57 FIIKNFINKEKCKEIMNNT-----QNKLFDSEVISGKNKAIRNSQQCWVSK-YDPMVKSM 110
Query: 147 EKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKN-----GGQRMATVLMY 201
++I+ PLEN E LQV+ Y GQ Y H D D + N GGQR TVL+Y
Sbjct: 111 FQKISQQFNIPLENAEDLQVVRYLPGQYYNEHHDACCDNNDKCNEFISRGGQRCLTVLVY 170
Query: 202 LSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDAS-LDP 260
L++ EGG T F N L +KP+ GDA++F+ + + S P
Sbjct: 171 LNNEFEGGHTFFKN-------------------LNLKVKPETGDAIVFYPLAKNTSKCHP 211
Query: 261 SSLHGGCPVIKGNKWSSTKWIR 282
SLH G PV G KW + W R
Sbjct: 212 LSLHAGMPVTSGEKWIANLWFR 233
>gi|195452742|ref|XP_002073480.1| GK13123 [Drosophila willistoni]
gi|194169565|gb|EDW84466.1| GK13123 [Drosophila willistoni]
Length = 540
Score = 106 bits (265), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 65/209 (31%), Positives = 107/209 (51%), Gaps = 26/209 (12%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
VE ++ +P +HN +S +E + LI +++S V G S S VRTS T+L
Sbjct: 325 VEQLNLDPYVAYFHNVISDDETDDLIEHGMGQVKRSRV--GTVGNSTVSEVRTSQNTWLW 382
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKN-GGQRM 195
+ +++++ R+ D T +E+ E LQ+++Y G YEPH+D+ D+ T G R+
Sbjct: 383 YEQQPWLKNLKLRLEDITGLGMESAEPLQLVNYGIGGHYEPHYDFVEDKVTTFGWKGNRL 442
Query: 196 ATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPD 255
T L+YL++V GG T FP + L++ P G L+++++
Sbjct: 443 LTALLYLNEVPMGGATAFPYLK-------------------LAVPPVKGSLLVWYNLH-- 481
Query: 256 ASLDPS--SLHGGCPVIKGNKWSSTKWIR 282
SLDP + H GCPV+ G+KW +W
Sbjct: 482 RSLDPDFRTKHAGCPVLMGSKWVCNEWFH 510
>gi|24651418|ref|NP_524594.2| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|7301951|gb|AAF57057.1| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
gi|359807686|gb|AEV66559.1| FI17802p1 [Drosophila melanogaster]
Length = 535
Score = 106 bits (265), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 63/210 (30%), Positives = 100/210 (47%), Gaps = 23/210 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVD-SDTGKSKDSRVRTSSGTFL 135
+E + +P H + ++ + L A P +++STV G S + RTS G
Sbjct: 319 LEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASF 378
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG---G 192
R+ + + + + DF+ ++ E LQV +Y G YEPH+D F + + G G
Sbjct: 379 NYSRNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSFPENHIYQEGDLHG 438
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
RMAT + YL+DVE GG T FP +P L + P+ G L ++++
Sbjct: 439 NRMATGIYYLADVEAGGGTAFP-------FLP------------LLVTPERGSLLFWYNL 479
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
P D + H CPV++G+KW + WIR
Sbjct: 480 HPSGDQDFRTKHAACPVLQGSKWIANVWIR 509
>gi|198449508|ref|XP_002136911.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
gi|198130638|gb|EDY67469.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
Length = 516
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 70/213 (32%), Positives = 104/213 (48%), Gaps = 28/213 (13%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL- 135
+E +S +P VYHN L E + + P +++S V D K S+ RT+ G +L
Sbjct: 305 MEELSLDPYIVVYHNVLCDAEIAEVERVTEPLLKRSVVFDGKENKMSTSKKRTALGAWLP 364
Query: 136 -----ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYF-MDEFNTK 189
GR +I+ I +RI + T + + + +Q++ Y G Y+ HFDYF TK
Sbjct: 365 DDNMDVSGR-AVIQRIFRRIHELTGLIINDRQDMQLIKYGYGGHYDIHFDYFNTSSPITK 423
Query: 190 NGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLF 249
G RMATVL YL+DV+ GG T F + Q L + + G L +
Sbjct: 424 ARGDRMATVLFYLNDVKHGGSTAFTDLQ-------------------LKVPSERGKVLFW 464
Query: 250 WSMKPDA-SLDPSSLHGGCPVIKGNKWSSTKWI 281
++M+ + LD +LHG CPVI G K + WI
Sbjct: 465 YNMRGETHDLDSRTLHGACPVIDGTKTILSCWI 497
>gi|241999340|ref|XP_002434313.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
gi|215496072|gb|EEC05713.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
Length = 267
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 70/229 (30%), Positives = 112/229 (48%), Gaps = 36/229 (15%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+EV+S +PR V+ +FL+ ECE +++ + ++ V + S RT+ +++
Sbjct: 51 IEVLSEDPRIVVFPDFLNPRECEIFRSISQEKLSRAKVYLGGPPEGGFSLRRTNKVAWMS 110
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY--FMDEFNT--KNGG 192
++ + +RIA T L + E QV +Y G Y PH DY F + K+ G
Sbjct: 111 DDLHPLLGKVSRRIALATGLTLTSAEMYQVANYGLGGHYIPHPDYAGFGEAQGDIYKSSG 170
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT+L+YL+DV GG T F N + L++KP +G AL ++++
Sbjct: 171 NRLATMLIYLADVAGGGATAFINMR-------------------LAVKPTLGTALFWYNL 211
Query: 253 KP-DASL------------DPSSLHGGCPVIKGNKWSSTKWIRVNEYKV 288
KP D + DP + H GCPV+ G+KW TKWI E +
Sbjct: 212 KPYDGPIVNESFWNQRRFGDPRTFHMGCPVLTGSKWIVTKWIHEREQGI 260
>gi|66772633|gb|AAY55628.1| IP02961p [Drosophila melanogaster]
Length = 409
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/210 (30%), Positives = 100/210 (47%), Gaps = 23/210 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVD-SDTGKSKDSRVRTSSGTFL 135
+E + +P H + ++ + L A P +++STV G S + RTS G
Sbjct: 193 LEELHLDPLVVQLHQVIGSKDSDSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASF 252
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG---G 192
R+ + + + + DF+ ++ E LQV +Y G YEPH+D F + + G G
Sbjct: 253 NYSRNAATKLLSRHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSFPENHIYQEGDLHG 312
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
RMAT + YL+DVE GG T FP +P L + P+ G L ++++
Sbjct: 313 NRMATGIYYLADVEAGGGTAFP-------FLP------------LLVTPERGSLLFWYNL 353
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
P D + H CPV++G+KW + WIR
Sbjct: 354 HPSGDQDFRTKHAACPVLQGSKWIANVWIR 383
>gi|432109537|gb|ELK33711.1| Prolyl 4-hydroxylase subunit alpha-2 [Myotis davidii]
Length = 555
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 66/222 (29%), Positives = 100/222 (45%), Gaps = 43/222 (19%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE + + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIQRIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY------------------FM-- 183
+ +R+ T ++ E LQV +Y G +YEPHFD+ F+
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455
Query: 184 ----DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSI 239
D F G R+AT L Y+SDVE GG TVFP+ G +I
Sbjct: 456 NDEQDVFKHLGTGNRVATFLNYMSDVEAGGATVFPD-------------------LGAAI 496
Query: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
PK G A+ ++++ D + H CPV+ G KW S KW
Sbjct: 497 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWF 538
>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
rubripes]
Length = 540
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 68/212 (32%), Positives = 103/212 (48%), Gaps = 30/212 (14%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV+S P +YH+F+S E E + A +R+S V D K + R S +L
Sbjct: 336 EVLSLRPYVVLYHDFISDSESEEIKQHAQLGLRRSVVATGD--KQATAEYRISKSAWLKG 393
Query: 138 GRDKIIRDIEKRIADFTFFPLEN--GEGLQVLHYEAGQKYEPHFDYFMDE----FNTKNG 191
+ ++++I+ T +++ GE LQV++Y G YEPHFD+ F K G
Sbjct: 394 SAHSTVSRLDQKISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKLKTG 453
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW- 250
R+AT ++YLS VE GG T F A ++ P M +A +FW
Sbjct: 454 -NRVATFMIYLSSVEAGGSTAFIYANFSV--------------------PVMKNAAIFWW 492
Query: 251 SMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
++ + D +LH GCPV+ G+KW + KWI
Sbjct: 493 NLHRNGEGDADTLHAGCPVLIGDKWVANKWIH 524
>gi|403263105|ref|XP_003923900.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
alpha-3, partial [Saimiri boliviensis boliviensis]
Length = 534
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 68/210 (32%), Positives = 96/210 (45%), Gaps = 26/210 (12%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV+ EP +YH+F+S E + + LA P +++S V + K R S +L
Sbjct: 330 EVLHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 387
Query: 138 GRDKIIRDIEKRIADFTFFPLE--NGEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 192
D ++ + RIA T + E LQV++Y G YEPHFD+ G
Sbjct: 388 TVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 447
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT ++YLS VE GG T F A LS+ AL +W++
Sbjct: 448 NRVATFMIYLSSVEAGGATAFIYA-------------------NLSVPVVKNAALFWWNL 488
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D +LH GCPV+ GNKW + KWI
Sbjct: 489 HRSGEGDSDTLHAGCPVLVGNKWVANKWIH 518
>gi|194765180|ref|XP_001964705.1| GF23331 [Drosophila ananassae]
gi|190614977|gb|EDV30501.1| GF23331 [Drosophila ananassae]
Length = 535
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 67/216 (31%), Positives = 103/216 (47%), Gaps = 23/216 (10%)
Query: 71 GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVD-SDTGKSKDSRVRT 129
G A +E +S EP F H +S + E++ +A P +++STV G S+ + RT
Sbjct: 313 GYAPFKLEELSHEPLVFQVHQVVSSKSAEFIKKMARPKIKRSTVYSIGGGGGSQAAAFRT 372
Query: 130 SSGTFLARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTK 189
S G R+ + + + + D + + E LQV +Y G YEPH+D F +
Sbjct: 373 SQGASFNYSRNAATKILSRHVGDLSSLDMNFAEELQVANYGIGGHYEPHWDSFPENHIYD 432
Query: 190 NG---GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDA 246
G G R+AT + YLSDVE GG T FP +P L + P+ G
Sbjct: 433 EGDDRGNRIATGIYYLSDVEAGGGTAFP-------FLP------------LLVTPEKGSL 473
Query: 247 LLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
L ++++ D + H CPV++G+KW + WIR
Sbjct: 474 LFWYNLHESGDQDYRTKHAACPVLQGSKWIANVWIR 509
>gi|326923465|ref|XP_003207956.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 3
[Meleagris gallopavo]
Length = 518
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 68/199 (34%), Positives = 94/199 (47%), Gaps = 33/199 (16%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR + + +S EE E + LA P + ++TV D +TGK + R S +L+ +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 395
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ I RI D T GL V E QK EP D F G R+AT L Y+
Sbjct: 396 VSRINTRIQDLT--------GLDVSTAEELQKDEP------DAFKELGTGNRIATWLFYM 441
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
SDV GG TVFP + G S+ PK G A+ ++++ P D S+
Sbjct: 442 SDVSAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFPSGEGDYST 482
Query: 263 LHGGCPVIKGNKWSSTKWI 281
H CPV+ GNKW S KW+
Sbjct: 483 RHAACPVLVGNKWVSNKWL 501
>gi|219113023|ref|XP_002186095.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
gi|209582945|gb|ACI65565.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
Length = 508
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 71/218 (32%), Positives = 108/218 (49%), Gaps = 33/218 (15%)
Query: 79 VISWEPRAFVYHNFLSKEECEYLINLATPH-MRKSTV-VDSDTGKSKDSRVRTSSGTFLA 136
V+S PR F +FLS E E+L+N+A+ +++ST+ + + + RTS+ ++
Sbjct: 283 VLSCVPRVFEVKDFLSDMEVEHLLNIASKRKLKRSTMHAGGSSEATTNDDTRTSTNDWIP 342
Query: 137 RGRDKIIRDIEKRIAD-------------------FTFFPLENGEGLQVLHYEAGQKYEP 177
R +D I I +R AD FT + E LQ+++Y+ GQ+Y P
Sbjct: 343 RHQDLITDTIYRRAADLLQMDEALLRWRRKSEIPEFTESHISISERLQLVNYQVGQQYTP 402
Query: 178 HFDYFMDEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGL 237
H D+ M R AT+L YL+D +GGET FP W + E G L
Sbjct: 403 HHDFTMPGL-VNMQPSRFATLLFYLNDDMDGGETAFPR---------WLHADEEGG--SL 450
Query: 238 SIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKW 275
+KP+ G A+LF+++ PD + D S H PV +G KW
Sbjct: 451 KVKPEKGKAILFYNLLPDGNYDERSEHAALPVRRGEKW 488
>gi|156398644|ref|XP_001638298.1| predicted protein [Nematostella vectensis]
gi|156225417|gb|EDO46235.1| predicted protein [Nematostella vectensis]
Length = 495
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 63/180 (35%), Positives = 92/180 (51%), Gaps = 24/180 (13%)
Query: 107 PHMRKSTVVDSDTGKSKDSRVRTSSGTFLA-RGRDKIIRDIEKRIADFTFFPLENGEGLQ 165
P + ++TV + TG + + R S +L+ R ++I +E+RIA T LE EG Q
Sbjct: 318 PTLNRATVHNPITGHLETAHYRISKNCWLSGREHGEVIDRVERRIAAMTRLNLETAEGFQ 377
Query: 166 VLHYEAGQKYEPHFDYFMDEFNTKNG----GQRMATVLMYLSDVEEGGETVFPNAQGNIS 221
V +Y +Y+PHFD+ D N+ G G R+ATVL+++S VE GG TVFP
Sbjct: 378 VQNYGLAGQYDPHFDFSRDLANSSLGSLGTGNRIATVLVWMSQVESGGATVFP------- 430
Query: 222 AVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
G I P+ GDA+ + ++ D + H GCPV+ G KW + KWI
Sbjct: 431 ------------YVGARILPQKGDAVFWHNLLRSGDGDFRTRHAGCPVLSGIKWVANKWI 478
>gi|195505207|ref|XP_002099404.1| GE23380 [Drosophila yakuba]
gi|194185505|gb|EDW99116.1| GE23380 [Drosophila yakuba]
Length = 540
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 62/209 (29%), Positives = 103/209 (49%), Gaps = 23/209 (11%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
+E ++ +P H L E + ++ +M++S V +G S + +RTS T+L
Sbjct: 329 IEQLNLDPYVAYVHEVLWDSEIDMIMEHGKGNMKRSMV--GQSGNSTTTEIRTSQNTWLW 386
Query: 137 RGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG--GQR 194
+ + I++R+ D T E+ E LQ+++Y G +YEPHFD+ D+ G G R
Sbjct: 387 YDANPWLAKIKQRLEDVTGLSTESAEPLQLVNYGIGGQYEPHFDFMEDDGQKVFGWKGNR 446
Query: 195 MATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKP 254
+AT L YL+DV GG T FP + L++ P G L+++++
Sbjct: 447 LATALFYLNDVALGGATAFPFLR-------------------LAVPPVKGSLLIWYNLHS 487
Query: 255 DASLDPSSLHGGCPVIKGNKWSSTKWIRV 283
D + H GCPV++G+KW +W V
Sbjct: 488 STHKDFRTKHAGCPVLQGSKWICNEWFHV 516
>gi|426255748|ref|XP_004021510.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Ovis
aries]
Length = 516
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 96/199 (48%), Gaps = 33/199 (16%)
Query: 83 EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
+PR +H+ +S E E + +LA P + ++TV D +TGK ++ R S +L+ + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393
Query: 143 IRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNGGQRMATVLMYL 202
+ I RI D T GL V E QK EP D F G R+AT L Y+
Sbjct: 394 VSRINMRIQDLT--------GLDVSTAEELQKDEP------DAFKELGTGNRIATWLFYM 439
Query: 203 SDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSMKPDASLDPSS 262
SDV GG TVFP + G S+ PK G A+ ++++ D S+
Sbjct: 440 SDVLAGGATVFP-------------------EVGASVWPKKGTAVFWYNLFASGEGDYST 480
Query: 263 LHGGCPVIKGNKWSSTKWI 281
H CPV+ GNKW S KW+
Sbjct: 481 RHAACPVLVGNKWVSNKWL 499
>gi|195575097|ref|XP_002105516.1| GD17035 [Drosophila simulans]
gi|194201443|gb|EDX15019.1| GD17035 [Drosophila simulans]
Length = 535
Score = 106 bits (264), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 64/210 (30%), Positives = 98/210 (46%), Gaps = 23/210 (10%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVD-SDTGKSKDSRVRTSSGTFL 135
+E + +P H + + E L A P +++STV G S + RTS G
Sbjct: 319 LEELHLDPLVVQLHQVIGSNDSESLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASF 378
Query: 136 ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFNTKNG---G 192
R+ + + + DF+ ++ E LQV +Y G YEPH+D F + + G G
Sbjct: 379 NYSRNAATKLLSHHVGDFSGLNMDYAEDLQVANYGIGGHYEPHWDSFPENHIYQEGDLHG 438
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT + YLSDVE GG T FP +P L + P+ G L ++++
Sbjct: 439 NRIATGIYYLSDVEAGGGTAFP-------FLP------------LLVTPEKGSLLFWYNL 479
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
P D + H CPV++G+KW + WIR
Sbjct: 480 HPSGDQDFRTKHAACPVLQGSKWIANVWIR 509
>gi|386368303|gb|AFJ06910.1| procollagen-proline dioxygenase [Mytilus galloprovincialis]
Length = 535
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 64/212 (30%), Positives = 108/212 (50%), Gaps = 26/212 (12%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
E + + ++H+ S +E + + +LA P + ++TV D TGK ++ R + +L
Sbjct: 329 EEVYRDANMVLFHDIASDKEMKIIKSLAIPKLFRATVHDPTTGKLIHAKYRITKTAWLD- 387
Query: 138 GRDKIIRD-IEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFM-DEFNT----KNG 191
RD ++ D ++ RI T L++ + LQV +Y G Y+PH+D+ D+ +T K
Sbjct: 388 DRDHLVVDRVQNRIKAVTGLDLDSADALQVANYGIGGHYDPHYDFSTRDDDDTSETEKRD 447
Query: 192 GQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWS 251
G R+AT L+Y++DV+ GG TVFP + + PK G A+ +++
Sbjct: 448 GNRIATFLLYMTDVDAGGATVFP-------------------IIDVRVLPKKGTAVFWYN 488
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIRV 283
++ + H CPV+ G KW S KWIR
Sbjct: 489 LRRSGKGIMETRHAACPVLVGTKWVSNKWIRT 520
>gi|198449504|ref|XP_002136909.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
gi|198130636|gb|EDY67467.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
Length = 527
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/213 (32%), Positives = 105/213 (49%), Gaps = 28/213 (13%)
Query: 77 VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL- 135
+E +S +P VYHN LS E + + P +++S V D K S+ RT+ G +L
Sbjct: 316 MEELSLDPYIVVYHNVLSDAEIAEVERVTEPLLKRSVVFDGKGNKMSTSKRRTALGAWLP 375
Query: 136 -----ARGRDKIIRDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDYFMDEFN-TK 189
GR +I+ I +RI + T + + + +Q++ Y G Y+ HFDYF TK
Sbjct: 376 DDNMDVSGR-AVIQRIFRRIHELTGLIINDRQDMQLIKYGYGGHYDIHFDYFNTSTPITK 434
Query: 190 NGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLF 249
G RMATVL YL+D++ GG T F + Q L + + G L +
Sbjct: 435 ARGDRMATVLFYLNDMKHGGSTAFTDLQ-------------------LKVPSERGKVLFW 475
Query: 250 WSMKPDA-SLDPSSLHGGCPVIKGNKWSSTKWI 281
++M+ + +D +LHG CPVI G K + WI
Sbjct: 476 YNMRGETHDVDSRTLHGACPVINGTKTILSCWI 508
>gi|443705944|gb|ELU02240.1| hypothetical protein CAPTEDRAFT_227850 [Capitella teleta]
Length = 475
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 101/208 (48%), Gaps = 30/208 (14%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
E++ P +++H+F+S E + L ++A P + S V+D G+S R SS F+
Sbjct: 178 ELLHANPEIYLFHDFISDSEIQRLKDMAEPQFQSSAVLDDTGGESFFDVSRLSSTAFVND 237
Query: 138 GRDKIIRDIEKRIADFTFFPLE------NGEGLQVLHYEAGQKYEPHFDYFMDEFN---- 187
D ++ + +R++ T E E LQVL Y G Y PH+D E +
Sbjct: 238 SND-LVASLNRRVSKLTGLQTEVLDSFSESESLQVLRYGPGGLYTPHYDTLGSEADLPPY 296
Query: 188 TKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDAL 247
++ G R+AT ++YL GG TVFP +P +SI + G A
Sbjct: 297 IQHTGDRIATFILYLDIATAGGATVFP-------LLP------------MSIPIQKGAAA 337
Query: 248 LFWSMKPDASLDPSSLHGGCPVIKGNKW 275
++++ PD SLD +LH CPVI+G KW
Sbjct: 338 FWFNLHPDGSLDRRTLHAACPVIRGTKW 365
>gi|332211329|ref|XP_003254773.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Nomascus
leucogenys]
Length = 544
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/210 (32%), Positives = 96/210 (45%), Gaps = 26/210 (12%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVI EP +YH+F+S E + + LA P +++S V + K R S +L
Sbjct: 340 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 397
Query: 138 GRDKIIRDIEKRIADFTFFPLE--NGEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 192
D ++ + RIA T + E LQV++Y G YEPHFD+ G
Sbjct: 398 TVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 457
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT ++YLS VE GG T F A LS+ AL +W++
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYAN-------------------LSVPVVRNAALFWWNL 498
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D +LH GCPV+ G+KW + KWI
Sbjct: 499 HRSGEGDSDTLHAGCPVLVGDKWVANKWIH 528
>gi|184185444|gb|ACC68850.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
(predicted) [Rhinolophus ferrumequinum]
Length = 555
Score = 105 bits (263), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 67/222 (30%), Positives = 99/222 (44%), Gaps = 43/222 (19%)
Query: 84 PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
P Y++ +S EE E + +A P + ++TV D TG + R S ++L D ++
Sbjct: 336 PHIVRYYDVMSDEEIEKIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEETEDPVV 395
Query: 144 RDIEKRIADFTFFPLENGEGLQVLHYEAGQKYEPHFDY------------------FM-- 183
+ R+ T ++ E LQV +Y G +YEPHFD+ F+
Sbjct: 396 ARLNLRMQHITGLSVKTAELLQVANYGMGGQYEPHFDFSRRPFDNGLKTEGNRLATFLNY 455
Query: 184 ----DEFNTKNGGQRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSI 239
D F G R+AT L Y+SDVE GG TVFP+ G +I
Sbjct: 456 NDEHDVFKHLGTGNRVATFLNYMSDVEAGGATVFPD-------------------LGAAI 496
Query: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWI 281
PK G A+ ++++ D + H CPV+ G KW S KW
Sbjct: 497 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWF 538
>gi|297689698|ref|XP_002822285.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pongo abelii]
Length = 544
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 68/210 (32%), Positives = 96/210 (45%), Gaps = 26/210 (12%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVI EP +YH+F+S E + + LA P +++S V + K R S +L
Sbjct: 340 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 397
Query: 138 GRDKIIRDIEKRIADFTFFPLE--NGEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 192
D ++ + RIA T + E LQV++Y G YEPHFD+ G
Sbjct: 398 TVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 457
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT ++YLS VE GG T F A LS+ AL +W++
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYAN-------------------LSVPVVRNAALFWWNL 498
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D +LH GCPV+ G+KW + KWI
Sbjct: 499 HRSGEGDSDTLHAGCPVLVGDKWVANKWIH 528
>gi|355709028|gb|AES03457.1| prolyl 4-hydroxylase, alpha polypeptide III [Mustela putorius furo]
Length = 477
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 66/210 (31%), Positives = 98/210 (46%), Gaps = 26/210 (12%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EVI EP +YH+F+S E + + LA P +++S V + K R S +L
Sbjct: 274 EVIHLEPYVVLYHDFVSDMEAQKIRGLAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 331
Query: 138 GRDKIIRDIEKRIADFTFFPLE--NGEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 192
D ++ +++ RI T ++ E LQV++Y G YEPHFD+ G
Sbjct: 332 TVDPLLVNLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPTSPLYRMKSG 391
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFWSM 252
R+AT ++YLS VE GG T F A ++ V AL +W++
Sbjct: 392 NRVATFMIYLSSVEAGGATAFIYANFSVPVV-------------------KNAALFWWNL 432
Query: 253 KPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
D +LH GCPV+ G+KW + KWI
Sbjct: 433 HRSGEGDGDTLHAGCPVLVGDKWVANKWIH 462
>gi|52139015|gb|AAH82538.1| P4ha3 protein [Mus musculus]
Length = 404
Score = 105 bits (262), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 69/211 (32%), Positives = 100/211 (47%), Gaps = 28/211 (13%)
Query: 78 EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
EV+ P +YH+F+S EE + + LA P +++S V + K R S +L
Sbjct: 200 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 257
Query: 138 GRDKIIRDIEKRIADFTFFPLE--NGEGLQVLHYEAGQKYEPHFDYFMDE---FNTKNGG 192
D ++ ++ RIA T ++ E LQV++Y G YEPHFD+ G
Sbjct: 258 TVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 317
Query: 193 QRMATVLMYLSDVEEGGETVFPNAQGNISAVPWWNELSECGKTGLSIKPKMGDALLFW-S 251
R+AT ++YLS VE GG T F GN S P + +A LFW +
Sbjct: 318 NRVATFMIYLSSVEAGGATAF--IYGNFSV------------------PVVKNAALFWWN 357
Query: 252 MKPDASLDPSSLHGGCPVIKGNKWSSTKWIR 282
+ D +LH GCPV+ G+KW + KWI
Sbjct: 358 LHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 388
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.133 0.401
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,771,129,137
Number of Sequences: 23463169
Number of extensions: 200434051
Number of successful extensions: 461228
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1527
Number of HSP's successfully gapped in prelim test: 507
Number of HSP's that attempted gapping in prelim test: 455725
Number of HSP's gapped (non-prelim): 2220
length of query: 288
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 147
effective length of database: 9,050,888,538
effective search space: 1330480615086
effective search space used: 1330480615086
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 76 (33.9 bits)