BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 030651
         (174 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|357483927|ref|XP_003612250.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355513585|gb|AES95208.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 204

 Score =  243 bits (620), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 118/175 (67%), Positives = 143/175 (81%), Gaps = 1/175 (0%)

Query: 1   MAKPRYSRFPTRKSSSS-TLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIV 59
           MAK RYSR P+RKSSS  TLI +L I FTF ILILL FGILS+PSS+ +  K NDL+SIV
Sbjct: 1   MAKSRYSRLPSRKSSSPYTLIFSLFIAFTFLILILLVFGILSIPSSNQNLPKPNDLTSIV 60

Query: 60  RKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT 119
             +++R++ +EG+ EQWVEV+SWEPRAFVYHNFL+KEECEYLI++A P M KSTVVDS+T
Sbjct: 61  HNTVDRNDDEEGKGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSET 120

Query: 120 GKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           GKSKDSRVRTSSGTFLARGRDKI+R+IEK+IADFTF P+       +  + V +K
Sbjct: 121 GKSKDSRVRTSSGTFLARGRDKIVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQK 175


>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 289

 Score =  242 bits (618), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 118/175 (67%), Positives = 143/175 (81%), Gaps = 1/175 (0%)

Query: 1   MAKPRYSRFPTRKSSSS-TLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIV 59
           MAK RYSR P+RKSSS  TLI +L I FTF ILILL FGILS+PSS+ +  K NDL+SIV
Sbjct: 1   MAKSRYSRLPSRKSSSPYTLIFSLFIAFTFLILILLVFGILSIPSSNQNLPKPNDLTSIV 60

Query: 60  RKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT 119
             +++R++ +EG+ EQWVEV+SWEPRAFVYHNFL+KEECEYLI++A P M KSTVVDS+T
Sbjct: 61  HNTVDRNDDEEGKGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSET 120

Query: 120 GKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           GKSKDSRVRTSSGTFLARGRDKI+R+IEK+IADFTF P+       +  + V +K
Sbjct: 121 GKSKDSRVRTSSGTFLARGRDKIVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQK 175


>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
          Length = 288

 Score =  239 bits (609), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 119/174 (68%), Positives = 147/174 (84%)

Query: 1   MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
           MAKPRY RFP RKSSSST++L++L+MF+F +L+LLA G+LS+PS SGDS +ANDLS+IV 
Sbjct: 1   MAKPRYPRFPPRKSSSSTVVLSMLLMFSFVVLVLLALGLLSIPSHSGDSPRANDLSTIVH 60

Query: 61  KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
           +++ERS+G++G+ E W EVISWEPRAFVYHNFLSK+ECEYLI LA PHM+KSTVVDS TG
Sbjct: 61  RTVERSDGNDGKGEPWSEVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSSTG 120

Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           KSKDSRVRTSSGTFL RG+DKIIR IEKR++DFTF P+       +  + V +K
Sbjct: 121 KSKDSRVRTSSGTFLTRGQDKIIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQK 174


>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
          Length = 288

 Score =  239 bits (609), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 119/174 (68%), Positives = 147/174 (84%)

Query: 1   MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
           MAKPRY RFP RKSSSST++L++L+MF+F +L+LLA G+LS+PS SGDS +ANDLS+IV 
Sbjct: 1   MAKPRYPRFPPRKSSSSTVVLSMLLMFSFVVLVLLALGLLSIPSHSGDSPRANDLSTIVH 60

Query: 61  KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
           +++ERS+G++G+ E W EVISWEPRAFVYHNFLSK+ECEYLI LA PHM+KSTVVDS TG
Sbjct: 61  RTVERSDGNDGKGEPWSEVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSSTG 120

Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           KSKDSRVRTSSGTFL RG+DKIIR IEKR++DFTF P+       +  + V +K
Sbjct: 121 KSKDSRVRTSSGTFLTRGQDKIIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQK 174


>gi|255633460|gb|ACU17088.1| unknown [Glycine max]
          Length = 207

 Score =  236 bits (602), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 120/176 (68%), Positives = 144/176 (81%), Gaps = 2/176 (1%)

Query: 1   MAKPRYSRFPTRKSSSS-TLILTLLIMFTFAILILLAFGILSMPSSS-GDSRKANDLSSI 58
           MAKPRYSR P RKSSSS TLILTL ++FTF +LILLA GILS+PSSS G+  K NDL+SI
Sbjct: 1   MAKPRYSRLPPRKSSSSSTLILTLFLVFTFLVLILLALGILSIPSSSRGNLPKPNDLASI 60

Query: 59  VRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD 118
            R ++  S+ D+ R EQWVEV+SWEPRAFVYHNFL+KEECEYLI++A P+M KS+VVDS+
Sbjct: 61  ARNTIHTSDDDDVRGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVVDSE 120

Query: 119 TGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           TGKSKDSRVRTSSGTFLARGRDKI+RDIEKRIA ++F P+       +  + V +K
Sbjct: 121 TGKSKDSRVRTSSGTFLARGRDKIVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQK 176


>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
 gi|255647110|gb|ACU24023.1| unknown [Glycine max]
          Length = 289

 Score =  236 bits (601), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 123/176 (69%), Positives = 145/176 (82%), Gaps = 3/176 (1%)

Query: 1   MAKPRYSRFPTRKSSSS-TLILTLLIMFTFAILILLAFGILSMPSSS-GDSRKANDLSSI 58
           MAKPRYSR P RKSSSS TLILTL ++FTF +LILLA GILS+PSSS G+  K NDL+SI
Sbjct: 1   MAKPRYSRLPPRKSSSSSTLILTLFLVFTFLVLILLALGILSIPSSSRGNLPKPNDLASI 60

Query: 59  VRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD 118
            R ++E S+ DE R EQWVEV+SWEPRAFVYHNFL+KEECEYLI++A P M KSTVVDS+
Sbjct: 61  ARNTIETSDSDE-RGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSE 119

Query: 119 TGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           TGKSKDSRVRTSSGTFLARGRDKI+R+IEK+I+DFTF P+       +  + V +K
Sbjct: 120 TGKSKDSRVRTSSGTFLARGRDKIVRNIEKKISDFTFIPVEHGEGLQVLHYEVGQK 175


>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
          Length = 290

 Score =  235 bits (599), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 120/176 (68%), Positives = 144/176 (81%), Gaps = 2/176 (1%)

Query: 1   MAKPRYSRFPTRKSSSS-TLILTLLIMFTFAILILLAFGILSMPSSS-GDSRKANDLSSI 58
           MAKPRYSR P RKSSSS TLILTL ++FTF +LILLA GILS+PSSS G+  K NDL+SI
Sbjct: 1   MAKPRYSRLPPRKSSSSSTLILTLFLVFTFLVLILLALGILSIPSSSRGNLPKPNDLASI 60

Query: 59  VRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD 118
            R ++  S+ D+ R EQWVEV+SWEPRAFVYHNFL+KEECEYLI++A P+M KS+VVDS+
Sbjct: 61  ARNTIHTSDDDDVRGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVVDSE 120

Query: 119 TGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           TGKSKDSRVRTSSGTFLARGRDKI+RDIEKRIA ++F P+       +  + V +K
Sbjct: 121 TGKSKDSRVRTSSGTFLARGRDKIVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQK 176


>gi|240256489|ref|NP_201407.4| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
 gi|332010770|gb|AED98153.1| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
          Length = 289

 Score =  235 bits (599), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 114/174 (65%), Positives = 137/174 (78%)

Query: 1   MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
           MA+PR  R   RKSS STL+  +LIM TF ILILLAFGILS+PS++  S KANDL+SIVR
Sbjct: 2   MARPRNHRPSARKSSHSTLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIVR 61

Query: 61  KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
           K+++RS  D+ + E+WVE+ISWEPRA VYHNFL+KEEC+YLI LA PHM KSTVVD  TG
Sbjct: 62  KTLQRSGEDDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTG 121

Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           KS DSRVRTSSGTFLARGRDK IR+IEKRI+DFTF P+       +  + + +K
Sbjct: 122 KSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQK 175


>gi|10177121|dbj|BAB10411.1| prolyl 4-hydroxylase, alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 267

 Score =  235 bits (599), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 114/174 (65%), Positives = 137/174 (78%)

Query: 1   MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
           MA+PR  R   RKSS STL+  +LIM TF ILILLAFGILS+PS++  S KANDL+SIVR
Sbjct: 2   MARPRNHRPSARKSSHSTLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIVR 61

Query: 61  KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
           K+++RS  D+ + E+WVE+ISWEPRA VYHNFL+KEEC+YLI LA PHM KSTVVD  TG
Sbjct: 62  KTLQRSGEDDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTG 121

Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           KS DSRVRTSSGTFLARGRDK IR+IEKRI+DFTF P+       +  + + +K
Sbjct: 122 KSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQK 175


>gi|297797785|ref|XP_002866777.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297312612|gb|EFH43036.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 266

 Score =  233 bits (593), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 113/174 (64%), Positives = 136/174 (78%)

Query: 1   MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
           MA+PR  R   RKSS STL+  +LIM TF ILILLAFGILS+PS++  S KANDL+SIVR
Sbjct: 1   MARPRSHRPSARKSSRSTLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIVR 60

Query: 61  KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
           K+++R   D+ + E+WVE+ISWEPRA VYHNFL+KEEC+YLI LA PHM KSTVVD  TG
Sbjct: 61  KTLQRGVEDDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTG 120

Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           KS DSRVRTSSGTFLARGRDK IR+IEKRI+DFTF P+       +  + + +K
Sbjct: 121 KSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQK 174


>gi|334188665|ref|NP_001190630.1| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
 gi|332010771|gb|AED98154.1| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
          Length = 243

 Score =  228 bits (580), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 113/174 (64%), Positives = 135/174 (77%), Gaps = 2/174 (1%)

Query: 1   MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
           MA+PR  R   RKSS STL+  +LIM TF ILILLAFGILS+PS++  S KANDL+SIVR
Sbjct: 2   MARPRNHRPSARKSSHSTLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIVR 61

Query: 61  KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
           K+++RS  D+ + E+WVE+ISWEPRA VYHNFL  EEC+YLI LA PHM KSTVVD  TG
Sbjct: 62  KTLQRSGEDDSKNERWVEIISWEPRASVYHNFL--EECKYLIELAKPHMEKSTVVDEKTG 119

Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           KS DSRVRTSSGTFLARGRDK IR+IEKRI+DFTF P+       +  + + +K
Sbjct: 120 KSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQK 173


>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 287

 Score =  223 bits (569), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 120/175 (68%), Positives = 145/175 (82%), Gaps = 3/175 (1%)

Query: 1   MAKPRYSRFPTRKSSS-STLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIV 59
           MAK RYSR P RKSSS +T+ILT+L+MFTF ILILLA GILS+PS+SGD  KA+DL++IV
Sbjct: 1   MAKARYSRLPARKSSSPTTMILTMLLMFTFVILILLALGILSVPSNSGD--KAHDLTTIV 58

Query: 60  RKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT 119
               +  +GD+G+ E+W EVISWEPRAFVYHNFL+KEECEYLINLA P+M+KSTVVDS+T
Sbjct: 59  HNKEQSFDGDDGKGERWAEVISWEPRAFVYHNFLTKEECEYLINLAKPNMQKSTVVDSET 118

Query: 120 GKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           G+SKDSRVRTSSGTFL+RGRDK IRDIEKRIADF+F P+       +  + V +K
Sbjct: 119 GRSKDSRVRTSSGTFLSRGRDKKIRDIEKRIADFSFIPVEHGEGLQVLHYEVGQK 173


>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 287

 Score =  218 bits (556), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 122/175 (69%), Positives = 143/175 (81%), Gaps = 3/175 (1%)

Query: 1   MAKPRYSRFPTRKSSSS-TLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIV 59
           MAK R SRFPTRKSSSS TL+ TLLIMFTF ILILLA GILS+P +SG S K +DLSSIV
Sbjct: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60

Query: 60  RKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT 119
           RK+ +  + DE + EQWVEVISWEPRAFVYHNFL+KEECEYLI+LA PHM+KSTVVDS+T
Sbjct: 61  RKTSD--DVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSET 118

Query: 120 GKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           G+SKDSRVRTSSGTFL RGRDK +R IEKR++DF+F P+       +  + V +K
Sbjct: 119 GQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQK 173


>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 287

 Score =  218 bits (556), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 122/175 (69%), Positives = 143/175 (81%), Gaps = 3/175 (1%)

Query: 1   MAKPRYSRFPTRKSSSS-TLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIV 59
           MAK R SRFPTRKSSSS TL+ TLLIMFTF ILILLA GILS+P +SG S K +DLSSIV
Sbjct: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60

Query: 60  RKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT 119
           RK+ +  + DE + EQWVEVISWEPRAFVYHNFL+KEECEYLI+LA PHM+KSTVVDS+T
Sbjct: 61  RKTSD--DVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSET 118

Query: 120 GKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           G+SKDSRVRTSSGTFL RGRDK +R IEKR++DF+F P+       +  + V +K
Sbjct: 119 GQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQK 173


>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
           from Gallus gallus gi|212530 [Arabidopsis thaliana]
 gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
 gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 287

 Score =  211 bits (536), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 106/157 (67%), Positives = 124/157 (78%), Gaps = 1/157 (0%)

Query: 1   MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
           MAK R+SRF  RK S+  L+L +L M T  +L+LLAFG+ S+P ++ +S    DLS   R
Sbjct: 1   MAKLRHSRFQARKWSTLMLVLFMLFMLTIVLLMLLAFGVFSLPINNDESSPI-DLSYFRR 59

Query: 61  KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
            + ERSEG   R +QW EV+SWEPRAFVYHNFLSKEECEYLI+LA PHM KSTVVDS+TG
Sbjct: 60  AATERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETG 119

Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFP 157
           KSKDSRVRTSSGTFL RGRDKII+ IEKRIAD+TF P
Sbjct: 120 KSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIP 156


>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
          Length = 287

 Score =  211 bits (536), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 106/157 (67%), Positives = 124/157 (78%), Gaps = 1/157 (0%)

Query: 1   MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
           MAK R+SRF  RK S+  L+L +L M T  +L+LLAFG+ S+P ++ +S    DLS   R
Sbjct: 1   MAKLRHSRFQARKWSTLMLVLFMLFMLTIVLLMLLAFGVFSLPINNDESSPI-DLSYFRR 59

Query: 61  KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
            + ERSEG   R +QW EV+SWEPRAFVYHNFLSKEECEYLI+LA PHM KSTVVDS+TG
Sbjct: 60  AATERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETG 119

Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFP 157
           KSKDSRVRTSSGTFL RGRDKII+ IEKRIAD+TF P
Sbjct: 120 KSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIP 156


>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 287

 Score =  211 bits (536), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 106/157 (67%), Positives = 124/157 (78%), Gaps = 1/157 (0%)

Query: 1   MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
           MAK R+SRF  RK S+  L+L +L M T  +L+LLAFG+ S+P ++ +S    DLS   R
Sbjct: 1   MAKLRHSRFQARKWSTLMLVLFMLFMLTIVLLMLLAFGVFSLPINNDESSPI-DLSYFRR 59

Query: 61  KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
            + ERSEG   R +QW EV+SWEPRAFVYHNFLSKEECEYLI+LA PHM KSTVVDS+TG
Sbjct: 60  AATERSEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETG 119

Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFP 157
           KSKDSRVRTSSGTFL RGRDKII+ IEKRIAD+TF P
Sbjct: 120 KSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIP 156


>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
 gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
          Length = 288

 Score =  202 bits (513), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 105/176 (59%), Positives = 128/176 (72%), Gaps = 4/176 (2%)

Query: 1   MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
           M K R+SR  T+K S+ TL+L++L M T  + ILL  G  S+P SS DS   NDL+S  R
Sbjct: 1   MVKVRHSRLHTKKWSTFTLVLSMLFMLTVVLFILLGLGAFSLPVSSEDS-SPNDLNSYRR 59

Query: 61  KSMERSEGDE--GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD 118
            + E S+GD    R EQW E++SWEPRAF+YHNFLSKEECEYLINLA PHM KSTVVDS 
Sbjct: 60  IASE-SDGDGMGKREEQWTEILSWEPRAFLYHNFLSKEECEYLINLAKPHMMKSTVVDSK 118

Query: 119 TGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           TG+SKDSRVRTSSG FL RGRD++IR+IEKRIADF+F P+       +  + V +K
Sbjct: 119 TGRSKDSRVRTSSGMFLRRGRDRVIREIEKRIADFSFIPVEHGEGLQVLHYEVGQK 174


>gi|224133600|ref|XP_002327635.1| predicted protein [Populus trichocarpa]
 gi|222836720|gb|EEE75113.1| predicted protein [Populus trichocarpa]
          Length = 291

 Score =  192 bits (489), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 122/177 (68%), Positives = 141/177 (79%), Gaps = 3/177 (1%)

Query: 1   MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDS--RKANDLSSI 58
           MAK RYSR P+RKS SSTLILTLL+MFTF ILILL  GILS+PS+S     R+ANDLSSI
Sbjct: 1   MAKARYSRIPSRKSPSSTLILTLLLMFTFVILILLGLGILSIPSTSSSDSSRQANDLSSI 60

Query: 59  VRKS-MERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
              S ++ S  DEG+AEQW EVISW+PRAFVYHNFL+K ECEYLINLA P M+KSTVVDS
Sbjct: 61  AHHSRIDGSGDDEGKAEQWAEVISWKPRAFVYHNFLTKAECEYLINLAKPRMQKSTVVDS 120

Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
            TGKSKDS+VRTSSGTFL RGRDKI+RDIEKRIADF+F P+       +  + V ++
Sbjct: 121 STGKSKDSKVRTSSGTFLPRGRDKIVRDIEKRIADFSFIPVEHGEGLQILHYEVGQR 177


>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
 gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
          Length = 288

 Score =  190 bits (483), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 107/174 (61%), Positives = 133/174 (76%)

Query: 1   MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
           MAK RYSR   ++ S+  L+L+LL+M T  +L+LLA GI+S+P  + DS  ANDLSS  R
Sbjct: 1   MAKGRYSRGHGKRWSTLALVLSLLLMLTVVLLMLLALGIVSLPIGTVDSDAANDLSSFRR 60

Query: 61  KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
           K+ +  EG   R EQW E++SWEPRAF+YHNFLSKEECEY+I+LA P+M+KSTVVDS+TG
Sbjct: 61  KTFDGGEGLGKRGEQWTEIVSWEPRAFIYHNFLSKEECEYMISLAKPYMKKSTVVDSETG 120

Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +SKDSRVRTSSG FL RGRDKIIRDIEKRIADFTF P+       +  + V +K
Sbjct: 121 RSKDSRVRTSSGMFLRRGRDKIIRDIEKRIADFTFIPVEHGEGLQVLHYEVGQK 174


>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 289

 Score =  189 bits (479), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 107/176 (60%), Positives = 129/176 (73%), Gaps = 3/176 (1%)

Query: 1   MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
           MAK R+SR   RK S+ TL+ ++L M T  +L+LLA GI S+P S+ DS   NDL++  R
Sbjct: 1   MAKMRHSRLQARKMSTLTLVFSMLFMLTVVLLMLLALGIFSLPMSTDDS-PPNDLAASYR 59

Query: 61  K-SMERS-EGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD 118
           + + ER  +G   R EQW E+ISWEPRAFVYHNFLSKEECEYLI LA PHM KSTVVDS 
Sbjct: 60  RMAAERDYDGLGKRVEQWTEIISWEPRAFVYHNFLSKEECEYLIALAKPHMVKSTVVDSK 119

Query: 119 TGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           TG+SKDSRVRTSSG FL RGRDKIIR+IEKRIADF+F P+       +  + V +K
Sbjct: 120 TGRSKDSRVRTSSGMFLRRGRDKIIRNIEKRIADFSFIPIEHGEGLQVLHYEVGQK 175


>gi|297802350|ref|XP_002869059.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297314895|gb|EFH45318.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 290

 Score =  180 bits (457), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 96/175 (54%), Positives = 123/175 (70%), Gaps = 4/175 (2%)

Query: 3   KPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKS 62
           KP++ R   RKS S T   T+LI+  F ILIL+  GILS+P+++  S +  DL++IV+  
Sbjct: 4   KPKHLRNQPRKSFS-TQAFTVLILGLFVILILVGLGILSLPNTNKSSSRPMDLTTIVQTI 62

Query: 63  MER-SEGDE--GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT 119
            ER S GDE  G  ++W+EVISWEPRAFVYHNFL+ EECE+LI+LA P M KS VVD  T
Sbjct: 63  EERESYGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMVKSKVVDVKT 122

Query: 120 GKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           GKS DSRVRTSSGTFL RG D+I+ +IE RI+DFTF P+       +  + V +K
Sbjct: 123 GKSIDSRVRTSSGTFLKRGHDEIVEEIENRISDFTFIPIENGEGLQVLHYEVGQK 177


>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 280

 Score =  175 bits (443), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 88/158 (55%), Positives = 115/158 (72%), Gaps = 8/158 (5%)

Query: 1   MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
           MAK +++   ++    STLIL  L M T  I++LLA GIL +P+++ DS   +      R
Sbjct: 1   MAKGKHTHPRSQVKKLSTLILLTLFMLTLVIIVLLALGILYLPNTTDDSLITD------R 54

Query: 61  KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
           + +  S  +  + EQW E++SWEPRAFVYHNFLSKEECE+LINLA P + KS+VVDS TG
Sbjct: 55  RKIYESLAE--KKEQWTEILSWEPRAFVYHNFLSKEECEHLINLAKPFLAKSSVVDSKTG 112

Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPL 158
           KS +SRVRTSSG FL RG+DKII++IE+RIADFTF P+
Sbjct: 113 KSTESRVRTSSGMFLKRGKDKIIQNIERRIADFTFIPV 150


>gi|222623961|gb|EEE58093.1| hypothetical protein OsJ_08962 [Oryza sativa Japonica Group]
          Length = 387

 Score =  171 bits (432), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 97/180 (53%), Positives = 117/180 (65%), Gaps = 14/180 (7%)

Query: 9   FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSS------------GDSRKAND 54
           FPTR  ++S   L L  L++ +  +L L+AFG+ S+P S+            GD+  A+ 
Sbjct: 17  FPTRGGRTSPLALALAALLLASALLLALIAFGVFSLPVSAPNAATTDSAAAGGDAEPADP 76

Query: 55  LSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV 114
                R   + SEG   R  QW EVISWEPRAFVYHNFLSKEEC+YLI LA PHM KSTV
Sbjct: 77  RPPRTRARRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV 136

Query: 115 VDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           VDS TGKSKDSRVRTSSG FL RGRDK+IR IEKRIAD+TF P+       +  + V +K
Sbjct: 137 VDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQK 196


>gi|224117220|ref|XP_002331751.1| predicted protein [Populus trichocarpa]
 gi|222874448|gb|EEF11579.1| predicted protein [Populus trichocarpa]
          Length = 266

 Score =  170 bits (430), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 103/176 (58%), Positives = 116/176 (65%), Gaps = 26/176 (14%)

Query: 1   MAKPRYSRFPTRKSSSSTLI-LTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIV 59
           MAK RYSR  TRKS SSTLI  +L + F                         NDLSSI 
Sbjct: 1   MAKARYSRISTRKSPSSTLIRKSLNVHF------------------------PNDLSSIA 36

Query: 60  RKS-MERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD 118
             S +  S  DEG+AEQWVE ISWEPRAF+YHNFL+K EC+YLINLA PHM+KS VVDS 
Sbjct: 37  HNSKIHESGDDEGKAEQWVEAISWEPRAFIYHNFLTKAECDYLINLAKPHMQKSMVVDSS 96

Query: 119 TGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +GKSKDSRVRTSSGTFL RGRDKIIRDIEKRIADF+F P        +  + V +K
Sbjct: 97  SGKSKDSRVRTSSGTFLPRGRDKIIRDIEKRIADFSFIPSEHGEGLQILHYEVGQK 152


>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 291

 Score =  169 bits (429), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 103/172 (59%), Positives = 125/172 (72%), Gaps = 3/172 (1%)

Query: 5   RYSRFPTRKS-SSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSM 63
           ++ R+  RKS S ST   T+LI+    ILILL  GILS+P+++ +S K NDL++IVRKS 
Sbjct: 7   QHLRYQPRKSVSRSTQAFTVLILLLVVILILLGLGILSLPNANRNSSKTNDLTNIVRKS- 65

Query: 64  ERSEGDE-GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKS 122
           E S GDE G  E+WVEVISWEPRA VYHNFLS EECE+LINLA P M KSTVVD  TG S
Sbjct: 66  ETSYGDEDGNGERWVEVISWEPRAVVYHNFLSNEECEHLINLAKPSMVKSTVVDEKTGGS 125

Query: 123 KDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           KDSRVRTSSGTFL RG D+++  IEKRI+DFTF P+       +  + V +K
Sbjct: 126 KDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQK 177


>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
          Length = 310

 Score =  169 bits (429), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 97/180 (53%), Positives = 117/180 (65%), Gaps = 14/180 (7%)

Query: 9   FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSS------------GDSRKAND 54
           FPTR  ++S   L L  L++ +  +L L+AFG+ S+P S+            GD+  A+ 
Sbjct: 17  FPTRGGRTSPLALALAALLLASALLLALIAFGVFSLPVSAPNAATTDSAAAGGDAEPADP 76

Query: 55  LSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV 114
                R   + SEG   R  QW EVISWEPRAFVYHNFLSKEEC+YLI LA PHM KSTV
Sbjct: 77  RPPRTRARRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEECDYLIGLAKPHMVKSTV 136

Query: 115 VDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           VDS TGKSKDSRVRTSSG FL RGRDK+IR IEKRIAD+TF P+       +  + V +K
Sbjct: 137 VDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPMEHGEGLQVLHYEVGQK 196


>gi|21593091|gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
          Length = 291

 Score =  168 bits (426), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 98/171 (57%), Positives = 123/171 (71%), Gaps = 1/171 (0%)

Query: 5   RYSRFPTRKS-SSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSM 63
           ++ R+  RKS S ST   T+LI+    ILILL  GILS+P+++ +S K NDL++IVRKS 
Sbjct: 7   QHLRYQPRKSVSRSTQAFTVLILLLVVILILLGLGILSLPNANRNSSKTNDLTNIVRKSE 66

Query: 64  ERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSK 123
             S  +EG  E+WVEVISWEPRA VYHNFL+ EECE+LI+LA P M KSTVVD  TG SK
Sbjct: 67  TSSGDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSK 126

Query: 124 DSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           DSRVRTSSGTFL RG D+++  IEKRI+DFTF P+       +  + V +K
Sbjct: 127 DSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQK 177


>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
           Arabidopsis thaliana
 gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
 gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
           thaliana]
 gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 291

 Score =  168 bits (425), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 98/171 (57%), Positives = 123/171 (71%), Gaps = 1/171 (0%)

Query: 5   RYSRFPTRKS-SSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSM 63
           ++ R+  RKS S ST   T+LI+    ILILL  GILS+P+++ +S K NDL++IVRKS 
Sbjct: 7   QHLRYQPRKSVSRSTQAFTVLILLLVVILILLGLGILSLPNANRNSSKTNDLTNIVRKSE 66

Query: 64  ERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSK 123
             S  +EG  E+WVEVISWEPRA VYHNFL+ EECE+LI+LA P M KSTVVD  TG SK
Sbjct: 67  TSSGDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSK 126

Query: 124 DSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           DSRVRTSSGTFL RG D+++  IEKRI+DFTF P+       +  + V +K
Sbjct: 127 DSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQVLHYQVGQK 177


>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 318

 Score =  168 bits (425), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 95/180 (52%), Positives = 111/180 (61%), Gaps = 18/180 (10%)

Query: 12  RKSSSSTLILTLLIMFTFAILILLAFGILSMPSS--------------SGDSRKANDLSS 57
           R S  +  +  LLI   F +L L+AFG+ S+P S              SG + ++    S
Sbjct: 26  RVSPYAVALGALLIASAF-LLALIAFGVFSLPVSAPNLATTAGGGETESGSTEESGGSES 84

Query: 58  IVRKSMER---SEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV 114
              +S  R   SEG   R  QW EVISWEPRAFVYHNFLSKEECEYLI LA P M KSTV
Sbjct: 85  HSARSRARRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEECEYLIGLAKPRMEKSTV 144

Query: 115 VDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           VDS TGKSKDSRVRTSSG FL RGRDK+IR IE+RIAD+TF P        +  + V +K
Sbjct: 145 VDSTTGKSKDSRVRTSSGMFLRRGRDKVIRAIERRIADYTFIPAEHGEGLQVLHYEVGQK 204


>gi|218191856|gb|EEC74283.1| hypothetical protein OsI_09531 [Oryza sativa Indica Group]
          Length = 376

 Score =  168 bits (425), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 86/150 (57%), Positives = 99/150 (66%), Gaps = 12/150 (8%)

Query: 37  FGILSMPSSS------------GDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEP 84
           FG+ S+P S+            GD+  A+      R   + SEG   R  QW EVISWEP
Sbjct: 47  FGVFSLPVSAPNAATTDSAAAGGDAEPADPRPPRTRARRDLSEGLGERGAQWTEVISWEP 106

Query: 85  RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIR 144
           RAFVYHNFLSKEEC+YLI LA PHM KSTVVDS TGKSKDSRVRTSSG FL RGRDK+IR
Sbjct: 107 RAFVYHNFLSKEECDYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIR 166

Query: 145 DIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
            IEKRIAD+TF P+       +  + V +K
Sbjct: 167 AIEKRIADYTFIPMEHGEGLQVLHYEVGQK 196


>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 284

 Score =  166 bits (420), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 94/177 (53%), Positives = 117/177 (66%), Gaps = 11/177 (6%)

Query: 1   MAKPRYSRFPTRKSSS---STLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSS 57
           ++K +Y +   RK S+   S +I+ L++   F +LI L F   S P +S      +   S
Sbjct: 3   ISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRF--FSPPETS------HHRFS 54

Query: 58  IVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
            VR +   S+G   R +QWVE ISWEPRAFVYHNFLSKEEC YLI+LA PHM KSTVVDS
Sbjct: 55  SVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDS 114

Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
            TG+S DSRVRTSSG FL RG+DKIIR+IEKRIADFTF P+       +  + V +K
Sbjct: 115 KTGESVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQK 171


>gi|356502598|ref|XP_003520105.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 296

 Score =  165 bits (417), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 99/183 (54%), Positives = 128/183 (69%), Gaps = 10/183 (5%)

Query: 1   MAKPRYSRFPTRK---SSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSR---KAND 54
           M K R SR   RK    SS  ++LTLL   +F ILILLA  ILS  +++   R   K ND
Sbjct: 1   MVKGRQSRLGHRKPSRGSSWPVMLTLLATCSFLILILLALPILSNSNANSSGRLIIKPND 60

Query: 55  LSSIVRKS---MERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRK 111
           L+SI   +   +  +E D+   E+WVE+ISWEPR F+YHNFL+KEECE+LIN+A P+MRK
Sbjct: 61  LNSIALNTTTHISEAEYDQ-LGERWVEIISWEPRIFLYHNFLTKEECEHLINIAKPNMRK 119

Query: 112 STVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYV 171
           STV++S+TG S +SRVRTSSGTFLARGRDKI+R+IE RIADFTF P+    +  +  + V
Sbjct: 120 STVIESETGMSIESRVRTSSGTFLARGRDKIVRNIENRIADFTFIPVDNGEELQVLHYQV 179

Query: 172 LKK 174
            +K
Sbjct: 180 GEK 182


>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
          Length = 290

 Score =  164 bits (415), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 86/157 (54%), Positives = 112/157 (71%), Gaps = 2/157 (1%)

Query: 2   AKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRK 61
            K  ++R   +K S+ +L+L  L   T  +++LLA GI+ +  ++ D     DLS+  RK
Sbjct: 4   GKHTHTRAQGKKWSTFSLVLWALFFLTLILVVLLALGIVYL-PTTDDDFPTTDLSAFRRK 62

Query: 62  SMERSEG-DEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
           + + +E   E   EQW E++SWEPRAF+YHNFLSKEECEYLI LA P M KS+VVDS TG
Sbjct: 63  TSQSAESLVENPPEQWTEILSWEPRAFIYHNFLSKEECEYLIELAKPQMVKSSVVDSKTG 122

Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFP 157
           KS +SRVRTSSG FL RG+DKI+++IEKRIADFTF P
Sbjct: 123 KSTESRVRTSSGMFLKRGKDKIVQNIEKRIADFTFIP 159


>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
 gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
          Length = 307

 Score =  163 bits (412), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 98/177 (55%), Positives = 119/177 (67%), Gaps = 11/177 (6%)

Query: 9   FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDL-------SSIV 59
           FPTR  ++S  T+ LT L++ + A+L L+AFG+ S+P S+ ++             S+ V
Sbjct: 17  FPTRGGRASPYTVALTALLLVSAALLALIAFGVFSLPVSAPNAAATTGTAAGGETESADV 76

Query: 60  RKSMER--SEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
           R    R   EG   R  QW EVISWEPRAFVYHNFLSK+ECEYLI LA PHM KSTVVDS
Sbjct: 77  RPRARRDLGEGLGERGAQWTEVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDS 136

Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
            TGKSKDSRVRTSSG FL RGRDK+IR IEKRIAD+TF P+       +  + V +K
Sbjct: 137 TTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQK 193


>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
 gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
 gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
 gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
 gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
 gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
 gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
 gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
          Length = 307

 Score =  163 bits (412), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 98/177 (55%), Positives = 119/177 (67%), Gaps = 11/177 (6%)

Query: 9   FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDL-------SSIV 59
           FPTR  ++S  T+ LT L++ + A+L L+AFG+ S+P S+ ++             S+ V
Sbjct: 17  FPTRGGRASPYTVALTALLLVSAALLALIAFGVFSLPVSAPNAAATTGTAAGGETESADV 76

Query: 60  RKSMER--SEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
           R    R   EG   R  QW EVISWEPRAFVYHNFLSK+ECEYLI LA PHM KSTVVDS
Sbjct: 77  RPRARRDLGEGLGERGAQWTEVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDS 136

Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
            TGKSKDSRVRTSSG FL RGRDK+IR IEKRIAD+TF P+       +  + V +K
Sbjct: 137 TTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQK 193


>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 315

 Score =  162 bits (409), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 87/154 (56%), Positives = 98/154 (63%), Gaps = 16/154 (10%)

Query: 37  FGILSMP--------------SSSGDSRKANDLSSIVRKSMER--SEGDEGRAEQWVEVI 80
           FG+ S+P              SS G    A++ S   R    R  SEG   R  QW EVI
Sbjct: 48  FGVFSLPVSSPTVPTTGAETESSGGGGEAASESSRPARNRGRRDLSEGLGERGAQWTEVI 107

Query: 81  SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           SWEPRAFVYHNFLSKEECEYLI LA P M KSTVVDS+TGKSKDSRVRTSSG FL RGRD
Sbjct: 108 SWEPRAFVYHNFLSKEECEYLIELAKPRMVKSTVVDSETGKSKDSRVRTSSGMFLQRGRD 167

Query: 141 KIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           K+IR IE+RIAD+TF P        +  + V +K
Sbjct: 168 KVIRAIERRIADYTFIPAEHGEGLQVLHYEVGQK 201


>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
 gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
          Length = 307

 Score =  161 bits (408), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 97/177 (54%), Positives = 118/177 (66%), Gaps = 11/177 (6%)

Query: 9   FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSS---------GDSRKANDLSS 57
           FPTR  ++S   + LT L++ + A+L L+AFG+ S+P S+         G++  A+   +
Sbjct: 17  FPTRGGRASPYAVALTALLLVSAALLALIAFGVFSLPVSAPNAATTGAGGETESADARPA 76

Query: 58  IVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
             R   +  EG   R  QW EVISWEPRAFVYHNFLSKEECEYLI LA PHM KSTVVDS
Sbjct: 77  RPRARRDLGEGLGERGAQWTEVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVDS 136

Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
            TGKSKDSRVRTSSG FL RGRDK+IR IEKRIAD+TF P        +  + V +K
Sbjct: 137 TTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPADHGEGLQVLHYEVGQK 193


>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
 gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
          Length = 307

 Score =  161 bits (408), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 97/177 (54%), Positives = 119/177 (67%), Gaps = 11/177 (6%)

Query: 9   FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDL-------SSIV 59
           FPTR  ++S  T+ LT L++ + A+L L+AFG+ S+P S+ ++             S+ V
Sbjct: 17  FPTRGGRASPYTVALTALLLVSAALLALIAFGVFSLPVSAPNAAATTGTAAGGETESADV 76

Query: 60  RKSMER--SEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
           R    R   EG   R  QW EVISWEPRAFVYHNFLSK+ECEYLI LA PHM KSTVVDS
Sbjct: 77  RPRARRDLGEGLGERGAQWTEVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDS 136

Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
            TGKSKDSRVRTSSG FL RGR+K+IR IEKRIAD+TF P+       +  + V +K
Sbjct: 137 TTGKSKDSRVRTSSGMFLQRGRNKVIRAIEKRIADYTFIPVDHGEGLQVLHYEVGQK 193


>gi|413923982|gb|AFW63914.1| hypothetical protein ZEAMMB73_179176 [Zea mays]
          Length = 222

 Score =  161 bits (407), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 96/178 (53%), Positives = 118/178 (66%), Gaps = 12/178 (6%)

Query: 9   FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSR----------KANDLS 56
           FPTR  ++S   + LT L++ + A+L L+AFG+ S+P S+ ++            A+   
Sbjct: 17  FPTRGGRASPYAVALTALLLVSAALLALIAFGVFSLPVSAPNAAATTTAGGETESADTRP 76

Query: 57  SIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVD 116
           +  R   +  EG   R  QW EVISWEPRAFVYHNFLSKEECEYLI LA PHM KSTVVD
Sbjct: 77  ARPRARRDLGEGLGERGAQWTEVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVD 136

Query: 117 SDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           S TGKSKDSRVRTSSG FL RGRDK+IR IEKRIAD+TF P+       +  + V +K
Sbjct: 137 STTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQK 194


>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
 gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
 gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
 gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
          Length = 308

 Score =  160 bits (406), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 96/178 (53%), Positives = 118/178 (66%), Gaps = 12/178 (6%)

Query: 9   FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSR----------KANDLS 56
           FPTR  ++S   + LT L++ + A+L L+AFG+ S+P S+ ++            A+   
Sbjct: 17  FPTRGGRASPYAVALTALLLVSAALLALIAFGVFSLPVSAPNAAATTTAGGETESADTRP 76

Query: 57  SIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVD 116
           +  R   +  EG   R  QW EVISWEPRAFVYHNFLSKEECEYLI LA PHM KSTVVD
Sbjct: 77  ARPRARRDLGEGLGERGAQWTEVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVD 136

Query: 117 SDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           S TGKSKDSRVRTSSG FL RGRDK+IR IEKRIAD+TF P+       +  + V +K
Sbjct: 137 STTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQK 194


>gi|357517881|ref|XP_003629229.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523251|gb|AET03705.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 278

 Score =  157 bits (396), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 92/177 (51%), Positives = 118/177 (66%), Gaps = 16/177 (9%)

Query: 1   MAKPRYSRFPTRKSS---SSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSS 57
           M K R+SR   RK S   S TLI TL + FT  ILILL   I           K N ++S
Sbjct: 1   MVKFRHSRLGPRKPSLTASQTLIFTLFVTFTLLILILLTLRI----------PKLNHINS 50

Query: 58  IVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
           I   ++   + D  R   WV+++SWEPRAF+YHNFL+K+ECE+LIN A P M+KS+VVD+
Sbjct: 51  ISHNALRSEDNDNKR---WVQIVSWEPRAFLYHNFLTKKECEHLINTAKPSMQKSSVVDN 107

Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +TGKSKDS VRTSSGTFL RG D+I+R+IEKRIADFTF P+     F++  + V +K
Sbjct: 108 ETGKSKDSSVRTSSGTFLDRGGDEIVRNIEKRIADFTFIPVENGESFNVLRYEVGQK 164


>gi|449443243|ref|XP_004139389.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 284

 Score =  156 bits (395), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 87/173 (50%), Positives = 113/173 (65%), Gaps = 11/173 (6%)

Query: 5   RYSRFPTRKSSS---STLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRK 61
           +Y +   +K S+   S +I+ L++   F +LI L F  LS P +S      +   S VR 
Sbjct: 7   KYIKLQGKKWSTFQLSKMIMALVLALGFFMLIALRF--LSPPETS------HHRFSSVRH 58

Query: 62  SMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGK 121
           +   S+G   R +QWVE ISWEPRAFVYHNFLSKEEC YLI+LA PHM KSTVVD++TGK
Sbjct: 59  TAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGK 118

Query: 122 SKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           + +  VRTSSG FL RG+DKI+ +IEKRIADFTF P+       +  + V +K
Sbjct: 119 NVEDSVRTSSGMFLNRGQDKIVSNIEKRIADFTFIPIEHGEGLQILHYEVGQK 171


>gi|388515007|gb|AFK45565.1| unknown [Lotus japonicus]
          Length = 154

 Score =  155 bits (391), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 108/152 (71%), Positives = 128/152 (84%), Gaps = 3/152 (1%)

Query: 1   MAKPRYSRFPTRKSSSS-TLILTLLIMFTFAILILLAFGILSMPSSSGDSR--KANDLSS 57
           MAKPRYSR P+RKSSSS TLI  L + FTF +LIL A GILS+PSSS   +  K NDL+S
Sbjct: 1   MAKPRYSRLPSRKSSSSSTLIFALFLAFTFLLLILFALGILSIPSSSSRDKFPKPNDLTS 60

Query: 58  IVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
           I   +++R++ D+GR EQWVEVISWEPRAFVYHNFL+KEECEYLI++A P+M KSTVVDS
Sbjct: 61  IAHNTLDRTDDDDGRGEQWVEVISWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSTVVDS 120

Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKR 149
           +TGKSKDSRVRTSSGTFL RGR KI+R+IEK+
Sbjct: 121 ETGKSKDSRVRTSSGTFLPRGRGKIVRNIEKK 152


>gi|42567428|ref|NP_195306.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|332661174|gb|AEE86574.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 290

 Score =  154 bits (389), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 91/158 (57%), Positives = 113/158 (71%), Gaps = 4/158 (2%)

Query: 3   KPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKS 62
           KP+  R   RKS S T   T++++  F ILIL+  GI S+PS++  S    DL++IV+  
Sbjct: 4   KPKQLRNKPRKSFS-TQTFTVVVLVLFVILILVGLGIFSLPSTNKTSSMPMDLTTIVQTI 62

Query: 63  MERSE-GDE--GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT 119
            ER   GDE  G  ++W+EVISWEPRAFVYHNFL+ EECE+LI+LA P M KS VVD  T
Sbjct: 63  QERESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMMKSKVVDVKT 122

Query: 120 GKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFP 157
           GKS DSRVRTSSGTFL RG D+I+ +IE RI+DFTF P
Sbjct: 123 GKSIDSRVRTSSGTFLNRGHDEIVEEIENRISDFTFIP 160


>gi|326495334|dbj|BAJ85763.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 300

 Score =  154 bits (388), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 71/103 (68%), Positives = 83/103 (80%)

Query: 72  RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
           + E W EV+SWEPRAF+YHNFLSKEECEYLI+LA PHM+KSTVVDS TG SKDSRVRTSS
Sbjct: 84  KGEPWTEVLSWEPRAFIYHNFLSKEECEYLISLAKPHMKKSTVVDSATGGSKDSRVRTSS 143

Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           GTFL RG+DKI+R IEKRI+DFTF P+       +  + V +K
Sbjct: 144 GTFLRRGQDKIVRTIEKRISDFTFIPVENGEGLQVLHYEVGQK 186


>gi|116788056|gb|ABK24739.1| unknown [Picea sitchensis]
          Length = 303

 Score =  153 bits (387), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 85/166 (51%), Positives = 103/166 (62%), Gaps = 22/166 (13%)

Query: 28  TFAILILLAFGILSMPSSSGDSRKANDLS--------SIVRKSMERSEGDEGRAEQWVEV 79
           +  +L+LLA GI+S+P    +SR  +++S        S  +K  E         +QW EV
Sbjct: 27  SIVLLMLLALGIVSLPV---NSRAPDEISNGGVYSEHSGGKKLQETYSNGMDEPKQWAEV 83

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSK-----------DSRVR 128
           +SWEPRA +YHNFL+KEECEYLINLA PHM KSTVVDS TGKSK           DSRVR
Sbjct: 84  LSWEPRAILYHNFLNKEECEYLINLAKPHMAKSTVVDSATGKSKDSRFVHRWKSNDSRVR 143

Query: 129 TSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           TSSG FL RG+DK IR IEKRIADFTF P        +  + V +K
Sbjct: 144 TSSGMFLNRGQDKTIRSIEKRIADFTFIPAEHGEGLQVLHYEVGQK 189


>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 306

 Score =  152 bits (385), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 71/103 (68%), Positives = 82/103 (79%)

Query: 72  RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
           + E W EV+SWEPRAF+YHNFLSKEECEYLI+LA PHM+KSTVVDS TG SKDSRVRTSS
Sbjct: 90  KGEPWTEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDSATGGSKDSRVRTSS 149

Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           GTFL RG+DK+IR IEKRI+DFTF P        +  + V +K
Sbjct: 150 GTFLRRGQDKVIRTIEKRISDFTFIPAENGEGLQVLHYEVGQK 192


>gi|363807814|ref|NP_001242181.1| uncharacterized protein LOC100782154 [Glycine max]
 gi|255644463|gb|ACU22735.1| unknown [Glycine max]
          Length = 285

 Score =  152 bits (385), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 75/133 (56%), Positives = 100/133 (75%), Gaps = 3/133 (2%)

Query: 28  TFAILILLAFGILSMP--SSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPR 85
           +F ILI LA  ILS P  +SS    K NDL+S+ R +   SEG+  R ++WVEV+SWEPR
Sbjct: 33  SFLILIPLALRILSTPHVNSSSALSKPNDLNSVPRNT-HVSEGENNRVKRWVEVMSWEPR 91

Query: 86  AFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRD 145
           AF+YHNFL+KEECEYLIN ATP+M KS V+D+++G+  ++  RTS+   + RG+DKI+R+
Sbjct: 92  AFLYHNFLTKEECEYLINTATPNMLKSLVIDNESGEGIETSYRTSTEYVVERGKDKIVRN 151

Query: 146 IEKRIADFTFFPL 158
           IEKRIAD TF P+
Sbjct: 152 IEKRIADVTFIPI 164


>gi|115482738|ref|NP_001064962.1| Os10g0497800 [Oryza sativa Japonica Group]
 gi|78708853|gb|ABB47828.1| prolyl 4-hydroxylase alpha subunit, putative, expressed [Oryza
           sativa Japonica Group]
 gi|113639571|dbj|BAF26876.1| Os10g0497800 [Oryza sativa Japonica Group]
 gi|215767852|dbj|BAH00081.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218184821|gb|EEC67248.1| hypothetical protein OsI_34188 [Oryza sativa Indica Group]
          Length = 321

 Score =  152 bits (385), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 73/119 (61%), Positives = 89/119 (74%), Gaps = 2/119 (1%)

Query: 56  SSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV 115
           S+     +E   G++G  E W EV+SWEPRAF+YHNFLSKEECEYLI+LA PHM+KSTVV
Sbjct: 91  SAAFESGLEMRGGEKG--EPWTEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVV 148

Query: 116 DSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           D+ TG SKDSRVRTSSG FL RG+DKIIR IEKRI+D+TF P+       +  + V +K
Sbjct: 149 DASTGGSKDSRVRTSSGMFLGRGQDKIIRTIEKRISDYTFIPVENGEGLQVLHYEVGQK 207


>gi|18071415|gb|AAL58274.1|AC068923_16 putative prolyl 4-hydroxylase, alpha subunit [Oryza sativa Japonica
           Group]
          Length = 343

 Score =  152 bits (384), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 71/103 (68%), Positives = 84/103 (81%), Gaps = 2/103 (1%)

Query: 56  SSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV 115
           S+     +E   G++G  E W EV+SWEPRAF+YHNFLSKEECEYLI+LA PHM+KSTVV
Sbjct: 91  SAAFESGLEMRGGEKG--EPWTEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVV 148

Query: 116 DSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPL 158
           D+ TG SKDSRVRTSSG FL RG+DKIIR IEKRI+D+TF P+
Sbjct: 149 DASTGGSKDSRVRTSSGMFLGRGQDKIIRTIEKRISDYTFIPV 191


>gi|414870897|tpg|DAA49454.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 222

 Score =  152 bits (383), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 70/103 (67%), Positives = 82/103 (79%)

Query: 72  RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
           + E W EV+SWEPRAFVYHNFLSKEEC++LI+LA PHM+KSTVVDS TG SKDSRVRTSS
Sbjct: 91  KGEPWTEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSRVRTSS 150

Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           G FL RG+DKIIR IEKRIAD+TF P+       +  + V +K
Sbjct: 151 GMFLRRGQDKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQK 193


>gi|414870899|tpg|DAA49456.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 364

 Score =  152 bits (383), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 70/103 (67%), Positives = 82/103 (79%)

Query: 72  RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
           + E W EV+SWEPRAFVYHNFLSKEEC++LI+LA PHM+KSTVVDS TG SKDSRVRTSS
Sbjct: 148 KGEPWTEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSRVRTSS 207

Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           G FL RG+DKIIR IEKRIAD+TF P+       +  + V +K
Sbjct: 208 GMFLRRGQDKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQK 250


>gi|212720775|ref|NP_001131953.1| uncharacterized protein LOC100193348 [Zea mays]
 gi|194693016|gb|ACF80592.1| unknown [Zea mays]
 gi|347978798|gb|AEP37741.1| prolyl 4-hydroxylase 1 [Zea mays]
 gi|414870898|tpg|DAA49455.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 307

 Score =  151 bits (382), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 70/103 (67%), Positives = 82/103 (79%)

Query: 72  RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
           + E W EV+SWEPRAFVYHNFLSKEEC++LI+LA PHM+KSTVVDS TG SKDSRVRTSS
Sbjct: 91  KGEPWTEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSRVRTSS 150

Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           G FL RG+DKIIR IEKRIAD+TF P+       +  + V +K
Sbjct: 151 GMFLRRGQDKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQK 193


>gi|222613083|gb|EEE51215.1| hypothetical protein OsJ_32038 [Oryza sativa Japonica Group]
          Length = 222

 Score =  150 bits (380), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 69/103 (66%), Positives = 82/103 (79%)

Query: 72  RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
           + E W EV+SWEPRAF+YHNFLSKEECEYLI+LA PHM+KSTVVD+ TG SKDSRVRTSS
Sbjct: 6   KGEPWTEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSRVRTSS 65

Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           G FL RG+DKIIR IEKRI+D+TF P+       +  + V +K
Sbjct: 66  GMFLGRGQDKIIRTIEKRISDYTFIPVENGEGLQVLHYEVGQK 108


>gi|242039227|ref|XP_002467008.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
 gi|241920862|gb|EER94006.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
          Length = 307

 Score =  150 bits (379), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 70/103 (67%), Positives = 82/103 (79%)

Query: 72  RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
           + E W EV+SWEPRAFVYHNFLSKEEC++LI+LA PHM+KSTVVDS TG SKDSRVRTSS
Sbjct: 91  KGEPWTEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGASKDSRVRTSS 150

Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           G FL RG+DKII+ IEKRIADFTF P+       +  + V +K
Sbjct: 151 GMFLRRGQDKIIQTIEKRIADFTFIPVEHGEGLQVLHYEVGQK 193


>gi|449520146|ref|XP_004167095.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 249

 Score =  149 bits (375), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 78/142 (54%), Positives = 97/142 (68%), Gaps = 6/142 (4%)

Query: 33  ILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNF 92
           +L+A   LS P +S      +   S VR +   S+G   R +QWVE ISWEPRAFVYHNF
Sbjct: 1   MLIALRFLSPPETS------HHRFSSVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNF 54

Query: 93  LSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIAD 152
           LSKEEC YLI+LA PHM KSTVVD++TGK+ +  VRTSSG FL RG+DKI+ +IEKRIAD
Sbjct: 55  LSKEECLYLISLAKPHMEKSTVVDNETGKNVEDSVRTSSGMFLNRGQDKIVSNIEKRIAD 114

Query: 153 FTFFPLGMQFKFSLFFFYVLKK 174
           FTF P+       +  + V +K
Sbjct: 115 FTFIPIEHGEGLQILHYEVGQK 136


>gi|357517895|ref|XP_003629236.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523258|gb|AET03712.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 326

 Score =  146 bits (369), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 72/153 (47%), Positives = 104/153 (67%), Gaps = 2/153 (1%)

Query: 22  TLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVIS 81
           +L+I +T  + + + F  L +   +    K N L+SI   +  R+E D+ +  +WV++IS
Sbjct: 62  SLIICWTLFLTLFVTFTFLILIILTLRIPKPNHLNSITHSNTLRNEDDDNK--RWVQIIS 119

Query: 82  WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDK 141
           WEPRAF+YHNFL+KEECE+LIN+A P M KS V+D +TG   DSR RTSSG FL RG D+
Sbjct: 120 WEPRAFLYHNFLTKEECEHLINIAKPSMHKSAVIDEETGNGVDSRERTSSGAFLKRGSDR 179

Query: 142 IIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           I+++IE+RIADFTF P+     F++  + V +K
Sbjct: 180 IVKNIERRIADFTFIPVEHGENFNVLHYEVGQK 212


>gi|363543363|ref|NP_001241692.1| prolyl 4-hydroxylase 8-2 [Zea mays]
 gi|347978834|gb|AEP37759.1| prolyl 4-hydroxylase 8-2 [Zea mays]
          Length = 184

 Score =  145 bits (367), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 91/158 (57%), Positives = 108/158 (68%), Gaps = 11/158 (6%)

Query: 9   FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDL-------SSIV 59
           FPTR  ++S  T+ LT L++ + A+L L+AFG+ S+P S+ ++             S+ V
Sbjct: 17  FPTRGGRASPYTVALTALLLVSAALLALIAFGVFSLPVSAPNAAATTGTAAGGETESADV 76

Query: 60  RKSMER--SEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
           R    R   EG   R  QW EVISWEPRAFVYHNFLSK+ECEYLI LA PHM KSTVVDS
Sbjct: 77  RPRARRDLGEGLGERGAQWTEVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDS 136

Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTF 155
            TGKSKDSRVRTSSG FL RGRDK+IR IE+ I   TF
Sbjct: 137 TTGKSKDSRVRTSSGMFLQRGRDKVIRAIEELIKRSTF 174


>gi|90704797|dbj|BAE92293.1| putative prolyl 4-hydroxylase, alpha subunit [Cryptomeria japonica]
          Length = 302

 Score =  145 bits (366), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 81/154 (52%), Positives = 103/154 (66%), Gaps = 5/154 (3%)

Query: 22  TLLIMFTFAILILLAFGILSMPSSSGDSRKANDLS-SIVRKSMERSEGDEGRAEQWVEVI 80
           TLL +F   +L +LA  + S+P      + +  LS SI     +RS+G +   E+ VEV+
Sbjct: 39  TLLSVF---VLFVLAIWVFSVPVKRTPYQISRQLSESIAADYAKRSDGKDEPKER-VEVL 94

Query: 81  SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           SWEPRAF+YHNFL+K+ECEYLIN+A PHM KS VVDS TG S DS VRTSSG FL RG+D
Sbjct: 95  SWEPRAFLYHNFLAKDECEYLINIAKPHMVKSMVVDSKTGGSMDSNVRTSSGWFLNRGQD 154

Query: 141 KIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           KIIR IEKRIADF+  P+       +  + V +K
Sbjct: 155 KIIRRIEKRIADFSHIPVEHGEGLHVLHYEVEQK 188


>gi|357517885|ref|XP_003629231.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523253|gb|AET03707.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 279

 Score =  144 bits (362), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 85/177 (48%), Positives = 114/177 (64%), Gaps = 13/177 (7%)

Query: 1   MAKPRYSRFPTRKS---SSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSS 57
           M K ++SR   RK    +S TLI TL + F F I+ L+    L +P       K   L+S
Sbjct: 1   MGKLKHSRVGPRKPLLPTSRTLIFTLFVTFIFLIIFLILLS-LRIP-------KPKHLNS 52

Query: 58  IVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
           I   +  R + D+ +  +WVE++SWEPR F+YHNFL+KEECE+LIN+A P ++KSTVVD 
Sbjct: 53  ITHINNLRRDDDDNK--RWVEIVSWEPRVFLYHNFLAKEECEHLINIAKPDVQKSTVVDD 110

Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
            TGKS +S  RTSSGTF+ RG DKI+ DIEKRIADFTF P+      ++  + V +K
Sbjct: 111 TTGKSVNSSARTSSGTFIDRGYDKILSDIEKRIADFTFIPVEHGEDVNILHYEVGQK 167


>gi|356502610|ref|XP_003520111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 286

 Score =  142 bits (359), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 72/126 (57%), Positives = 92/126 (73%), Gaps = 8/126 (6%)

Query: 33  ILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNF 92
           ILLA  ILS P        AN  SS+ R +   +E D+  A + +EVISW+PRAF+YHNF
Sbjct: 38  ILLALHILSTP-------HANANSSVSRNTHIEAEEDDQVALR-MEVISWQPRAFLYHNF 89

Query: 93  LSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIAD 152
           L+KEECEYLIN+ATPHM+KSTV D+ +G+S    VR S+G FL RG+D+I+R+IEKRIAD
Sbjct: 90  LTKEECEYLINIATPHMQKSTVADNQSGQSVVHDVRKSTGAFLDRGQDEIVRNIEKRIAD 149

Query: 153 FTFFPL 158
            TF P+
Sbjct: 150 VTFIPI 155


>gi|357517897|ref|XP_003629237.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523259|gb|AET03713.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|388513409|gb|AFK44766.1| unknown [Medicago truncatula]
 gi|388516345|gb|AFK46234.1| unknown [Medicago truncatula]
          Length = 275

 Score =  140 bits (352), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 78/174 (44%), Positives = 108/174 (62%), Gaps = 11/174 (6%)

Query: 1   MAKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVR 60
           M K ++S    RK S  T     L +F     ++L    L +P       K N L+SI  
Sbjct: 1   MVKFKHSNVGLRKPSLITCWTLFLTLFVTFTFLILIILTLRIP-------KLNHLNSITH 53

Query: 61  KSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
            +  R++ ++    +WV++ISWEPRAF+YHNFL+KEECE+LIN+A P M KS V+D  TG
Sbjct: 54  SNTLRNDDNK----RWVQIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSEVIDEKTG 109

Query: 121 KSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           KS +S +RTSSGTFL R  D+I+ +IEKRIADFTF P+     F++  + V +K
Sbjct: 110 KSLNSSIRTSSGTFLDREGDEIVSNIEKRIADFTFIPVEHGESFNVLHYEVGQK 163


>gi|3805847|emb|CAA21467.1| putative protein [Arabidopsis thaliana]
 gi|7270533|emb|CAB81490.1| putative protein [Arabidopsis thaliana]
          Length = 307

 Score =  139 bits (349), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 78/156 (50%), Positives = 94/156 (60%), Gaps = 34/156 (21%)

Query: 36  AFGILSMPSSSGDSRKANDLSSIVRKSMER-SEGDE--GRAEQWVEVISWEPRAFVYHNF 92
             GI S+PS++  S    DL++IV+   ER S GDE  G  ++W+EVISWEPRAFVYHNF
Sbjct: 36  GLGIFSLPSTNKTSSMPMDLTTIVQTIQERESFGDEEDGNGDRWLEVISWEPRAFVYHNF 95

Query: 93  LSKEECEYLINLATPHMRKSTVVDSDTGKSKDS--------------------------- 125
           L+ EECE+LI+LA P M KS VVD  TGKS DS                           
Sbjct: 96  LTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRFCTLTSVVVFTFQLNLERFENSKFAN 155

Query: 126 ----RVRTSSGTFLARGRDKIIRDIEKRIADFTFFP 157
               RVRTSSGTFL RG D+I+ +IE RI+DFTF P
Sbjct: 156 PSLCRVRTSSGTFLNRGHDEIVEEIENRISDFTFIP 191


>gi|168002780|ref|XP_001754091.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694645|gb|EDQ80992.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 214

 Score =  136 bits (342), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 61/83 (73%), Positives = 74/83 (89%)

Query: 75  QWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTF 134
           +WVEV+SWEPRAF+YH+FL++EEC +LI +A P + KSTVVDSDTGKSKDSR+RTSSGTF
Sbjct: 1   RWVEVLSWEPRAFLYHHFLTEEECNHLIEVARPSLVKSTVVDSDTGKSKDSRLRTSSGTF 60

Query: 135 LARGRDKIIRDIEKRIADFTFFP 157
           L RG+D +I+ IEKRIADFTF P
Sbjct: 61  LMRGQDPVIKRIEKRIADFTFIP 83


>gi|168060785|ref|XP_001782374.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666166|gb|EDQ52828.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 211

 Score =  132 bits (333), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 60/83 (72%), Positives = 72/83 (86%)

Query: 76  WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
           WVEV+SWEPRAF+YH+FL++ EC +LI +A P + KSTV+DS TGKSKDSRVRTSSGTFL
Sbjct: 1   WVEVLSWEPRAFLYHHFLTQVECNHLIEVAKPSLVKSTVIDSATGKSKDSRVRTSSGTFL 60

Query: 136 ARGRDKIIRDIEKRIADFTFFPL 158
            RG+D II+ IEKRIADFTF P+
Sbjct: 61  VRGQDHIIKRIEKRIADFTFIPV 83


>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
 gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
          Length = 256

 Score =  128 bits (321), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 60/99 (60%), Positives = 74/99 (74%)

Query: 76  WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
           W E ISW+PRA V+HNFLS EEC++LI LA P+M++S VVD+ TGKSKDSRVRTSSGTFL
Sbjct: 45  WTETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSRVRTSSGTFL 104

Query: 136 ARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
            RG+D+II  IE+RIA FTF P        +  + V +K
Sbjct: 105 RRGQDEIISRIEERIAKFTFIPKEHGEGLQVLHYEVGQK 143


>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
 gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
          Length = 256

 Score =  127 bits (320), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 60/99 (60%), Positives = 74/99 (74%)

Query: 76  WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
           W E ISW+PRA V+HNFLS EEC++LI LA P+M++S VVD+ TGKSKDSRVRTSSGTFL
Sbjct: 45  WTETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSRVRTSSGTFL 104

Query: 136 ARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
            RG+D+II  IE+RIA FTF P        +  + V +K
Sbjct: 105 RRGQDEIISRIEERIAKFTFIPKEHGEGLQVLHYEVGQK 143


>gi|302762452|ref|XP_002964648.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
 gi|300168377|gb|EFJ34981.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
          Length = 225

 Score =  122 bits (306), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 56/93 (60%), Positives = 71/93 (76%)

Query: 65  RSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD 124
           R E  EG+ E W E+ISW PRA + HNFL+ +EC++LI +A P M+KSTVVDS TG S+D
Sbjct: 2   REEVGEGKHEPWSEIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRD 61

Query: 125 SRVRTSSGTFLARGRDKIIRDIEKRIADFTFFP 157
           SRVRTSSG FL RG+D++I +IE +IA  TF P
Sbjct: 62  SRVRTSSGMFLNRGQDRVISEIEDKIAKLTFIP 94


>gi|357467077|ref|XP_003603823.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492871|gb|AES74074.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 291

 Score =  122 bits (305), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 57/103 (55%), Positives = 75/103 (72%)

Query: 72  RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
           R +QW EV+S EPRA +YHNFLSKEECE+LINLA P M++S VVD  TG+   + VRTSS
Sbjct: 80  RKDQWTEVLSSEPRASMYHNFLSKEECEHLINLAKPFMQRSLVVDGVTGQGILNSVRTSS 139

Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           GTFL RG+DKI++++E+RIAD T  P+       +  + V +K
Sbjct: 140 GTFLERGKDKIVQNVERRIADITSIPIENGEGLQIIHYEVGQK 182


>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 683

 Score =  117 bits (294), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 56/103 (54%), Positives = 73/103 (70%)

Query: 72  RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
           R +QW E++S  PRA +YHNFLSKEECE+LINLA P M +S VVD  TG+ K+S  RTSS
Sbjct: 107 RKDQWTEILSSVPRASMYHNFLSKEECEHLINLAKPFMARSLVVDGVTGEVKESSSRTSS 166

Query: 132 GTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           G FL RG+DKI+++IE+RIAD T  P+       +  + V +K
Sbjct: 167 GMFLDRGKDKIVQNIERRIADITSVPIENGEGLHVIHYGVGQK 209



 Score = 89.4 bits (220), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 46/76 (60%), Positives = 55/76 (72%), Gaps = 1/76 (1%)

Query: 92  FLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIA 151
           F SKEECE+LINLA P M +S VVD  TGK ++S  RTSSG FL RG+DKI+++IE+RIA
Sbjct: 372 FGSKEECEHLINLAKPFMTRSLVVDGLTGKGRESSARTSSGRFLERGKDKIVQNIEQRIA 431

Query: 152 DFTFFPLGMQFKFSLF 167
           D T  P  M   F LF
Sbjct: 432 DITSIPR-MARDFMLF 446


>gi|307106819|gb|EFN55064.1| hypothetical protein CHLNCDRAFT_35843 [Chlorella variabilis]
          Length = 287

 Score =  114 bits (286), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 55/81 (67%), Positives = 63/81 (77%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE +SW PRAFVYHNFLS EECE+L  LA   + KSTVVD+ TGKS DS VRTSSGTFLA
Sbjct: 38  VEQVSWRPRAFVYHNFLSDEECEHLKELARKRLTKSTVVDNKTGKSMDSTVRTSSGTFLA 97

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           RG D+++R IEKRI+  T  P
Sbjct: 98  RGEDEVVRAIEKRISLVTMIP 118


>gi|302815629|ref|XP_002989495.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
 gi|300142673|gb|EFJ09371.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
          Length = 213

 Score =  114 bits (285), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 51/82 (62%), Positives = 65/82 (79%)

Query: 76  WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
           W E+ISW PRA + HNFL+ +EC++LI +A P M+KSTVVDS TG S+DSRVRTSSG FL
Sbjct: 1   WSEIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSRVRTSSGMFL 60

Query: 136 ARGRDKIIRDIEKRIADFTFFP 157
            RG+D++I +IE +IA  TF P
Sbjct: 61  NRGQDRVISEIEDKIAKLTFIP 82


>gi|449467908|ref|XP_004151664.1| PREDICTED: uncharacterized protein LOC101218099, partial [Cucumis
           sativus]
          Length = 122

 Score =  113 bits (282), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 66/128 (51%), Positives = 83/128 (64%), Gaps = 11/128 (8%)

Query: 1   MAKPRYSRFPTRKSSS---STLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSS 57
           ++K +Y +   RK S+   S +I+ L++   F +LI L F  LS P +S      +   S
Sbjct: 3   ISKGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRF--LSPPETS------HHRFS 54

Query: 58  IVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
            VR +   S+G   R +QWVE ISWEPRAFVYHNFLSKEEC YLI+LA PHM KSTVVDS
Sbjct: 55  SVRHTAFLSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDS 114

Query: 118 DTGKSKDS 125
            TG+S DS
Sbjct: 115 KTGESVDS 122


>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
           C-169]
          Length = 222

 Score =  109 bits (272), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 61/81 (75%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +EV+SWEPRA++YHNFL++ E +YL+    PHM KS VVD++TGKS  S+VRTSSG FL 
Sbjct: 1   MEVLSWEPRAYLYHNFLTEAEADYLVQKGKPHMEKSEVVDNETGKSAPSKVRTSSGMFLN 60

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           RG D +I  IE RIA +T  P
Sbjct: 61  RGEDDVIERIEARIAKYTAIP 81


>gi|297802348|ref|XP_002869058.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314894|gb|EFH45317.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 245

 Score =  108 bits (271), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 52/93 (55%), Positives = 70/93 (75%), Gaps = 8/93 (8%)

Query: 73  AEQWVEVISWEPRAFVYHNFLS--------KEECEYLINLATPHMRKSTVVDSDTGKSKD 124
           +E+W+EVI+ EPRAFVYHNFL+         EECE+LI+LA P M +S V ++ TG  ++
Sbjct: 51  SERWLEVIAKEPRAFVYHNFLALFFKFCKTNEECEHLISLAKPSMARSKVRNAITGLGEE 110

Query: 125 SRVRTSSGTFLARGRDKIIRDIEKRIADFTFFP 157
           S  RTSSGTFL +G DKI+++IEKRI++FTF P
Sbjct: 111 SSSRTSSGTFLRKGHDKIVKEIEKRISEFTFIP 143


>gi|15233345|ref|NP_195307.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|3805848|emb|CAA21468.1| putative protein [Arabidopsis thaliana]
 gi|7270534|emb|CAB81491.1| putative protein [Arabidopsis thaliana]
 gi|332661175|gb|AEE86575.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 272

 Score =  105 bits (262), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 50/92 (54%), Positives = 68/92 (73%), Gaps = 8/92 (8%)

Query: 74  EQWVEVISWEPRAFVYHNFL--------SKEECEYLINLATPHMRKSTVVDSDTGKSKDS 125
           E+W+EVI+ EPRAFVYHNFL        + EEC++LI+LA P M +S V ++ TG  ++S
Sbjct: 85  ERWLEVITKEPRAFVYHNFLALFFKICKTNEECDHLISLAKPSMARSKVRNALTGLGEES 144

Query: 126 RVRTSSGTFLARGRDKIIRDIEKRIADFTFFP 157
             RTSSGTF+  G DKI+++IEKRI++FTF P
Sbjct: 145 SSRTSSGTFIRSGHDKIVKEIEKRISEFTFIP 176


>gi|302831512|ref|XP_002947321.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
           nagariensis]
 gi|300267185|gb|EFJ51369.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
           nagariensis]
          Length = 797

 Score =  104 bits (259), Expect = 2e-20,   Method: Composition-based stats.
 Identities = 44/84 (52%), Positives = 59/84 (70%)

Query: 74  EQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGT 133
           E W+E ISW PRAFVYHNFL+  EC++L+ + T  + +S VVDS TG+SK   +RTS G 
Sbjct: 490 EPWIETISWSPRAFVYHNFLTSAECDHLVQIGTQRVSRSLVVDSQTGQSKLDDIRTSYGA 549

Query: 134 FLARGRDKIIRDIEKRIADFTFFP 157
              RG D +I +IE+RIA++T  P
Sbjct: 550 AFGRGEDPVIAEIEERIAEWTHLP 573


>gi|241913390|pdb|3GZE|A Chain A, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913391|pdb|3GZE|B Chain B, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913392|pdb|3GZE|C Chain C, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913393|pdb|3GZE|D Chain D, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
          Length = 225

 Score =  102 bits (255), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 48/104 (46%), Positives = 68/104 (65%), Gaps = 3/104 (2%)

Query: 74  EQW---VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
           E+W   V  +SW PRAF+  NFLS EEC+Y++  A P M KS+VVD+++GKS DS +RTS
Sbjct: 8   EEWRGEVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTS 67

Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +GT+ A+G D +I  IEKR+A  T  PL       +  ++  +K
Sbjct: 68  TGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQK 111


>gi|159478673|ref|XP_001697425.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158274304|gb|EDP00087.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 297

 Score =  102 bits (255), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 48/104 (46%), Positives = 68/104 (65%), Gaps = 3/104 (2%)

Query: 74  EQW---VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
           E+W   V  +SW PRAF+  NFLS EEC+Y++  A P M KS+VVD+++GKS DS +RTS
Sbjct: 36  EEWRGEVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTS 95

Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +GT+ A+G D +I  IEKR+A  T  PL       +  ++  +K
Sbjct: 96  TGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQK 139


>gi|159794881|pdb|2JIJ|A Chain A, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
 gi|159794882|pdb|2JIJ|B Chain B, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
 gi|159794883|pdb|2JIJ|C Chain C, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
          Length = 233

 Score =  102 bits (255), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 48/104 (46%), Positives = 68/104 (65%), Gaps = 3/104 (2%)

Query: 74  EQW---VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
           E+W   V  +SW PRAF+  NFLS EEC+Y++  A P M KS+VVD+++GKS DS +RTS
Sbjct: 16  EEWRGEVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTS 75

Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +GT+ A+G D +I  IEKR+A  T  PL       +  ++  +K
Sbjct: 76  TGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQK 119


>gi|159794879|pdb|2JIG|A Chain A, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
           Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
           Dicarboxylate
 gi|159794880|pdb|2JIG|B Chain B, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
           Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
           Dicarboxylate
          Length = 224

 Score =  102 bits (254), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 48/104 (46%), Positives = 68/104 (65%), Gaps = 3/104 (2%)

Query: 74  EQW---VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
           E+W   V  +SW PRAF+  NFLS EEC+Y++  A P M KS+VVD+++GKS DS +RTS
Sbjct: 7   EEWRGEVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTS 66

Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +GT+ A+G D +I  IEKR+A  T  PL       +  ++  +K
Sbjct: 67  TGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQK 110


>gi|50845214|gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
          Length = 316

 Score =  102 bits (254), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 55/136 (40%), Positives = 80/136 (58%), Gaps = 1/136 (0%)

Query: 24  LIMFTFAILILLAFGILS-MPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISW 82
           ++ F   + I L+F +LS  PSS G         S++R   E      G     V  +SW
Sbjct: 1   MVKFDVFLTIFLSFFLLSPHPSSCGWLNNVKKGKSVLRLKSENVPSSVGVDPSHVTQLSW 60

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PRAF+Y  FL+ EEC++LI++A   + KS V D+++GKS  S VRTSSG FL + +D +
Sbjct: 61  KPRAFLYEGFLTHEECDHLIDMAKDKLEKSMVADNESGKSIPSEVRTSSGMFLQKAQDDV 120

Query: 143 IRDIEKRIADFTFFPL 158
           +  IE RIA +TF P+
Sbjct: 121 VAAIEARIAAWTFLPI 136


>gi|302845234|ref|XP_002954156.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
           nagariensis]
 gi|300260655|gb|EFJ44873.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
           nagariensis]
          Length = 309

 Score =  101 bits (251), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 60/150 (40%), Positives = 84/150 (56%), Gaps = 6/150 (4%)

Query: 27  FTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVI--SWEP 84
             F +  +L   ILS+  +S    +A   S ++  S       E + E   EVI  SW P
Sbjct: 6   MAFQVSAVLLLTILSLAVAS----EAASTSHVITGSGHTVGFGELKEEWRGEVIHLSWSP 61

Query: 85  RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIR 144
           RAF+   FLS EECE++I  A P M KS+VVD+ +GKS DS +RTS+G +LA+G D+II 
Sbjct: 62  RAFLLKGFLSDEECEHIIAKAKPRMVKSSVVDNASGKSVDSEIRTSTGAWLAKGEDEIIS 121

Query: 145 DIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
            IEKR+A  T  PL       +  ++  +K
Sbjct: 122 RIEKRVAQVTMIPLENHEGLQVLHYHDGQK 151


>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
          Length = 344

 Score =  100 bits (249), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 46/83 (55%), Positives = 60/83 (72%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+V+  + R F+YHNFL+ EEC+++I LA P M +S VV++D+GKSK   VRTS GTFL 
Sbjct: 63  VQVLHEDARIFLYHNFLTDEECDHIIKLAEPTMARSGVVETDSGKSKIDNVRTSKGTFLN 122

Query: 137 RGRDKIIRDIEKRIADFTFFPLG 159
           RG D +I DIE RIA +T  P G
Sbjct: 123 RGHDSVIADIEARIAKWTLMPAG 145


>gi|359477455|ref|XP_002278454.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Vitis
           vinifera]
          Length = 296

 Score =  100 bits (248), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 53/131 (40%), Positives = 79/131 (60%), Gaps = 18/131 (13%)

Query: 27  FTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRA 86
             F +L+ ++  IL   SS  D+  +N  ++ VR+                  ISW+PRA
Sbjct: 5   LQFLLLLWISSTILEFSSSYADAAGSNVSAAKVRQ------------------ISWKPRA 46

Query: 87  FVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDI 146
           FVY  FLS+EEC++LI+LA   +++S V D+ +GKS+ S VRTSSG F+ +G+D I+  I
Sbjct: 47  FVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSRLSEVRTSSGMFIGKGKDPIVAGI 106

Query: 147 EKRIADFTFFP 157
           E +IA +TF P
Sbjct: 107 EDKIAAWTFLP 117


>gi|359477453|ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
           vinifera]
 gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera]
          Length = 298

 Score = 99.8 bits (247), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 53/131 (40%), Positives = 79/131 (60%), Gaps = 18/131 (13%)

Query: 27  FTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRA 86
             F +L+ ++  IL   SS  D+  +N  ++ VR+                  ISW+PRA
Sbjct: 5   LQFLLLLWISSTILEFSSSYADAAGSNVSAAKVRQ------------------ISWKPRA 46

Query: 87  FVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDI 146
           FVY  FLS+EEC++LI+LA   +++S V D+ +GKS+ S VRTSSG F+ +G+D I+  I
Sbjct: 47  FVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSRLSEVRTSSGMFIGKGKDPIVAGI 106

Query: 147 EKRIADFTFFP 157
           E +IA +TF P
Sbjct: 107 EDKIAAWTFLP 117


>gi|125546091|gb|EAY92230.1| hypothetical protein OsI_13950 [Oryza sativa Indica Group]
          Length = 178

 Score = 99.8 bits (247), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 46/81 (56%), Positives = 60/81 (74%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PRAF+Y  FLS +EC++L+NLA   M KS V D+D+GKS  S+VRTSSGTFL+
Sbjct: 37  VTQLSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSGTFLS 96

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +  D I+  IEKR+A +TF P
Sbjct: 97  KHEDDIVSGIEKRVAAWTFLP 117


>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
          Length = 1062

 Score = 99.8 bits (247), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 46/81 (56%), Positives = 60/81 (74%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PRAF+Y  FLS +EC++L+NLA   M KS V D+D+GKS  S+VRTSSGTFL+
Sbjct: 37  VTQLSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSGTFLS 96

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +  D I+  IEKR+A +TF P
Sbjct: 97  KHEDDIVSGIEKRVAAWTFLP 117


>gi|159795555|pdb|2V4A|A Chain A, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795556|pdb|2V4A|B Chain B, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795557|pdb|2V4A|C Chain C, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795558|pdb|2V4A|D Chain D, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii
          Length = 233

 Score = 99.4 bits (246), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 47/104 (45%), Positives = 67/104 (64%), Gaps = 3/104 (2%)

Query: 74  EQW---VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
           E+W   V  +SW PRAF+  NFLS EEC+Y++  A P   KS+VVD+++GKS DS +RTS
Sbjct: 16  EEWRGEVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKXVKSSVVDNESGKSVDSEIRTS 75

Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +GT+ A+G D +I  IEKR+A  T  PL       +  ++  +K
Sbjct: 76  TGTWFAKGEDSVISKIEKRVAQVTXIPLENHEGLQVLHYHDGQK 119


>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
 gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
 gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
           expressed [Oryza sativa Japonica Group]
 gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
 gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
          Length = 299

 Score = 99.0 bits (245), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 46/81 (56%), Positives = 60/81 (74%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PRAF+Y  FLS +EC++L+NLA   M KS V D+D+GKS  S+VRTSSGTFL+
Sbjct: 37  VTQLSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSGTFLS 96

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +  D I+  IEKR+A +TF P
Sbjct: 97  KHEDDIVSGIEKRVAAWTFLP 117


>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 311

 Score = 98.6 bits (244), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 45/81 (55%), Positives = 61/81 (75%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PRAF+Y  FLS EEC++LI+LA   + KS V D+++GKS +S VRTSSG F+A
Sbjct: 49  VTQLSWHPRAFLYKGFLSYEECDHLIDLARDKLEKSMVADNESGKSIESEVRTSSGMFIA 108

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D+I+ DIE RIA +TF P
Sbjct: 109 KAQDEIVADIEARIAAWTFLP 129


>gi|384246332|gb|EIE19822.1| hypothetical protein COCSUDRAFT_25518 [Coccomyxa subellipsoidea
           C-169]
          Length = 347

 Score = 98.6 bits (244), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 51/92 (55%), Positives = 64/92 (69%), Gaps = 2/92 (2%)

Query: 68  GDEGRAEQWVEVI--SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDS 125
           G +G+ E   EVI  SW PRAF+   FL + ECE+LI+ A P M KSTVVD+DTGKS DS
Sbjct: 72  GADGKEEWRGEVIEVSWSPRAFLLKGFLKEAECEHLISKAKPSMVKSTVVDNDTGKSIDS 131

Query: 126 RVRTSSGTFLARGRDKIIRDIEKRIADFTFFP 157
            VRTS+GTF  R  D++I+ IE+RI+  T  P
Sbjct: 132 TVRTSTGTFFGREEDEVIQGIERRISMITHLP 163


>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
 gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
          Length = 297

 Score = 98.6 bits (244), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 47/81 (58%), Positives = 59/81 (72%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PRAF+Y  FLS  EC++LINLA   M KS V D+D+GKS  S+VRTSSG FLA
Sbjct: 35  VTQLSWRPRAFLYSGFLSDTECDHLINLAKGSMEKSMVADNDSGKSLMSQVRTSSGAFLA 94

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +  D+I+  IEKR+A +TF P
Sbjct: 95  KHEDEIVSAIEKRVAAWTFLP 115


>gi|224141325|ref|XP_002324024.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
           trichocarpa]
 gi|222867026|gb|EEF04157.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
           trichocarpa]
          Length = 308

 Score = 98.2 bits (243), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 44/81 (54%), Positives = 60/81 (74%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PRAF+Y  FLS EEC++L+NLA   + KS V D+++GKS +S VRTSSG F+ 
Sbjct: 46  VTQLSWNPRAFLYKGFLSDEECDHLMNLARDKLEKSMVADNESGKSIESEVRTSSGMFIG 105

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D+I+ DIE RIA +TF P
Sbjct: 106 KSQDEIVDDIEARIAAWTFLP 126


>gi|114796723|gb|ABI79328.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
          Length = 297

 Score = 96.3 bits (238), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 40/82 (48%), Positives = 64/82 (78%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  ISW+PRAFVY  FL+ EEC++LI++A   +++S V D+++GKS+ S VRTSSG F++
Sbjct: 37  VRQISWKPRAFVYEGFLTDEECDHLISIAKTELKRSAVADNESGKSQVSEVRTSSGAFIS 96

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
           + +D I++ IE+++A +TF P+
Sbjct: 97  KAKDAIVQRIEEKLATWTFLPI 118


>gi|255551575|ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 297

 Score = 96.3 bits (238), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 43/81 (53%), Positives = 63/81 (77%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +SW+PRAFVY  FL+  EC++LI+LA   +++S V D+++GKSK S VRTSSG F+A
Sbjct: 36  VKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSEVRTSSGMFIA 95

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +G+D II  IE++I+ +TF P
Sbjct: 96  KGKDPIIAGIEEKISTWTFLP 116


>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 293

 Score = 95.9 bits (237), Expect = 5e-18,   Method: Compositional matrix adjust.
 Identities = 49/106 (46%), Positives = 67/106 (63%), Gaps = 4/106 (3%)

Query: 69  DEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVR 128
           D+ R  Q    +SW PRAF+Y  FLS  EC++L+ LA   ++KS V D+D+GKS  S+VR
Sbjct: 27  DQARVTQ----LSWRPRAFLYSGFLSHAECDHLVKLAKGRLQKSMVADNDSGKSVMSQVR 82

Query: 129 TSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           TSSGTFL +  D+II  IEKR+A +TF P        +  + V +K
Sbjct: 83  TSSGTFLNKHEDEIISGIEKRVAAWTFLPEENAESIQVLHYEVGQK 128


>gi|412992163|emb|CCO19876.1| predicted protein [Bathycoccus prasinos]
          Length = 350

 Score = 95.5 bits (236), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 43/82 (52%), Positives = 60/82 (73%)

Query: 76  WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
           W E ISW+PRAFV H+ LS+EECE ++ +A P M++STVVDS TG+ K   +RTS  TFL
Sbjct: 79  WTEPISWQPRAFVLHSILSEEECEEILRIAKPMMKRSTVVDSITGEIKTDPIRTSKQTFL 138

Query: 136 ARGRDKIIRDIEKRIADFTFFP 157
           ARG+  ++  +E+R++ FT  P
Sbjct: 139 ARGKYPVVTRVEERLSRFTMLP 160


>gi|225452614|ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera]
          Length = 316

 Score = 95.5 bits (236), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 45/82 (54%), Positives = 60/82 (73%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PRAF+Y  FLS+EEC++LI LA   + KS V D+++GKS  S VRTSSG FL 
Sbjct: 54  VTQLSWRPRAFLYKGFLSEEECDHLITLAKDKLEKSMVADNESGKSIMSEVRTSSGMFLL 113

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
           + +D+I+ DIE RIA +TF P+
Sbjct: 114 KAQDEIVADIEARIAAWTFLPV 135


>gi|357517893|ref|XP_003629235.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523257|gb|AET03711.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 196

 Score = 95.5 bits (236), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 46/81 (56%), Positives = 60/81 (74%), Gaps = 1/81 (1%)

Query: 94  SKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADF 153
           +KEECE+LIN+A P M KSTV D +TGKS D+  RTSSGTF+ RG DKI+R+IE+RIADF
Sbjct: 14  TKEECEHLINIAKPSMHKSTV-DDETGKSVDNSARTSSGTFINRGHDKILRNIEQRIADF 72

Query: 154 TFFPLGMQFKFSLFFFYVLKK 174
           TF P+      ++  + V +K
Sbjct: 73  TFIPVENGESVNILHYEVGQK 93


>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
          Length = 294

 Score = 95.1 bits (235), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 41/78 (52%), Positives = 58/78 (74%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           ISW+PRAFVY  FL+ EEC +LI+LA   +++S V D+++G SK S VRTSSG F+ + +
Sbjct: 36  ISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAVADNESGNSKTSEVRTSSGMFIPKAK 95

Query: 140 DKIIRDIEKRIADFTFFP 157
           D I+  IE++IA +TF P
Sbjct: 96  DPIVSGIEEKIATWTFLP 113


>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 263

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 44/81 (54%), Positives = 60/81 (74%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +SW+PRAF+Y NFLS  EC+++I+LA   + KS V D+++GKS  S +RTSSG FL 
Sbjct: 6   VKQLSWKPRAFLYSNFLSDAECDHMISLAKDKLEKSMVADNESGKSVKSEIRTSSGMFLM 65

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +G+D II  IE RIA +TF P
Sbjct: 66  KGQDDIISRIEDRIAAWTFLP 86


>gi|412988743|emb|CCO15334.1| predicted protein [Bathycoccus prasinos]
          Length = 352

 Score = 94.7 bits (234), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 42/94 (44%), Positives = 63/94 (67%), Gaps = 2/94 (2%)

Query: 76  WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
           W+E +SW+PRAF+YHNFLSKEE ++L++L  P + +STVV   TG+  D  +RTS GTF+
Sbjct: 67  WIEALSWDPRAFLYHNFLSKEEAKHLVDLGEPRVTRSTVVGGQTGRVSD--IRTSFGTFI 124

Query: 136 ARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
            +  D+++  IE R A F+  P+  Q +  L  +
Sbjct: 125 PKKYDEVLEKIEDRCAVFSGIPVAHQEQMQLLRY 158


>gi|363543367|ref|NP_001241693.1| prolyl 4-hydroxylase 8-3 [Zea mays]
 gi|347978836|gb|AEP37760.1| prolyl 4-hydroxylase 8-3 [Zea mays]
          Length = 188

 Score = 94.4 bits (233), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 64/126 (50%), Positives = 79/126 (62%), Gaps = 11/126 (8%)

Query: 9   FPTR--KSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDL-------SSIV 59
           FPTR  ++S  T+ LT L++ + A+L L+AFG+ S+P S+ ++             S+ V
Sbjct: 17  FPTRGGRASPYTVALTALLLVSAALLALIAFGVFSLPVSAPNAAATTGTAAGGETESADV 76

Query: 60  RKSMER--SEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
           R    R   EG   R  QW EVISWEPRAFVYHNFL K+ECEYLI  A P M KSTVVDS
Sbjct: 77  RPRARRDLGEGLGERGAQWTEVISWEPRAFVYHNFLPKDECEYLIGSAKPLMVKSTVVDS 136

Query: 118 DTGKSK 123
            TGK K
Sbjct: 137 TTGKPK 142


>gi|215490183|dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
          Length = 318

 Score = 94.4 bits (233), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 42/81 (51%), Positives = 58/81 (71%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  ISW PRAFVY NFL+ EEC++ I LA   + KS V D+++GKS +S VRTSSG F  
Sbjct: 62  VTQISWRPRAFVYRNFLTDEECDHFITLAKHKLEKSMVADNESGKSVESEVRTSSGMFFR 121

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D+++ ++E RIA +TF P
Sbjct: 122 KAQDQVVANVEARIAAWTFLP 142


>gi|168001068|ref|XP_001753237.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695523|gb|EDQ81866.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 284

 Score = 94.4 bits (233), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 46/84 (54%), Positives = 61/84 (72%), Gaps = 4/84 (4%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVISW+PR  + HNFLS +EC++LINLA P + KSTVVD+ TGK  +S+VRTS+G FL  
Sbjct: 79  EVISWQPRIILLHNFLSADECDHLINLARPRLVKSTVVDATTGKGIESKVRTSTGMFL-N 137

Query: 138 GRDK---IIRDIEKRIADFTFFPL 158
           G D+    I+ IE RIA ++  P+
Sbjct: 138 GNDRRHHTIQAIETRIAAYSMVPV 161


>gi|357496283|ref|XP_003618430.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
 gi|217073992|gb|ACJ85356.1| unknown [Medicago truncatula]
 gi|355493445|gb|AES74648.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
 gi|388494436|gb|AFK35284.1| unknown [Medicago truncatula]
          Length = 313

 Score = 94.0 bits (232), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 43/82 (52%), Positives = 59/82 (71%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PRAF+Y NFL+ EEC++LI L+   + KS V D+++GKS  S VRTSSG FL 
Sbjct: 51  VTQLSWSPRAFLYKNFLTDEECDHLIELSKDKLEKSMVADNESGKSIQSEVRTSSGMFLN 110

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
           + +D+I+  IE RIA +TF P+
Sbjct: 111 KQQDEIVSGIEARIAAWTFLPV 132


>gi|255637501|gb|ACU19077.1| unknown [Glycine max]
          Length = 318

 Score = 93.6 bits (231), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 58/82 (70%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PRAF+Y  FLS EEC++LI LA   + KS V D+++GKS  S VRTSSG FL 
Sbjct: 56  VTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMSEVRTSSGMFLN 115

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
           + +D+I+  IE RIA +TF P+
Sbjct: 116 KAQDEIVAGIEARIAAWTFLPI 137


>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 318

 Score = 93.6 bits (231), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 58/82 (70%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PRAF+Y  FLS EEC++LI LA   + KS V D+++GKS  S VRTSSG FL 
Sbjct: 56  VTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMSEVRTSSGMFLN 115

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
           + +D+I+  IE RIA +TF P+
Sbjct: 116 KAQDEIVAGIEARIAAWTFLPI 137


>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
          Length = 319

 Score = 93.6 bits (231), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 59/82 (71%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PRAF+Y  FLS+EEC++LI LA   + KS V D+D+GKS  S +RTSSG FL 
Sbjct: 57  VTQLSWSPRAFLYKGFLSEEECDHLIVLAKDKLEKSMVADNDSGKSIMSDIRTSSGMFLN 116

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
           + +D+I+  IE RIA +TF P+
Sbjct: 117 KAQDEIVAGIEARIAAWTFLPV 138


>gi|159481038|ref|XP_001698589.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158282329|gb|EDP08082.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 258

 Score = 93.2 bits (230), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 56/81 (69%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E ISW PRAF+YHNFLS+ EC++L ++    + +S VVDS TG+SK   +RTS G    
Sbjct: 8   IETISWSPRAFIYHNFLSEAECDHLTDIGNKRVSRSLVVDSKTGQSKLDDIRTSYGAAFG 67

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           RG D +I  +E+RIA++T  P
Sbjct: 68  RGEDPVIAAVEERIAEWTHLP 88


>gi|449443245|ref|XP_004139390.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 295

 Score = 92.8 bits (229), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 48/104 (46%), Positives = 67/104 (64%), Gaps = 8/104 (7%)

Query: 72  RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG-KSKDSRVRTS 130
           R ++W       PRAF+YHNFLS++EC  LINLA P M +S V   +T  +   S  RTS
Sbjct: 78  RGDEW-------PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTS 130

Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           SG FLA+G+++++R IEKRIA+FTF P+      S+  + V +K
Sbjct: 131 SGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVGQK 174


>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
 gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
          Length = 300

 Score = 92.8 bits (229), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 41/81 (50%), Positives = 60/81 (74%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +SW+PRAFVY  FL+  EC++LI+LA   +++S V D+++GKSK S VRTSSG F+ 
Sbjct: 39  VKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSEVRTSSGMFIT 98

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+  IE +IA +TF P
Sbjct: 99  KAKDPIVAGIEDKIATWTFLP 119


>gi|363543307|ref|NP_001241869.1| prolyl 4-hydroxylase 6-4 precursor [Zea mays]
 gi|347978826|gb|AEP37755.1| prolyl 4-hydroxylase 6-4 [Zea mays]
          Length = 145

 Score = 92.4 bits (228), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 43/81 (53%), Positives = 58/81 (71%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +S  PRAF+Y  FLS  EC+++++LA   M KS V D+D+GKS  S+ RTSSGTFLA
Sbjct: 35  VTQLSSRPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVASQARTSSGTFLA 94

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +  D+I+  IEKR+A +TF P
Sbjct: 95  KREDEIVSAIEKRVAAWTFLP 115


>gi|145345764|ref|XP_001417370.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577597|gb|ABO95663.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 328

 Score = 92.0 bits (227), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 43/93 (46%), Positives = 59/93 (63%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E +SW P A VY  FL++EEC++L  LATP + +STVVD+  G S  S +RTSSG FL 
Sbjct: 56  IERVSWRPHAEVYRGFLTREECDHLKALATPSLGRSTVVDASNGGSVPSDIRTSSGMFLL 115

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
           RG D ++  IE+RIA +T  P      F +  +
Sbjct: 116 RGEDDVVASIERRIASWTHVPESHGEGFQVLRY 148


>gi|449461905|ref|XP_004148682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 295

 Score = 92.0 bits (227), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 41/81 (50%), Positives = 57/81 (70%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW+PRAF+Y  FLS  EC++LI+LA   + KS V D+D+GKS  S VRTSSG FL 
Sbjct: 32  VTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLR 91

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D+++  +E RIA +T  P
Sbjct: 92  KAQDEVVAGVEARIAAWTLLP 112


>gi|449522594|ref|XP_004168311.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Cucumis
           sativus]
          Length = 313

 Score = 92.0 bits (227), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 41/81 (50%), Positives = 57/81 (70%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW+PRAF+Y  FLS  EC++LI+LA   + KS V D+D+GKS  S VRTSSG FL 
Sbjct: 53  VTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLR 112

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D+++  +E RIA +T  P
Sbjct: 113 KAQDEVVAGVEARIAAWTLLP 133


>gi|224141327|ref|XP_002324025.1| predicted protein [Populus trichocarpa]
 gi|222867027|gb|EEF04158.1| predicted protein [Populus trichocarpa]
          Length = 239

 Score = 92.0 bits (227), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 46/89 (51%), Positives = 61/89 (68%), Gaps = 4/89 (4%)

Query: 69  DEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVR 128
           D  RA Q    +SW+PRAFVY  FLS EEC++LINLA   + KS V + +TG+S +S+ R
Sbjct: 14  DPTRAAQ----LSWQPRAFVYKGFLSDEECDHLINLAKGKLVKSMVANDETGESMESQER 69

Query: 129 TSSGTFLARGRDKIIRDIEKRIADFTFFP 157
           TSSG F+ +  D+I+  IE RIA +TF P
Sbjct: 70  TSSGMFIFKTEDEIVNGIEARIAAWTFLP 98


>gi|307110744|gb|EFN58979.1| hypothetical protein CHLNCDRAFT_137600 [Chlorella variabilis]
          Length = 327

 Score = 92.0 bits (227), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 39/99 (39%), Positives = 62/99 (62%), Gaps = 1/99 (1%)

Query: 76  WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
           WVEV++W+PRA + H FLS  EC+++I +A P + +STVV  + G   D  +RTSSG F+
Sbjct: 41  WVEVVAWKPRALLLHGFLSHAECDHIIRVADPSLERSTVVSPEGGSMLD-EIRTSSGMFI 99

Query: 136 ARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
            +G D +I  +E+R+A  T  P+  Q    +  + + +K
Sbjct: 100 LKGHDAVISGLEERVAALTHLPVSHQEDLQVLRYELGQK 138


>gi|24417248|gb|AAN60234.1| unknown [Arabidopsis thaliana]
          Length = 190

 Score = 91.7 bits (226), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 53/136 (38%), Positives = 79/136 (58%), Gaps = 7/136 (5%)

Query: 25  IMFTFAILILLAFGILSMPSSSGDSRKAN--DLSSIVRKSMERSEG-DEGRAEQWVEVIS 81
           I   F++  L    ++S   +   +R +N  D S I  K+   S G D  R  Q    +S
Sbjct: 5   IFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIKMKTSASSFGFDPTRVTQ----LS 60

Query: 82  WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDK 141
           W PR F+Y  FLS EEC++ I LA   + KS V D+D+G+S +S VRTSSG FL++ +D 
Sbjct: 61  WTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQDD 120

Query: 142 IIRDIEKRIADFTFFP 157
           I+ ++E ++A +TF P
Sbjct: 121 IVSNVEAKLAAWTFLP 136


>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
 gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
          Length = 316

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 53/136 (38%), Positives = 79/136 (58%), Gaps = 7/136 (5%)

Query: 25  IMFTFAILILLAFGILSMPSSSGDSRKAN--DLSSIVRKSMERSEG-DEGRAEQWVEVIS 81
           I   F++  L    ++S   +   +R +N  D S I  K+   S G D  R  Q    +S
Sbjct: 5   IFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIKMKTSASSFGFDPTRVTQ----LS 60

Query: 82  WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDK 141
           W PR F+Y  FLS EEC++ I LA   + KS V D+D+G+S +S VRTSSG FL++ +D 
Sbjct: 61  WTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQDD 120

Query: 142 IIRDIEKRIADFTFFP 157
           I+ ++E ++A +TF P
Sbjct: 121 IVNNVEAKLAAWTFLP 136


>gi|363543309|ref|NP_001241870.1| prolyl 4-hydroxylase 6-3 precursor [Zea mays]
 gi|347978824|gb|AEP37754.1| prolyl 4-hydroxylase 6-3 [Zea mays]
          Length = 208

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 44/81 (54%), Positives = 58/81 (71%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +S  PRAF+Y  FLS  EC++L++LA   M KS V D+D+GKS  S+ RTSSGTFLA
Sbjct: 35  VTQLSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQARTSSGTFLA 94

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +  D+I+  IEKR+A +TF P
Sbjct: 95  KREDEIVSAIEKRVAAWTFLP 115


>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 303

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 42/81 (51%), Positives = 59/81 (72%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ ISW PRAFVY  FL+  EC++LI+LA   +++S+V D+ +GKSK S VRTSSG F+ 
Sbjct: 41  VKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSEVRTSSGAFIH 100

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+  IE +IA +TF P
Sbjct: 101 KAKDPIVSGIEDKIAAWTFLP 121


>gi|449495423|ref|XP_004159836.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 304

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 42/81 (51%), Positives = 59/81 (72%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ ISW PRAFVY  FL+  EC++LI+LA   +++S+V D+ +GKSK S VRTSSG F+ 
Sbjct: 41  VKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSEVRTSSGAFIH 100

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+  IE +IA +TF P
Sbjct: 101 KAKDPIVSGIEDKIAAWTFLP 121


>gi|449454448|ref|XP_004144967.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449474082|ref|XP_004154068.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449515181|ref|XP_004164628.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 300

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 61/81 (75%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ ISW+PRAFVY  FL+  EC++L+++A   +++S V D+D+GKSK S VRTSSG F++
Sbjct: 39  VKQISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFIS 98

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+  IE +I+ +TF P
Sbjct: 99  KNKDPIVSGIEDKISAWTFLP 119


>gi|363543389|ref|NP_001241704.1| prolyl 4-hydroxylase 6-2 precursor [Zea mays]
 gi|347978822|gb|AEP37753.1| prolyl 4-hydroxylase 6-2 [Zea mays]
          Length = 162

 Score = 91.3 bits (225), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 45/90 (50%), Positives = 62/90 (68%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +S  PRAF+Y  FLS  EC+++++LA   M KS V D+D+GKS  S+ RTSSGTFLA
Sbjct: 35  VTQLSSRPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVASQARTSSGTFLA 94

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQFKFSL 166
           +  D+I+  IEKR+A +TF P   + + SL
Sbjct: 95  KREDEIVSAIEKRVAAWTFPPAVSELRPSL 124


>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
 gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
 gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 316

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 53/136 (38%), Positives = 79/136 (58%), Gaps = 7/136 (5%)

Query: 25  IMFTFAILILLAFGILSMPSSSGDSRKAN--DLSSIVRKSMERSEG-DEGRAEQWVEVIS 81
           I   F++  L    ++S   +   +R +N  D S I  K+   S G D  R  Q    +S
Sbjct: 5   IFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIKMKTSASSFGFDPTRVTQ----LS 60

Query: 82  WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDK 141
           W PR F+Y  FLS EEC++ I LA   + KS V D+D+G+S +S VRTSSG FL++ +D 
Sbjct: 61  WTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQDD 120

Query: 142 IIRDIEKRIADFTFFP 157
           I+ ++E ++A +TF P
Sbjct: 121 IVSNVEAKLAAWTFLP 136


>gi|449520144|ref|XP_004167094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 323

 Score = 90.9 bits (224), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 46/92 (50%), Positives = 63/92 (68%), Gaps = 1/92 (1%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG-KSKDSRVRTSSGTFLARGRDKI 142
           PRAF+YHNFLS++EC  LINLA P M +S V   +T  +   S  RTSSG FLA+G++++
Sbjct: 74  PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQL 133

Query: 143 IRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +R IEKRIA+FTF P+      S+  + V +K
Sbjct: 134 VRRIEKRIAEFTFIPVENGEGLSILHYEVGQK 165


>gi|108706360|gb|ABF94155.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative [Oryza
           sativa Japonica Group]
 gi|125585047|gb|EAZ25711.1| hypothetical protein OsJ_09544 [Oryza sativa Japonica Group]
          Length = 277

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 45/95 (47%), Positives = 63/95 (66%), Gaps = 1/95 (1%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLA-TPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
           +SW PRAF+Y  FLS  EC++LI+LA    M KSTVVD ++G+S  S+VRTSSG FL + 
Sbjct: 45  VSWRPRAFLYEGFLSDAECDHLISLAKQGKMEKSTVVDGESGESVTSKVRTSSGMFLDKK 104

Query: 139 RDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLK 173
           +D+++  IE+RIA +T  P      +    F +LK
Sbjct: 105 QDEVVARIEERIAAWTMLPTECIIFYCFANFAILK 139


>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
          Length = 297

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 44/81 (54%), Positives = 58/81 (71%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +S  PRAF+Y  FLS  EC++L++LA   M KS V D+D+GKS  S+ RTSSGTFLA
Sbjct: 35  VTQLSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQARTSSGTFLA 94

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +  D+I+  IEKR+A +TF P
Sbjct: 95  KREDEIVSAIEKRVAAWTFLP 115


>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
          Length = 313

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 41/99 (41%), Positives = 64/99 (64%), Gaps = 2/99 (2%)

Query: 76  WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
           W++V+  E R F+  NFL++EEC++++ LA PH+ +S VVD+ TG S+ S +RTS G FL
Sbjct: 33  WMQVLDAEARIFI--NFLTEEECDHIVALAKPHLERSGVVDTATGGSEISDIRTSKGMFL 90

Query: 136 ARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
            RG D  +  IE+RIA +T  P+G      +  ++  +K
Sbjct: 91  ERGHDDTVAAIEERIARWTLLPVGNGEGLQVLNYHPGEK 129


>gi|326526235|dbj|BAJ97134.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 308

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 40/78 (51%), Positives = 58/78 (74%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           ISW PRAF+Y +FLS +E  +L++LA   +++S V D  +GKS+ S VRTSSGTF+++G+
Sbjct: 54  ISWHPRAFLYPHFLSDDEANHLVSLARAELKRSAVADETSGKSQLSEVRTSSGTFISKGK 113

Query: 140 DKIIRDIEKRIADFTFFP 157
           D I+  IE +IA +TF P
Sbjct: 114 DPIVAGIEDKIAAWTFLP 131


>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 316

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 58/81 (71%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PRAF+Y  FLS EEC++ I LA   + KS V D+D+G+S +S VRTSSG FL+
Sbjct: 56  VTQLSWTPRAFLYKGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLS 115

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+ ++E ++A +TF P
Sbjct: 116 KRQDDIVANVEAKLAAWTFIP 136


>gi|30689216|ref|NP_189490.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
 gi|332643931|gb|AEE77452.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
          Length = 288

 Score = 90.1 bits (222), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 42/79 (53%), Positives = 58/79 (73%), Gaps = 1/79 (1%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV-DSDTGKSKDSRVRTSSGTFLARG 138
           +SW PRAF+Y  FLS EEC++LI LA   + KS VV D D+G+S+DS VRTSSG FL + 
Sbjct: 35  LSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKR 94

Query: 139 RDKIIRDIEKRIADFTFFP 157
           +D I+ ++E ++A +TF P
Sbjct: 95  QDDIVANVEAKLAAWTFLP 113


>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
          Length = 321

 Score = 89.7 bits (221), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 45/95 (47%), Positives = 63/95 (66%), Gaps = 1/95 (1%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLA-TPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
           +SW PRAF+Y  FLS  EC++LI+LA    M KSTVVD ++G+S  S+VRTSSG FL + 
Sbjct: 45  VSWRPRAFLYEGFLSDAECDHLISLAKQGKMEKSTVVDGESGESVTSKVRTSSGMFLDKK 104

Query: 139 RDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLK 173
           +D+++  IE+RIA +T  P      +    F +LK
Sbjct: 105 QDEVVARIEERIAAWTMLPTECIIFYCFANFAILK 139


>gi|388520325|gb|AFK48224.1| unknown [Lotus japonicus]
          Length = 188

 Score = 89.7 bits (221), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 44/74 (59%), Positives = 53/74 (71%)

Query: 101 LINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGM 160
           +INLA PHM KS+VVDS TGKS  SRVRTSSG FL RG+DK+I+ IEKRIADF F P+  
Sbjct: 1   MINLAKPHMAKSSVVDSQTGKSVGSRVRTSSGMFLKRGKDKVIQTIEKRIADFAFIPVEN 60

Query: 161 QFKFSLFFFYVLKK 174
                +  + V +K
Sbjct: 61  GEGLQVLHYEVGQK 74


>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
 gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
          Length = 283

 Score = 89.7 bits (221), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 38/78 (48%), Positives = 60/78 (76%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +SW+PRAF+Y  F+S  EC++++ +A   ++KS V D+++GKS  S +RTSSG FL++G+
Sbjct: 31  LSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSNIRTSSGMFLSKGQ 90

Query: 140 DKIIRDIEKRIADFTFFP 157
           D++I  IE+RIA +TF P
Sbjct: 91  DEVINRIEERIAAWTFLP 108


>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
 gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
          Length = 296

 Score = 89.7 bits (221), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 38/78 (48%), Positives = 60/78 (76%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +SW+PRAF+Y  F+S  EC++++ +A   ++KS V D+++GKS  S +RTSSG FL++G+
Sbjct: 45  LSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSNIRTSSGMFLSKGQ 104

Query: 140 DKIIRDIEKRIADFTFFP 157
           D++I  IE+RIA +TF P
Sbjct: 105 DEVINRIEERIAAWTFLP 122


>gi|242039723|ref|XP_002467256.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
 gi|241921110|gb|EER94254.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
          Length = 303

 Score = 89.7 bits (221), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 41/78 (52%), Positives = 57/78 (73%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +SW PRAF++  FLS  EC++LI LA   + KS V D+++GKS  S VRTSSG FL + +
Sbjct: 43  LSWRPRAFLHKGFLSDAECDHLIVLAKDKLEKSMVADNESGKSVQSEVRTSSGMFLEKKQ 102

Query: 140 DKIIRDIEKRIADFTFFP 157
           D+++R IE+RIA +TF P
Sbjct: 103 DEVVRGIEERIAAWTFLP 120


>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
 gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
 gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
          Length = 297

 Score = 89.4 bits (220), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 43/81 (53%), Positives = 58/81 (71%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +S  PRAF+Y  FLS  EC+++++LA   M KS V D+D+GKS  S+ RTSSGTFLA
Sbjct: 35  VTQLSSRPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVASQARTSSGTFLA 94

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +  D+I+  IEKR+A +TF P
Sbjct: 95  KREDEIVSAIEKRVAAWTFLP 115


>gi|297818458|ref|XP_002877112.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322950|gb|EFH53371.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 289

 Score = 89.4 bits (220), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 42/82 (51%), Positives = 59/82 (71%), Gaps = 1/82 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV-DSDTGKSKDSRVRTSSGTFL 135
           V  +SW PRAF+Y+ FLS EEC++LINLA   + KS VV D ++G+S DS  RTSSG FL
Sbjct: 32  VTQLSWTPRAFLYNGFLSDEECDHLINLAKGKLEKSMVVADDNSGESIDSEERTSSGVFL 91

Query: 136 ARGRDKIIRDIEKRIADFTFFP 157
            + +D I+ ++E ++A +TF P
Sbjct: 92  TKRQDDIVANVEAKLATWTFLP 113


>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
 gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 298

 Score = 89.4 bits (220), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 63/81 (77%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +S +PRAFVY  FL++ EC+++++LA   +++S V D+D+G+SK S VRTSSGTF++
Sbjct: 37  VKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSSGTFIS 96

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +G+D I+  IE +I+ +TF P
Sbjct: 97  KGKDPIVSGIEDKISTWTFLP 117


>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
 gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
          Length = 298

 Score = 89.4 bits (220), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 63/81 (77%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +S +PRAFVY  FL++ EC+++++LA   +++S V D+D+G+SK S VRTSSGTF++
Sbjct: 37  VKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSSGTFIS 96

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +G+D I+  IE +I+ +TF P
Sbjct: 97  KGKDPIVSGIEDKISTWTFLP 117


>gi|297829156|ref|XP_002882460.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328300|gb|EFH58719.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 299

 Score = 89.0 bits (219), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 62/81 (76%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +S +PRAFVY  FL+  EC++LI+LA  ++++S V D+D G+S+ S VRTSSGTF++
Sbjct: 38  VKQVSAKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFIS 97

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +G+D I+  IE +++ +TF P
Sbjct: 98  KGKDPIVSGIEDKLSTWTFLP 118


>gi|28393447|gb|AAO42145.1| putative prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 253

 Score = 89.0 bits (219), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 42/78 (53%), Positives = 57/78 (73%), Gaps = 1/78 (1%)

Query: 81  SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV-DSDTGKSKDSRVRTSSGTFLARGR 139
           SW PRAF+Y  FLS EEC++LI LA   + KS VV D D+G+S+DS VRTSSG FL + +
Sbjct: 1   SWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKRQ 60

Query: 140 DKIIRDIEKRIADFTFFP 157
           D I+ ++E ++A +TF P
Sbjct: 61  DDIVANVEAKLAAWTFLP 78


>gi|18397528|ref|NP_566279.1| P4H isoform 2 [Arabidopsis thaliana]
 gi|332640849|gb|AEE74370.1| P4H isoform 2 [Arabidopsis thaliana]
          Length = 299

 Score = 89.0 bits (219), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 62/81 (76%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +S +PRAFVY  FL+  EC++LI+LA  ++++S V D+D G+S+ S VRTSSGTF++
Sbjct: 38  VKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFIS 97

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +G+D I+  IE +++ +TF P
Sbjct: 98  KGKDPIVSGIEDKLSTWTFLP 118


>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
          Length = 278

 Score = 89.0 bits (219), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 62/81 (76%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +S +PRAFVY  FL+  EC++LI+LA  ++++S V D+D G+S+ S VRTSSGTF++
Sbjct: 38  VKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFIS 97

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +G+D I+  IE +++ +TF P
Sbjct: 98  KGKDPIVSGIEDKLSTWTFLP 118


>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 298

 Score = 89.0 bits (219), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 62/81 (76%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +S +PRAFVY  FL++ EC+++++LA   +++S V D+D+G+SK S VRTSSGTF+ 
Sbjct: 37  VKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSSGTFIP 96

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +G+D I+  IE +I+ +TF P
Sbjct: 97  KGKDPIVSGIEDKISTWTFLP 117


>gi|21618073|gb|AAM67123.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 297

 Score = 89.0 bits (219), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 62/81 (76%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +S +PRAFVY  FL+  EC++LI+LA  ++++S V D+D G+S+ S VRTSSGTF++
Sbjct: 36  VKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFIS 95

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +G+D I+  IE +++ +TF P
Sbjct: 96  KGKDPIVSGIEDKLSTWTFLP 116


>gi|222636605|gb|EEE66737.1| hypothetical protein OsJ_23428 [Oryza sativa Japonica Group]
          Length = 487

 Score = 88.6 bits (218), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 41/81 (50%), Positives = 55/81 (67%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PR FVY  FLS +EC++L+ L    M++S V D+ +GKS  S VRTSSG FL 
Sbjct: 57  VRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLD 116

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D ++  IEKRIA +TF P
Sbjct: 117 KRQDPVVSRIEKRIAAWTFLP 137


>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
          Length = 487

 Score = 88.6 bits (218), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 41/81 (50%), Positives = 55/81 (67%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PR FVY  FLS +EC++L+ L    M++S V D+ +GKS  S VRTSSG FL 
Sbjct: 57  VRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLD 116

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D ++  IEKRIA +TF P
Sbjct: 117 KRQDPVVSRIEKRIAAWTFLP 137


>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 332

 Score = 88.6 bits (218), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 57/81 (70%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PR F+Y  FLS EEC++ I LA   + KS V D+D+G+S +S VRTSSG FL+
Sbjct: 72  VTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLS 131

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+ ++E ++A +TF P
Sbjct: 132 KRQDDIVSNVEAKLAAWTFLP 152


>gi|414591891|tpg|DAA42462.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
          Length = 207

 Score = 88.6 bits (218), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 57/81 (70%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +SW PR FVY  FLS  EC++L+ LA   +++S V D+++GKS  S VRTSSG FL 
Sbjct: 45  VKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSEVRTSSGMFLD 104

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D ++  IE+RIA +TF P
Sbjct: 105 KRQDPVVSRIEERIAAWTFLP 125


>gi|110738390|dbj|BAF01121.1| hypothetical protein [Arabidopsis thaliana]
          Length = 299

 Score = 88.6 bits (218), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 62/81 (76%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +S +PRAFVY  FL+  EC++LI+LA  ++++S V D+D G+S+ S VRTSSGTF++
Sbjct: 38  VKQVSSKPRAFVYGGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFIS 97

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           +G+D I+  IE +++ +TF P
Sbjct: 98  KGKDPIVSGIEDKLSTWTFLP 118


>gi|159487763|ref|XP_001701892.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158281111|gb|EDP06867.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 259

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 60/85 (70%), Gaps = 1/85 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E ++W+PR F+YHNF+++ E ++LI LA P M++STVV +  GKS +   RTS GTFL 
Sbjct: 1   IEHVAWKPRVFIYHNFITEVEAKHLIELAAPQMKRSTVVGAG-GKSVEDNYRTSYGTFLK 59

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQ 161
           R +D+I+  IE R+A +T  P+  Q
Sbjct: 60  RYQDEIVERIENRVAAWTQIPVAHQ 84


>gi|363543295|ref|NP_001241863.1| prolyl 4-hydroxylase 4 precursor [Zea mays]
 gi|347978806|gb|AEP37745.1| prolyl 4-hydroxylase 4 [Zea mays]
 gi|414591890|tpg|DAA42461.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
          Length = 274

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 57/81 (70%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +SW PR FVY  FLS  EC++L+ LA   +++S V D+++GKS  S VRTSSG FL 
Sbjct: 45  VKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSEVRTSSGMFLD 104

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D ++  IE+RIA +TF P
Sbjct: 105 KRQDPVVSRIEERIAAWTFLP 125


>gi|115471029|ref|NP_001059113.1| Os07g0194500 [Oryza sativa Japonica Group]
 gi|113610649|dbj|BAF21027.1| Os07g0194500 [Oryza sativa Japonica Group]
 gi|215768445|dbj|BAH00674.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 319

 Score = 88.2 bits (217), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 41/81 (50%), Positives = 55/81 (67%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PR FVY  FLS +EC++L+ L    M++S V D+ +GKS  S VRTSSG FL 
Sbjct: 57  VRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLD 116

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D ++  IEKRIA +TF P
Sbjct: 117 KRQDPVVSRIEKRIAAWTFLP 137


>gi|34393269|dbj|BAC83179.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
           sativa Japonica Group]
 gi|50509101|dbj|BAD30161.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
           sativa Japonica Group]
          Length = 313

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 41/81 (50%), Positives = 55/81 (67%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PR FVY  FLS +EC++L+ L    M++S V D+ +GKS  S VRTSSG FL 
Sbjct: 51  VRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLD 110

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D ++  IEKRIA +TF P
Sbjct: 111 KRQDPVVSRIEKRIAAWTFLP 131


>gi|224069056|ref|XP_002302889.1| predicted protein [Populus trichocarpa]
 gi|222844615|gb|EEE82162.1| predicted protein [Populus trichocarpa]
          Length = 287

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 43/83 (51%), Positives = 58/83 (69%), Gaps = 2/83 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E+ISW PR  V H+FLS EEC+YL  LA P +R STVVD  TGK  +S+VRTSSG FL+ 
Sbjct: 82  EIISWSPRIIVLHDFLSSEECDYLRALAKPRLRISTVVDVKTGKGIESKVRTSSGMFLSS 141

Query: 138 GRD--KIIRDIEKRIADFTFFPL 158
                ++++ IEKRI+ ++  P+
Sbjct: 142 EEKTYQVVQAIEKRISVYSQVPI 164


>gi|363543297|ref|NP_001241864.1| prolyl 4-hydroxylase 4-2 precursor [Zea mays]
 gi|194704960|gb|ACF86564.1| unknown [Zea mays]
 gi|347978810|gb|AEP37747.1| prolyl 4-hydroxylase 4-2 [Zea mays]
          Length = 207

 Score = 87.8 bits (216), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 56/81 (69%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +SW PR FVY  FLS  EC++L+ LA    ++S V D+++GKS  S VRTSSG FL 
Sbjct: 45  VKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKTQRSMVADNESGKSVKSEVRTSSGMFLD 104

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D ++  IE+RIA +TF P
Sbjct: 105 KRQDPVVSRIEERIAAWTFLP 125


>gi|357140446|ref|XP_003571778.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 298

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 39/78 (50%), Positives = 57/78 (73%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +SW PRAF++  FLS+ EC+++I LA   + KS V D+++GKS  S VRTSSG FL + +
Sbjct: 38  LSWRPRAFLHKGFLSEPECDHMIELAKDKLEKSMVADNESGKSVQSEVRTSSGMFLEKRQ 97

Query: 140 DKIIRDIEKRIADFTFFP 157
           D+++  IE+RIA +TF P
Sbjct: 98  DEVVARIEERIAAWTFLP 115


>gi|242088305|ref|XP_002439985.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
 gi|241945270|gb|EES18415.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
          Length = 308

 Score = 87.8 bits (216), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 41/78 (52%), Positives = 57/78 (73%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           ISW+PR F+Y +FLS +E  +LI+LA   +++S V D+ +GKS  S VRTSSGTFL +G+
Sbjct: 54  ISWKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSDVRTSSGTFLRKGQ 113

Query: 140 DKIIRDIEKRIADFTFFP 157
           D I+  IE +IA +TF P
Sbjct: 114 DPIVEGIEDKIAAWTFLP 131


>gi|363543293|ref|NP_001241862.1| prolyl 4-hydroxylase 2-1 precursor [Zea mays]
 gi|347978802|gb|AEP37743.1| prolyl 4-hydroxylase 2-1 [Zea mays]
          Length = 204

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 40/78 (51%), Positives = 56/78 (71%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +SW PRAF++  FLS  EC++LI LA   + KS V D+++GKS  S VRTSSG FL R +
Sbjct: 39  LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSEVRTSSGMFLERKQ 98

Query: 140 DKIIRDIEKRIADFTFFP 157
           D+++  IE+RI+ +TF P
Sbjct: 99  DEVVTRIEERISAWTFLP 116


>gi|359806348|ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
 gi|255645457|gb|ACU23224.1| unknown [Glycine max]
          Length = 298

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 60/81 (74%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ ISW+PRAFVY  FL+  EC++LI+LA   +++S V D+ +G+S+ S VRTSSG F++
Sbjct: 37  VKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSDVRTSSGMFIS 96

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D II  IE +I+ +TF P
Sbjct: 97  KNKDPIISGIEDKISSWTFLP 117


>gi|242047772|ref|XP_002461632.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
 gi|241925009|gb|EER98153.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
          Length = 307

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 57/81 (70%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +SW+PR FVY  FLS  EC++L+ LA   +++S V D+ +GKS  S VRTSSG FL 
Sbjct: 44  VKAVSWQPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNQSGKSVMSEVRTSSGMFLN 103

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D ++  IE+RIA +TF P
Sbjct: 104 KRQDPVVSRIEERIAAWTFLP 124


>gi|242075290|ref|XP_002447581.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
 gi|241938764|gb|EES11909.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
          Length = 263

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 56/82 (68%), Gaps = 2/82 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVISW PR  ++HNFLS EEC+YL+ +A P ++ STVVD  TGK   S VRTSSG F+  
Sbjct: 58  EVISWTPRIIIFHNFLSSEECDYLMAIARPRLQMSTVVDVATGKGVKSDVRTSSGMFVNS 117

Query: 138 GRDK--IIRDIEKRIADFTFFP 157
              K  +I+ IEKRI+ F+  P
Sbjct: 118 EERKSPVIQAIEKRISVFSQIP 139


>gi|414587755|tpg|DAA38326.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
          Length = 244

 Score = 87.4 bits (215), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 56/82 (68%), Gaps = 2/82 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVISW PR  V+HNFLS EEC+YL+ +A P ++ STVVD  TGK   S VRTSSG F+  
Sbjct: 58  EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSDVRTSSGMFVNS 117

Query: 138 GRDK--IIRDIEKRIADFTFFP 157
              K  +++ IEKRI+ F+  P
Sbjct: 118 EERKSPVVQAIEKRISVFSQIP 139


>gi|388496942|gb|AFK36537.1| unknown [Lotus japonicus]
          Length = 302

 Score = 87.0 bits (214), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 60/81 (74%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +SW+PRAFVY  FL++ EC++LI+LA   +++S V D+ +G SK S VRTSSG F++
Sbjct: 41  VKQVSWKPRAFVYKGFLTELECDHLISLAKSELKRSAVADNLSGDSKLSDVRTSSGMFIS 100

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+  IE +I+ +TF P
Sbjct: 101 KNKDPIVAGIEDKISSWTFLP 121


>gi|218184507|gb|EEC66934.1| hypothetical protein OsI_33548 [Oryza sativa Indica Group]
          Length = 308

 Score = 87.0 bits (214), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 40/78 (51%), Positives = 57/78 (73%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +SW PRAF++  FL+  ECE+LI+LA   + KS V D+++GKS  S VRTSSG FL + +
Sbjct: 48  LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGMFLEKKQ 107

Query: 140 DKIIRDIEKRIADFTFFP 157
           D+++  IE+RIA +TF P
Sbjct: 108 DEVVARIEERIAAWTFLP 125


>gi|116309432|emb|CAH66506.1| OSIGBa0111I14.1 [Oryza sativa Indica Group]
          Length = 267

 Score = 87.0 bits (214), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 57/82 (69%), Gaps = 2/82 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVISW PR  V+HNFLS EEC+YL ++A P ++ STVVD  TGK   S VRTSSG F++ 
Sbjct: 62  EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSNVRTSSGMFVSS 121

Query: 138 GRDK--IIRDIEKRIADFTFFP 157
              K  +I+ IEKRI+ ++  P
Sbjct: 122 EERKLPVIQSIEKRISVYSQIP 143


>gi|115481998|ref|NP_001064592.1| Os10g0413500 [Oryza sativa Japonica Group]
 gi|110289075|gb|ABG66075.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113639201|dbj|BAF26506.1| Os10g0413500 [Oryza sativa Japonica Group]
 gi|215692577|dbj|BAG87997.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222612821|gb|EEE50953.1| hypothetical protein OsJ_31503 [Oryza sativa Japonica Group]
          Length = 308

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 40/78 (51%), Positives = 57/78 (73%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +SW PRAF++  FL+  ECE+LI+LA   + KS V D+++GKS  S VRTSSG FL + +
Sbjct: 48  LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGMFLEKKQ 107

Query: 140 DKIIRDIEKRIADFTFFP 157
           D+++  IE+RIA +TF P
Sbjct: 108 DEVVARIEERIAAWTFLP 125


>gi|110289076|gb|ABB47602.2| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 309

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 40/78 (51%), Positives = 57/78 (73%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +SW PRAF++  FL+  ECE+LI+LA   + KS V D+++GKS  S VRTSSG FL + +
Sbjct: 48  LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGMFLEKKQ 107

Query: 140 DKIIRDIEKRIADFTFFP 157
           D+++  IE+RIA +TF P
Sbjct: 108 DEVVARIEERIAAWTFLP 125


>gi|357447555|ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 303

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 38/81 (46%), Positives = 60/81 (74%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +SW+PRAFVY  FL+  EC++LI++A   +++S V D+ +G+SK S VRTSSG F++
Sbjct: 40  VKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIS 99

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+  IE +I+ +TF P
Sbjct: 100 KNKDAIVSGIEDKISSWTFLP 120


>gi|357447553|ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 301

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 38/81 (46%), Positives = 60/81 (74%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +SW+PRAFVY  FL+  EC++LI++A   +++S V D+ +G+SK S VRTSSG F++
Sbjct: 40  VKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIS 99

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+  IE +I+ +TF P
Sbjct: 100 KNKDAIVSGIEDKISSWTFLP 120


>gi|115457822|ref|NP_001052511.1| Os04g0346000 [Oryza sativa Japonica Group]
 gi|38346023|emb|CAE03962.2| OSJNBb0085H11.11 [Oryza sativa Japonica Group]
 gi|113564082|dbj|BAF14425.1| Os04g0346000 [Oryza sativa Japonica Group]
 gi|125547818|gb|EAY93640.1| hypothetical protein OsI_15426 [Oryza sativa Indica Group]
 gi|125589953|gb|EAZ30303.1| hypothetical protein OsJ_14349 [Oryza sativa Japonica Group]
 gi|215693934|dbj|BAG89133.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 267

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 57/82 (69%), Gaps = 2/82 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVISW PR  V+HNFLS EEC+YL ++A P ++ STVVD  TGK   S VRTSSG F++ 
Sbjct: 62  EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSNVRTSSGMFVSS 121

Query: 138 GRDK--IIRDIEKRIADFTFFP 157
              K  +I+ IEKRI+ ++  P
Sbjct: 122 EERKLPVIQSIEKRISVYSQIP 143


>gi|293337056|ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays]
 gi|224031897|gb|ACN35024.1| unknown [Zea mays]
 gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays]
 gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays]
          Length = 299

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 40/78 (51%), Positives = 56/78 (71%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +SW PRAF++  FLS  EC++LI LA   + KS V D+++GKS  S VRTSSG FL R +
Sbjct: 39  LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSEVRTSSGMFLERKQ 98

Query: 140 DKIIRDIEKRIADFTFFP 157
           D+++  IE+RI+ +TF P
Sbjct: 99  DEVVTRIEERISAWTFLP 116


>gi|356546462|ref|XP_003541645.1| PREDICTED: uncharacterized protein LOC100818794 [Glycine max]
          Length = 839

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 38/81 (46%), Positives = 60/81 (74%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +SW+PRAFVY  FL++ EC++LI++A   +++S V D+ +G+SK S VRTSSG F+ 
Sbjct: 578 VKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIP 637

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+  IE +I+ +TF P
Sbjct: 638 KNKDLIVAGIEDKISSWTFLP 658


>gi|297824279|ref|XP_002880022.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
 gi|297325861|gb|EFH56281.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
          Length = 283

 Score = 87.0 bits (214), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 46/82 (56%), Positives = 54/82 (65%), Gaps = 2/82 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+SW PR  V H+FLS EECEYL  +A P ++ STVVD  TGK   S VRTSSG FL  
Sbjct: 78  EVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSDVRTSSGMFLTH 137

Query: 138 GR--DKIIRDIEKRIADFTFFP 157
               + II+ IEKRIA F+  P
Sbjct: 138 VERSNPIIQAIEKRIAVFSQVP 159


>gi|414587754|tpg|DAA38325.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
          Length = 169

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 56/82 (68%), Gaps = 2/82 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVISW PR  V+HNFLS EEC+YL+ +A P ++ STVVD  TGK   S VRTSSG F+  
Sbjct: 58  EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSDVRTSSGMFVNS 117

Query: 138 GRDK--IIRDIEKRIADFTFFP 157
              K  +++ IEKRI+ F+  P
Sbjct: 118 EERKSPVVQAIEKRISVFSQIP 139


>gi|384250156|gb|EIE23636.1| hypothetical protein COCSUDRAFT_53414 [Coccomyxa subellipsoidea
           C-169]
          Length = 285

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 38/93 (40%), Positives = 58/93 (62%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE ISW PRAF+Y   LS++EC+Y+IN A P+M K+TV+D+ T K   +++R +   ++ 
Sbjct: 53  VERISWNPRAFLYRGLLSQDECDYIINAARPNMVKATVLDAKTKKQVPNKLRNNKEAYID 112

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
              D +I  IE+RIA +TF P      F +  +
Sbjct: 113 GSADDVIDQIERRIARYTFLPAAHGEPFHIMQY 145


>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
 gi|255641119|gb|ACU20838.1| unknown [Glycine max]
          Length = 297

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 60/81 (74%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ ISW+PRAFVY  FL+  EC++LI+LA   +++S V D+ +G+S+ S VRTSSG F++
Sbjct: 36  VKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSDVRTSSGMFIS 95

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+  IE +I+ +TF P
Sbjct: 96  KNKDPIVAGIEDKISSWTFLP 116


>gi|307110383|gb|EFN58619.1| hypothetical protein CHLNCDRAFT_19485 [Chlorella variabilis]
          Length = 328

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 40/98 (40%), Positives = 67/98 (68%), Gaps = 1/98 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE +SW+PRAFV+HNF+++EE ++++ LA P M++STVV +  G S + ++RTS GTFL 
Sbjct: 32  VEPVSWKPRAFVFHNFMTEEEADHIVALAKPFMKRSTVVGAG-GASVEDQIRTSYGTFLK 90

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           R +D I+  +E+R+A +T   +  Q    +  + + +K
Sbjct: 91  RLQDPIVTAVEQRLATWTKLNVSHQEDMQILRYGIGQK 128


>gi|308799217|ref|XP_003074389.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116000560|emb|CAL50240.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 294

 Score = 86.7 bits (213), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 41/93 (44%), Positives = 55/93 (59%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E +SW P A VY  FL++ ECE++  LAT  ++ STVVD+ TG    S +RTSSG FL 
Sbjct: 26  IERLSWAPHAEVYRGFLTEAECEHIERLATAELKPSTVVDASTGGDASSEIRTSSGMFLG 85

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
           R  D +I  IE RIA +T  P      F +  +
Sbjct: 86  RAEDDVIEAIEARIAAWTHVPESHGEGFQVLRY 118


>gi|255641919|gb|ACU21228.1| unknown [Glycine max]
          Length = 301

 Score = 86.7 bits (213), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 38/81 (46%), Positives = 60/81 (74%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +SW+PRAFVY  FL++ EC++LI++A   +++S V D+ +G+SK S VRTSSG F+ 
Sbjct: 40  VKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIP 99

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+  IE +I+ +TF P
Sbjct: 100 KNKDLIVAGIEDKISSWTFLP 120


>gi|326501992|dbj|BAK06488.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 306

 Score = 86.3 bits (212), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 42/79 (53%), Positives = 57/79 (72%), Gaps = 1/79 (1%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPH-MRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
           +SW PRAF+Y  FL++ EC++L+ LA    ++KS VVD  TGKS  S VRTSSGTFLA+ 
Sbjct: 41  VSWRPRAFLYKGFLTEAECDHLVALAEEGGLQKSMVVDRQTGKSVMSEVRTSSGTFLAKK 100

Query: 139 RDKIIRDIEKRIADFTFFP 157
           +D+++  IE RIA +T  P
Sbjct: 101 QDQVVATIEARIAAWTLLP 119


>gi|414587756|tpg|DAA38327.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
          Length = 263

 Score = 86.3 bits (212), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 44/82 (53%), Positives = 56/82 (68%), Gaps = 2/82 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVISW PR  V+HNFLS EEC+YL+ +A P ++ STVVD  TGK   S VRTSSG F+  
Sbjct: 58  EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSDVRTSSGMFVNS 117

Query: 138 GRDK--IIRDIEKRIADFTFFP 157
              K  +++ IEKRI+ F+  P
Sbjct: 118 EERKSPVVQAIEKRISVFSQIP 139


>gi|388492638|gb|AFK34385.1| unknown [Medicago truncatula]
          Length = 299

 Score = 86.3 bits (212), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 58/81 (71%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ ISW PRAFVY  FL+  EC++LI+LA   +++S V D+ +G S+ S VRTSSG F++
Sbjct: 37  VKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSSGMFIS 96

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+  IE RI+ +TF P
Sbjct: 97  KNKDPIVSGIEDRISAWTFLP 117


>gi|357478545|ref|XP_003609558.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355510613|gb|AES91755.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 299

 Score = 85.9 bits (211), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 58/81 (71%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ ISW PRAFVY  FL+  EC++LI+LA   +++S V D+ +G S+ S VRTSSG F++
Sbjct: 37  VKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSSGMFIS 96

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+  IE RI+ +TF P
Sbjct: 97  KNKDPIVSGIEDRISAWTFLP 117


>gi|29150368|gb|AAO72377.1| putative oxidoreductase [Oryza sativa Japonica Group]
 gi|108711617|gb|ABF99412.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|125546090|gb|EAY92229.1| hypothetical protein OsI_13949 [Oryza sativa Indica Group]
 gi|125588294|gb|EAZ28958.1| hypothetical protein OsJ_13002 [Oryza sativa Japonica Group]
          Length = 310

 Score = 85.9 bits (211), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 38/81 (46%), Positives = 56/81 (69%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V +ISW+PR F Y  FLS +EC++L+ L    +++S V D+++GKS  S VRTSSG FL 
Sbjct: 48  VTIISWKPRIFFYKGFLSDDECDHLVKLGKEKLKRSMVADNESGKSVMSEVRTSSGMFLD 107

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D ++  IE+RIA +T  P
Sbjct: 108 KQQDPVVSGIEERIAAWTLLP 128


>gi|357445147|ref|XP_003592851.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355481899|gb|AES63102.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 281

 Score = 85.9 bits (211), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 43/83 (51%), Positives = 57/83 (68%), Gaps = 2/83 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+SW PR  + HNFLS EEC+YL  +A P ++ STVVD++TGK   S VRTSSG FL+ 
Sbjct: 76  EVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSDVRTSSGMFLSH 135

Query: 138 GRDK--IIRDIEKRIADFTFFPL 158
              K  +I  IEKRI+ ++  P+
Sbjct: 136 EERKYPMIHAIEKRISVYSQIPI 158


>gi|307109700|gb|EFN57937.1| hypothetical protein CHLNCDRAFT_142031 [Chlorella variabilis]
          Length = 325

 Score = 85.9 bits (211), Expect = 6e-15,   Method: Compositional matrix adjust.
 Identities = 43/93 (46%), Positives = 59/93 (63%), Gaps = 1/93 (1%)

Query: 69  DEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVR 128
           D   A  W E +SW PRAFV HNF SKEE +++I LA P +R+STVV S  G+S     R
Sbjct: 24  DTAAAHPWFEPVSWYPRAFVAHNFASKEETDHMIKLAQPQLRRSTVVGS-RGESVVDNYR 82

Query: 129 TSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQ 161
           TS G F+ R  D+++  +EKR+A +T + +  Q
Sbjct: 83  TSYGMFIRRHHDEVVSTLEKRVATWTKYNVTHQ 115


>gi|326518408|dbj|BAJ88233.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 276

 Score = 85.5 bits (210), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 45/83 (54%), Positives = 55/83 (66%), Gaps = 2/83 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVISW PR  V+HNFLS EEC+YL  +A P +  STVVD  TGK   S VRTSSG F+  
Sbjct: 61  EVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSDVRTSSGMFVNS 120

Query: 138 GRDK--IIRDIEKRIADFTFFPL 158
              K  +I+ IEKRI+ F+  P+
Sbjct: 121 EERKLPVIKAIEKRISVFSQIPV 143


>gi|326503458|dbj|BAJ86235.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516134|dbj|BAJ88090.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 266

 Score = 85.5 bits (210), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 45/83 (54%), Positives = 55/83 (66%), Gaps = 2/83 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVISW PR  V+HNFLS EEC+YL  +A P +  STVVD  TGK   S VRTSSG F+  
Sbjct: 61  EVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSDVRTSSGMFVNS 120

Query: 138 GRDK--IIRDIEKRIADFTFFPL 158
              K  +I+ IEKRI+ F+  P+
Sbjct: 121 EERKLPVIKAIEKRISVFSQIPV 143


>gi|357128903|ref|XP_003566109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 313

 Score = 85.5 bits (210), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 38/78 (48%), Positives = 57/78 (73%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           ISW+PR F+Y +FLS +E  +L++LA   +++S V D+ +GKS  S VRTS GTF+++G+
Sbjct: 55  ISWKPRVFLYQHFLSDDEANHLLSLARAELKRSAVADNTSGKSTLSEVRTSYGTFISKGK 114

Query: 140 DKIIRDIEKRIADFTFFP 157
           D I+  IE +IA +TF P
Sbjct: 115 DPIVAGIEDKIAAWTFLP 132


>gi|15224220|ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana]
 gi|3763917|gb|AAC64297.1| hypothetical protein [Arabidopsis thaliana]
 gi|20197628|gb|AAM15158.1| hypothetical protein [Arabidopsis thaliana]
 gi|26450452|dbj|BAC42340.1| unknown protein [Arabidopsis thaliana]
 gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis thaliana]
 gi|330255112|gb|AEC10206.1| P4H isoform 1 [Arabidopsis thaliana]
          Length = 283

 Score = 85.5 bits (210), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 46/82 (56%), Positives = 53/82 (64%), Gaps = 2/82 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+SW PR  V H+FLS EECEYL  +A P ++ STVVD  TGK   S VRTSSG FL  
Sbjct: 78  EVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSDVRTSSGMFLTH 137

Query: 138 GRDK--IIRDIEKRIADFTFFP 157
                 II+ IEKRIA F+  P
Sbjct: 138 VERSYPIIQAIEKRIAVFSQVP 159


>gi|388495016|gb|AFK35574.1| unknown [Lotus japonicus]
          Length = 297

 Score = 85.5 bits (210), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 58/81 (71%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +SW+PRAFVY  FL+  EC++LI+LA   +++S V D+  G SK S VRTSSG F++
Sbjct: 36  VKQVSWKPRAFVYEGFLTGLECDHLISLAKSELKRSAVADNLPGDSKLSEVRTSSGMFIS 95

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+  IE +I+ +TF P
Sbjct: 96  KKKDPIVAGIEDKISAWTFLP 116


>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Glycine max]
          Length = 297

 Score = 85.5 bits (210), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 59/81 (72%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +SW+PRAFVY  FL++ EC++LI++A   +++S V D+ +G+SK S VRTSSG F+ 
Sbjct: 40  VKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIP 99

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+  +E +I+ +T  P
Sbjct: 100 KNKDPIVAGVEDKISSWTLLP 120


>gi|224033439|gb|ACN35795.1| unknown [Zea mays]
          Length = 180

 Score = 85.1 bits (209), Expect = 8e-15,   Method: Compositional matrix adjust.
 Identities = 43/66 (65%), Positives = 49/66 (74%)

Query: 109 MRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFF 168
           M KSTVVDS TGKSKDSRVRTSSG FL RGRDK+IR IEKRIAD+TF P+       +  
Sbjct: 1   MVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPVDHGEGLQVLH 60

Query: 169 FYVLKK 174
           + V +K
Sbjct: 61  YEVGQK 66


>gi|255083957|ref|XP_002508553.1| predicted protein [Micromonas sp. RCC299]
 gi|226523830|gb|ACO69811.1| predicted protein [Micromonas sp. RCC299]
          Length = 262

 Score = 85.1 bits (209), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 44/96 (45%), Positives = 62/96 (64%), Gaps = 4/96 (4%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS--DTGKSKDSRVRTSSGTF 134
           VE +S EP+AF+YH FLS EEC++LI + TPH+++STVV    DTG   D  VRTS GTF
Sbjct: 1   VEKLSDEPKAFLYHGFLSAEECDHLIKIGTPHLKRSTVVGGKDDTGVLDD--VRTSFGTF 58

Query: 135 LARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFY 170
           L +  D ++  IE+R+ DF+      Q +  L  ++
Sbjct: 59  LPKKYDDVLYGIERRVEDFSQISYENQEQLQLLKYH 94


>gi|356555585|ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Glycine max]
          Length = 301

 Score = 85.1 bits (209), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 59/81 (72%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +SW+PRAFVY  FL++ EC++LI++A   +++S V D+ +G+SK S VRTSSG F+ 
Sbjct: 40  VKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIP 99

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+  +E +I+ +T  P
Sbjct: 100 KNKDPIVAGVEDKISSWTLLP 120


>gi|238007346|gb|ACR34708.1| unknown [Zea mays]
          Length = 180

 Score = 84.7 bits (208), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 43/66 (65%), Positives = 49/66 (74%)

Query: 109 MRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFF 168
           M KSTVVDS TGKSKDSRVRTSSG FL RGRDK+IR IEKRIAD+TF P+       +  
Sbjct: 1   MVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRIADYTFIPVDHGEGLQVLH 60

Query: 169 FYVLKK 174
           + V +K
Sbjct: 61  YEVGQK 66


>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
 gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
          Length = 269

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 42/86 (48%), Positives = 61/86 (70%), Gaps = 5/86 (5%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSK---DSRVRTSSGTF 134
           EV++W PR  + H FLS EEC+YLI +A P + KSTVVD+ TGK++   +S+VRTS+G F
Sbjct: 61  EVLNWSPRIILLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHGIESKVRTSTGMF 120

Query: 135 LAR--GRDKIIRDIEKRIADFTFFPL 158
           L+    R  +I+ IE+RIA ++  P+
Sbjct: 121 LSNYDRRYPMIQAIERRIAVYSMIPV 146


>gi|388500582|gb|AFK38357.1| unknown [Medicago truncatula]
          Length = 299

 Score = 84.3 bits (207), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 57/81 (70%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ ISW PRAFVY  FL+  EC++LI+LA   +++S V D+ +G S+ S VRTSSG  ++
Sbjct: 37  VKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSSGMLIS 96

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D I+  IE RI+ +TF P
Sbjct: 97  KNKDPIVSGIEDRISAWTFLP 117


>gi|413945803|gb|AFW78452.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
          Length = 239

 Score = 84.0 bits (206), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 40/78 (51%), Positives = 56/78 (71%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           IS +PR F+Y +FLS +E  +LI+LA   +++S V D+ +GKS  S VRTSSGTFL +G+
Sbjct: 54  ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSEVRTSSGTFLRKGQ 113

Query: 140 DKIIRDIEKRIADFTFFP 157
           D I+  IE +IA +TF P
Sbjct: 114 DPIVEGIEDKIAAWTFLP 131


>gi|218192156|gb|EEC74583.1| hypothetical protein OsI_10158 [Oryza sativa Indica Group]
          Length = 299

 Score = 84.0 bits (206), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 41/79 (51%), Positives = 55/79 (69%), Gaps = 1/79 (1%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLA-TPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
           +SW PR F+Y  FLS  ECE+LI LA    M +STVV+  +G+S  S+ RTSSG FL R 
Sbjct: 40  VSWSPRVFLYEGFLSDAECEHLIALAKQGRMERSTVVNGKSGESVMSKTRTSSGMFLIRK 99

Query: 139 RDKIIRDIEKRIADFTFFP 157
           +D+++  IE+RIA +T FP
Sbjct: 100 QDEVVARIEERIAAWTMFP 118


>gi|255083627|ref|XP_002508388.1| predicted protein [Micromonas sp. RCC299]
 gi|226523665|gb|ACO69646.1| predicted protein [Micromonas sp. RCC299]
          Length = 253

 Score = 84.0 bits (206), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 39/78 (50%), Positives = 54/78 (69%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +SW PRAF  HNF+S EEC+ ++ +A P +R+STV+DS TG+SK   +RTS  TFL RG 
Sbjct: 1   VSWYPRAFHLHNFMSHEECDRILEIARPRVRRSTVIDSVTGQSKVDPIRTSEQTFLNRGT 60

Query: 140 DKIIRDIEKRIADFTFFP 157
             I+  +E+R+A  T  P
Sbjct: 61  WDIVTKVEERLAVVTQLP 78


>gi|255085784|ref|XP_002505323.1| predicted protein [Micromonas sp. RCC299]
 gi|226520592|gb|ACO66581.1| predicted protein [Micromonas sp. RCC299]
          Length = 215

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 41/85 (48%), Positives = 54/85 (63%), Gaps = 8/85 (9%)

Query: 76  WVEVISWEPRAFVYHNFLSKEECEYLINLATP---HMRKSTVVDSDTGKSKDSRVRTSSG 132
           W+E ISWEPRAFVYHNFL+ EEC +L+NLA      ++++TV D+ TG +        SG
Sbjct: 1   WIEQISWEPRAFVYHNFLTPEECAHLVNLAKATDGGLKRATVADARTGGT-----FPGSG 55

Query: 133 TFLARGRDKIIRDIEKRIADFTFFP 157
            FL R  D I+  IE+RI+ F   P
Sbjct: 56  AFLLRNHDPIVTRIEERISAFAMIP 80


>gi|357162904|ref|XP_003579560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 266

 Score = 83.6 bits (205), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 44/83 (53%), Positives = 55/83 (66%), Gaps = 2/83 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVISW PR  V+HNFLS EEC++L  +A P +  STVVD  TGK   S VRTSSG F+  
Sbjct: 61  EVISWTPRIIVFHNFLSSEECDFLKEIARPRLEISTVVDVATGKGVKSDVRTSSGMFVNS 120

Query: 138 GRDK--IIRDIEKRIADFTFFPL 158
              K  +I+ IEKRI+ F+  P+
Sbjct: 121 EERKFPVIQAIEKRISVFSQIPV 143


>gi|302834449|ref|XP_002948787.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
           nagariensis]
 gi|300265978|gb|EFJ50167.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
           nagariensis]
          Length = 329

 Score = 83.2 bits (204), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 35/80 (43%), Positives = 54/80 (67%)

Query: 79  VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
           V+SW+PR F+Y   L++EEC+YLI +A   + +S V D+ TG+   S +RTSSG F  RG
Sbjct: 52  VLSWQPRVFLYKGILTQEECDYLIKIAQGRLERSGVSDATTGEGGVSDIRTSSGMFYTRG 111

Query: 139 RDKIIRDIEKRIADFTFFPL 158
            + +++ IE R+A +T  P+
Sbjct: 112 ENDVVKRIETRLAMWTMLPV 131


>gi|108706361|gb|ABF94156.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|222624253|gb|EEE58385.1| hypothetical protein OsJ_09545 [Oryza sativa Japonica Group]
          Length = 299

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 41/79 (51%), Positives = 55/79 (69%), Gaps = 1/79 (1%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLA-TPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
           +SW PR F+Y  FLS  ECE+LI LA    M +STVV+  +G+S  S+ RTSSG FL R 
Sbjct: 40  VSWSPRVFLYEGFLSDVECEHLIALAKQGRMERSTVVNGKSGESVMSKTRTSSGMFLIRK 99

Query: 139 RDKIIRDIEKRIADFTFFP 157
           +D+++  IE+RIA +T FP
Sbjct: 100 QDEVVARIEERIAAWTMFP 118


>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
 gi|194697650|gb|ACF82909.1| unknown [Zea mays]
 gi|194708468|gb|ACF88318.1| unknown [Zea mays]
 gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
 gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
 gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
          Length = 308

 Score = 83.2 bits (204), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 40/78 (51%), Positives = 56/78 (71%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           IS +PR F+Y +FLS +E  +LI+LA   +++S V D+ +GKS  S VRTSSGTFL +G+
Sbjct: 54  ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSEVRTSSGTFLRKGQ 113

Query: 140 DKIIRDIEKRIADFTFFP 157
           D I+  IE +IA +TF P
Sbjct: 114 DPIVEGIEDKIAAWTFLP 131


>gi|363814557|ref|NP_001242754.1| uncharacterized protein LOC100794585 [Glycine max]
 gi|255628535|gb|ACU14612.1| unknown [Glycine max]
          Length = 238

 Score = 82.8 bits (203), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 42/83 (50%), Positives = 55/83 (66%), Gaps = 2/83 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV++W PR  + HNFLS EEC+YL  LA P +  STVVD+ TGK   S VRTSSG FL  
Sbjct: 82  EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSDVRTSSGMFLNS 141

Query: 138 GRDK--IIRDIEKRIADFTFFPL 158
              K  +++ IEKRI+ ++  P+
Sbjct: 142 KERKYPMVQAIEKRISVYSQIPI 164


>gi|363543299|ref|NP_001241865.1| prolyl 4-hydroxylase 5-1 [Zea mays]
 gi|347978814|gb|AEP37749.1| prolyl 4-hydroxylase 5-1 [Zea mays]
          Length = 180

 Score = 82.8 bits (203), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 42/66 (63%), Positives = 48/66 (72%)

Query: 109 MRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFF 168
           M KSTVVDS TGKSKDSRVRTSSG FL RGRDK+IR IEKRI D+TF P+       +  
Sbjct: 1   MVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRITDYTFIPVDHGEGLQVLH 60

Query: 169 FYVLKK 174
           + V +K
Sbjct: 61  YEVGQK 66


>gi|302764100|ref|XP_002965471.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
 gi|300166285|gb|EFJ32891.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
          Length = 264

 Score = 82.8 bits (203), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 42/86 (48%), Positives = 60/86 (69%), Gaps = 5/86 (5%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSK---DSRVRTSSGTF 134
           EV++W PR  + H FLS EEC+YLI +A P + KSTVVD+ TGK++   +S+VRTS+G F
Sbjct: 60  EVLNWSPRITLLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHGIESKVRTSTGMF 119

Query: 135 LAR--GRDKIIRDIEKRIADFTFFPL 158
           L+    R  +I  IE+RIA ++  P+
Sbjct: 120 LSNYDRRYPMIEAIERRIAVYSMIPV 145


>gi|255637879|gb|ACU19258.1| unknown [Glycine max]
          Length = 287

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 42/83 (50%), Positives = 55/83 (66%), Gaps = 2/83 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV++W PR  + HNFLS EEC+YL  LA P +  STVVD+ TGK   S VRTSSG FL  
Sbjct: 82  EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSDVRTSSGMFLNS 141

Query: 138 GRDK--IIRDIEKRIADFTFFPL 158
              K  +++ IEKRI+ ++  P+
Sbjct: 142 KERKYPMVQAIEKRISVYSQIPI 164


>gi|125552794|gb|EAY98503.1| hypothetical protein OsI_20415 [Oryza sativa Indica Group]
          Length = 319

 Score = 82.4 bits (202), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 37/78 (47%), Positives = 57/78 (73%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           ISW+PR F+Y +FLS +E  +L++LA   +++S V D+ +GKS+ S  RTSSGTF+ + +
Sbjct: 61  ISWKPRVFLYQHFLSDDEANHLVSLARAELKRSAVADNLSGKSELSDARTSSGTFIRKSQ 120

Query: 140 DKIIRDIEKRIADFTFFP 157
           D I+  IE++IA +TF P
Sbjct: 121 DPIVAGIEEKIAAWTFLP 138


>gi|115464581|ref|NP_001055890.1| Os05g0489100 [Oryza sativa Japonica Group]
 gi|50511363|gb|AAT77286.1| putative prolyl 4-hydroxylase alpha subunit [Oryza sativa Japonica
           Group]
 gi|113579441|dbj|BAF17804.1| Os05g0489100 [Oryza sativa Japonica Group]
 gi|125587281|gb|EAZ27945.1| hypothetical protein OsJ_11906 [Oryza sativa Japonica Group]
 gi|215737307|dbj|BAG96236.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 319

 Score = 82.0 bits (201), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 37/78 (47%), Positives = 57/78 (73%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           ISW+PR F+Y +FLS +E  +L++LA   +++S V D+ +GKS+ S  RTSSGTF+ + +
Sbjct: 61  ISWKPRVFLYQHFLSDDEANHLVSLARTELKRSAVADNLSGKSELSDARTSSGTFIRKSQ 120

Query: 140 DKIIRDIEKRIADFTFFP 157
           D I+  IE++IA +TF P
Sbjct: 121 DPIVAGIEEKIAAWTFLP 138


>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
          Length = 298

 Score = 82.0 bits (201), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 38/78 (48%), Positives = 54/78 (69%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +SW PRAF++  FL   EC++LI LA   + KS V D+ +GKS  S VRTSSG FL + +
Sbjct: 38  LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSEVRTSSGMFLEKKQ 97

Query: 140 DKIIRDIEKRIADFTFFP 157
           D+++  IE+RI+ +TF P
Sbjct: 98  DEVVTRIEERISAWTFLP 115


>gi|212720650|ref|NP_001132477.1| uncharacterized protein LOC100193935 precursor [Zea mays]
 gi|194694488|gb|ACF81328.1| unknown [Zea mays]
 gi|347978828|gb|AEP37756.1| prolyl 4-hydroxylase 7 [Zea mays]
 gi|413934218|gb|AFW68769.1| prolyl 4-hydroxylase [Zea mays]
          Length = 298

 Score = 82.0 bits (201), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 38/78 (48%), Positives = 54/78 (69%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +SW PRAF++  FL   EC++LI LA   + KS V D+ +GKS  S VRTSSG FL + +
Sbjct: 38  LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSEVRTSSGMFLEKKQ 97

Query: 140 DKIIRDIEKRIADFTFFP 157
           D+++  IE+RI+ +TF P
Sbjct: 98  DEVVTRIEERISAWTFLP 115


>gi|255085592|ref|XP_002505227.1| predicted protein [Micromonas sp. RCC299]
 gi|226520496|gb|ACO66485.1| predicted protein [Micromonas sp. RCC299]
          Length = 267

 Score = 81.6 bits (200), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 40/95 (42%), Positives = 58/95 (61%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +S +P+A++Y  FL + EC+Y+   A P + KSTVVD+ TG+S  S +RTS G F  R  
Sbjct: 8   LSEKPKAYLYRGFLRQAECDYIKERAKPKLEKSTVVDNKTGQSVPSNIRTSDGMFFDRHE 67

Query: 140 DKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           D II DIE+RIA++T  P        +  + V +K
Sbjct: 68  DDIIEDIERRIAEWTNVPWENGEGIQVLRYEVGQK 102


>gi|303287328|ref|XP_003062953.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455589|gb|EEH52892.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 259

 Score = 81.3 bits (199), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 52/81 (64%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE ISW PRAF  HN ++  EC+ ++ LA   +R+STVVDS TG+SK   +RTS   FL 
Sbjct: 1   VEPISWHPRAFHLHNIMTDAECDEVLELARTRVRRSTVVDSTTGESKVDPIRTSEQCFLN 60

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           RG   I+  IEKR+  +T  P
Sbjct: 61  RGHFPIVSVIEKRLERYTMLP 81


>gi|307102962|gb|EFN51227.1| hypothetical protein CHLNCDRAFT_28161 [Chlorella variabilis]
          Length = 300

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 58/84 (69%), Gaps = 1/84 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT-GKSKDSRVRTSSGTFL 135
           ++V+SW+PR F+Y   L++EEC++++  A P + +S VVD D  G    S +RTS G F 
Sbjct: 16  LKVLSWDPRIFLYQRLLTEEECDHMMTKAGPRLTRSGVVDVDNPGGESVSDIRTSYGMFF 75

Query: 136 ARGRDKIIRDIEKRIADFTFFPLG 159
            RG D+++R++E+R+++++  P G
Sbjct: 76  DRGEDEVVREVERRLSEWSLIPPG 99


>gi|255641158|gb|ACU20856.1| unknown [Glycine max]
          Length = 195

 Score = 80.9 bits (198), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 40/83 (48%), Positives = 54/83 (65%), Gaps = 2/83 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV++W PR  + HNFLS EEC+YL  +A P +  S VVD+ TGK   S VRTSSG FL  
Sbjct: 82  EVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKTGKGIKSDVRTSSGMFLNP 141

Query: 138 GRDK--IIRDIEKRIADFTFFPL 158
              K  +++ IEKRI+ ++  P+
Sbjct: 142 QERKYPMVQAIEKRISVYSQIPI 164


>gi|225433714|ref|XP_002268409.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296089634|emb|CBI39453.3| unnamed protein product [Vitis vinifera]
          Length = 287

 Score = 80.5 bits (197), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 40/83 (48%), Positives = 56/83 (67%), Gaps = 2/83 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E+++W PR  + H+FLS EEC+YL  +A P ++ STVVD+ TGK   S VRTSSG FL+ 
Sbjct: 82  EILNWSPRIILLHSFLSSEECDYLRAMAEPLLQISTVVDAQTGKGIQSDVRTSSGMFLSP 141

Query: 138 GRD--KIIRDIEKRIADFTFFPL 158
                 I+R IEKRI+ ++  P+
Sbjct: 142 DDSTYPIVRAIEKRISVYSQVPV 164


>gi|302844249|ref|XP_002953665.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
 gi|300261074|gb|EFJ45289.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
          Length = 245

 Score = 80.5 bits (197), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 45/136 (33%), Positives = 75/136 (55%), Gaps = 2/136 (1%)

Query: 26  MFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPR 85
           +  F +L+ L   +L++  ++        L S +    E  +  +  A  WVE +   PR
Sbjct: 4   LLAFTVLLFLR-AVLALSENTWGGLPERLLPSALVMHHEADKQFDEEATPWVEQVGLHPR 62

Query: 86  AFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRD 145
           A+++HNFL+K E  +++ LA P +++STVV +D G+     +RTS G F+ R  D +I  
Sbjct: 63  AYLFHNFLTKAERAHMVRLAAPKLKRSTVVGND-GEGVVDEIRTSYGMFIRRLADPVITR 121

Query: 146 IEKRIADFTFFPLGMQ 161
           IEKRI+ +T  P+  Q
Sbjct: 122 IEKRISLWTHLPIEHQ 137


>gi|302850293|ref|XP_002956674.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
           nagariensis]
 gi|300258035|gb|EFJ42276.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
           nagariensis]
          Length = 325

 Score = 80.1 bits (196), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 39/93 (41%), Positives = 58/93 (62%), Gaps = 1/93 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           ++ ISW+PRA VYHNFLS +E  ++I+LA   M++STVV +      D  +RTS GTFL 
Sbjct: 41  IQTISWKPRAVVYHNFLSDQEARHIIDLAHEQMKRSTVVGNKNEGVVDD-IRTSYGTFLR 99

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
           R +D +I  IE+R+A ++  P   Q    +  +
Sbjct: 100 RAQDPVIMAIEERLALWSHMPPSHQEDMQVLRY 132


>gi|356576923|ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 287

 Score = 79.7 bits (195), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 40/83 (48%), Positives = 54/83 (65%), Gaps = 2/83 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV++W PR  + HNFLS EEC+YL  +A P +  S VVD+ TGK   S VRTSSG FL  
Sbjct: 82  EVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKTGKGIKSDVRTSSGMFLNP 141

Query: 138 GRDK--IIRDIEKRIADFTFFPL 158
              K  +++ IEKRI+ ++  P+
Sbjct: 142 QERKYPMVQAIEKRISVYSQIPI 164


>gi|145343778|ref|XP_001416487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576712|gb|ABO94780.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 255

 Score = 79.3 bits (194), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 39/100 (39%), Positives = 59/100 (59%), Gaps = 13/100 (13%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PRAFVY  FL+ EEC++++ L+  H+ KS VVD+ TG S  S +RTS+GTF++R  D  I
Sbjct: 1   PRAFVYEGFLTDEECDHILALSKGHLHKSGVVDAKTGGSTTSDIRTSTGTFISRAHDPTI 60

Query: 144 RDIEKRIADFTFFPL-------------GMQFKFSLFFFY 170
             IE+RI  ++  P+             G ++K    +F+
Sbjct: 61  TAIEERIELWSQIPVDHGEALQVLRYENGQEYKAHFDYFF 100


>gi|308801080|ref|XP_003075321.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116061875|emb|CAL52593.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 541

 Score = 79.0 bits (193), Expect = 6e-13,   Method: Composition-based stats.
 Identities = 39/86 (45%), Positives = 56/86 (65%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PRAF+Y NFLS++ECE+L+ L+   + KS VVD+ TG S  S VRTS+GTF++R  D II
Sbjct: 265 PRAFLYENFLSEKECEHLLALSKGKLHKSGVVDAQTGGSSLSEVRTSTGTFISRKYDDII 324

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFF 169
             +E+RI  ++  P      F +  +
Sbjct: 325 AGVEERIELWSQIPQSHHEAFQILRY 350


>gi|302835042|ref|XP_002949083.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
           nagariensis]
 gi|300265828|gb|EFJ50018.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
           nagariensis]
          Length = 263

 Score = 78.6 bits (192), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 38/81 (46%), Positives = 51/81 (62%), Gaps = 2/81 (2%)

Query: 76  WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
           WVE +SW PRAFVYH FL+  EC++LI LATP + +S VV +D+    D  +RTS    +
Sbjct: 58  WVETVSWMPRAFVYHQFLTPAECDHLIELATPKLERSMVVGTDSDLIDD--IRTSFSASI 115

Query: 136 ARGRDKIIRDIEKRIADFTFF 156
             G   I+  IE+RIA +T  
Sbjct: 116 MYGETSIVSSIEERIARWTVL 136


>gi|303282201|ref|XP_003060392.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457863|gb|EEH55161.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 369

 Score = 78.6 bits (192), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 52/82 (63%), Gaps = 4/82 (4%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PRA+VY  FL+  EC++ I  A+P + KS VVD+DTG+   S +RTS G F  RG D ++
Sbjct: 83  PRAYVYRGFLTDAECDHFIARASPKLAKSNVVDTDTGEGVPSAIRTSDGMFFDRGEDDVV 142

Query: 144 RDIEKRIADFTFFPL----GMQ 161
             +E+RI+ +T  P     GMQ
Sbjct: 143 DAVERRISAWTRLPTENGEGMQ 164


>gi|302845026|ref|XP_002954052.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
           nagariensis]
 gi|300260551|gb|EFJ44769.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
           nagariensis]
          Length = 311

 Score = 78.6 bits (192), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 43/99 (43%), Positives = 60/99 (60%), Gaps = 6/99 (6%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V VISW+PRAFV  NFL++ EC ++ +LA  HMR+STVV +D G S     RTS GTF+ 
Sbjct: 1   VSVISWQPRAFVIRNFLTEHECTHIADLAQVHMRRSTVV-ADNGSSVLDDYRTSYGTFIN 59

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQ-----FKFSLFFFY 170
           R +  +I  +E R+A  T  P+  Q      ++ L  +Y
Sbjct: 60  RYQTPVIAAVEDRVALLTRTPVVYQEDMQVLRYGLGQYY 98


>gi|226503309|ref|NP_001141668.1| uncharacterized protein LOC100273794 [Zea mays]
 gi|194705482|gb|ACF86825.1| unknown [Zea mays]
 gi|414587757|tpg|DAA38328.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
 gi|414587758|tpg|DAA38329.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
          Length = 165

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 36/58 (62%), Positives = 43/58 (74%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
           EVISW PR  V+HNFLS EEC+YL+ +A P ++ STVVD  TGK   S VRTSSG F+
Sbjct: 58  EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSDVRTSSGMFV 115


>gi|294461211|gb|ADE76168.1| unknown [Picea sitchensis]
          Length = 280

 Score = 78.2 bits (191), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 36/75 (48%), Positives = 53/75 (70%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P  F+Y NFL+  EC++LI LA   ++KS V D+++GKS  S +RTSSG FL + +D+I+
Sbjct: 28  PGLFLYKNFLTDAECDHLIFLARDKLQKSMVADNESGKSVMSEIRTSSGMFLNKAQDEIV 87

Query: 144 RDIEKRIADFTFFPL 158
             +E RIA +TF P+
Sbjct: 88  ASVEDRIAAWTFLPI 102


>gi|449468746|ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
          Length = 290

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 39/83 (46%), Positives = 53/83 (63%), Gaps = 2/83 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+SW PR  V HNFLS +EC+YL  +A   +  STVVD+ TGK   S  RTSSG FL+ 
Sbjct: 83  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSSGMFLSH 142

Query: 138 GRDK--IIRDIEKRIADFTFFPL 158
                 +++ IEKRI+ ++  P+
Sbjct: 143 HEKNFPMVQAIEKRISVYSQVPV 165


>gi|412985583|emb|CCO19029.1| predicted protein [Bathycoccus prasinos]
          Length = 458

 Score = 77.8 bits (190), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 36/99 (36%), Positives = 66/99 (66%), Gaps = 1/99 (1%)

Query: 77  VEVISWE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
           +++IS + PRAF+Y  F++ EEC++LI+ +   M KS VVD++TG +  S +RTS+G+F+
Sbjct: 177 MQIISLDHPRAFLYKRFMTDEECDFLIDHSKSRMSKSGVVDAETGGTAKSDIRTSTGSFV 236

Query: 136 ARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
             G + +++ +EKR+A F+  P+  Q    +  + V ++
Sbjct: 237 GIGANDLMKKLEKRVATFSMLPVKHQEATQVLRYEVKQE 275


>gi|449459442|ref|XP_004147455.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449515722|ref|XP_004164897.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 319

 Score = 77.4 bits (189), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 39/79 (49%), Positives = 56/79 (70%), Gaps = 1/79 (1%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +S +PRAF+Y  FLS EEC++LIN A   + +S +V + TG+S  S+ RTS+G FL + +
Sbjct: 63  LSSKPRAFLYKGFLSAEECQHLINSAKGKLHQS-LVAAGTGQSVTSKERTSTGMFLHKAQ 121

Query: 140 DKIIRDIEKRIADFTFFPL 158
           D+I+  IE RIA +TF PL
Sbjct: 122 DEIVARIESRIAAWTFLPL 140


>gi|145347188|ref|XP_001418057.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578285|gb|ABO96350.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 317

 Score = 76.6 bits (187), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 44/112 (39%), Positives = 62/112 (55%), Gaps = 3/112 (2%)

Query: 60  RKSMERSEGDEGRAE--QWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS 117
           R S     GDE   E  + VE +SW PR F+  NFLS EECE+LI L    + +STVV+S
Sbjct: 17  RTSTSGGGGDEDDVERSKVVETLSWSPRVFLLKNFLSDEECEHLIELGEKKLERSTVVNS 76

Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
           D   +  S  RTS GTF+ R   + ++ +E R+A ++  P   Q +  L  +
Sbjct: 77  DESGAV-STARTSFGTFVTRRLTETLQRVEDRVAKYSGIPWEHQEQLQLLRY 127


>gi|159485424|ref|XP_001700744.1| hypothetical protein CHLREDRAFT_187378 [Chlamydomonas reinhardtii]
 gi|158281243|gb|EDP06998.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 253

 Score = 76.3 bits (186), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 41/104 (39%), Positives = 54/104 (51%), Gaps = 8/104 (7%)

Query: 68  GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD--S 125
           GD G    W+E ISW PRAF+YH FLS  EC++LI LA P + +S VV    G   D   
Sbjct: 31  GDVGAP--WIETISWVPRAFIYHGFLSHAECDHLIGLALPKLERSLVV----GNKSDEVD 84

Query: 126 RVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
            +RTS    +      ++ DIE RIA +T  P   Q    +  +
Sbjct: 85  PIRTSYSASIGYNETDVVADIEGRIARWTHLPRSHQEPMEVLRY 128


>gi|302838815|ref|XP_002950965.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
           nagariensis]
 gi|300263660|gb|EFJ47859.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
           nagariensis]
          Length = 298

 Score = 75.9 bits (185), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 51/81 (62%), Gaps = 1/81 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E +SW PR F+YHNFL+  EC ++   A P M++S+VV  + G S    +RTS GTF+ 
Sbjct: 2   IEAVSWNPRVFIYHNFLTDGECRHIKRTAAPMMKRSSVVGQN-GSSVTDNIRTSYGTFIR 60

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           R  D +I  I +R+A +T  P
Sbjct: 61  RRHDPVIERILRRVAAWTKAP 81


>gi|302844247|ref|XP_002953664.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
           nagariensis]
 gi|300261073|gb|EFJ45288.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
           nagariensis]
          Length = 364

 Score = 75.5 bits (184), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 43/114 (37%), Positives = 66/114 (57%), Gaps = 2/114 (1%)

Query: 56  SSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV 115
           S++V  +    + DE  A  WVE +   PRA+++HNFL+K E  +++ LA P +++STVV
Sbjct: 28  SALVMHTEADKQFDE-EATPWVEQVGLHPRAYLFHNFLTKAERAHMVRLAAPKLKRSTVV 86

Query: 116 DSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
            S  G+     +RTS G F+ R  D II  IEKRI+ +T  P+  Q    +  +
Sbjct: 87  GS-KGEGVVDNIRTSFGMFIRRLSDPIIARIEKRISLWTHLPIEHQEDIQVLRY 139


>gi|159486447|ref|XP_001701251.1| hypothetical protein CHLREDRAFT_122372 [Chlamydomonas reinhardtii]
 gi|158271833|gb|EDO97644.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 251

 Score = 75.1 bits (183), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 35/93 (37%), Positives = 55/93 (59%), Gaps = 1/93 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E +SW PR F+YHNFLS  EC ++   A P M++S+VV ++ G S    +RTS GTF+ 
Sbjct: 2   IETVSWNPRVFIYHNFLSDAECRHIKRTAAPMMKRSSVVGTN-GSSVLDTIRTSYGTFIR 60

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
           R  D ++  + +R+A +T  P   Q    +  +
Sbjct: 61  RRHDPVVERVLRRVAAWTKAPPENQEDLQVLRY 93


>gi|255072321|ref|XP_002499835.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
 gi|226515097|gb|ACO61093.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
          Length = 454

 Score = 73.9 bits (180), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 34/75 (45%), Positives = 49/75 (65%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           EP+A+++ NFL+  ECE+L+ LA   +  STVV      S  S++RTS+G FL RG+D  
Sbjct: 176 EPKAYMFRNFLTPHECEHLMQLAKKQLAPSTVVGDKGSGSMVSKIRTSAGMFLGRGQDPT 235

Query: 143 IRDIEKRIADFTFFP 157
           +R IE+RIA  +  P
Sbjct: 236 VRAIEERIAAASGLP 250


>gi|159489450|ref|XP_001702710.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280732|gb|EDP06489.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 252

 Score = 73.6 bits (179), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 39/93 (41%), Positives = 55/93 (59%), Gaps = 1/93 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V VISWEPRAFV  NFL+ +E  ++ ++A  HMR+STVV +D G S     RTS GTF+ 
Sbjct: 1   VSVISWEPRAFVIRNFLTDQEATHIADVAQVHMRRSTVV-ADNGSSVLDDYRTSYGTFIN 59

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
           R    ++  +E R+A  T  P+  Q    +  +
Sbjct: 60  RYATPVVARVEDRVAVLTRVPVHYQEDMQVLRY 92


>gi|145345836|ref|XP_001417405.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577632|gb|ABO95698.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 330

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 38/84 (45%), Positives = 56/84 (66%), Gaps = 5/84 (5%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +P+A++  NFLS EEC++L+ LA   +  STVV  + G S  S +RTS+G FL +G+DKI
Sbjct: 48  QPKAYLLRNFLSAEECDHLMKLAKRELAPSTVV-GEAGDSVPSDIRTSAGMFLRKGQDKI 106

Query: 143 IRDIEKRIADFTFFPL----GMQF 162
           ++ IE+RIA  +  P+    GMQ 
Sbjct: 107 VKAIEERIARLSGTPVDNGEGMQI 130


>gi|159476104|ref|XP_001696154.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
           [Chlamydomonas reinhardtii]
 gi|158275325|gb|EDP01103.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
           [Chlamydomonas reinhardtii]
          Length = 343

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 50/80 (62%)

Query: 79  VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
           V+SW PR F+Y   L+ EEC+ L++ +   + +S V D+ TG    S +RTSSG F  RG
Sbjct: 69  VLSWHPRVFLYKGILTHEECDQLMDNSRSRLERSGVSDATTGAGAVSDIRTSSGMFYERG 128

Query: 139 RDKIIRDIEKRIADFTFFPL 158
             ++++ IE R+A +T  P+
Sbjct: 129 ETELVKRIENRLAMWTMLPV 148


>gi|302765413|ref|XP_002966127.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
 gi|300165547|gb|EFJ32154.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
          Length = 201

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 34/64 (53%), Positives = 43/64 (67%)

Query: 94  SKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADF 153
           S +EC++LI LA P +R+S+V+D  TG  KDSR RTS G FL R  D I+  IE RI+  
Sbjct: 8   SDDECDHLIGLALPRLRRSSVIDEKTGLGKDSRNRTSWGAFLRRDHDNIVSGIEDRISSI 67

Query: 154 TFFP 157
           TF P
Sbjct: 68  TFIP 71


>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
 gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
          Length = 287

 Score = 72.8 bits (177), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 34/76 (44%), Positives = 47/76 (61%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  V+  FLS +EC+ L+ LA P + +S  VD+DTG S+ +  RTS G F  RG  ++
Sbjct: 99  DPRVVVFGGFLSHDECDALVALAQPRLARSETVDNDTGGSEVNEARTSQGMFFMRGEGEL 158

Query: 143 IRDIEKRIADFTFFPL 158
           I  IE RIA    +PL
Sbjct: 159 ISRIEARIAALLDWPL 174


>gi|303279839|ref|XP_003059212.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459048|gb|EEH56344.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 409

 Score = 72.4 bits (176), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 59/100 (59%), Gaps = 4/100 (4%)

Query: 69  DEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT----GKSKD 124
           D G  +  VE +S  PRA+++  FL+KEEC +LI ++TPH+++STVV  D        + 
Sbjct: 75  DIGVGDARVEKLSDSPRAYLFREFLTKEECAHLIEISTPHLKRSTVVGDDALLGEADGRR 134

Query: 125 SRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKF 164
           S  RTS+G FL +  D ++  +E+R+  F+  P   Q + 
Sbjct: 135 SDYRTSTGAFLPKLYDDVVTRVERRVEAFSRLPFENQEQL 174


>gi|308804269|ref|XP_003079447.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116057902|emb|CAL54105.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 363

 Score = 72.4 bits (176), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 34/91 (37%), Positives = 52/91 (57%)

Query: 79  VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
            +SW PRAF+Y NFL+++ECE+LI L    + +STVV S   +      RTS GTF+ R 
Sbjct: 93  TLSWSPRAFLYQNFLTEDECEHLIALGEKKLERSTVVGSKGKEGDVHSARTSFGTFITRR 152

Query: 139 RDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
               +  +E R+A+++  P   Q +  L  +
Sbjct: 153 LTPTLSAVEDRVAEYSGIPWRHQEQLQLLRY 183


>gi|148537204|dbj|BAF63493.1| prolyl 4-hydroxylase [Potamogeton distinctus]
          Length = 246

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 36/66 (54%), Positives = 46/66 (69%)

Query: 92  FLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIA 151
           FLS EEC++LI L    + KS V D+++GKS  S +RTSSG FL R +D+ I  IEKRIA
Sbjct: 3   FLSHEECDHLIALGKDKLEKSMVADNESGKSVMSEIRTSSGMFLERRQDETITRIEKRIA 62

Query: 152 DFTFFP 157
            +TF P
Sbjct: 63  AWTFLP 68


>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
 gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
          Length = 433

 Score = 71.6 bits (174), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 57/86 (66%), Gaps = 5/86 (5%)

Query: 77  VEVISWE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
           ++V+S + PRAF++  FLS+ EC+ L+  A P+M KS VVD+  G S  S +RTS+G+F+
Sbjct: 158 IQVVSLDNPRAFMHIGFLSERECDLLVEYARPNMYKSGVVDASNGGSSFSNIRTSTGSFV 217

Query: 136 AR----GRDKIIRDIEKRIADFTFFP 157
                 G + ++R IE+RIA +T  P
Sbjct: 218 PTVFPLGMNDVVRRIERRIAAWTQIP 243


>gi|308812133|ref|XP_003083374.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
           [Ostreococcus tauri]
 gi|116055254|emb|CAL57650.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
           [Ostreococcus tauri]
          Length = 311

 Score = 71.2 bits (173), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 32/89 (35%), Positives = 54/89 (60%)

Query: 70  EGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRT 129
           +G +  W+E IS  PRA+V+  FL+  EC+ +I  A P M  S V D D+G+++    R+
Sbjct: 61  DGGSSGWIEKISDSPRAYVFREFLTDAECDRVIERAYPTMEASEVTDDDSGEARPDDARS 120

Query: 130 SSGTFLARGRDKIIRDIEKRIADFTFFPL 158
           S G +++   D++IR+IE R + +   P+
Sbjct: 121 SIGGWVSGDDDEVIRNIELRASTWAMLPM 149


>gi|159487419|ref|XP_001701720.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280939|gb|EDP06695.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 274

 Score = 71.2 bits (173), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 35/85 (41%), Positives = 55/85 (64%), Gaps = 1/85 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +   PRA+ +HNFL+K E  +L+ LA P +++STVV +D G+     +RTS G F+ 
Sbjct: 1   VQQVGLHPRAYYFHNFLTKAERGHLVKLAAPKLKRSTVVGND-GEGVVDNIRTSYGMFIR 59

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQ 161
           R +D ++  IEKRI+ +T  P+  Q
Sbjct: 60  RLQDPVVARIEKRISLWTHLPVEHQ 84


>gi|363543305|ref|NP_001241868.1| prolyl 4-hydroxylase 6-1 precursor [Zea mays]
 gi|347978820|gb|AEP37752.1| prolyl 4-hydroxylase 6-1 [Zea mays]
          Length = 122

 Score = 71.2 bits (173), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 34/61 (55%), Positives = 44/61 (72%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +S  PRAF+Y  FLS  EC++L++LA   M KS V D+D+GKS  S+ RTSSGTFLA
Sbjct: 35  VTQLSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQARTSSGTFLA 94

Query: 137 R 137
           +
Sbjct: 95  K 95


>gi|307102975|gb|EFN51240.1| hypothetical protein CHLNCDRAFT_28187 [Chlorella variabilis]
          Length = 322

 Score = 70.1 bits (170), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 35/102 (34%), Positives = 58/102 (56%), Gaps = 4/102 (3%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD-TGKSKDSRVR---TSSG 132
           +E++SW+PRA + H FL+  EC+++I+LA   +  S VV  D +GK    R R   +SSG
Sbjct: 15  IELVSWKPRALLLHGFLAHSECDHMISLAEARLEPSKVVSRDGSGKLDSVRTRQGLSSSG 74

Query: 133 TFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           TFL + +D ++  +E RI   T  P     +  +  + + +K
Sbjct: 75  TFLTKRQDSVVAGVEDRIELATHLPFSHSEQLQVLKYELGQK 116


>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
 gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
          Length = 224

 Score = 69.3 bits (168), Expect = 5e-10,   Method: Compositional matrix adjust.
 Identities = 36/99 (36%), Positives = 52/99 (52%), Gaps = 13/99 (13%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V+   LS++EC+ L+ LA P + +S  VD+ TG S+ +  RTS G F  RG   +I
Sbjct: 37  PRVVVFGGLLSEQECDELVALAQPRLLRSETVDNSTGGSEVNAARTSDGMFFERGETPLI 96

Query: 144 RDIEKRIADFTFFPL-------------GMQFKFSLFFF 169
             IE+RIA+   +P+             G Q+K    FF
Sbjct: 97  ERIERRIAELVHWPVERGEGLQVLHYRPGAQYKPHHDFF 135


>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
           IL144]
 gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
           oxidoreductase protein [Rubrivivax gelatinosus IL144]
          Length = 279

 Score = 69.3 bits (168), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 32/75 (42%), Positives = 45/75 (60%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V+   LS EEC+ L+ LA P + +S  VD+ TG S+ +  RTS G F  RG   +I
Sbjct: 92  PRVVVFGGLLSDEECDELVALARPRLARSETVDNSTGGSEVNAARTSDGMFFERGEKPLI 151

Query: 144 RDIEKRIADFTFFPL 158
             IE+RIA+   +P+
Sbjct: 152 ERIERRIAELVRWPV 166


>gi|357417854|ref|YP_004930874.1| procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
 gi|355335432|gb|AER56833.1| Procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
          Length = 283

 Score = 68.6 bits (166), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 44/76 (57%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
            PR  V+ N L+ EEC+ LI LA   +++S V D DTG+ +  + RTS G F  RG + +
Sbjct: 95  HPRVIVFGNLLAAEECDALIALARRQIKRSPVFDPDTGQDQQHQARTSEGMFFGRGANPL 154

Query: 143 IRDIEKRIADFTFFPL 158
              +E RIA    +PL
Sbjct: 155 CARVEARIAALLNWPL 170


>gi|299532490|ref|ZP_07045880.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
 gi|298719437|gb|EFI60404.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
          Length = 299

 Score = 68.6 bits (166), Expect = 8e-10,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 49/82 (59%), Gaps = 4/82 (4%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V+ N LS EEC+ +I  A P MR+S  VD+ +G    +  RTS+G F  RG +++I
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAARPRMRRSLTVDNQSGGEAVNDDRTSNGMFFQRGENELI 171

Query: 144 RDIEKRIADFTFFPL----GMQ 161
             +E+RIA    +PL    GMQ
Sbjct: 172 SLVEQRIARLLNWPLENGEGMQ 193


>gi|159469311|ref|XP_001692811.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158278064|gb|EDP03830.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 273

 Score = 68.2 bits (165), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 50/80 (62%)

Query: 79  VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
           V+  + R +++  FL+ EEC+Y+   A   + +S VVD+ +G S  S +RTS G F  RG
Sbjct: 38  VLDPDARIYLWKGFLTPEECDYIRMKAEKRLERSGVVDTGSGGSVVSDIRTSDGMFFERG 97

Query: 139 RDKIIRDIEKRIADFTFFPL 158
            D II  +E+R+AD+T  P+
Sbjct: 98  EDAIIEAVEQRLADWTMTPI 117


>gi|264677094|ref|YP_003277000.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
 gi|262207606|gb|ACY31704.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
          Length = 306

 Score = 68.2 bits (165), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 35/83 (42%), Positives = 48/83 (57%), Gaps = 4/83 (4%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
            PR  V+ N LS EEC+ +I  A P MR+S  VD+ +G    +  RTS+G F  RG + +
Sbjct: 118 HPRVVVFGNLLSDEECDAIIAAARPRMRRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDL 177

Query: 143 IRDIEKRIADFTFFPL----GMQ 161
           I  +E+RIA    +PL    GMQ
Sbjct: 178 ISLVEQRIARLLNWPLENGEGMQ 200


>gi|308802438|ref|XP_003078532.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
           tauri]
 gi|116056985|emb|CAL51412.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
           tauri]
          Length = 369

 Score = 68.2 bits (165), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 55/86 (63%), Gaps = 5/86 (5%)

Query: 81  SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           S +P+A++  NFLS +EC++L+ LA   +  STVV  D G S  S +RTS+G FL + +D
Sbjct: 87  SKKPKAYLMRNFLSPQECDHLMMLAKRELAPSTVV-GDGGSSVASEIRTSAGMFLRKSQD 145

Query: 141 KIIRDIEKRIADFTFFPL----GMQF 162
             +R+IE+RIA  +  P+    GMQ 
Sbjct: 146 DTVREIEERIARLSGVPVDNGEGMQI 171


>gi|242047774|ref|XP_002461633.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
 gi|241925010|gb|EER98154.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
          Length = 275

 Score = 67.8 bits (164), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 52/81 (64%), Gaps = 5/81 (6%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +SW+PR FVY  FLS +EC++L+ LA    +K T+V +    S   + RTSSG FL 
Sbjct: 48  VKALSWQPRIFVYKGFLSDDECDHLVTLA----KKGTMV-AHNRSSYYRQTRTSSGMFLR 102

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           + +D ++  IE+RIA +T  P
Sbjct: 103 KRQDPVVSRIEERIAAWTLLP 123


>gi|303285562|ref|XP_003062071.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456482|gb|EEH53783.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 522

 Score = 67.8 bits (164), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 34/83 (40%), Positives = 50/83 (60%), Gaps = 4/83 (4%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P+A+++ NFL++EEC +LI LA   +  STVV     KS  S +RTS+G FL +G+   +
Sbjct: 236 PKAYLFRNFLTEEECRHLIALAKAQLAPSTVVADGGKKSTKSGIRTSAGMFLTKGQTPTV 295

Query: 144 RDIEKRIADFTFFP----LGMQF 162
           R +E+R+A     P     GMQ 
Sbjct: 296 RMVEERVAAAVGLPEENGEGMQI 318


>gi|418530659|ref|ZP_13096582.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
 gi|371452378|gb|EHN65407.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
          Length = 299

 Score = 67.8 bits (164), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 48/82 (58%), Gaps = 4/82 (4%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V+ N LS EEC+ +I  A P M++S  VD+ +G    +  RTS+G F  RG + +I
Sbjct: 112 PRVVVFGNLLSNEECDAIIAAARPRMQRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDLI 171

Query: 144 RDIEKRIADFTFFPL----GMQ 161
             +E+RIA    +PL    GMQ
Sbjct: 172 SRVEQRIARLLNWPLENGEGMQ 193


>gi|159462456|ref|XP_001689458.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283446|gb|EDP09196.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 221

 Score = 67.4 bits (163), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 35/83 (42%), Positives = 47/83 (56%), Gaps = 1/83 (1%)

Query: 87  FVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDI 146
            VYHNFLS  EC ++I+LA   M++STVV S      D  +RTS GTFL R  D +I  I
Sbjct: 1   MVYHNFLSDRECRHIIDLAHAQMKRSTVVGSKNAGVVDD-IRTSYGTFLRRVPDPVIAAI 59

Query: 147 EKRIADFTFFPLGMQFKFSLFFF 169
           E R+A ++  P   Q    +  +
Sbjct: 60  EHRLALWSHLPASHQEDMQVLRY 82


>gi|319786559|ref|YP_004146034.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
 gi|317465071|gb|ADV26803.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
          Length = 289

 Score = 67.4 bits (163), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 33/75 (44%), Positives = 43/75 (57%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V    LS EEC+ L+ L+ P +R+ST VD+ TG S+    RTS GTF  RG   + 
Sbjct: 102 PRVVVLGGLLSDEECDALVELSRPRLRRSTTVDAQTGGSQVHADRTSRGTFFERGAHPVC 161

Query: 144 RDIEKRIADFTFFPL 158
             IE RIA    +P+
Sbjct: 162 ATIEARIARLLEWPV 176


>gi|145354086|ref|XP_001421326.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581563|gb|ABO99619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 309

 Score = 67.0 bits (162), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 52/159 (32%), Positives = 78/159 (49%), Gaps = 16/159 (10%)

Query: 12  RKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSI------------- 58
           R  +S   +L L+ + T A           +P  + D R  +D + +             
Sbjct: 3   RGGASRVAVLALIALATSAAPRRATADRARLPRDARDERLDDDDARLRAEEHVAYASDAR 62

Query: 59  VRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD 118
            R  + R   D   A QW+E IS  PRA+VY NFL++EE E  I  A   MR+S VV+  
Sbjct: 63  SRVGLRRDGAD---ARQWIERISESPRAYVYRNFLTREEAEATIAAARRTMRRSEVVNEA 119

Query: 119 TGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFP 157
            G SK S  RTSSG +++    +++ +IE+R+A +T  P
Sbjct: 120 DGTSKTSDERTSSGGWVSGEDSEVMANIERRVAAWTMLP 158


>gi|241767624|ref|ZP_04765273.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
 gi|241361463|gb|EER57922.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
          Length = 318

 Score = 67.0 bits (162), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 32/75 (42%), Positives = 43/75 (57%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V+ N LS EECE LI  A P M +S  V + TG  + +  RTS G F  RG   ++
Sbjct: 131 PRVVVFGNLLSPEECEALIAAAAPRMARSLTVATQTGGEEVNDDRTSHGMFFQRGESPLV 190

Query: 144 RDIEKRIADFTFFPL 158
           + IE+RIA    +P+
Sbjct: 191 QRIEERIASLLNWPI 205


>gi|221068712|ref|ZP_03544817.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
 gi|220713735|gb|EED69103.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
          Length = 299

 Score = 67.0 bits (162), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 48/82 (58%), Gaps = 4/82 (4%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V+ N LS EEC+ +I  A P M++S  VD+ +G    +  RTS+G F  RG + +I
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAAGPRMQRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDLI 171

Query: 144 RDIEKRIADFTFFPL----GMQ 161
             +E+RIA    +PL    GMQ
Sbjct: 172 CRVEQRIARLLNWPLENGEGMQ 193


>gi|302844281|ref|XP_002953681.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
           nagariensis]
 gi|300261090|gb|EFJ45305.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
           nagariensis]
          Length = 304

 Score = 67.0 bits (162), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 51/78 (65%), Gaps = 1/78 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E ++W+PR F+YHNF++  E +++I LA P M++STVV +  G+S +   RT     + 
Sbjct: 1   IEHVAWKPRVFIYHNFITDMEAKHMIELAAPQMKRSTVVGA-GGQSVEDSYRTLYTAGVR 59

Query: 137 RGRDKIIRDIEKRIADFT 154
           R +D ++  IE R+A +T
Sbjct: 60  RYQDDVVERIENRVAAWT 77


>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
           JS666]
 gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
           JS666]
          Length = 277

 Score = 66.6 bits (161), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 51/88 (57%), Gaps = 2/88 (2%)

Query: 73  AEQWVEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
            ++WV+V+     P   V+ N LS  ECE L+ +A P + +S  V+  TG  + +R RTS
Sbjct: 77  GDRWVDVLQRLQLPDLVVFGNLLSDSECEALMEVAQPRLARSLTVNIKTGGEERNRDRTS 136

Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPL 158
            G F ARG + +++ +E RIA    +P+
Sbjct: 137 QGMFFARGENPLVQRVEARIARLVGWPV 164


>gi|302841711|ref|XP_002952400.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
           nagariensis]
 gi|300262336|gb|EFJ46543.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
           nagariensis]
          Length = 269

 Score = 66.2 bits (160), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 48/76 (63%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           + R +++  FL+ EEC+Y+   A   + +S VVD+ +G S  S +RTS G F  RG D I
Sbjct: 42  DARIYLWRGFLTPEECDYIRMKAEKRLERSGVVDTASGSSVVSDIRTSDGMFFERGEDAI 101

Query: 143 IRDIEKRIADFTFFPL 158
           +  +E+R+AD+T  P+
Sbjct: 102 LEAVEQRLADWTMTPI 117


>gi|159487421|ref|XP_001701721.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280940|gb|EDP06696.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 336

 Score = 66.2 bits (160), Expect = 5e-09,   Method: Compositional matrix adjust.
 Identities = 36/100 (36%), Positives = 56/100 (56%), Gaps = 1/100 (1%)

Query: 70  EGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRT 129
           E  A  WV+ +   PRA+ +HNFL+K E  +L+ +A P +++STVV        D  +RT
Sbjct: 12  EEDATPWVQQVGLHPRAYYFHNFLTKAERAHLVRVAAPKLKRSTVVGGKGEGVVDD-IRT 70

Query: 130 SSGTFLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
           S G F+ R  D ++  IEKRI+ +T  P+  Q    +  +
Sbjct: 71  SYGMFIRRLSDPVVTRIEKRISLWTHLPVEHQEDIQILRY 110


>gi|428170517|gb|EKX39441.1| hypothetical protein GUITHDRAFT_114401 [Guillardia theta CCMP2712]
          Length = 322

 Score = 65.9 bits (159), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 41/136 (30%), Positives = 71/136 (52%), Gaps = 14/136 (10%)

Query: 29  FAILILLAFGILSM---------PSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEV 79
           F  L+++ + +LSM         P S     KA    S+ R+SM  + G   +   W+E 
Sbjct: 22  FLFLVIVGYAVLSMLLQSLWMTGPKSDALLSKA---PSLERRSM-TNLGGMAKKSTWIET 77

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLA-TPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
           +S +PR F+ HN L++EEC++L++LA    +  S +    T K  +S  RT+   +L   
Sbjct: 78  VSVDPRIFIVHNLLTEEECDHLVSLALQKGLSASLITPYGTNKLVESTTRTNKQAWLDFQ 137

Query: 139 RDKIIRDIEKRIADFT 154
           +D +++ +E +IA  T
Sbjct: 138 QDDVVKRVEDKIAKLT 153


>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
 gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
          Length = 296

 Score = 65.5 bits (158), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 34/88 (38%), Positives = 48/88 (54%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P A +  +FLS  ECE LI+LA P + +STVVD  TG++  +  R+S G F   G   +I
Sbjct: 102 PAAILLDDFLSANECEQLISLARPRLSRSTVVDPVTGRNVVAGHRSSDGMFFRLGETPLI 161

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFFYV 171
             +E RIA+ T  P+       L  + V
Sbjct: 162 ARLEARIAELTGLPVENGEGLQLLHYEV 189


>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
 gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
          Length = 286

 Score = 65.5 bits (158), Expect = 8e-09,   Method: Compositional matrix adjust.
 Identities = 31/77 (40%), Positives = 46/77 (59%)

Query: 82  WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDK 141
           + PR  V+ + LS +ECE LI LA P + +S  V + TG  + +  RTSSG F  RG ++
Sbjct: 97  YNPRVVVFGSLLSDQECEQLIGLAKPRLARSLTVATKTGGEEVNEDRTSSGMFFQRGENE 156

Query: 142 IIRDIEKRIADFTFFPL 158
           ++  IE RIA    +P+
Sbjct: 157 LVARIEARIARLVNWPV 173


>gi|334185677|ref|NP_001189994.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
 gi|332643930|gb|AEE77451.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 324

 Score = 65.1 bits (157), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 49/144 (34%), Positives = 74/144 (51%), Gaps = 15/144 (10%)

Query: 25  IMFTFAILILLAFGILSMPSSSGDSRKAN--DLSSIVRKSMERSEG-DEGRAEQWVEVIS 81
           I   F++  L    ++S   +   +R +N  D S I  K+   S G D  R  Q    +S
Sbjct: 5   IFLAFSLCFLFTLPLISSAPNRFLTRSSNTRDGSVIKMKTSASSFGFDPTRVTQ----LS 60

Query: 82  WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR----VRTSSGTFLAR 137
           W PR F+Y  FLS EEC++ I LA   + KS V D+D+G+S +S     V   S +F+A 
Sbjct: 61  WTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEDSVSVVRQSSSFIAN 120

Query: 138 GR----DKIIRDIEKRIADFTFFP 157
                 D I+ ++E ++A +TF P
Sbjct: 121 MDSLEIDDIVSNVEAKLAAWTFLP 144


>gi|145341735|ref|XP_001415959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576182|gb|ABO94251.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 254

 Score = 65.1 bits (157), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 35/98 (35%), Positives = 56/98 (57%), Gaps = 1/98 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE +SW PRAF   + L++ +CE ++      +R+STVVDS TG+SK   +RTS  TFL 
Sbjct: 3   VEPLSWYPRAFALRDALTEAQCEAVLRATRARVRRSTVVDSVTGESKVDPIRTSKQTFLN 62

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           R  ++++R+I   ++  T  P        +  + V +K
Sbjct: 63  RD-EEVVREIYDALSAVTMLPWTHNEDMQVLEYRVGEK 99


>gi|395003644|ref|ZP_10387769.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
 gi|394318439|gb|EJE54870.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
          Length = 299

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 44/75 (58%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V+ N LS EEC+ LI  A P M +S  V + TG  + +  RTS G F  RG + ++
Sbjct: 112 PRIVVFGNLLSAEECDALIAAAAPRMARSLTVATKTGGEEVNDDRTSDGMFFQRGENPVV 171

Query: 144 RDIEKRIADFTFFPL 158
           + IE+RIA    +P+
Sbjct: 172 QRIEERIARLLDWPI 186


>gi|255584898|ref|XP_002533164.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223527036|gb|EEF29223.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 290

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 60/100 (60%), Gaps = 3/100 (3%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR-VRTSSGTFLA 136
           +V+SW+PRA  + NF + E+C+ +IN+A P++  ST+        ++++ +RTSSG FL+
Sbjct: 81  QVLSWKPRALYFPNFATAEQCQSVINMAKPNLTPSTLALRKGETEENTKGIRTSSGMFLS 140

Query: 137 RGRDK--IIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
              DK  ++  IE++IA  T  P      F++  + + +K
Sbjct: 141 ASEDKTGVLDAIEEKIARATMLPRANGEAFNILRYEIGQK 180


>gi|412993142|emb|CCO16675.1| predicted protein [Bathycoccus prasinos]
          Length = 564

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 36/84 (42%), Positives = 52/84 (61%), Gaps = 5/84 (5%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +P+A+++ NFLS EEC++L+ LA   +  STVV +  G S  S +RTS+G FL +  DK 
Sbjct: 285 KPKAYLFRNFLSAEECDHLMKLAKAELAPSTVVGA-GGTSVPSTIRTSAGMFLRKAADKT 343

Query: 143 IRDIEKRIADFTFFP----LGMQF 162
           + +IE RIA  +  P     GMQ 
Sbjct: 344 LENIEYRIAAASGTPEPNGEGMQI 367


>gi|407938132|ref|YP_006853773.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
 gi|407895926|gb|AFU45135.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
          Length = 303

 Score = 64.7 bits (156), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 45/76 (59%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  V+ N LS EEC+ LI  A P M +S  V + TG  + +  RTS G F  RG+  +
Sbjct: 115 QPRIVVFGNLLSPEECDALIAAAEPRMARSLTVATKTGGEEINADRTSDGMFFQRGQSPL 174

Query: 143 IRDIEKRIADFTFFPL 158
           I+ IE+RIA    +P+
Sbjct: 175 IQRIEERIARLLQWPI 190


>gi|412986386|emb|CCO14812.1| predicted protein [Bathycoccus prasinos]
          Length = 337

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/93 (36%), Positives = 52/93 (55%), Gaps = 15/93 (16%)

Query: 65  RSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD 124
           R++  +G    W+E +SWEPR FVYHNFLS++E +YL +    H + S  +D ++ K   
Sbjct: 91  RTDASDGNT-PWIEHVSWEPRVFVYHNFLSEKEAKYLRDA---HKKASKAMDDESMK--- 143

Query: 125 SRVRTSSGTFLARGRDKIIRDIEKRIADFTFFP 157
                   T   RG+D I+  IE+R++ F   P
Sbjct: 144 --------TTFKRGQDPIVNVIEQRLSAFVMLP 168


>gi|351731158|ref|ZP_08948849.1| 2OG-Fe(II) oxygenase [Acidovorax radicis N35]
          Length = 303

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 32/76 (42%), Positives = 45/76 (59%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  V+ N LS EEC+ LI  A P M +S  V + TG  + +  RTS G F  RG+  +
Sbjct: 115 QPRVVVFGNLLSPEECDALIADAAPRMARSLTVATKTGGEEINDDRTSDGMFFQRGQSPL 174

Query: 143 IRDIEKRIADFTFFPL 158
           I+ IE+RIA    +P+
Sbjct: 175 IQRIEERIARLLNWPI 190


>gi|388567209|ref|ZP_10153646.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
 gi|388265592|gb|EIK91145.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
          Length = 296

 Score = 64.3 bits (155), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 32/88 (36%), Positives = 50/88 (56%), Gaps = 2/88 (2%)

Query: 73  AEQWVEVISW--EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
            ++WV++I+    PR  V  N LS EEC+ +I  A P + +S  V + TG  + +  RTS
Sbjct: 96  GDRWVDIITHMNHPRVVVLGNLLSAEECDAIIESAKPKLARSLTVQTATGGEELNADRTS 155

Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPL 158
           SG F  RG+   +  +E+RIA    +P+
Sbjct: 156 SGMFFTRGQTPEVTAVERRIARLVGWPV 183


>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
 gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
          Length = 296

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 33/86 (38%), Positives = 46/86 (53%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P A +  +FLS  ECE LI LA P + +STVVD  TG++  +  R+S G F   G   +I
Sbjct: 102 PAAVLLDDFLSANECEQLIALARPRLSRSTVVDPVTGRNVVAGHRSSDGMFFRLGETPLI 161

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFF 169
             +E RIA+ T  P+       L  +
Sbjct: 162 ARLEARIAELTGLPVENGEGLQLLHY 187


>gi|224122338|ref|XP_002318810.1| predicted protein [Populus trichocarpa]
 gi|222859483|gb|EEE97030.1| predicted protein [Populus trichocarpa]
          Length = 310

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Compositional matrix adjust.
 Identities = 48/145 (33%), Positives = 76/145 (52%), Gaps = 12/145 (8%)

Query: 16  SSTLILTLLIMFTFAILILLAFGILSMPSSSGDSR-KANDLSSIVR--KSMERSEGDEGR 72
           +S + L L ++ T      L FG     SS  + R K   L ++++   S++ +  D  R
Sbjct: 2   ASFVYLLLFMVLTLTTQFSLCFG----KSSRKELRNKEAHLETMIQFGSSIQTNWVDPSR 57

Query: 73  AEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSG 132
               V  +SW+PR FVY  FL+ EEC++LI+LA      S   D D+G+ + +R+  SS 
Sbjct: 58  ----VVTVSWQPRVFVYKGFLTDEECDHLISLAQGTKETSEGKDDDSGRIERNRLFASS- 112

Query: 133 TFLARGRDKIIRDIEKRIADFTFFP 157
           T L    D I+  IE+R++ +T  P
Sbjct: 113 TSLLNMDDNILSRIEERVSAWTLLP 137


>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           gladioli BSR3]
 gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           gladioli BSR3]
          Length = 302

 Score = 63.2 bits (152), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 33/75 (44%), Positives = 43/75 (57%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P A +   FLS  EC  LI LA P + +STVVD  TG++  +  R+S G F   G   +I
Sbjct: 102 PAAVLLDGFLSAGECRQLIELARPRLNRSTVVDPVTGRNIVAGHRSSDGMFFRLGETPLI 161

Query: 144 RDIEKRIADFTFFPL 158
             IE+RIA  T FP+
Sbjct: 162 SRIEQRIAALTGFPV 176


>gi|222111817|ref|YP_002554081.1| procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
 gi|221731261|gb|ACM34081.1| Procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
          Length = 289

 Score = 63.2 bits (152), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 44/75 (58%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++ N LS EEC+ +I+ A P M +S  V + TG  + +  RTS G F  RG   ++
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARSLTVQTTTGGEEVNADRTSDGMFFQRGETPVV 161

Query: 144 RDIEKRIADFTFFPL 158
           + +E+RIA    +P+
Sbjct: 162 QRLEERIARLVRWPI 176


>gi|121595595|ref|YP_987491.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
 gi|120607675|gb|ABM43415.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
          Length = 289

 Score = 63.2 bits (152), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 44/75 (58%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++ N LS EEC+ +I+ A P M +S  V + TG  + +  RTS G F  RG   ++
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARSLTVQTTTGGEEVNADRTSDGMFFQRGETPVV 161

Query: 144 RDIEKRIADFTFFPL 158
           + +E+RIA    +P+
Sbjct: 162 QRLEERIARLVRWPI 176


>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
           [Collimonas fungivorans Ter331]
 gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
           [Collimonas fungivorans Ter331]
          Length = 289

 Score = 63.2 bits (152), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 33/75 (44%), Positives = 44/75 (58%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PRA ++ N LS +EC+ LI L+   + +S VVD  TG +K    RTSSGTF  RG    
Sbjct: 99  KPRAILFGNVLSHDECDQLIALSKTKLLRSGVVDHQTGNTKLHEHRTSSGTFFHRGTTPF 158

Query: 143 IRDIEKRIADFTFFP 157
           I  I+KR+A     P
Sbjct: 159 IAMIDKRLAALMQVP 173


>gi|375106426|ref|ZP_09752687.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
           JOSHI_001]
 gi|374667157|gb|EHR71942.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
           JOSHI_001]
          Length = 295

 Score = 62.8 bits (151), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 44/75 (58%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V+   LS EEC+ +++LA P + +S  V + +G S+ +  RTS G F  RG   + 
Sbjct: 108 PRVMVFGGLLSDEECDAMVDLARPRLARSETVHNGSGGSEVNAARTSDGMFFDRGEFPLC 167

Query: 144 RDIEKRIADFTFFPL 158
           R IE+RIA    +P+
Sbjct: 168 RTIEQRIAALVNWPV 182


>gi|365090417|ref|ZP_09328465.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
 gi|363416516|gb|EHL23626.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
          Length = 302

 Score = 62.4 bits (150), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 45/76 (59%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  V+ N LS EEC+ LI  A P + +S  V + TG  + +  RTS G F  RG+  +
Sbjct: 114 QPRIVVFGNLLSPEECDALIADAQPRLARSLTVATKTGGEEINDDRTSDGMFFQRGQSPL 173

Query: 143 IRDIEKRIADFTFFPL 158
           I+ IE+RIA    +P+
Sbjct: 174 IQRIEERIARLLNWPI 189


>gi|325925807|ref|ZP_08187179.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
           91-118]
 gi|325543793|gb|EGD15204.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
           91-118]
          Length = 286

 Score = 62.4 bits (150), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 41/113 (36%), Positives = 55/113 (48%), Gaps = 5/113 (4%)

Query: 51  KANDLSSIVRKSMERSEGDE-----GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLA 105
           +ANDL   VR    + + D      G  +  V V    PR  V   FLS EEC+ LI LA
Sbjct: 58  QANDLPMPVRVPALQQDADASLLALGDRDVRVLVSLLLPRVVVLGGFLSDEECDALIALA 117

Query: 106 TPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPL 158
            PH+ +S  VD+  G+      RTS    L  G+D + + IE RIA    +P+
Sbjct: 118 RPHLARSRTVDNANGEHVVHAARTSDSMCLRLGQDALCQRIEARIARLLDWPV 170


>gi|295700439|ref|YP_003608332.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
 gi|295439652|gb|ADG18821.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
          Length = 296

 Score = 62.4 bits (150), Expect = 6e-08,   Method: Compositional matrix adjust.
 Identities = 33/75 (44%), Positives = 42/75 (56%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P A    NFLS +ECE LI LA P + +S VVD  TG+   +  R+S G F   G   +I
Sbjct: 102 PAAVHLANFLSADECEQLIALAQPRLDRSAVVDPVTGRDVIATHRSSHGMFFRLGETPLI 161

Query: 144 RDIEKRIADFTFFPL 158
             IE RIA+ T  P+
Sbjct: 162 ARIEARIAELTATPV 176


>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
 gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 244

 Score = 62.0 bits (149), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 51/75 (68%), Gaps = 1/75 (1%)

Query: 85  RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIR 144
           R F+  +FL+ EE ++++ ++   + +S VV ++ G S++S++RTS G FL RG D +++
Sbjct: 1   RIFLIEHFLTDEEADHIVQVSERRLERSGVVATNGG-SEESQIRTSFGVFLERGEDPVVK 59

Query: 145 DIEKRIADFTFFPLG 159
            +E+RI+  T  P+G
Sbjct: 60  GVEERISALTLMPVG 74


>gi|308799555|ref|XP_003074558.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
 gi|116000729|emb|CAL50409.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
          Length = 274

 Score = 62.0 bits (149), Expect = 8e-08,   Method: Compositional matrix adjust.
 Identities = 33/81 (40%), Positives = 51/81 (62%), Gaps = 1/81 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE +SW PRAF   N L + E   ++ LA   + +STV+DS++GKS  + +RTS  TFL+
Sbjct: 9   VEPLSWYPRAFALRNALDETEMRAILALARTRVARSTVIDSESGKSVVNPIRTSKQTFLS 68

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           R  D ++R + +R++  T  P
Sbjct: 69  R-NDPVVRKVLERMSSVTHLP 88


>gi|224071291|ref|XP_002303388.1| predicted protein [Populus trichocarpa]
 gi|222840820|gb|EEE78367.1| predicted protein [Populus trichocarpa]
          Length = 297

 Score = 62.0 bits (149), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 60/101 (59%), Gaps = 5/101 (4%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDSRVRTSSGTFL 135
           +V+SW PRA  Y  F++ E+C+++IN+A P ++ ST+     +T ++    +RTSSG F+
Sbjct: 90  QVLSWRPRALYYPGFITAEQCQHIINMAKPSLQPSTLALRKGETAETTKG-IRTSSGMFV 148

Query: 136 ARGRDK--IIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
               D+  +++ IE++IA  T  P      F++  + + +K
Sbjct: 149 FSSEDQAGVLQVIEEKIARATMIPSTHGEAFNVLRYEIGQK 189


>gi|299115886|emb|CBN75895.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
           [Ectocarpus siliculosus]
          Length = 404

 Score = 61.6 bits (148), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 33/95 (34%), Positives = 54/95 (56%), Gaps = 3/95 (3%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV--VDSDTGKSKDSRVRTSSGTF 134
           ++ +S EP  F   NFL  EEC+++   A PHM+ S V  +D D GK  D+  RTS+  F
Sbjct: 193 MKTLSMEPLVFEARNFLLDEECKHIREKADPHMKPSPVSLMDHDKGKP-DTNWRTSTTYF 251

Query: 135 LARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
           +   RD +++ I++R+ +FT  P   Q +  +  +
Sbjct: 252 MPSTRDPLLQGIDRRVEEFTRVPKSHQEQVQVLKY 286


>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
 gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
          Length = 294

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 45/86 (52%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V+ N LS EEC+ +I  A P M +S  V + +G  + +  RTS+G F  RG   I+
Sbjct: 107 PRIVVFGNLLSHEECDAIIAAARPRMARSLTVATQSGGEEINDDRTSNGMFFQRGETGIV 166

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFF 169
             +E+RIA    +PL       +  +
Sbjct: 167 SQLEERIARLLRWPLDHGEGLQVLHY 192


>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
 gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
          Length = 294

 Score = 61.2 bits (147), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 45/86 (52%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V+ N LS EEC+ +I  A P M +S  V + +G  + +  RTS+G F  RG   I+
Sbjct: 107 PRIVVFGNLLSHEECDAIIAAARPRMARSLTVATQSGGEEINDDRTSNGMFFQRGETGIV 166

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFF 169
             +E+RIA    +PL       +  +
Sbjct: 167 SQLEERIARLLRWPLDHGEGLQVLHY 192


>gi|168043388|ref|XP_001774167.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674574|gb|EDQ61081.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 284

 Score = 61.2 bits (147), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/93 (36%), Positives = 53/93 (56%), Gaps = 3/93 (3%)

Query: 68  GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV-VDSDTGKSKDSR 126
           GD    +   +V+SW+PRA +Y NF SKE+CE +I LA   +  S + +     ++    
Sbjct: 66  GDSSVTDIPFQVLSWKPRALLYPNFASKEQCEAIIKLARTRLAPSGLALRKGESEATTKE 125

Query: 127 VRTSSGTFLARGRDKI--IRDIEKRIADFTFFP 157
           +RTSSGTFL    DK   + ++E+++A  T  P
Sbjct: 126 IRTSSGTFLRASEDKTQSLAEVEEKMARATMIP 158


>gi|346723630|ref|YP_004850299.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346648377|gb|AEO41001.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 286

 Score = 61.2 bits (147), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 41/75 (54%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V   FLS EEC+ LI LA PH+ +S  VD+  G+      RTS    L  G+D + 
Sbjct: 96  PRVVVLGGFLSDEECDALIALAQPHLARSRTVDNANGEHVVHAARTSDSMCLRLGQDALC 155

Query: 144 RDIEKRIADFTFFPL 158
           + IE RIA    +P+
Sbjct: 156 QRIEARIARLLDWPV 170


>gi|398804098|ref|ZP_10563100.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
 gi|398094921|gb|EJL85274.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
          Length = 277

 Score = 60.8 bits (146), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 42/75 (56%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P  +V+ N LS  ECE LI  A   + +S  VD  TG  + +  RTS G F  RG +++I
Sbjct: 90  PELWVFDNLLSAAECEALIAAAESRLARSLTVDIRTGGEELNHDRTSHGMFYTRGENEVI 149

Query: 144 RDIEKRIADFTFFPL 158
           R IE RIA    +P+
Sbjct: 150 RRIEARIARLLNWPV 164


>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
 gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
          Length = 313

 Score = 60.8 bits (146), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 44/75 (58%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P+  V+ N LS +EC  +I  +   +++ST+VD  TG+    R RTS G +  RG D +I
Sbjct: 124 PQVIVFGNVLSPDECAEMIERSRHRLKRSTIVDPATGREDVIRNRTSEGIWYQRGEDALI 183

Query: 144 RDIEKRIADFTFFPL 158
             +++RIA    +PL
Sbjct: 184 ERLDQRIASLMNWPL 198


>gi|397568865|gb|EJK46391.1| hypothetical protein THAOC_34939 [Thalassiosira oceanica]
          Length = 488

 Score = 60.8 bits (146), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/100 (37%), Positives = 53/100 (53%), Gaps = 3/100 (3%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDSRVRTSSGTF 134
           +E +S +P       FL+ EEC+Y++  A P M+ S V   D+D G+   S  RTS  TF
Sbjct: 267 IETLSMKPLVLSISGFLADEECDYIMEKAAPTMKYSGVSLKDADKGRPA-SDWRTSQSTF 325

Query: 135 LARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +A   D I+RDIE R A  T  P+  Q    +  + V +K
Sbjct: 326 VAAMGDPILRDIELRTASLTRVPVTHQEFVQVLRYGVTEK 365


>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
 gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
          Length = 296

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/88 (42%), Positives = 49/88 (55%), Gaps = 2/88 (2%)

Query: 73  AEQWVEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
            E+ V V+S    P A    +FLS +ECE LI LA P + +STVVD  TG++  +  R+S
Sbjct: 89  GERKVRVLSRLQRPAAVHLADFLSADECEQLIALAQPRLDRSTVVDPVTGRNVVAGHRSS 148

Query: 131 SGTFLARGRDKIIRDIEKRIADFTFFPL 158
            G F   G   +I  IE RIA  T  P+
Sbjct: 149 HGMFFRLGETPLIVRIEARIAALTGTPV 176


>gi|356574299|ref|XP_003555286.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 290

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 37/102 (36%), Positives = 59/102 (57%), Gaps = 7/102 (6%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKS-KDSRVRTSSGTF 134
           +++SW PRA  + NF S E C+ +I +A P +  S +     +T +S KD+  RTSSGTF
Sbjct: 76  QILSWRPRAVYFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGETAESTKDT--RTSSGTF 133

Query: 135 LARGRDK--IIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           ++   DK  I+  +E++IA  T  P     KF++  + V +K
Sbjct: 134 ISASEDKSGILDFVERKIAKVTMIPRTHGEKFNILKYEVAQK 175


>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
 gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
          Length = 511

 Score = 60.5 bits (145), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/85 (40%), Positives = 50/85 (58%), Gaps = 4/85 (4%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E++   P   V+H+ LS +E +YL NLA P ++++TV     GK    RVRTS G +L 
Sbjct: 310 MEIVLLNPFIVVFHDALSPQEIDYLQNLARPLLKRTTV--HVNGKYVSRRVRTSKGAWLE 367

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQ 161
           R  + + R IE+R+ D T   L MQ
Sbjct: 368 RDLNNLTRRIERRVVDMT--ELSMQ 390


>gi|357459545|ref|XP_003600053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355489101|gb|AES70304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 156

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 43/67 (64%), Gaps = 3/67 (4%)

Query: 92  FLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIA 151
           + SKEECE+LI L  P++ +S + D  TGK  ++R   + G F+   +DKII++IE+RI 
Sbjct: 25  YESKEECEHLIKLGKPYLERSRISDKRTGKGIENRFAYACGGFV---KDKIIKNIEQRIP 81

Query: 152 DFTFFPL 158
           D    P+
Sbjct: 82  DIISIPV 88


>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
 gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
          Length = 300

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 44/75 (58%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P+  V+ N LS EEC+ +I  +   +++ST+VD  TG+    R RTS G +  RG D  I
Sbjct: 111 PQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEGVIRNRTSEGIWYQRGEDAFI 170

Query: 144 RDIEKRIADFTFFPL 158
             +++RIA    +P+
Sbjct: 171 ERLDRRIASLMNWPV 185


>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
 gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
          Length = 300

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 44/75 (58%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P+  V+ N LS EEC+ +I  +   +++ST+VD  TG+    R RTS G +  RG D  I
Sbjct: 111 PQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEGVIRNRTSEGIWYQRGEDAFI 170

Query: 144 RDIEKRIADFTFFPL 158
             +++RIA    +P+
Sbjct: 171 ERLDQRIASLMNWPV 185


>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
          Length = 296

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 51/84 (60%), Gaps = 2/84 (2%)

Query: 77  VEVISWEPRAFVYH--NFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTF 134
           ++V+S   + F+ H   FLS+EEC+ LI ++   ++ STV+D  TG+ K +  RTS G  
Sbjct: 100 IKVLSKVEKPFILHLDYFLSEEECDQLIEMSRERLKPSTVIDPKTGEEKAATGRTSKGMS 159

Query: 135 LARGRDKIIRDIEKRIADFTFFPL 158
                ++ I+ +EKRIA+   FP+
Sbjct: 160 FYLQENEFIKKVEKRIAELIEFPV 183


>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
 gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
          Length = 307

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 44/75 (58%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P+  V+ N LS EEC+ +I  +   +++ST+VD  TG+    R RTS G +  RG D  I
Sbjct: 118 PQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEDVIRNRTSEGIWYQRGEDAFI 177

Query: 144 RDIEKRIADFTFFPL 158
             +++RIA    +P+
Sbjct: 178 ERLDQRIASLMNWPV 192


>gi|354334983|gb|AER23925.1| procollagen-proline dioxygenase [Variovorax sp. HH01]
          Length = 280

 Score = 60.1 bits (144), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 43/75 (57%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V+ N LS EECE LI  A   + +S  V++ TG    +  RTS G F  RG ++I+
Sbjct: 93  PRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSDGMFFERGENEIV 152

Query: 144 RDIEKRIADFTFFPL 158
             +E+RIA    +PL
Sbjct: 153 ARVEQRIAALLRWPL 167


>gi|225428938|ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296083079|emb|CBI22483.3| unnamed protein product [Vitis vinifera]
          Length = 284

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 46/144 (31%), Positives = 73/144 (50%), Gaps = 7/144 (4%)

Query: 20  ILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKS-MERSEGDEGRAEQW-V 77
           +L L I ++F  L  L   +L     +G   +   L S+   S M   E  E   +    
Sbjct: 15  LLLLFISWSFFFLAGLFGSMLFSQDVNGVRSQPRLLESVEEYSPMPHGETGESSVDMIPF 74

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDSRVRTSSGTFL 135
           +V+SW+PRA  +  F + E+C+ +I +A  H+R ST+     +T +S     RTSSGTF+
Sbjct: 75  QVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQGETDESTKG-TRTSSGTFI 133

Query: 136 ARGRDK--IIRDIEKRIADFTFFP 157
           +   DK  I+  +E++IA  T  P
Sbjct: 134 SASEDKTGILDFVERKIAKATMIP 157


>gi|224001336|ref|XP_002290340.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220973762|gb|EED92092.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 483

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/86 (36%), Positives = 47/86 (54%), Gaps = 1/86 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFL 135
           +E +S  P       FLS EEC+Y+  +A+P ++ S+V   D  K KD S  RTS   FL
Sbjct: 261 IETLSLRPLVVSVEGFLSDEECDYIAEIASPQVKYSSVSLKDADKGKDSSEWRTSQSAFL 320

Query: 136 ARGRDKIIRDIEKRIADFTFFPLGMQ 161
           +   D+++ +I+ R+A  T  P   Q
Sbjct: 321 SARDDEVLTEIDHRVASLTRIPRNHQ 346


>gi|147823227|emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera]
          Length = 276

 Score = 59.7 bits (143), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 51/84 (60%), Gaps = 5/84 (5%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDSRVRTSSGTFL 135
           +V+SW+PRA  +  F + E+C+ +I +A  H+R ST+     +T +S     RTSSGTF+
Sbjct: 67  QVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLALRQGETDESTKG-TRTSSGTFI 125

Query: 136 ARGRDK--IIRDIEKRIADFTFFP 157
           +   DK  I+  +E++IA  T  P
Sbjct: 126 SASEDKTGILDFVERKIAKATMIP 149


>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           CMR15]
          Length = 289

 Score = 59.7 bits (143), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 44/86 (51%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++ +FLS EEC+ LI L    +++S VV+ +TG+      RTS G     G   +I
Sbjct: 97  PRIVLFQHFLSDEECDQLITLGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFF 169
             IE RIA  T  P+     F +  +
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHY 182


>gi|168006299|ref|XP_001755847.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693166|gb|EDQ79520.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 299

 Score = 59.7 bits (143), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 40/113 (35%), Positives = 63/113 (55%), Gaps = 5/113 (4%)

Query: 66  SEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDS 125
           S GD   A+   +V+SW+PRA +Y  F SKE+CE ++ LA   +  S +     G+S+DS
Sbjct: 79  STGDNFIADIPFQVLSWKPRALLYPRFASKEQCEAIMKLARTRLAPSALA-LRKGESEDS 137

Query: 126 R--VRTSSGTFLARGRD--KIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
              +RTSSGTFL    D  + +  +E+++A  T  P      F++  + V +K
Sbjct: 138 TKDIRTSSGTFLRADEDTTRSLEQVEEKMAKATMIPRENGEAFNVLKYNVGQK 190


>gi|449448264|ref|XP_004141886.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 294

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 34/95 (35%), Positives = 53/95 (55%), Gaps = 5/95 (5%)

Query: 67  EGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKD 124
            GD+  +    +V+SW PRA  +  F + E+C+ ++NLA P +R ST+     +T +S  
Sbjct: 71  NGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRKGETAESTK 130

Query: 125 SRVRTSSGTFLARGRDK--IIRDIEKRIADFTFFP 157
             VRTSSG F +   D+   +  IE++IA  T  P
Sbjct: 131 G-VRTSSGVFFSASEDESGTLGVIEEKIARATMIP 164


>gi|325922187|ref|ZP_08183974.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
           19865]
 gi|325547306|gb|EGD18373.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
           19865]
          Length = 285

 Score = 59.3 bits (142), Expect = 5e-07,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 40/75 (53%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V  +FLS  EC+ LI LA P + +S  VD+D G       RTS    L  G+D + 
Sbjct: 96  PRVVVLGDFLSDAECDALIALAQPRLARSRTVDNDNGAQIVHAARTSDSMCLQLGQDALC 155

Query: 144 RDIEKRIADFTFFPL 158
           + IE RIA    +P+
Sbjct: 156 QRIEARIARLLDWPV 170


>gi|388505024|gb|AFK40578.1| unknown [Medicago truncatula]
          Length = 297

 Score = 59.3 bits (142), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 63/110 (57%), Gaps = 3/110 (2%)

Query: 68  GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR- 126
           GD+       +V+SW+PRA  + NF + E+CE ++++A   ++ S++       +++++ 
Sbjct: 76  GDDSITSIPFQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGETTENTKG 135

Query: 127 VRTSSGTFLARGRD--KIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +RTSSG FL+  RD  K +  IE++IA  T  P      F++  + V ++
Sbjct: 136 IRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQR 185


>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
           [Alteromonas sp. S89]
          Length = 294

 Score = 59.3 bits (142), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 46/89 (51%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +P   ++ NFL++ EC+ L+ ++ P++  S VV++  G  +    RTS GT  ARG   +
Sbjct: 102 QPNIVLFANFLAEWECDALVEMSRPNLSPSRVVNTQHGAFELKPSRTSGGTHFARGETPL 161

Query: 143 IRDIEKRIADFTFFPLGMQFKFSLFFFYV 171
           I DIE RIA     P        +  + V
Sbjct: 162 IADIEARIASLLKVPEAHGEPLQILHYPV 190


>gi|357476355|ref|XP_003608463.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355509518|gb|AES90660.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 297

 Score = 59.3 bits (142), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 33/110 (30%), Positives = 63/110 (57%), Gaps = 3/110 (2%)

Query: 68  GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR- 126
           GD+       +V+SW+PRA  + NF + E+CE ++++A   ++ S++       +++++ 
Sbjct: 76  GDDSITSIPFQVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGETTENTKG 135

Query: 127 VRTSSGTFLARGRD--KIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +RTSSG FL+  RD  K +  IE++IA  T  P      F++  + V ++
Sbjct: 136 IRTSSGVFLSASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQR 185


>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
 gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
          Length = 309

 Score = 59.3 bits (142), Expect = 6e-07,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 44/76 (57%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  ++ N LS EEC+ +I+ A P M +S  V + TG  + +  RTS+G F  R  + +
Sbjct: 121 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDDRTSNGMFFQREENPV 180

Query: 143 IRDIEKRIADFTFFPL 158
           +  +E RIA    +PL
Sbjct: 181 VARLEARIARLVNWPL 196


>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
          Length = 289

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 44/86 (51%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++ +FLS EEC+ LI L    +++S VV+ +TG+      RTS G     G   +I
Sbjct: 97  PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFF 169
             IE RIA  T  P+     F +  +
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHY 182


>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
           bacterium R229]
          Length = 289

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 44/86 (51%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++ +FLS EEC+ LI L    +++S VV+ +TG+      RTS G     G   +I
Sbjct: 97  PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFF 169
             IE RIA  T  P+     F +  +
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHY 182


>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
 gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
          Length = 311

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 45/86 (52%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P   V    LS EEC+ +I L+   M+ S VVD ++G S +S VR S G+   RG ++++
Sbjct: 121 PNIAVIRGLLSDEECDEVIRLSRGKMKTSQVVDRESGGSYESSVRKSEGSHFERGENELV 180

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFF 169
           R IE R++     P+       +  +
Sbjct: 181 RRIEARLSALVDLPVNRGEPLQILHY 206


>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
 gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           PSI07]
          Length = 289

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 30/86 (34%), Positives = 44/86 (51%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++ +FLS EEC+ LI L    +++S VV+ +TG+      RTS G     G   +I
Sbjct: 97  PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFF 169
             IE RIA  T  P+     F +  +
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHY 182


>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
           ATCC 19860]
 gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
           ATCC 19860]
          Length = 298

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 44/76 (57%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  ++ N LS EEC+ +I+ A P M +S  V + TG  + +  RTS+G F  R  + +
Sbjct: 110 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDDRTSNGMFFQREENPM 169

Query: 143 IRDIEKRIADFTFFPL 158
           +  +E RIA    +PL
Sbjct: 170 VAKLEARIARLVNWPL 185


>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
 gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
          Length = 289

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 45/87 (51%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++ +FLS EEC+ LI L    +++S VV+ +TG+      RTS G     G   ++
Sbjct: 97  PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 156

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFFY 170
             IE RIA  T  P+     F +  ++
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYH 183


>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum MolK2]
 gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum MolK2]
          Length = 283

 Score = 58.9 bits (141), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 45/87 (51%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++ +FLS EEC+ LI L    +++S VV+ +TG+      RTS G     G   ++
Sbjct: 91  PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 150

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFFY 170
             IE RIA  T  P+     F +  ++
Sbjct: 151 ARIEARIAQATGVPVEHGEGFQVLHYH 177


>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
 gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
          Length = 292

 Score = 58.9 bits (141), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 45/87 (51%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++ +FLS EEC+ LI L    +++S VV+ +TG+      RTS G     G   ++
Sbjct: 100 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 159

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFFY 170
             IE RIA  T  P+     F +  ++
Sbjct: 160 ARIEARIAQATGVPVEHGEGFQVLHYH 186


>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
 gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum IPO1609]
          Length = 280

 Score = 58.9 bits (141), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 45/87 (51%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++ +FLS EEC+ LI L    +++S VV+ +TG+      RTS G     G   ++
Sbjct: 88  PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 147

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFFY 170
             IE RIA  T  P+     F +  ++
Sbjct: 148 ARIEARIAQATGVPVEHGEGFQVLHYH 174


>gi|242038031|ref|XP_002466410.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
 gi|241920264|gb|EER93408.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
          Length = 294

 Score = 58.9 bits (141), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 57/101 (56%), Gaps = 5/101 (4%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDSRVRTSSGTFL 135
           +++SW+PRA  +  F + E+CE ++  A   ++ ST+     +T +S    +RTSSGTFL
Sbjct: 87  QILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKG-IRTSSGTFL 145

Query: 136 ARGRD--KIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +   D  + + +IEK+IA  T  P      F++  + + ++
Sbjct: 146 SANEDPTRTLAEIEKKIARATMIPRNHGEPFNVLRYNIGQR 186


>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
 gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
           oxidoreductase protein [Ralstonia solanacearum GMI1000]
          Length = 289

 Score = 58.9 bits (141), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 29/86 (33%), Positives = 44/86 (51%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++ +FLS EEC+ LI L    +++S VV+ +TG+      RTS G     G   ++
Sbjct: 97  PRIVLFQHFLSDEECDQLIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLV 156

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFF 169
             IE RIA  T  P+     F +  +
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHY 182


>gi|357114580|ref|XP_003559078.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 295

 Score = 58.9 bits (141), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 59/101 (58%), Gaps = 5/101 (4%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR--VRTSSGTFL 135
           +++SW+PRA  +  F + E+CE ++  A   +R ST+     G+++++   +RTSSGTFL
Sbjct: 88  QILSWQPRALYFPQFATSEQCENVVKTAKARLRPSTLA-LRKGETEETTKGIRTSSGTFL 146

Query: 136 ARGRD--KIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +   D  + + ++EK+IA  T  P      F++  + + +K
Sbjct: 147 SADEDPTRTLAEVEKKIAKATMIPRSHGEPFNVLRYEIGQK 187


>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
           CFBP2957]
 gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           CFBP2957]
          Length = 289

 Score = 58.9 bits (141), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 45/87 (51%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++ +FLS EEC+ LI L    +++S VV+ +TG+      RTS G     G   ++
Sbjct: 97  PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 156

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFFY 170
             IE RIA  T  P+     F +  ++
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYH 183


>gi|449502290|ref|XP_004161600.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
          Length = 137

 Score = 58.9 bits (141), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 25/44 (56%), Positives = 31/44 (70%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGK 121
           EV+SW PR  V HNFLS +EC+YL  +A   +  STVVD+ TGK
Sbjct: 83  EVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGK 126


>gi|148653656|ref|YP_001280749.1| procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
 gi|148572740|gb|ABQ94799.1| Procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
          Length = 268

 Score = 58.5 bits (140), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 46/79 (58%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           + ++P   V ++FLS EEC+ LI+ A   ++ S VVD + G   +   RTS+ T   RG 
Sbjct: 75  VCYKPFVTVINDFLSPEECDALISDADQKLKASRVVDPEDGSFVEHSARTSTSTGYHRGE 134

Query: 140 DKIIRDIEKRIADFTFFPL 158
             II+ IE RIAD   +P+
Sbjct: 135 IDIIKTIEARIADLINWPV 153


>gi|223945827|gb|ACN26997.1| unknown [Zea mays]
 gi|414872966|tpg|DAA51523.1| TPA: prolyl 4-hydroxylase [Zea mays]
          Length = 294

 Score = 58.5 bits (140), Expect = 8e-07,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 57/101 (56%), Gaps = 5/101 (4%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDSRVRTSSGTFL 135
           +++SW+PRA  +  F + E+CE ++  A   ++ ST+     +T +S    +RTSSGTFL
Sbjct: 87  QILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKG-IRTSSGTFL 145

Query: 136 ARGRD--KIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +   D  + + +IEK+IA  T  P      F++  + + ++
Sbjct: 146 SANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQR 186


>gi|226499492|ref|NP_001150030.1| LOC100283657 [Zea mays]
 gi|195636206|gb|ACG37571.1| prolyl 4-hydroxylase [Zea mays]
 gi|347978804|gb|AEP37744.1| prolyl 4-hydroxylase 3 [Zea mays]
          Length = 294

 Score = 58.5 bits (140), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 57/101 (56%), Gaps = 5/101 (4%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDSRVRTSSGTFL 135
           +++SW+PRA  +  F + E+CE ++  A   ++ ST+     +T +S    +RTSSGTFL
Sbjct: 87  QILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKG-IRTSSGTFL 145

Query: 136 ARGRD--KIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +   D  + + +IEK+IA  T  P      F++  + + ++
Sbjct: 146 SANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQR 186


>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           K60-1]
 gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           K60-1]
          Length = 288

 Score = 58.5 bits (140), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 29/87 (33%), Positives = 45/87 (51%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++ +FLS EEC+ LI L    +++S VV+ +TG+      RTS G     G   ++
Sbjct: 96  PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 155

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFFY 170
             IE RIA  T  P+     F +  ++
Sbjct: 156 ARIEARIAQATGVPVEHGEGFQVLHYH 182


>gi|398808448|ref|ZP_10567311.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
 gi|398087480|gb|EJL78066.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
          Length = 280

 Score = 58.5 bits (140), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 43/75 (57%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V+ N LS EECE LI  A   + +S  V++ TG    +  RTS G F  RG ++I+
Sbjct: 93  PRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSDGMFFERGENEIV 152

Query: 144 RDIEKRIADFTFFPL 158
             +E+R+A    +PL
Sbjct: 153 ARLEQRLATLLRWPL 167


>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
 gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
          Length = 285

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 44/75 (58%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P+  V+ N L ++EC+ +I  +   + +ST V+++TG  +  R RTS GT+   G D +I
Sbjct: 96  PQIVVFGNVLDQDECDEMIQRSMHKLEQSTTVNAETGTQEVIRHRTSHGTWFQNGEDALI 155

Query: 144 RDIEKRIADFTFFPL 158
           R IE R+A     P+
Sbjct: 156 RRIETRLAALMNCPV 170


>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
 gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
          Length = 286

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 44/75 (58%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P+  V+ + LS  EC  LI  +   +++ST V+  TG+    R RTS G +  RG D++I
Sbjct: 97  PQLVVFADVLSAAECAELIERSRHRLKRSTTVNPLTGREDVIRNRTSEGVWYRRGEDQLI 156

Query: 144 RDIEKRIADFTFFPL 158
             +E+RIA  T +PL
Sbjct: 157 ARVERRIASLTNWPL 171


>gi|40809925|dbj|BAD07294.1| prolyl 4-hydroxylase [Nicotiana tabacum]
          Length = 286

 Score = 58.2 bits (139), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 54/84 (64%), Gaps = 5/84 (5%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR--VRTSSGTFL 135
           +V+SW PRA  + NF S E+C+ +I +A  +M  S++    TG+++++   +RTSSGTF+
Sbjct: 77  QVLSWFPRALYFPNFASIEQCQSIIKMAKANMEPSSLA-LRTGETEETTKGIRTSSGTFI 135

Query: 136 ARGRDK--IIRDIEKRIADFTFFP 157
           +   DK  I+  IE++IA  T  P
Sbjct: 136 SASEDKTGILDLIEEKIAKATMIP 159


>gi|319792090|ref|YP_004153730.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
 gi|315594553|gb|ADU35619.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
          Length = 280

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 43/75 (57%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V+ N LS EECE LI  A   + +S  V++ TG    +  RTS G F  RG ++I+
Sbjct: 93  PRVIVFGNLLSTEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSDGMFFERGENEIV 152

Query: 144 RDIEKRIADFTFFPL 158
             +E+R+A    +PL
Sbjct: 153 ARLEQRLAMLLRWPL 167


>gi|449488641|ref|XP_004158125.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101218968
           [Cucumis sativus]
          Length = 311

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 48/93 (51%), Gaps = 1/93 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PR F+Y  FLS EEC++LI+LA+      +   + +G +  + +  SSG  L 
Sbjct: 56  VVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVIL- 114

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
              D I+  IE R+A +T  P      F +  +
Sbjct: 115 NTTDDIVARIENRLAIWTLLPKDHSMPFQIMQY 147


>gi|449469338|ref|XP_004152378.1| PREDICTED: uncharacterized protein LOC101218968 [Cucumis sativus]
          Length = 311

 Score = 57.8 bits (138), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 31/93 (33%), Positives = 48/93 (51%), Gaps = 1/93 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +SW PR F+Y  FLS EEC++LI+LA+      +   + +G +  + +  SSG  L 
Sbjct: 56  VVQVSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVIL- 114

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
              D I+  IE R+A +T  P      F +  +
Sbjct: 115 NTTDDIVARIENRLAIWTLLPKDHSMPFQIMQY 147


>gi|359490628|ref|XP_002271805.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Vitis
           vinifera]
          Length = 312

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 53/97 (54%), Gaps = 7/97 (7%)

Query: 62  SMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGK 121
           S+E +  D  R  Q    +SW+PRAF+Y  FLS EEC++LI+LA     +      D+G 
Sbjct: 46  SIEYNRVDPSRVIQ----LSWQPRAFLYRGFLSDEECDHLISLALGKKEELATNGGDSGN 101

Query: 122 SKDSR-VRTSSGTFLARGRDKIIRDIEKRIADFTFFP 157
               R +++S G       D++   IEKRI+ +TF P
Sbjct: 102 VVLKRLLKSSEGPLYI--DDEVAARIEKRISAWTFLP 136


>gi|302143843|emb|CBI22704.3| unnamed protein product [Vitis vinifera]
          Length = 317

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 36/97 (37%), Positives = 53/97 (54%), Gaps = 7/97 (7%)

Query: 62  SMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGK 121
           S+E +  D  R  Q    +SW+PRAF+Y  FLS EEC++LI+LA     +      D+G 
Sbjct: 46  SIEYNRVDPSRVIQ----LSWQPRAFLYRGFLSDEECDHLISLALGKKEELATNGGDSGN 101

Query: 122 SKDSR-VRTSSGTFLARGRDKIIRDIEKRIADFTFFP 157
               R +++S G       D++   IEKRI+ +TF P
Sbjct: 102 VVLKRLLKSSEGPLYI--DDEVAARIEKRISAWTFLP 136


>gi|356496957|ref|XP_003517331.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 299

 Score = 57.4 bits (137), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 35/101 (34%), Positives = 60/101 (59%), Gaps = 5/101 (4%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR--VRTSSGTFL 135
           +V+SW PRA  + NF+S E+CE +I +A   ++ ST+V    G++++S   +RTS G F+
Sbjct: 90  QVLSWYPRALYFPNFVSAEQCETIIEMARGGLKPSTLV-LRKGETEESTKGIRTSYGVFM 148

Query: 136 ARGRDK--IIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +   D+  I+  IE++IA  T  P      F++  + V +K
Sbjct: 149 SASEDETGILDSIEEKIAKATKIPRTHGEAFNILRYEVGQK 189


>gi|449511009|ref|XP_004163837.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-1-like [Cucumis sativus]
          Length = 294

 Score = 57.0 bits (136), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/95 (34%), Positives = 52/95 (54%), Gaps = 5/95 (5%)

Query: 67  EGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKD 124
            GD+  +    +V+SW PRA  +  F + E+C+ ++NLA P +R ST+     +T +S  
Sbjct: 71  NGDDSISSIPFQVLSWRPRALYFPKFATAEQCQSIVNLAKPKLRPSTLALRKGETAESTK 130

Query: 125 SRVRTSSGTFLARGRDK--IIRDIEKRIADFTFFP 157
             VRTSSG F +   D+   +  IE++ A  T  P
Sbjct: 131 G-VRTSSGVFFSASEDESGTLGVIEEKXARATMIP 164


>gi|326492085|dbj|BAJ98267.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 347

 Score = 57.0 bits (136), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 32/101 (31%), Positives = 57/101 (56%), Gaps = 5/101 (4%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR--VRTSSGTFL 135
           +++SW+PRA  +  F + E+CE ++  A   +R ST+     G+S+++   +RTSSGTFL
Sbjct: 140 QILSWQPRALYFPQFATAEQCENVVKTAKARLRPSTLA-LRKGESEETTKGIRTSSGTFL 198

Query: 136 ARGRDKI--IRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +   D    + +IE +IA  T  P      F++  + + +K
Sbjct: 199 SAEEDPTGALAEIETKIAKATMMPRSHGEPFNVLRYEIGQK 239


>gi|356541677|ref|XP_003539300.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 297

 Score = 57.0 bits (136), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 35/101 (34%), Positives = 59/101 (58%), Gaps = 5/101 (4%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR--VRTSSGTFL 135
           +V+SW PRA  + NF S E+CE +I +A   ++ ST+     G++++S   +RTSSG F+
Sbjct: 90  QVLSWYPRALYFPNFASAEQCESIIEMARGGLKSSTLA-LRKGETEESTKGIRTSSGVFM 148

Query: 136 ARGRDK--IIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +   D+  I+  IE++IA  T  P      F++  + V +K
Sbjct: 149 SASEDETGILDAIEEKIAKATKIPRTHGEAFNILRYEVGQK 189


>gi|356563543|ref|XP_003550021.1| PREDICTED: putative prolyl 4-hydroxylase-like [Glycine max]
          Length = 293

 Score = 57.0 bits (136), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 59/101 (58%), Gaps = 5/101 (4%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR--VRTSSGTFL 135
           +V+SW PRA  + NF + E+CE +I++A   ++ ST+     G+++D+   +RTSSG F+
Sbjct: 84  QVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLA-LRQGETEDNTKGIRTSSGVFV 142

Query: 136 ARGRDK--IIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +   DK   +  IE++IA  T  P      F++  + V ++
Sbjct: 143 SASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQR 183


>gi|255647903|gb|ACU24410.1| unknown [Glycine max]
          Length = 293

 Score = 57.0 bits (136), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 33/101 (32%), Positives = 59/101 (58%), Gaps = 5/101 (4%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR--VRTSSGTFL 135
           +V+SW PRA  + NF + E+CE +I++A   ++ ST+     G+++D+   +RTSSG F+
Sbjct: 84  QVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLA-LRQGETEDNTKGIRTSSGVFV 142

Query: 136 ARGRDK--IIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +   DK   +  IE++IA  T  P      F++  + V ++
Sbjct: 143 SASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQR 183


>gi|356536125|ref|XP_003536590.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 286

 Score = 57.0 bits (136), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 53/177 (29%), Positives = 85/177 (48%), Gaps = 20/177 (11%)

Query: 13  KSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERS-----E 67
           K  SS L L +  +F    L     G    P    D       S I+++S+++       
Sbjct: 5   KVKSSKLKLGVPTLFILCALFFFV-GFFVSPLLFQDLDDVGPRSRILQESVKKEYEPLEH 63

Query: 68  GDEGRAEQWV-----EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTG 120
           G+ G  E +V     +++SW PRA  + NF S E C+ +I +A P +  S +     +T 
Sbjct: 64  GESG--EPFVDSIPSQILSWRPRAVFFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGETA 121

Query: 121 KS-KDSRVRTSSGTFLARGRDK--IIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +S KD+  RTSSGTF++   DK  I+  +E++IA  T  P      F++  + V +K
Sbjct: 122 ESTKDT--RTSSGTFISASEDKSGILDLVERKIAKVTMIPRTHGEIFNILKYEVGQK 176


>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
 gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
          Length = 539

 Score = 57.0 bits (136), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 26/78 (33%), Positives = 47/78 (60%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE++ ++P A ++ N +S  E E +  LA+P ++++TV +S TG+ + +  R S   +L 
Sbjct: 318 VEILRFDPLAVLFKNVISDSEIEVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLK 377

Query: 137 RGRDKIIRDIEKRIADFT 154
              D +I  + +RI DFT
Sbjct: 378 GDLDPVIDRVNRRIEDFT 395


>gi|319763870|ref|YP_004127807.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
 gi|330823866|ref|YP_004387169.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
 gi|317118431|gb|ADV00920.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
 gi|329309238|gb|AEB83653.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
          Length = 284

 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 43/75 (57%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++ N LS EEC+ +I  A   M +S  V + +G  + ++ RTS G F  RG ++ +
Sbjct: 97  PRVVLFGNLLSPEECQAVIEAARTRMARSLTVQAASGGEEVNKDRTSDGMFFQRGENEAV 156

Query: 144 RDIEKRIADFTFFPL 158
             +E+RIA    +P+
Sbjct: 157 ARLEERIARLVRWPV 171


>gi|78046308|ref|YP_362483.1| 2OG-Fe(II) oxygenase [Xanthomonas campestris pv. vesicatoria str.
           85-10]
 gi|78034738|emb|CAJ22383.1| putative 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas
           campestris pv. vesicatoria str. 85-10]
          Length = 296

 Score = 56.6 bits (135), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 40/75 (53%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V   FLS EEC+ LI LA P + +S  VD+  G+      RTS    L  G+D + 
Sbjct: 106 PRVVVLGGFLSDEECDALIALARPRLARSRTVDNANGEHVVHAARTSDSMCLRLGQDALC 165

Query: 144 RDIEKRIADFTFFPL 158
           + IE RIA    +P+
Sbjct: 166 QRIEARIARLLDWPV 180


>gi|239814309|ref|YP_002943219.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
 gi|239800886|gb|ACS17953.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
          Length = 279

 Score = 56.6 bits (135), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 42/75 (56%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V+ N +S EECE LI  A   + +S  V++ TG    +  RTS G F  RG + I+
Sbjct: 92  PRVVVFGNLVSPEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSEGMFFERGENDIV 151

Query: 144 RDIEKRIADFTFFPL 158
             +E+RIA    +P+
Sbjct: 152 ARLEQRIAALLRWPV 166


>gi|363807682|ref|NP_001242420.1| uncharacterized protein LOC100775302 [Glycine max]
 gi|255641811|gb|ACU21174.1| unknown [Glycine max]
          Length = 293

 Score = 56.2 bits (134), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 34/113 (30%), Positives = 61/113 (53%), Gaps = 3/113 (2%)

Query: 65  RSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD 124
           R  GD+       +V+SW PRA  + NF + E+CE +I++A   ++ ST+        ++
Sbjct: 71  RDLGDDSITSIPFQVLSWRPRALYFPNFATAEQCENIIDVAKDGLKPSTLALRQGETEEN 130

Query: 125 SR-VRTSSGTFLARGRDK--IIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           ++ +RTSSG F++   DK   +  IE++IA  T  P      F++  + V ++
Sbjct: 131 TKGIRTSSGVFVSASGDKTGTLAVIEEKIARATMIPRSHGEAFNILRYEVDQR 183


>gi|196011902|ref|XP_002115814.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
 gi|190581590|gb|EDV21666.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
          Length = 534

 Score = 56.2 bits (134), Expect = 5e-06,   Method: Composition-based stats.
 Identities = 30/78 (38%), Positives = 43/78 (55%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VEVIS +P   +YHN L+  E E L  LA P ++++TV + DTGK + +  R S   +L 
Sbjct: 327 VEVISLQPYILIYHNLLNDLEVEALKTLAAPMLQRATVHNKDTGKLEYATYRISKSAWLN 386

Query: 137 RGRDKIIRDIEKRIADFT 154
                ++R I   I D T
Sbjct: 387 DDDHPLVRRISTLIEDVT 404


>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
 gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
 gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
 gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
 gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
 gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
          Length = 288

 Score = 55.8 bits (133), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 43/86 (50%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++ +FLS +EC+ LI +    +++S VV+ DTG+      RTS G     G   +I
Sbjct: 96  PRIVLFQHFLSDQECDELIAIGRNRLKRSPVVNPDTGEENLISARTSQGGMFQVGEHPLI 155

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFF 169
             IE RIA     P+     F +  +
Sbjct: 156 AKIEARIAQAVGVPVEHGEGFQVLNY 181


>gi|418523362|ref|ZP_13089380.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410699993|gb|EKQ58573.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
          Length = 286

 Score = 55.8 bits (133), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 40/75 (53%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V   FLS  EC+ LI LA P + +S  VD+  G+      RTS G  L  G+D + 
Sbjct: 96  PRVVVLGGFLSDGECDALIALARPRLARSRTVDNANGEHLVHAARTSDGMCLRVGQDALC 155

Query: 144 RDIEKRIADFTFFPL 158
           + IE RIA    +P+
Sbjct: 156 QRIEARIARLFDWPV 170


>gi|124267278|ref|YP_001021282.1| hypothetical protein Mpe_A2091 [Methylibium petroleiphilum PM1]
 gi|124260053|gb|ABM95047.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
          Length = 289

 Score = 55.8 bits (133), Expect = 6e-06,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 40/76 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  V+   LS  EC+ ++ LA   + +S  VD+ TG S+ +  RTS G F  RG   +
Sbjct: 101 DPRVIVFSGLLSDAECDEIVALAGARLARSHTVDTATGASEVNAARTSDGMFFTRGEHPV 160

Query: 143 IRDIEKRIADFTFFPL 158
               E RIA    +P+
Sbjct: 161 CARFEARIAALLNWPV 176


>gi|225428943|ref|XP_002263094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296083076|emb|CBI22480.3| unnamed protein product [Vitis vinifera]
          Length = 282

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 54/84 (64%), Gaps = 5/84 (5%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR--VRTSSGTFL 135
           +V+SW+PRA  + +F + E+C+ +I +A   +  ST+V    G++++S   +RTSSGTF+
Sbjct: 74  QVLSWKPRARYFPHFATAEQCQSIIEMAKSGLSPSTLV-LRKGETEESTKGIRTSSGTFI 132

Query: 136 ARGRDK--IIRDIEKRIADFTFFP 157
           +   DK  I+  IE++IA  T  P
Sbjct: 133 SASEDKTGILDFIERKIAKATMIP 156


>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
          Length = 541

 Score = 55.5 bits (132), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 28/78 (35%), Positives = 43/78 (55%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E+ S +PR  +YHN ++ EE E    LA   +R+STV +S TG S+ ++ R +   FL 
Sbjct: 335 MELASLKPRLVIYHNVVTDEEIETAKKLAQSRLRRSTVQNSLTGASEPTKYRIAKAAFLQ 394

Query: 137 RGRDKIIRDIEKRIADFT 154
                 I  + +RI D T
Sbjct: 395 NSEHDHIVKMTRRIGDVT 412


>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
 gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
          Length = 305

 Score = 55.5 bits (132), Expect = 9e-06,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 61/123 (49%), Gaps = 18/123 (14%)

Query: 36  AFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSK 95
           A+   + P ++G+  +A+D    VR   ER                  P+  V+ + LS+
Sbjct: 86  AYDYDACPVAAGNIVRAHDRDVAVRVRFER------------------PQVIVFDDVLSR 127

Query: 96  EECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTF 155
           +EC+ LI  A   +++ST V+ ++G+    ++RTS G +  R  D  I  +++RI+    
Sbjct: 128 DECDELIERARHRLKRSTTVNPESGREDVIQLRTSEGFWFQRCEDAFIERLDRRISALMN 187

Query: 156 FPL 158
           +PL
Sbjct: 188 WPL 190


>gi|17541712|ref|NP_502317.1| Protein PHY-2 [Caenorhabditis elegans]
 gi|32171589|sp|Q20065.1|P4HA2_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|3876769|emb|CAA93469.1| Protein PHY-2 [Caenorhabditis elegans]
          Length = 539

 Score = 55.1 bits (131), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 25/78 (32%), Positives = 46/78 (58%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE++ ++P A ++ N +   E E +  LA+P ++++TV +S TG+ + +  R S   +L 
Sbjct: 318 VEILRFDPLAVLFKNVIHDSEIEVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLK 377

Query: 137 RGRDKIIRDIEKRIADFT 154
              D +I  + +RI DFT
Sbjct: 378 GDLDPVIDRVNRRIEDFT 395


>gi|325915062|ref|ZP_08177391.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325538760|gb|EGD10427.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 286

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 38/75 (50%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V   FLS  EC+ +I LA P + +S  VD+  G       RTS    L  G+D + 
Sbjct: 96  PRVMVLGGFLSDAECDAMIALAQPRLARSRTVDNANGAHVVHAARTSDSMCLQLGQDALC 155

Query: 144 RDIEKRIADFTFFPL 158
           + IE RIA    +P+
Sbjct: 156 QRIEARIARLLDWPV 170


>gi|374620441|ref|ZP_09692975.1| 2OG-Fe(II) oxygenase superfamily enzyme [gamma proteobacterium
           HIMB55]
 gi|374303668|gb|EHQ57852.1| 2OG-Fe(II) oxygenase superfamily enzyme [gamma proteobacterium
           HIMB55]
          Length = 570

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Composition-based stats.
 Identities = 31/81 (38%), Positives = 46/81 (56%), Gaps = 2/81 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E  S +P   V +N +S  EC YLI LA PH++++ VV  D G  K+S  RT S  +L  
Sbjct: 15  EAYSLDPLVGVRNNVISPVECAYLIELAKPHIKRAGVV-LDEG-YKESEGRTGSNHWLKY 72

Query: 138 GRDKIIRDIEKRIADFTFFPL 158
             D +++ + +RI+D    PL
Sbjct: 73  DEDDVVQSVGQRISDIVGLPL 93


>gi|377810637|ref|YP_005043077.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
           YI23]
 gi|357939998|gb|AET93554.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
           YI23]
          Length = 297

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/84 (38%), Positives = 45/84 (53%), Gaps = 2/84 (2%)

Query: 73  AEQWVEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
           AE+   VI+    P A +   FL+  EC+ LI LA P + +STVVD  TG+   +  R+S
Sbjct: 89  AERKTRVIARLQRPAAVLLDEFLTGSECDQLIALARPRLSRSTVVDPVTGRDVAAGHRSS 148

Query: 131 SGTFLARGRDKIIRDIEKRIADFT 154
            GTF       ++  +E RIA  T
Sbjct: 149 DGTFFRLAETPLVARLEMRIAALT 172


>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
 gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
          Length = 211

 Score = 54.7 bits (130), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 50/105 (47%), Gaps = 18/105 (17%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+  EP    + N LS EEC+ LI+ A+  + +S +      K + S +RTSSG F   
Sbjct: 24  EVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKL-----AKKEISSIRTSSGMFFEE 78

Query: 138 GRDKIIRDIEKRIADFTFFPL-------------GMQFKFSLFFF 169
             + +I +IEKRI+     P+             G +FK    FF
Sbjct: 79  NENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKAHFDFF 123


>gi|260802724|ref|XP_002596242.1| hypothetical protein BRAFLDRAFT_117983 [Branchiostoma floridae]
 gi|229281496|gb|EEN52254.1| hypothetical protein BRAFLDRAFT_117983 [Branchiostoma floridae]
          Length = 527

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 25/82 (30%), Positives = 49/82 (59%)

Query: 73  AEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSG 132
           A Q +E +  +P+ +++HN L+  E + + +LA P +R++T+ +S TG+ + +  R S  
Sbjct: 346 APQKLEQVFDKPKMWIFHNILTDPEMKVIKDLAQPRLRRATIQNSITGELEHASYRISKS 405

Query: 133 TFLARGRDKIIRDIEKRIADFT 154
            +L     K+IR + +R+ D T
Sbjct: 406 AWLQGWEHKVIRRVNQRVEDVT 427


>gi|225438938|ref|XP_002279411.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296087348|emb|CBI33722.3| unnamed protein product [Vitis vinifera]
          Length = 285

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/102 (31%), Positives = 56/102 (54%), Gaps = 7/102 (6%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV---VDSDTGKSKDSRVRTSSGTF 134
           +V+SW PRA  + NF + E+C+ +IN+A  ++  STV   V    G ++   +RTSSG F
Sbjct: 76  QVLSWRPRALYFPNFATSEQCQSIINMAKSNLTPSTVALRVGEIRGNTEG--IRTSSGVF 133

Query: 135 LARGRDK--IIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           ++   DK   +  IE++IA     P      F++  + + ++
Sbjct: 134 ISASEDKTGTLDLIEQKIARVIMIPRTHGEAFNVLRYEIGQR 175


>gi|412986224|emb|CCO17424.1| predicted protein [Bathycoccus prasinos]
          Length = 557

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/76 (39%), Positives = 45/76 (59%), Gaps = 7/76 (9%)

Query: 79  VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL--A 136
            +S  P  FV+ NFL + ECE+L  LA   +++S V D      K S  RTSS  FL  A
Sbjct: 318 CVSLSPLLFVFENFLHESECEFLRTLADKDLKRSRVTD-----GKLSNGRTSSSCFLIGA 372

Query: 137 RGRDKIIRDIEKRIAD 152
           +G++ +++ IE+R+ D
Sbjct: 373 KGKEDVVKTIERRMLD 388


>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
 gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
          Length = 288

 Score = 54.3 bits (129), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 42/86 (48%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++ +FLS  EC+ LI +    +++S VV+ DTG+      RTS G     G   +I
Sbjct: 96  PRIVLFQHFLSDAECDELIAIGRNRLKRSPVVNPDTGEENLISARTSQGGMFQVGEHPLI 155

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFF 169
             IE RIA     P+     F +  +
Sbjct: 156 AKIEVRIAQAVGVPVEHGEGFQVLNY 181


>gi|115455509|ref|NP_001051355.1| Os03g0761900 [Oryza sativa Japonica Group]
 gi|14488368|gb|AAK63935.1|AC084282_16 putative dioxygenase [Oryza sativa Japonica Group]
 gi|17027263|gb|AAL34117.1|AC090713_4 putative hydroxylase subunit [Oryza sativa Japonica Group]
 gi|108711218|gb|ABF99013.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113549826|dbj|BAF13269.1| Os03g0761900 [Oryza sativa Japonica Group]
 gi|125545807|gb|EAY91946.1| hypothetical protein OsI_13633 [Oryza sativa Indica Group]
          Length = 310

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 55/97 (56%), Gaps = 8/97 (8%)

Query: 68  GDEGRAEQWV---EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD 124
           G+ G  E  +   +++SW+PRA  +  F + ++CE ++  A   +  ST+     G++++
Sbjct: 88  GESGEPEPSLIPYQILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLA-LRKGETEE 146

Query: 125 SR--VRTSSGTFLARGRDK--IIRDIEKRIADFTFFP 157
           S   +RTSSGTFL+   D    + ++EK+IA  T  P
Sbjct: 147 STKGIRTSSGTFLSSDEDPTGTLAEVEKKIAKATMIP 183


>gi|224034451|gb|ACN36301.1| unknown [Zea mays]
 gi|413945801|gb|AFW78450.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
          Length = 295

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 44/78 (56%), Gaps = 13/78 (16%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           IS +PR F+Y +FLS +E  +LI+LA   +++S V D+ +GKS  S              
Sbjct: 54  ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLS-------------E 100

Query: 140 DKIIRDIEKRIADFTFFP 157
           D I+  IE +IA +TF P
Sbjct: 101 DPIVEGIEDKIAAWTFLP 118


>gi|145344669|ref|XP_001416850.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577076|gb|ABO95143.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 225

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 36/96 (37%), Positives = 54/96 (56%), Gaps = 7/96 (7%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL- 135
           V V+S +   FV  +FLS+EE + LI +A P M++S V D      K S  RTS+ TFL 
Sbjct: 21  VRVLSNDCLLFVLEDFLSEEEGDQLIEIARPSMQRSRVTD-----GKLSEGRTSTSTFLT 75

Query: 136 -ARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFY 170
            AR  D ++ +IE+RI      PL ++ + ++   Y
Sbjct: 76  GARAHDDLVLEIERRIQAAIRLPLIVERRKNVKVMY 111


>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
 gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
          Length = 293

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 42/86 (48%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  +  N L   EC+ ++ LA   +++S VV+ DTG       RTS G     G   ++
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVVNPDTGDENLIDARTSMGAMFQVGEHALL 160

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFF 169
           + IE RIA  T +P+     F +  +
Sbjct: 161 QRIEARIAAVTGWPVEHGEGFQVLNY 186


>gi|125588006|gb|EAZ28670.1| hypothetical protein OsJ_12681 [Oryza sativa Japonica Group]
          Length = 280

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 31/97 (31%), Positives = 55/97 (56%), Gaps = 8/97 (8%)

Query: 68  GDEGRAEQWV---EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD 124
           G+ G  E  +   +++SW+PRA  +  F + ++CE ++  A   +  ST+     G++++
Sbjct: 58  GESGEPEPSLIPYQILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLA-LRKGETEE 116

Query: 125 SR--VRTSSGTFLARGRDK--IIRDIEKRIADFTFFP 157
           S   +RTSSGTFL+   D    + ++EK+IA  T  P
Sbjct: 117 STKGIRTSSGTFLSSDEDPTGTLAEVEKKIAKATMIP 153


>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
 gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
          Length = 295

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 43/75 (57%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P+  V+ + LS +EC  +I  +   +++ST V+ +TGK    R RTS G +  RG D  I
Sbjct: 106 PQVIVFGDVLSPDECAEMIERSRHRLKRSTTVNPETGKEDVIRNRTSEGIWYQRGEDAFI 165

Query: 144 RDIEKRIADFTFFPL 158
             +++RI+    +P+
Sbjct: 166 ERMDRRISSLMNWPV 180


>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
 gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
          Length = 211

 Score = 53.9 bits (128), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 44/81 (54%), Gaps = 5/81 (6%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+  EP    + N LS EEC+ LI+ A+  + +S +      K + S +RTSSG F   
Sbjct: 24  EVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKL-----AKKEISSIRTSSGMFFEE 78

Query: 138 GRDKIIRDIEKRIADFTFFPL 158
             + +I +IEKRI+     P+
Sbjct: 79  NENPLISEIEKRISSLMHLPI 99


>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
 gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
          Length = 293

 Score = 53.9 bits (128), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 42/86 (48%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  +  N L   EC+ ++ LA   +++S VV+ DTG       RTS G     G   ++
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVVNPDTGDENLIDARTSMGAMFQVGEHALL 160

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFF 169
           + IE RIA  T +P+     F +  +
Sbjct: 161 QRIEARIAAVTGWPVEHGEGFQVLNY 186


>gi|77747935|ref|NP_638775.2| hypothetical protein XCC3429 [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
          Length = 288

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 38/75 (50%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V    L+ +EC+ LI LA P + +S  VD+  G       RTS    L  G+D + 
Sbjct: 98  PRVVVLGGLLADDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 157

Query: 144 RDIEKRIADFTFFPL 158
           + IE RIA    +P+
Sbjct: 158 QRIEARIAQLLEWPV 172


>gi|428175714|gb|EKX44602.1| hypothetical protein GUITHDRAFT_71994 [Guillardia theta CCMP2712]
          Length = 244

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 33/86 (38%), Positives = 46/86 (53%), Gaps = 4/86 (4%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV---DSDTGKSK-DSRVRTSSG 132
           V+ +S  PR FV  NFLS EECE +I  ATP +  STV+   D   G+ K    VRTS  
Sbjct: 20  VKRLSSTPRLFVVENFLSAEECEEIIKTATPLLAPSTVLKQGDQSNGEEKVKDEVRTSET 79

Query: 133 TFLARGRDKIIRDIEKRIADFTFFPL 158
            +L   +  I+  I +R+ +    P+
Sbjct: 80  AWLMDKKVPIVAKIRQRVEELIRIPM 105


>gi|77761111|ref|YP_241833.2| hypothetical protein XC_0735 [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 288

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 38/75 (50%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V    L+ +EC+ LI LA P + +S  VD+  G       RTS    L  G+D + 
Sbjct: 98  PRVVVLGGLLADDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 157

Query: 144 RDIEKRIADFTFFPL 158
           + IE RIA    +P+
Sbjct: 158 QRIEARIAQLLEWPV 172


>gi|77748547|ref|NP_641044.2| hypothetical protein XAC0691 [Xanthomonas axonopodis pv. citri str.
           306]
 gi|381169877|ref|ZP_09879039.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380689647|emb|CCG35526.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 286

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 39/75 (52%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V   FLS  EC+ LI LA P + +S  VD+  G+      RTS    L  G+D + 
Sbjct: 96  PRVVVLGGFLSDGECDALIALARPRLARSRTVDNANGEHMVHAARTSDSMCLRVGQDALC 155

Query: 144 RDIEKRIADFTFFPL 158
           + IE RIA    +P+
Sbjct: 156 QRIEARIARLFDWPV 170


>gi|21114687|gb|AAM42699.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. ATCC 33913]
          Length = 308

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 38/75 (50%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V    L+ +EC+ LI LA P + +S  VD+  G       RTS    L  G+D + 
Sbjct: 118 PRVVVLGGLLADDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 177

Query: 144 RDIEKRIADFTFFPL 158
           + IE RIA    +P+
Sbjct: 178 QRIEARIAQLLEWPV 192


>gi|66572403|gb|AAY47813.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. 8004]
          Length = 308

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 38/75 (50%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V    L+ +EC+ LI LA P + +S  VD+  G       RTS    L  G+D + 
Sbjct: 118 PRVVVLGGLLADDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 177

Query: 144 RDIEKRIADFTFFPL 158
           + IE RIA    +P+
Sbjct: 178 QRIEARIAQLLEWPV 192


>gi|384429387|ref|YP_005638747.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
           campestris pv. raphani 756C]
 gi|341938490|gb|AEL08629.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
           campestris pv. raphani 756C]
          Length = 286

 Score = 53.5 bits (127), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 38/75 (50%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V    LS +EC+ LI LA P + +S  VD+  G       RTS    L  G+D + 
Sbjct: 96  PRVVVLGGLLSDDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 155

Query: 144 RDIEKRIADFTFFPL 158
           + IE RIA    +P+
Sbjct: 156 QRIEARIARLLEWPV 170


>gi|195390831|ref|XP_002054071.1| GJ22995 [Drosophila virilis]
 gi|194152157|gb|EDW67591.1| GJ22995 [Drosophila virilis]
          Length = 485

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 47/83 (56%), Gaps = 1/83 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR-VRTSSGTFL 135
           +EV+  +P    +H+ LS  E   L  LA P ++++TV DS+ G     +  RTS G +L
Sbjct: 297 MEVLVVKPFIVAFHDVLSPHEIGELQQLAMPLLKRTTVYDSNAGLHGSVKGTRTSKGIWL 356

Query: 136 ARGRDKIIRDIEKRIADFTFFPL 158
           +R  + + + I +RI+D T F L
Sbjct: 357 SRSHNNLTKRIGRRISDMTGFHL 379


>gi|21106803|gb|AAM35580.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 306

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 39/75 (52%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V   FLS  EC+ LI LA P + +S  VD+  G+      RTS    L  G+D + 
Sbjct: 116 PRVVVLGGFLSDGECDALIALARPRLARSRTVDNANGEHMVHAARTSDSMCLRVGQDALC 175

Query: 144 RDIEKRIADFTFFPL 158
           + IE RIA    +P+
Sbjct: 176 QRIEARIARLFDWPV 190


>gi|308451420|ref|XP_003088665.1| CRE-PHY-2 protein [Caenorhabditis remanei]
 gi|308246199|gb|EFO90151.1| CRE-PHY-2 protein [Caenorhabditis remanei]
          Length = 609

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 24/78 (30%), Positives = 46/78 (58%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE++ ++P A ++ N +S  E + +  LA+P ++++TV +S TG+ + +  R S   +L 
Sbjct: 334 VEILRFDPLAVLFKNVISDSEIKVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLK 393

Query: 137 RGRDKIIRDIEKRIADFT 154
                +I  + +RI DFT
Sbjct: 394 GDLHPVIERVNRRIEDFT 411


>gi|346724248|ref|YP_004850917.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346648995|gb|AEO41619.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 418

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 42/91 (46%), Gaps = 9/91 (9%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR   Y   LS +EC  L+ LA PH+R S V+D +   +  + VRTS G  L    D II
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTGRAPVRTSHGATL----DPII 283

Query: 144 RD-----IEKRIADFTFFPLGMQFKFSLFFF 169
            D      + R+A     PL      S+  +
Sbjct: 284 EDFAARAAQSRLAACAQLPLAHAEPLSVLCY 314


>gi|224103711|ref|XP_002313164.1| predicted protein [Populus trichocarpa]
 gi|222849572|gb|EEE87119.1| predicted protein [Populus trichocarpa]
          Length = 294

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 38/128 (29%), Positives = 65/128 (50%), Gaps = 18/128 (14%)

Query: 63  MERSEGDEGRAEQWV-----------EVISWEPRAFVYHNFLSKEECEYLINLATPHMRK 111
           +E +EGD       V           +V+SW+PRA  +  F + E+CE +I +    ++ 
Sbjct: 57  LEVAEGDHDTMPHGVTGEASVESIPFQVLSWKPRALYFPKFATPEQCESIIKMVESKLKP 116

Query: 112 STVV--DSDTGKS-KDSRVRTSSGTFLARGRDK--IIRDIEKRIADFTFFPLGMQFKFSL 166
           ST+     +T +S KD+  RTSSG+F++   D+   +  IEK+IA  T  P      F++
Sbjct: 117 STLALRKGETAESTKDT--RTSSGSFVSGSEDETGTLDFIEKKIAKATMIPQSHGEAFNI 174

Query: 167 FFFYVLKK 174
             + + +K
Sbjct: 175 LRYEIGQK 182


>gi|418515355|ref|ZP_13081536.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
 gi|410708074|gb|EKQ66523.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
          Length = 216

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 39/75 (52%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V   FLS  EC+ LI LA P + +S  VD+  G+      RTS    L  G+D + 
Sbjct: 26  PRVVVLGGFLSDGECDALIALARPRLARSRTVDNANGEHLVHAARTSDSMCLRVGQDALC 85

Query: 144 RDIEKRIADFTFFPL 158
           + IE RIA    +P+
Sbjct: 86  QRIEARIARLFDWPV 100


>gi|308476969|ref|XP_003100699.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
 gi|308264511|gb|EFP08464.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
          Length = 573

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 24/78 (30%), Positives = 46/78 (58%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE++ ++P A ++ N +S  E + +  LA+P ++++TV +S TG+ + +  R S   +L 
Sbjct: 334 VEILRFDPLAVLFKNVISDSEIKVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLK 393

Query: 137 RGRDKIIRDIEKRIADFT 154
                +I  + +RI DFT
Sbjct: 394 GDLHPVIERVNRRIEDFT 411


>gi|294889729|ref|XP_002772943.1| prolyl 4-hydroxylase alpha subunit, putative [Perkinsus marinus
           ATCC 50983]
 gi|239877523|gb|EER04759.1| prolyl 4-hydroxylase alpha subunit, putative [Perkinsus marinus
           ATCC 50983]
          Length = 383

 Score = 53.1 bits (126), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 33/88 (37%), Positives = 46/88 (52%), Gaps = 9/88 (10%)

Query: 79  VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV---------VDSDTGKSKDSRVRT 129
           VI   P+  +  +FL+ EECEY+I+LA    R STV           SD   +K S+ RT
Sbjct: 170 VICRSPKVRLVPDFLTPEECEYMISLAEGKWRPSTVGRSSSSISDGKSDKYVNKRSKGRT 229

Query: 130 SSGTFLARGRDKIIRDIEKRIADFTFFP 157
           SS   L   +D ++ +IE+R A    FP
Sbjct: 230 SSSFMLLHSQDDVVAEIERRAASLVGFP 257


>gi|78046960|ref|YP_363135.1| hypothetical protein XCV1404 [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
 gi|78035390|emb|CAJ23035.1| conserved hypothetical protein [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
          Length = 418

 Score = 52.8 bits (125), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 42/91 (46%), Gaps = 9/91 (9%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR   Y   LS +EC  L+ LA PH+R S V+D +   +  + VRTS G  L    D II
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTGRAPVRTSHGATL----DPII 283

Query: 144 RD-----IEKRIADFTFFPLGMQFKFSLFFF 169
            D      + R+A     PL      S+  +
Sbjct: 284 EDFAARAAQSRLAACAQLPLAHAEPLSVLCY 314


>gi|388523073|gb|AFK49598.1| unknown [Lotus japonicus]
          Length = 318

 Score = 52.8 bits (125), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 34/101 (33%), Positives = 57/101 (56%), Gaps = 5/101 (4%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR--VRTSSGTFL 135
           +V+SW P A  + NF + E+CE +I  A   ++ ST+V    G++ +S   +RTSSG F+
Sbjct: 93  QVLSWNPHALYFPNFATAEQCESIIETAKEGLKPSTLV-LRVGETDESTTGIRTSSGVFI 151

Query: 136 ARGRDK--IIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +   DK  ++  IE++IA  T  P      F++  + V +K
Sbjct: 152 SAFEDKTGVLDVIEEKIARATKIPRTHGEAFNVLRYKVGQK 192


>gi|311032645|ref|ZP_07710735.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           m3-13]
          Length = 137

 Score = 52.8 bits (125), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 34/104 (32%), Positives = 50/104 (48%), Gaps = 17/104 (16%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           EP   V  N LS EEC+ LI L+   M++S V     G   D+ +RTSS TF   G +++
Sbjct: 38  EPLVVVLANVLSDEECDALIRLSKDKMKRSKV---SNGLEVDA-IRTSSSTFFHEGENEL 93

Query: 143 IRDIEKRIADFTFFP-------------LGMQFKFSLFFFYVLK 173
           +  IEKR++     P             +G ++K    FF V +
Sbjct: 94  VARIEKRVSQIMNVPVEHGEGLQILNYQVGQEYKAHFDFFQVFQ 137


>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
 gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
          Length = 292

 Score = 52.8 bits (125), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 42/75 (56%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P+  V+ + LS +EC  +I  +   +++ST V+  TGK    R RTS G +  RG D  I
Sbjct: 103 PQVIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRNRTSEGIWYQRGEDPFI 162

Query: 144 RDIEKRIADFTFFPL 158
             +++RI+    +P+
Sbjct: 163 ERMDRRISSLMNWPV 177


>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
 gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
          Length = 522

 Score = 52.8 bits (125), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 26/78 (33%), Positives = 44/78 (56%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E +  +P+  ++H+ LS  E E L  LA P + ++T+ +  TGK++ S+ R S  ++  
Sbjct: 322 LEEMHLKPKIVIFHDVLSDTEIELLKRLAKPILERATIANQQTGKAERSKDRVSKSSWFP 381

Query: 137 RGRDKIIRDIEKRIADFT 154
                 IR I KR+AD T
Sbjct: 382 DEYHSTIRTITKRVADMT 399


>gi|195341590|ref|XP_002037389.1| GM12139 [Drosophila sechellia]
 gi|194131505|gb|EDW53548.1| GM12139 [Drosophila sechellia]
          Length = 525

 Score = 52.4 bits (124), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 25/82 (30%), Positives = 47/82 (57%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E++  EP   +YH+ LS +E   L  +ATP ++++TV  + +G+++  + RTS   +  
Sbjct: 322 MELVGLEPYMVLYHDVLSPKEITELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAWFP 381

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
            G + +   +  RI+D T F L
Sbjct: 382 DGYNPLTVRLNARISDMTGFNL 403


>gi|384250599|gb|EIE24078.1| hypothetical protein COCSUDRAFT_47131 [Coccomyxa subellipsoidea
           C-169]
          Length = 327

 Score = 52.4 bits (124), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 30/103 (29%), Positives = 53/103 (51%), Gaps = 3/103 (2%)

Query: 68  GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDT-GKSKDSR 126
           G++  + Q  ++ISW PR  +Y  F+  E C++ + +A   +  S +    T G  +   
Sbjct: 100 GNDFYSVQPQQLISWYPRIILYPGFIDPERCKHFVKVAKARLAPSGLALRTTEGPQETEN 159

Query: 127 VRTSSGTFLARGRDK--IIRDIEKRIADFTFFPLGMQFKFSLF 167
           VRTS GTF++R  D   +I  +E++ A  T  P+     F++ 
Sbjct: 160 VRTSQGTFMSRKDDPAGVIAWVEEKAAQVTGLPVSHGEPFNVL 202


>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
 gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
          Length = 286

 Score = 52.4 bits (124), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 30/99 (30%), Positives = 47/99 (47%), Gaps = 13/99 (13%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P   V   F+S EECE LI  +   +  S +VD  TGK +    R+S GT+  RG   +I
Sbjct: 96  PDIVVVDEFMSGEECEQLIEQSRRKLTPSAIVDPQTGKFQVIADRSSEGTYFQRGESPLI 155

Query: 144 RDIEKRIADFTFFP-------------LGMQFKFSLFFF 169
             +++RI++   +P             +G Q+K    +F
Sbjct: 156 SRLDRRISELMNWPEDHGEGIQILHYGVGAQYKPHFDYF 194


>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
 gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
          Length = 292

 Score = 52.4 bits (124), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 42/75 (56%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P+  V+ + LS +EC  +I  +   +++ST V+  TGK    R RTS G +  RG D  I
Sbjct: 103 PQMIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRNRTSEGIWYQRGEDPFI 162

Query: 144 RDIEKRIADFTFFPL 158
             +++RI+    +P+
Sbjct: 163 ERMDRRISSLMNWPV 177


>gi|302845120|ref|XP_002954099.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
           nagariensis]
 gi|300260598|gb|EFJ44816.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
           nagariensis]
          Length = 231

 Score = 52.4 bits (124), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 32/96 (33%), Positives = 51/96 (53%), Gaps = 5/96 (5%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD--SRVRTSSGTFL 135
           +++SW PR  V+  F+ K   EY+I LA+  M  S +     G++ D   + RTS+GTFL
Sbjct: 18  QILSWYPRVVVFPGFIDKARAEYVIKLASKFMYPSGLA-YRPGETVDPSQQTRTSTGTFL 76

Query: 136 ARGRDK--IIRDIEKRIADFTFFPLGMQFKFSLFFF 169
           A   D   ++  +E+RIA  T  P      F++  +
Sbjct: 77  AAAMDPEGVLGWVEQRIAAATLLPAENGEAFNVLHY 112


>gi|21711777|gb|AAM75079.1| RE70601p [Drosophila melanogaster]
          Length = 316

 Score = 52.4 bits (124), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 48/82 (58%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E++  +P   +YH+ LS +E + L  +ATP ++++TV  + +G+++  + RTS   +  
Sbjct: 113 MELVGLDPYMVLYHDVLSPKEIKELQGMATPSLKRATVYQASSGRNEVVKTRTSKVAWFP 172

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
            G + +   +  RI+D T F L
Sbjct: 173 DGYNPLTVRLNARISDMTGFNL 194


>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
 gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
          Length = 279

 Score = 52.4 bits (124), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 40/82 (48%), Gaps = 4/82 (4%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P   V  NF++ EEC  LI LA   +  +TVVD  TG+    + RTS     AR    +I
Sbjct: 91  PEVVVLDNFITAEECAQLIALAEGKVEDATVVDPATGEFVKHQDRTSMNAAFARAEHPLI 150

Query: 144 RDIEKRIADFTFFPL----GMQ 161
             +E RIA    +P     GMQ
Sbjct: 151 ARLEARIAAAIHWPAENGEGMQ 172


>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
 gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
          Length = 282

 Score = 52.0 bits (123), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 55/123 (44%), Gaps = 18/123 (14%)

Query: 36  AFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSK 95
           A+   + P ++G++  A+D    VR   ER                  P+   + + LS 
Sbjct: 63  AYHYDACPVAAGNTVHAHDRDVTVRIRFER------------------PQVIAFDDVLSG 104

Query: 96  EECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTF 155
           EEC  LI  A   +++ST V+ + G     ++RTS G +  R  D  I  ++ RI+    
Sbjct: 105 EECAELIERARHRLKRSTTVNPENGSEDVIQLRTSEGFWFQRCEDAFIERLDHRISALMN 164

Query: 156 FPL 158
           +PL
Sbjct: 165 WPL 167


>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
 gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
          Length = 285

 Score = 52.0 bits (123), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 33/123 (26%), Positives = 55/123 (44%), Gaps = 18/123 (14%)

Query: 36  AFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSK 95
           A+   + P ++G++  A+D    VR   ER                  P+   + + LS 
Sbjct: 66  AYHYDACPVAAGNTVHAHDRDVTVRIRFER------------------PQVIAFDDVLSG 107

Query: 96  EECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTF 155
           EEC  LI  A   +++ST V+ + G     ++RTS G +  R  D  I  ++ RI+    
Sbjct: 108 EECAELIERARHRLKRSTTVNPENGSEDVIQLRTSEGFWFQRCEDAFIERLDHRISALMN 167

Query: 156 FPL 158
           +PL
Sbjct: 168 WPL 170


>gi|242051901|ref|XP_002455096.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
 gi|241927071|gb|EES00216.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
          Length = 303

 Score = 52.0 bits (123), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 41/78 (52%), Gaps = 10/78 (12%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +SW PR F+Y  FLS  EC++LI++A    + S VV    G +       S G  +    
Sbjct: 62  LSWHPRVFLYEGFLSDMECDHLISMAHGKKQSSLVVGGSAGNN-------SQGASI---E 111

Query: 140 DKIIRDIEKRIADFTFFP 157
           D I+  IE RI+ ++F P
Sbjct: 112 DTIVSTIEDRISVWSFLP 129


>gi|389775678|ref|ZP_10193553.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
 gi|388437120|gb|EIL93940.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
          Length = 284

 Score = 52.0 bits (123), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 40/86 (46%), Gaps = 1/86 (1%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P   V  N L+ EECE LI LA P ++++  V SD     D R RTS G F       ++
Sbjct: 95  PALRVLENLLAAEECEELIALAQPRLKRALTVASDGSNQVDQR-RTSEGMFFTLNELPLV 153

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFF 169
             IE+R+A     P+       +  +
Sbjct: 154 GRIEQRLATLLGMPVSHGEGLQILHY 179


>gi|9294584|dbj|BAB02865.1| unnamed protein product [Arabidopsis thaliana]
          Length = 328

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 22/44 (50%), Positives = 33/44 (75%)

Query: 114 VVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFP 157
           V D D+G+S+DS VRTSSG FL + +D I+ ++E ++A +TF P
Sbjct: 3   VADVDSGESEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLP 46


>gi|352086439|ref|ZP_08953941.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
 gi|389799401|ref|ZP_10202396.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
 gi|351679404|gb|EHA62545.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
 gi|388442818|gb|EIL98985.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
          Length = 284

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 40/74 (54%), Gaps = 1/74 (1%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P   V  N LS +ECE LI LA P ++++  VDS+ G+ +  R RTS G F       ++
Sbjct: 95  PALRVLENILSTQECEELIALARPRLQRALTVDSE-GRQQVDRRRTSEGMFFTLNEVPLV 153

Query: 144 RDIEKRIADFTFFP 157
             IE+R+A     P
Sbjct: 154 GRIEQRLAALLRVP 167


>gi|389793983|ref|ZP_10197143.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
 gi|388433014|gb|EIL89992.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
          Length = 282

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/74 (37%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P   V    LS+ EC  LI LA P ++++  VDSD GK +  + RTS G F   G   ++
Sbjct: 93  PALRVLDGLLSERECADLIELARPRLQRALTVDSD-GKQQIDQRRTSEGMFFRAGETPLV 151

Query: 144 RDIEKRIADFTFFP 157
             IE+R+A     P
Sbjct: 152 AAIEQRLAQLLGVP 165


>gi|302802700|ref|XP_002983104.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
 gi|300149257|gb|EFJ15913.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
          Length = 292

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 42/168 (25%), Positives = 82/168 (48%), Gaps = 9/168 (5%)

Query: 16  SSTLILTLLIMFTFAILILLAFGIL------SMPSSSGDSRKANDLSSIVRKSMERSEGD 69
           S+ L+L L ++       +  F  +      S+PS S  + + ++L   V  +     GD
Sbjct: 15  SAPLVLVLCVLAFLVGYFIPEFQQVILVTKHSIPSFSSFANQRHELLEDVTVAEHGVTGD 74

Query: 70  EGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR-VR 128
           +  +    +V+SW PRA ++  F S  +CE +I+LA   +  S++       + +++ VR
Sbjct: 75  DQLSFIPFQVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALRKGETATETQDVR 134

Query: 129 TSSGTFLARGRDK--IIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           TS G FL+  +DK   +  +E+++A  T  P      F++  + + +K
Sbjct: 135 TSHGCFLSSRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQK 182


>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
 gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
          Length = 528

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 25/82 (30%), Positives = 46/82 (56%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E++  +P   +YH+ LS +E + L  +ATP ++++TV  + +G+++  R RTS   +  
Sbjct: 325 MELVGLDPYMVLYHDVLSAKEIKELQGMATPGLKRATVFQAASGRNEVVRTRTSKVAWFP 384

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
            G   +   +  RI D T F L
Sbjct: 385 DGYSPLTVRLNARITDMTGFNL 406


>gi|219121927|ref|XP_002181308.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407294|gb|EEC47231.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 226

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/100 (33%), Positives = 46/100 (46%), Gaps = 3/100 (3%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKS--TVVDSDTGKSKDSRVRTSSGTF 134
           +E +S  P       FLS +EC Y+   A PHM  S  T++D D G+   S  RTS   F
Sbjct: 7   LETLSLVPLVLSVEGFLSDDECTYIQETAEPHMEYSEVTLMDKDQGRPA-SDFRTSQSAF 65

Query: 135 LARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +    D I+ DI+ R A     P   Q    +  + V +K
Sbjct: 66  IRAHDDAILTDIDYRTASLVRIPRRHQEDVQVLRYDVTEK 105


>gi|24651477|ref|NP_733395.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
 gi|20269812|gb|AAM18061.1|AF495539_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]PV [Drosophila
           melanogaster]
 gi|23172718|gb|AAN14252.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
          Length = 525

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 24/82 (29%), Positives = 48/82 (58%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E++  +P   +YH+ LS +E + L  +ATP ++++TV  + +G+++  + RTS   +  
Sbjct: 322 MELVGLDPYMVLYHDVLSPKEIKELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAWFP 381

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
            G + +   +  RI+D T F L
Sbjct: 382 DGYNPLTVRLNARISDMTGFNL 403


>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
          Length = 545

 Score = 51.6 bits (122), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 24/72 (33%), Positives = 38/72 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  VYH+ +S EE E +  LA P   ++TV   ++G+ + SR R +   +L       
Sbjct: 345 KPRIVVYHDIISDEEIETIKRLAQPRFERATVQKKESGEREFSRYRIAKSAWLKHEEHDY 404

Query: 143 IRDIEKRIADFT 154
           + DI  R+ D T
Sbjct: 405 VSDINFRVGDIT 416


>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 24/72 (33%), Positives = 41/72 (56%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P +R++TV D +TGK   ++ R S   +L+   + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|294666178|ref|ZP_06731433.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292604043|gb|EFF47439.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 296

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 38/75 (50%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P   V   FLS  EC+ LI LA P + +S  VD+  G+      RTS    L  G+D + 
Sbjct: 106 PCVVVLGGFLSGGECDALIALARPRLARSRTVDNANGEHVVHAARTSDSMCLRVGQDALC 165

Query: 144 RDIEKRIADFTFFPL 158
           + IE RIA    +P+
Sbjct: 166 QRIEARIARLLDWPV 180


>gi|195159148|ref|XP_002020444.1| GL13996 [Drosophila persimilis]
 gi|194117213|gb|EDW39256.1| GL13996 [Drosophila persimilis]
          Length = 559

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 29/83 (34%), Positives = 46/83 (55%), Gaps = 5/83 (6%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS----- 131
           +E +S +P   +YHN LS EE   L N++TP + ++ + D +T K K S VR++      
Sbjct: 343 MEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARIFDKETKKPKISPVRSADEVGIP 402

Query: 132 GTFLARGRDKIIRDIEKRIADFT 154
              L  G  +++  I+KRI D T
Sbjct: 403 NPKLVTGDIQLVECIQKRITDLT 425


>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
 gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
          Length = 549

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 27/78 (34%), Positives = 47/78 (60%), Gaps = 2/78 (2%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE +S +P   ++H+ + + E + L+ LA   + ++TV   ++  S  S  RTS  TFL 
Sbjct: 327 VEELSHDPLLVLFHDVIYQSEIDTLMRLAKNKIHRATVTGHNS--SVVSNARTSQFTFLP 384

Query: 137 RGRDKIIRDIEKRIADFT 154
           + R K++R I++R+AD T
Sbjct: 385 KTRHKVLRTIDQRVADMT 402


>gi|294627644|ref|ZP_06706226.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292597996|gb|EFF42151.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 296

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/75 (37%), Positives = 38/75 (50%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P   V   FLS  EC+ LI LA P + +S  VD+  G+      RTS    L  G+D + 
Sbjct: 106 PCVVVLGGFLSGGECDALIALARPRLARSRTVDNANGEHVVHAARTSDSMCLRVGQDALC 165

Query: 144 RDIEKRIADFTFFPL 158
           + IE RIA    +P+
Sbjct: 166 QRIEARIARLLDWPV 180


>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
 gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
          Length = 525

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 30/93 (32%), Positives = 51/93 (54%), Gaps = 7/93 (7%)

Query: 69  DEGRAEQWV-----EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSK 123
           DE     W+     E +S +P   +YH+ + + E + +  L T  ++++T+  + T +S 
Sbjct: 282 DETHPFLWIAPLKAEELSRDPLLILYHDVIYQSEIDTIRKLTTNKLKRATI--TSTNESV 339

Query: 124 DSRVRTSSGTFLARGRDKIIRDIEKRIADFTFF 156
            S VRTS  TFL    DK++  I++R+AD T F
Sbjct: 340 VSNVRTSQFTFLPVTEDKVLATIDRRVADMTNF 372


>gi|289662828|ref|ZP_06484409.1| hypothetical protein XcampvN_06993, partial [Xanthomonas campestris
           pv. vasculorum NCPPB 702]
          Length = 301

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/63 (41%), Positives = 35/63 (55%), Gaps = 4/63 (6%)

Query: 89  YHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEK 148
           Y   LS +EC  L+ LA PH+R S V+D +   ++ + VRTS G  L    D II D   
Sbjct: 116 YAGVLSADECRLLMLLARPHLRDSQVIDPNDASTQRAPVRTSRGATL----DPIIEDFAA 171

Query: 149 RIA 151
           R+A
Sbjct: 172 RVA 174


>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
 gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
          Length = 283

 Score = 51.2 bits (121), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/83 (34%), Positives = 46/83 (55%), Gaps = 2/83 (2%)

Query: 77  VEVISWEPRAFVYH--NFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTF 134
           V+V+S   + FV H    LS EEC+ LI+L+   ++ S VVD  +G+ +    RTS    
Sbjct: 87  VKVLSRNEKPFVLHLDQVLSSEECDELISLSRSRLQPSLVVDRGSGEERAGSGRTSKSMA 146

Query: 135 LARGRDKIIRDIEKRIADFTFFP 157
                ++++  IE RIA+ T +P
Sbjct: 147 FRLKENELVERIETRIAELTGYP 169


>gi|405965633|gb|EKC30995.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
          Length = 617

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 24/77 (31%), Positives = 43/77 (55%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E +   P   +YH+ +S +E + +  +ATP + ++TV +  TGK + +  R S   +L  
Sbjct: 412 EEVYLNPWIVIYHDVVSDKEIDTIKRIATPLLSRATVHNPRTGKLETAEYRVSKSAWLKD 471

Query: 138 GRDKIIRDIEKRIADFT 154
           G D +I ++  RI+D T
Sbjct: 472 GDDPVIHNVNNRISDIT 488


>gi|195575145|ref|XP_002105540.1| GD16902 [Drosophila simulans]
 gi|194201467|gb|EDX15043.1| GD16902 [Drosophila simulans]
          Length = 525

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 24/82 (29%), Positives = 47/82 (57%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E++  +P   +YH+ LS +E   L  +ATP ++++TV  + +G+++  + RTS   +  
Sbjct: 322 MELVGLDPYMVLYHDVLSPKEITELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAWFP 381

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
            G + +   +  RI+D T F L
Sbjct: 382 DGYNPLTVRLNARISDMTGFNL 403


>gi|297798522|ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297312981|gb|EFH43404.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 288

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/110 (27%), Positives = 58/110 (52%), Gaps = 3/110 (2%)

Query: 68  GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR- 126
           G+E       +V+SW PRA  + NF + E+C+ +I  A  +++ S +       +++++ 
Sbjct: 69  GEESVGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKG 128

Query: 127 VRTSSGTFLARGRDKI--IRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
            RTSSGTF++   D    +  +E++IA  T  P      F++  + + +K
Sbjct: 129 TRTSSGTFISASEDSTGALDFVERKIARATMIPRSHGESFNILRYELGQK 178


>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
 gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
          Length = 319

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 37/75 (49%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++   L  +ECE LI L+   + +S VV+ DTG       RTS G     G   +I
Sbjct: 127 PRIALFQRLLMPDECEALIALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVGEHPLI 186

Query: 144 RDIEKRIADFTFFPL 158
             +E RIA  T  P+
Sbjct: 187 ERLEARIAAVTGVPV 201


>gi|348688210|gb|EGZ28024.1| hypothetical protein PHYSODRAFT_321730 [Phytophthora sojae]
          Length = 487

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/92 (30%), Positives = 47/92 (51%), Gaps = 1/92 (1%)

Query: 71  GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR-VRT 129
           GR +  +E IS  P  F    FL  +E + ++ L+ PH+  S V   D  +++ +   RT
Sbjct: 262 GRGDLVMETISMTPLVFSVEEFLRDDEIDVVLELSMPHLAPSGVTLQDGHENRPATDWRT 321

Query: 130 SSGTFLARGRDKIIRDIEKRIADFTFFPLGMQ 161
           S+  +L      +++DI+KR AD    P+  Q
Sbjct: 322 STTYWLESSSHPVVQDIDKRTADLVKVPISHQ 353


>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
 gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
 gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
           [Rattus norvegicus]
          Length = 534

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   D
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYED 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
 gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
          Length = 534

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 30/95 (31%), Positives = 48/95 (50%), Gaps = 1/95 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E I  +P   +YH  LS  E   LI  AT +M K+T V  + G  K +R RT+ G +  
Sbjct: 319 IEQIGLDPYVVLYHEVLSAREISMLIGKATQNM-KNTRVHKEQGVPKKNRGRTAKGFWFK 377

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYV 171
           +  +++ + I +RI D T F L     F +  + +
Sbjct: 378 KESNELTKGITRRIMDMTGFDLADSEGFQVINYGI 412


>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
          Length = 534

 Score = 50.8 bits (120), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   D
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYED 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Cricetulus griseus]
          Length = 534

 Score = 50.4 bits (119), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   D
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYED 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
 gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
          Length = 216

 Score = 50.4 bits (119), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 41/76 (53%), Gaps = 4/76 (5%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           EP   +  N LS EEC+ LI  +   M++S V +S     +   +RTSS TF   G ++I
Sbjct: 37  EPLIVILGNVLSDEECDQLIQQSKDRMQRSKVANS----LEVDELRTSSSTFFHEGENEI 92

Query: 143 IRDIEKRIADFTFFPL 158
           +  IEKRI+     P+
Sbjct: 93  VARIEKRISQIMNIPV 108


>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
 gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
           musculus]
          Length = 534

 Score = 50.4 bits (119), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 24/72 (33%), Positives = 40/72 (55%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
 gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
          Length = 286

 Score = 50.4 bits (119), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 40/76 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +P   V    LS EEC+ LI  A   +++ST+VD  TGK +    R+S GTF     D  
Sbjct: 94  QPVLAVLDGVLSHEECDELIRRAAAKLQRSTIVDPTTGKHETIADRSSEGTFFEINADDF 153

Query: 143 IRDIEKRIADFTFFPL 158
           I  +++RI+     P+
Sbjct: 154 IARLDRRISALMNLPV 169


>gi|357135727|ref|XP_003569460.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 2
           [Brachypodium distachyon]
          Length = 314

 Score = 50.4 bits (119), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 47/82 (57%), Gaps = 8/82 (9%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSD----TGKSKDSRVRTSSGTFL 135
           ++W PR F+Y  FLS  EC++L+ +A  ++  S +V++     T  S D+R +      L
Sbjct: 63  LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLVNAGARNITQNSTDARFKFQ----L 118

Query: 136 ARGRDKIIRDIEKRIADFTFFP 157
           A  +D ++  IE RI+ ++F P
Sbjct: 119 ADSKDIVVSKIEDRISLWSFIP 140


>gi|339236271|ref|XP_003379690.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
 gi|316977627|gb|EFV60702.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
          Length = 558

 Score = 50.4 bits (119), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 22/74 (29%), Positives = 43/74 (58%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VEV+ W+P+   +   +S EE   +  LA+P ++++TV ++DTG+ + +  R S   +L 
Sbjct: 329 VEVMHWKPKIVYFRGVISDEEIAVIKQLASPLLKRATVHNADTGQLETASYRISKSAWLK 388

Query: 137 RGRDKIIRDIEKRI 150
               ++++ I  RI
Sbjct: 389 DTEHEVVKRISDRI 402


>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
 gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
          Length = 212

 Score = 50.4 bits (119), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 49/100 (49%), Gaps = 17/100 (17%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           EP   V  N LS EEC+ LI L+   +++S + ++      ++ +RTSS TF+  G  ++
Sbjct: 36  EPLIVVLGNVLSDEECDALIGLSKDKLKRSKIGNT----RNENDMRTSSSTFMEEGESEV 91

Query: 143 IRDIEKRIADFTFFP-------------LGMQFKFSLFFF 169
           +  +EKRI+     P             +G ++K    FF
Sbjct: 92  VTRVEKRISQIMNIPYENGEGLQILNYKIGQEYKAHFDFF 131


>gi|159489502|ref|XP_001702736.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280758|gb|EDP06515.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 231

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 50/95 (52%), Gaps = 3/95 (3%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDS-RVRTSSGTFLA 136
           +++SW PR  V+  F+ K   E+++ LA   M  S +      + + S + RTS+GTFL+
Sbjct: 18  QILSWYPRIVVFPGFIDKARAEHIVKLAGKFMYPSGLAYRPGEQVESSQQTRTSTGTFLS 77

Query: 137 RGRD--KIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
            G D   ++  +E+RIA  T  P      F++  +
Sbjct: 78  SGMDTEGVLGWVEQRIAAATLLPADNGEAFNVLHY 112


>gi|297727581|ref|NP_001176154.1| Os10g0415128 [Oryza sativa Japonica Group]
 gi|255679404|dbj|BAH94882.1| Os10g0415128 [Oryza sativa Japonica Group]
          Length = 241

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 23/44 (52%), Positives = 33/44 (75%)

Query: 114 VVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFP 157
           V D+++GKS  S VRTSSG FL + +D+++  IE+RIA +TF P
Sbjct: 2   VADNESGKSVMSEVRTSSGMFLEKKQDEVVARIEERIAAWTFLP 45


>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
           anatinus]
          Length = 493

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 40/74 (54%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   YH  +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   D
Sbjct: 291 WDKPRIVRYHEIISDAEIETVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYED 350

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 351 PVVSRINMRIQDLT 364


>gi|389728965|ref|ZP_10189244.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
 gi|388441204|gb|EIL97500.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
          Length = 285

 Score = 50.1 bits (118), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 38/86 (44%), Gaps = 1/86 (1%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P   V+   LS +EC  LI LA P ++++  V  D  +  D   RTS G F   G   +I
Sbjct: 95  PPLRVFDGLLSDDECAALIELAKPRLQRARTVAEDGAQQIDEH-RTSDGMFFGLGEQPLI 153

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFF 169
             IE RIA     P+       +  +
Sbjct: 154 ERIEARIAALLGIPVDHGEGLQVLHY 179


>gi|440635830|gb|ELR05749.1| hypothetical protein GMDG_07591 [Geomyces destructans 20631-21]
          Length = 264

 Score = 50.1 bits (118), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 41/78 (52%), Gaps = 1/78 (1%)

Query: 79  VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
           ++S++P      NF++  E  YL N+   ++R+STV   D G       RTSS  FL   
Sbjct: 61  IVSYDPLIIHLENFITPAERAYLANVTNVNLRRSTVASRDGGPPVHKPSRTSSTAFLPH- 119

Query: 139 RDKIIRDIEKRIADFTFF 156
            D + + I+KR ADF  F
Sbjct: 120 NDTVGQCIQKRAADFQGF 137


>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
           eutropha JMP134]
 gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
           eutropha JMP134]
          Length = 282

 Score = 50.1 bits (118), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 37/75 (49%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P   +Y + LS  EC+ L+ LA   + +S V++ DTG       RTS G     G   +I
Sbjct: 90  PSIRLYQHLLSDAECDALVELARGRLARSPVINPDTGDENLIDARTSMGAMFQVGEHTLI 149

Query: 144 RDIEKRIADFTFFPL 158
           + IE RIA     P+
Sbjct: 150 QRIEDRIAAVLGVPV 164


>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
 gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
          Length = 545

 Score = 50.1 bits (118), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 26/78 (33%), Positives = 45/78 (57%), Gaps = 2/78 (2%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE +S +P   +YH+ + + E + L  L    + ++TV  ++   S  S  RTS  TF+ 
Sbjct: 323 VEELSHDPLLVLYHDVIYQSEIDTLAKLTKNKIHRATVTGNNA--SVVSNARTSQFTFIP 380

Query: 137 RGRDKIIRDIEKRIADFT 154
           + R K++R I++R+AD T
Sbjct: 381 KTRHKVLRTIDQRVADMT 398


>gi|170591592|ref|XP_001900554.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|16415740|emb|CAC82616.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|21425621|emb|CAD19314.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|158592166|gb|EDP30768.1| prolyl 4-hydroxylase, putative [Brugia malayi]
          Length = 541

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 44/74 (59%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE++ + P A ++ + ++ EE   +  LATP +R++TV +S TG+ + +  RTS   +L 
Sbjct: 325 VEILRFNPLAVLFRDVITDEEVTMIQMLATPRLRRATVQNSITGELETASYRTSKSAWLK 384

Query: 137 RGRDKIIRDIEKRI 150
               +++  I KRI
Sbjct: 385 DEEHEVVHRINKRI 398


>gi|357453665|ref|XP_003597113.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|357482683|ref|XP_003611628.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355486161|gb|AES67364.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355512963|gb|AES94586.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 294

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/100 (28%), Positives = 55/100 (55%), Gaps = 3/100 (3%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR-VRTSSGTFLA 136
           +V+SW PRA  + NF S E+C+ +I +A   +  S ++  +    + ++ +RTSSG F++
Sbjct: 84  QVLSWNPRALYFPNFASAEQCDRIIEMAKAELSPSRLMLREGETEEGTKGIRTSSGMFIS 143

Query: 137 RGRDK--IIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
              DK  ++  I+++IA     P      +++  + V +K
Sbjct: 144 ASEDKTGLLEVIDEKIARAAKIPKTHGGAYNILRYKVGQK 183


>gi|427795421|gb|JAA63162.1| Putative prolyl-4-hydroxylase-alpha efb, partial [Rhipicephalus
           pulchellus]
          Length = 568

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 24/77 (31%), Positives = 42/77 (54%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+  +PR  +YH+ LS+ E   +  LA P +R++TV +  +G+ + +  R S   +L  
Sbjct: 395 EVMFPKPRIVIYHDVLSEHEMNVIKTLAQPRLRRATVQNYKSGELETASYRISKSAWLKN 454

Query: 138 GRDKIIRDIEKRIADFT 154
               +I  + +RI D T
Sbjct: 455 EEHGVIARVTRRIEDIT 471


>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
           [Monodelphis domestica]
          Length = 537

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 25/74 (33%), Positives = 40/74 (54%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H  +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   D
Sbjct: 335 WDKPRIVRFHEIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYED 394

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 395 PVVSRINMRIQDLT 408


>gi|402593814|gb|EJW87741.1| hypothetical protein WUBG_01349 [Wuchereria bancrofti]
          Length = 541

 Score = 49.7 bits (117), Expect = 4e-04,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 44/74 (59%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE++ + P A ++ + ++ EE   +  LATP +R++TV +S TG+ + +  RTS   +L 
Sbjct: 325 VEILRFNPLAVLFRDVITDEEITMIQMLATPRLRRATVQNSITGELETASYRTSKSAWLK 384

Query: 137 RGRDKIIRDIEKRI 150
               +++  I KRI
Sbjct: 385 DEEHEVVHRINKRI 398


>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
 gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
          Length = 529

 Score = 49.7 bits (117), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 24/82 (29%), Positives = 47/82 (57%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E+IS +P   +YH+ +S  E   L +LA P ++++TV +  + ++   + RTS  T+L 
Sbjct: 323 MELISLDPYMVIYHDVISPSEISELQSLAVPGLKRATVFNQQSMRNHVVKTRTSKVTWLL 382

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
              +++   + +RI D T F +
Sbjct: 383 DTLNQLTIRLNRRITDMTGFDM 404


>gi|255573113|ref|XP_002527486.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223533126|gb|EEF34884.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 286

 Score = 49.7 bits (117), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 31/101 (30%), Positives = 54/101 (53%), Gaps = 5/101 (4%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV--DSDTGKSKDSRVRTSSGTFL 135
           +V+SW+PRA  + +F + E+C+ +I +A   ++ S +     +T +S     RTSSGTFL
Sbjct: 77  QVLSWKPRAVYFPDFATPEQCKNIIEMAKLRLKPSGLALRKGETAESTKG-TRTSSGTFL 135

Query: 136 ARGRD--KIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           +   D    +  IE +IA  T  P      F++  + + +K
Sbjct: 136 SASEDGTGTLDFIEHKIARATMIPRSHGEAFNILRYEIGQK 176


>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
 gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
          Length = 525

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 24/82 (29%), Positives = 46/82 (56%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E++  +P   +YH+ LS +E + L  +ATP + ++TV  + +G+++  + RTS   +  
Sbjct: 322 MELVGLDPYMVLYHDVLSAKEIKELQGMATPGLTRATVFQASSGRNEVVKTRTSKVAWFP 381

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
              + +   +  RIAD T F L
Sbjct: 382 DSYNPLTVRLNARIADMTGFNL 403


>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
 gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
          Length = 220

 Score = 49.3 bits (116), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 43/77 (55%), Gaps = 8/77 (10%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
           EP   V  N LS EECE LI L+   M++S +     G S++   +RTSSGTFL    ++
Sbjct: 42  EPLIVVLENVLSDEECESLIELSKDSMKRSKI-----GASREVDNIRTSSGTFLE--ENE 94

Query: 142 IIRDIEKRIADFTFFPL 158
            +  IEKR++     P+
Sbjct: 95  TVAIIEKRVSSIMNIPV 111


>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
 gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
          Length = 509

 Score = 49.3 bits (116), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 43/77 (55%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV++ +P   VYH+  S  E   LI LA   + ++T+   D G+ + S  RTS   +L  
Sbjct: 303 EVLNLDPFITVYHDVASDREISKLIELAKSRISRATI--RDDGEPQVSNARTSQNAWLDA 360

Query: 138 GRDKIIRDIEKRIADFT 154
           G D+++  +++R+ D T
Sbjct: 361 GDDRVVTTLDRRVGDMT 377


>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
 gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
          Length = 526

 Score = 49.3 bits (116), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 24/72 (33%), Positives = 39/72 (54%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E +  LA P + ++TV D +TGK   ++ R S   +L+   D +
Sbjct: 326 KPRIIRFHDIISDAEIEIVKYLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 385

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 386 VSRINMRIQDLT 397


>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
          Length = 550

 Score = 49.3 bits (116), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 24/78 (30%), Positives = 46/78 (58%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE++ + P A ++ + +S EE + +  +ATP ++++TV +S TG+ + +  R S   +L 
Sbjct: 322 VEILRFNPLAVLFVDIISDEEAKMIQQIATPRLKRATVQNSKTGELETAAYRISKSAWLK 381

Query: 137 RGRDKIIRDIEKRIADFT 154
            G  ++I  I +RI   T
Sbjct: 382 GGDHELIDRINRRIELMT 399


>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
           latipes]
          Length = 523

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 26/74 (35%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ P    YH+  S++E E +  LA P +R++TV D  TGK   ++ R S   +L     
Sbjct: 321 WDSPYIVRYHDVASEKEMETVKELAKPRLRRATVHDPQTGKLTTAQYRVSKSAWLGSHEH 380

Query: 141 KIIRDIEKRIADFT 154
            I+  I +RI D T
Sbjct: 381 PIVDRINQRIEDIT 394


>gi|47213360|emb|CAF90979.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 511

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 31/92 (33%), Positives = 45/92 (48%), Gaps = 6/92 (6%)

Query: 69  DEGRAEQWV-----EVISWE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKS 122
           D GR  ++V     +   W+ PR   YH+ LS  E E +  LA P +R++TV D  TG+ 
Sbjct: 286 DNGRHPKYVIGPVKQEDEWDHPRIVRYHDVLSNREMEKVKELARPRLRRATVHDPRTGQL 345

Query: 123 KDSRVRTSSGTFLARGRDKIIRDIEKRIADFT 154
             +  R S   +L      I+  I +RI D T
Sbjct: 346 TTAPYRVSKSAWLGAFEHPIVDQINQRIEDIT 377


>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide precursor [Salmo
           salar]
 gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
          Length = 545

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 25/74 (33%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   YH+ LS  E E +  LA P +R++T+ +  TG  + +  R S   +L    D
Sbjct: 343 WDRPRIIRYHDVLSNSEIEKVKELAKPRLRRATISNPITGVLETAHYRISKSAWLTAYED 402

Query: 141 KIIRDIEKRIADFT 154
            ++  I +RI D T
Sbjct: 403 PVVDKINQRIEDIT 416


>gi|255545252|ref|XP_002513687.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223547595|gb|EEF49090.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 309

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 43/78 (55%), Gaps = 4/78 (5%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +SW PR F+Y  FL+ EEC+ LI+LA  H  K   +    G    + ++ +S    +   
Sbjct: 61  LSWRPRVFLYKGFLTDEECDRLISLA--HGAKE--ISKGKGDGSRNNIQLASSESRSHIY 116

Query: 140 DKIIRDIEKRIADFTFFP 157
           D ++  IE+RI+ +TF P
Sbjct: 117 DDLLARIEERISAWTFIP 134


>gi|3297815|emb|CAA19873.1| putative protein [Arabidopsis thaliana]
 gi|7270340|emb|CAB80108.1| putative protein [Arabidopsis thaliana]
          Length = 257

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 27/100 (27%), Positives = 55/100 (55%), Gaps = 3/100 (3%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR-VRTSSGTFLA 136
           +V+SW PRA  + NF + E+C+ +I  A  +++ S +       +++++  RTSSGTF++
Sbjct: 28  QVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRTSSGTFIS 87

Query: 137 RGRDKI--IRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
              +    +  +E++IA  T  P      F++  + + +K
Sbjct: 88  ASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQK 127


>gi|347964867|ref|XP_309164.4| AGAP000971-PA [Anopheles gambiae str. PEST]
 gi|333466515|gb|EAA04901.5| AGAP000971-PA [Anopheles gambiae str. PEST]
          Length = 553

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 23/71 (32%), Positives = 39/71 (54%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P   +YH+ +S  E E + + A P  R++TV +  TG+ + +  R S   +L    D++I
Sbjct: 349 PYIVIYHDVMSDREIERIKHYARPRFRRATVQNYKTGELEFANYRISKSAWLKDAEDEMI 408

Query: 144 RDIEKRIADFT 154
           R I +R+ D T
Sbjct: 409 RTISQRVEDMT 419


>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
 gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
          Length = 286

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 42/78 (53%)

Query: 81  SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           S +P   +  + L   EC+ LI +   H+++S+VVD D+GK      R S G F+    D
Sbjct: 91  SEQPVIALVADVLDDTECDRLIEIGREHVQRSSVVDPDSGKEITIEERRSEGAFVNASTD 150

Query: 141 KIIRDIEKRIADFTFFPL 158
            ++  I++RIA+    P+
Sbjct: 151 ALVETIDRRIAELFRQPV 168


>gi|18418321|ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|17381226|gb|AAL36425.1| unknown protein [Arabidopsis thaliana]
 gi|20465827|gb|AAM20018.1| unknown protein [Arabidopsis thaliana]
 gi|21592377|gb|AAM64328.1| putative dioxygenase [Arabidopsis thaliana]
 gi|332660892|gb|AEE86292.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 288

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 29/110 (26%), Positives = 58/110 (52%), Gaps = 3/110 (2%)

Query: 68  GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR- 126
           G+E       +V+SW PRA  + NF + E+C+ +I  A  +++ S +       +++++ 
Sbjct: 69  GEESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKG 128

Query: 127 VRTSSGTFLARGRDKI--IRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
            RTSSGTF++   +    +  +E++IA  T  P      F++  + + +K
Sbjct: 129 TRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQK 178


>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
           porcellus]
          Length = 534

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
           jacchus]
          Length = 534

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|385137888|gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana]
          Length = 288

 Score = 48.9 bits (115), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 29/110 (26%), Positives = 58/110 (52%), Gaps = 3/110 (2%)

Query: 68  GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR- 126
           G+E       +V+SW PRA  + NF + E+C+ +I  A  +++ S +       +++++ 
Sbjct: 69  GEESIGSIPFQVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKG 128

Query: 127 VRTSSGTFLARGRDKI--IRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
            RTSSGTF++   +    +  +E++IA  T  P      F++  + + +K
Sbjct: 129 TRTSSGTFISASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQK 178


>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
 gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
          Length = 534

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 29/95 (30%), Positives = 47/95 (49%), Gaps = 1/95 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E I  +P   +YH  LS  E   LI  A  +M K+T V  + G  K +R RT+ G +  
Sbjct: 319 IEQIGLDPYVVLYHEVLSAREISMLIGKAAQNM-KNTRVHKEQGVPKKNRGRTAKGFWFK 377

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYV 171
           +  +++ + I +RI D T F L     F +  + +
Sbjct: 378 KESNELTKGITRRIMDMTGFDLADSEGFQVINYGI 412


>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
          Length = 454

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   D
Sbjct: 252 WDKPRIIRFHDIISDAENEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYED 311

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 312 PVVSRINMRIQDLT 325


>gi|198449506|ref|XP_002136910.1| GA26925 [Drosophila pseudoobscura pseudoobscura]
 gi|198130637|gb|EDY67468.1| GA26925 [Drosophila pseudoobscura pseudoobscura]
          Length = 543

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 27/83 (32%), Positives = 46/83 (55%), Gaps = 5/83 (6%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E +S +P   +YHN LS EE   L N++TP + ++ + D +T K K S VR++    + 
Sbjct: 327 MEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARIFDKETKKPKISPVRSADEVGIP 386

Query: 137 RGR-----DKIIRDIEKRIADFT 154
             +      +++  I+KRI D T
Sbjct: 387 NPKLVTEDIQLVECIQKRITDLT 409


>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
           melanoleuca]
          Length = 534

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
 gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
          Length = 541

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 24/78 (30%), Positives = 48/78 (61%), Gaps = 2/78 (2%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE ++  P   +YH+ + + E + + NL    + ++TV+ +    S+ S+VRTS  TF+ 
Sbjct: 323 VEELNHNPLLVLYHDVIYQSEIDVIRNLTENEISRATVIGAK--GSEVSKVRTSQFTFIP 380

Query: 137 RGRDKIIRDIEKRIADFT 154
           + R K+++ I++R+AD +
Sbjct: 381 KTRHKVLQTIDQRVADMS 398


>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
          Length = 534

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 23/72 (31%), Positives = 40/72 (55%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   + +
Sbjct: 334 KPRIIRFHDIISDAEIEVVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
          Length = 534

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 23/72 (31%), Positives = 40/72 (55%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
 gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
          Length = 534

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 23/72 (31%), Positives = 40/72 (55%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|302764866|ref|XP_002965854.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
 gi|300166668|gb|EFJ33274.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
          Length = 231

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 55/100 (55%), Gaps = 3/100 (3%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR-VRTSSGTFLA 136
           +V+SW PRA ++  F S  +CE +I+LA   +  S++       + +++ VRTS G FL+
Sbjct: 22  QVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALRKGETATETQDVRTSHGCFLS 81

Query: 137 RGRDK--IIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
             +DK   +  +E+++A  T  P      F++  + + +K
Sbjct: 82  SRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQK 121


>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
 gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
          Length = 209

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 41/77 (53%), Gaps = 6/77 (7%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
           EP   +  N LS  EC+ LI+LA+  M+++ +     G S D S VRTSS  F     ++
Sbjct: 31  EPLILILDNVLSWAECDLLIDLASARMQRAKI-----GSSHDVSEVRTSSSMFFEESENE 85

Query: 142 IIRDIEKRIADFTFFPL 158
            I  +E R+A+    P+
Sbjct: 86  CIGQVEARVAELMNIPV 102


>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
           sapiens]
 gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
           troglodytes]
 gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
 gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I variant [Homo
           sapiens]
 gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
           sapiens]
 gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
           sapiens]
 gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 23/72 (31%), Positives = 40/72 (55%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
          Length = 534

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
           aries]
          Length = 534

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
           lupus familiaris]
          Length = 534

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|356559784|ref|XP_003548177.1| PREDICTED: uncharacterized protein LOC100795761 [Glycine max]
          Length = 264

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 44/78 (56%), Gaps = 10/78 (12%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           ISW+PR F+Y  FLS +EC+YL++LA     KS+    + G S+   V TS         
Sbjct: 20  ISWQPRVFLYKGFLSDKECDYLVSLAYAVKEKSS---GNGGLSEG--VETSLDM-----E 69

Query: 140 DKIIRDIEKRIADFTFFP 157
           D I+  IE+R++ + F P
Sbjct: 70  DDILARIEERLSVWAFLP 87


>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 566

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
          Length = 542

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 43/74 (58%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE++ + P A  + + ++ EE   +  LATP +R++TV +S TG+ + +  RTS   +L 
Sbjct: 326 VEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELETASYRTSKSAWLK 385

Query: 137 RGRDKIIRDIEKRI 150
               +I+  I +RI
Sbjct: 386 DEEHEIVHRINRRI 399


>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
          Length = 534

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 23/72 (31%), Positives = 40/72 (55%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
          Length = 541

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 43/74 (58%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE++ + P A  + + ++ EE   +  LATP +R++TV +S TG+ + +  RTS   +L 
Sbjct: 325 VEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELETASYRTSKSAWLK 384

Query: 137 RGRDKIIRDIEKRI 150
               +I+  I +RI
Sbjct: 385 DEEHEIVHRINRRI 398


>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
           [Oryctolagus cuniculus]
          Length = 534

 Score = 48.9 bits (115), Expect = 8e-04,   Method: Composition-based stats.
 Identities = 23/72 (31%), Positives = 40/72 (55%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|389809938|ref|ZP_10205598.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
 gi|388441354|gb|EIL97635.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
          Length = 284

 Score = 48.5 bits (114), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 41/86 (47%), Gaps = 1/86 (1%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P   V  N LS  EC+ LI LA P ++++  VDS+ G+ +  R RTS G F       ++
Sbjct: 95  PALRVLENILSARECDELIALARPRLQRALTVDSE-GRQQVDRRRTSEGMFFTLDEVPLV 153

Query: 144 RDIEKRIADFTFFPLGMQFKFSLFFF 169
             IE+R+A     P        +  +
Sbjct: 154 GRIERRVAALLDVPASHGEGLQILHY 179


>gi|30686940|ref|NP_194290.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
 gi|26451153|dbj|BAC42680.1| unknown protein [Arabidopsis thaliana]
 gi|29893542|gb|AAP06823.1| unknown protein [Arabidopsis thaliana]
 gi|332659681|gb|AEE85081.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
          Length = 291

 Score = 48.5 bits (114), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 42/139 (30%), Positives = 64/139 (46%), Gaps = 37/139 (26%)

Query: 31  ILILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEG--------DEGRAEQWVEVISW 82
           +++++     S P  SG SRK      I  KS +            D  R  Q    +SW
Sbjct: 9   LILMITMSSSSPPFCSGGSRKELRDKEITSKSDDTQASYVLGSKFVDPTRVLQ----LSW 64

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTV----VDSDTGKSKDSRVRTSSGTFLARG 138
            PR F+Y  FLS+EEC++LI+L     RK T     VD+D GK++               
Sbjct: 65  LPRVFLYRGFLSEEECDHLISL-----RKETTEVYSVDAD-GKTQ--------------- 103

Query: 139 RDKIIRDIEKRIADFTFFP 157
            D ++  IE++++ +TF P
Sbjct: 104 LDPVVAGIEEKVSAWTFLP 122


>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
           garnettii]
          Length = 534

 Score = 48.5 bits (114), Expect = 9e-04,   Method: Composition-based stats.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|343171882|gb|AEL98645.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
           [Silene latifolia]
 gi|343171884|gb|AEL98646.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
           [Silene latifolia]
          Length = 162

 Score = 48.5 bits (114), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 30/94 (31%), Positives = 52/94 (55%), Gaps = 5/94 (5%)

Query: 68  GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDS-- 125
           GD+  +    +V+SW PR   +  F + + CE +I++A   ++ S +     G++ DS  
Sbjct: 66  GDDSISSIPFQVLSWRPRVLYFPKFATADHCETIISIARSQLKPSRLA-LRKGETLDSTR 124

Query: 126 RVRTSSGTFLARGRDK--IIRDIEKRIADFTFFP 157
            +RTSSG F++   DK  I+  I+++IA  T  P
Sbjct: 125 EIRTSSGMFISADEDKTGILDFIDEKIARATMIP 158


>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
 gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
          Length = 283

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 40/76 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           EP   +  + LS  EC+ LI +    +R+S+VVD D+G       R S G F+    D +
Sbjct: 90  EPVVALLADVLSPRECDRLIEIGRERVRRSSVVDPDSGGEVLIDARKSEGAFVNGSTDPL 149

Query: 143 IRDIEKRIADFTFFPL 158
           +  I++RIA+    P+
Sbjct: 150 VATIDRRIAELVQQPV 165


>gi|356530852|ref|XP_003533993.1| PREDICTED: uncharacterized protein LOC100775928 [Glycine max]
          Length = 302

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 30/78 (38%), Positives = 44/78 (56%), Gaps = 10/78 (12%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           ISW+PR F+Y  FLS +EC+YL++LA     KS+    + G S+         TFL    
Sbjct: 58  ISWQPRVFLYKGFLSDKECDYLVSLAYAVKEKSS---GNGGFSEGVE------TFLDI-E 107

Query: 140 DKIIRDIEKRIADFTFFP 157
           D I+  IE+R++ + F P
Sbjct: 108 DDILARIEERLSLWAFLP 125


>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
 gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
          Length = 525

 Score = 48.5 bits (114), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/81 (28%), Positives = 45/81 (55%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E+++ +P   +YH+ ++  E   L  LA P ++++TV +   G++   + RTS  T+L  
Sbjct: 323 ELLALDPYMVLYHDVITPSEIRELQYLAVPTLKRATVFNQKMGRNTVVKTRTSKVTWLTD 382

Query: 138 GRDKIIRDIEKRIADFTFFPL 158
             + +   + +RI+D T F L
Sbjct: 383 SLNPLTVRLNRRISDMTGFDL 403


>gi|390363005|ref|XP_797519.3| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like
           [Strongylocentrotus purpuratus]
          Length = 579

 Score = 48.5 bits (114), Expect = 0.001,   Method: Composition-based stats.
 Identities = 24/81 (29%), Positives = 46/81 (56%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  +PR   Y N L+ +E  ++  LA+P ++++T+ ++ TG  + +  R S   ++ +
Sbjct: 368 EVIFDDPRLVFYRNILNDKEIAFVKRLASPRLQRATIQNAITGNLEFADYRISKSAWVKQ 427

Query: 138 GRDKIIRDIEKRIADFTFFPL 158
             D++IR I  R+  +T   L
Sbjct: 428 EEDQLIRSIRFRVQAYTGLEL 448


>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
           [Loxodonta africana]
          Length = 534

 Score = 48.5 bits (114), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/72 (31%), Positives = 40/72 (55%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   + +
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
 gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
          Length = 534

 Score = 48.5 bits (114), Expect = 0.001,   Method: Composition-based stats.
 Identities = 31/82 (37%), Positives = 45/82 (54%), Gaps = 2/82 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E I  +P   +YH  LS  E   LI+ A  +M K+T V  +T K K +R RT+ G +L +
Sbjct: 321 EQIGLDPYVVLYHEVLSAREISMLISKAAQNM-KNTRVHRET-KPKTNRGRTAKGHWLKK 378

Query: 138 GRDKIIRDIEKRIADFTFFPLG 159
             +++ R I +RI D T F L 
Sbjct: 379 ESNELTRRITRRIVDMTGFDLA 400


>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
 gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
          Length = 520

 Score = 48.5 bits (114), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/82 (28%), Positives = 43/82 (52%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E++   P   +YH+ LS  E + +  +ATP ++++TV  +  GK++  + RTS   +  
Sbjct: 320 MEIVGLNPYMVIYHDVLSSAEIDEMKEMATPSLKRATVYKASLGKNEVVKTRTSKVAWFP 379

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
              + +   +  RI D T F L
Sbjct: 380 DSYNSLTLRLNARIHDMTGFDL 401


>gi|325915856|ref|ZP_08178155.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
           [Xanthomonas vesicatoria ATCC 35937]
 gi|325537977|gb|EGD09674.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
           [Xanthomonas vesicatoria ATCC 35937]
          Length = 418

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/54 (46%), Positives = 31/54 (57%), Gaps = 4/54 (7%)

Query: 92  FLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRD 145
            LS +EC  LI LA PH+R S VVD D   S+ + +RTS G  L    D I+ D
Sbjct: 236 VLSADECRLLILLARPHLRASQVVDPDDASSQRTPIRTSRGATL----DPILED 285


>gi|443709455|gb|ELU04127.1| hypothetical protein CAPTEDRAFT_149240 [Capitella teleta]
          Length = 532

 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 26/81 (32%), Positives = 42/81 (51%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E  S EP   VYH  ++  E E +  +ATP + ++TV +S TG+ + ++ R S   +L  
Sbjct: 330 ETASLEPWIAVYHQLMNDHEIERIKEMATPRLARATVHNSATGQLEHAKYRISKSGWLRD 389

Query: 138 GRDKIIRDIEKRIADFTFFPL 158
             D +I  I +R +  T   L
Sbjct: 390 EEDPLIARISERCSALTNLSL 410


>gi|290243077|ref|YP_003494747.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
 gi|288945582|gb|ADC73280.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
          Length = 575

 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 34/97 (35%), Positives = 50/97 (51%), Gaps = 4/97 (4%)

Query: 64  ERSEGDEGRAEQW--VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGK 121
           ER E +   AE++  +E +S +P       FL   ECE LI+LA   M+++ V  S  G 
Sbjct: 35  ERVERNRMPAERYDGMETLSQDPLVVYLDEFLEPGECEALIHLAQGRMKRALV--SLDGS 92

Query: 122 SKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPL 158
           S  S+ RT S  +L    + + R I +R+A    FPL
Sbjct: 93  SGVSQGRTGSNCWLRYQEEPLARRIGERVAKRVGFPL 129


>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
           19424]
 gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
           taiwanensis LMG 19424]
          Length = 296

 Score = 48.1 bits (113), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 36/74 (48%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P+  ++   LS +EC+ L+ L+   + +S VV+ DTG       RTS G         +I
Sbjct: 104 PQVQLFQQLLSDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHALI 163

Query: 144 RDIEKRIADFTFFP 157
             IE RIA  T  P
Sbjct: 164 ARIEARIAAVTGVP 177


>gi|410927705|ref|XP_003977281.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
           rubripes]
          Length = 531

 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 29/92 (31%), Positives = 44/92 (47%), Gaps = 6/92 (6%)

Query: 69  DEGRAEQWV-----EVISWE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKS 122
           D GR  ++V     +   W+ P    YH+ LS  E E +  LA P +R++TV D  TG+ 
Sbjct: 311 DNGRHPKYVIGPVKQEDEWDRPHIVRYHDILSNREMETVKELAKPRLRRATVHDPQTGQL 370

Query: 123 KDSRVRTSSGTFLARGRDKIIRDIEKRIADFT 154
             +  R S   +L      ++  I +RI D T
Sbjct: 371 TTAPYRVSKSAWLGAFEHPVVDRINQRIEDIT 402


>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
 gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
          Length = 528

 Score = 48.1 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 23/82 (28%), Positives = 46/82 (56%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E++  +P   +YH+ +S  E   L ++ATP ++++TV  +   +S+  + RTS   +  
Sbjct: 323 MELVGLDPYMVLYHDVISAPEISQLQDMATPGLKRATVYKASGRRSEVVKTRTSKVAWFP 382

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
              +++   + +RIAD T F L
Sbjct: 383 DTFNELTERLNRRIADMTNFDL 404


>gi|77748579|ref|NP_641686.2| hypothetical protein XAC1351 [Xanthomonas axonopodis pv. citri str.
           306]
          Length = 418

 Score = 47.8 bits (112), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 4/62 (6%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR   Y   LS +EC  L+ LA PH+R S V+D +   ++ + +RTS G  L    D II
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTQRAPIRTSRGATL----DPII 283

Query: 144 RD 145
            D
Sbjct: 284 ED 285


>gi|403298096|ref|XP_003939871.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saimiri
           boliviensis boliviensis]
          Length = 412

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/93 (27%), Positives = 47/93 (50%), Gaps = 1/93 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 307 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 366

Query: 141 KIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLK 173
            ++  I  RI D T   +    +  +    +LK
Sbjct: 367 PVVSRINMRIQDLTGLDVSTAEELQVGNHIILK 399


>gi|255607134|ref|XP_002538686.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223510975|gb|EEF23697.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 318

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 48/83 (57%), Gaps = 2/83 (2%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV-DSDTGKSKDSRVRTSSGTFL 135
           V ++   PR  ++ + LS  EC+ LI  +   +++S VV +  +G+  D   RTS G + 
Sbjct: 119 VVMVCTAPRIALFDDVLSDAECDALIAASRSRLQRSKVVANRGSGEFVDD-TRTSYGAYF 177

Query: 136 ARGRDKIIRDIEKRIADFTFFPL 158
            +G + ++  I++RIA+ T +PL
Sbjct: 178 NKGENSLVATIQRRIAELTRWPL 200


>gi|381173085|ref|ZP_09882194.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380686458|emb|CCG38681.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 418

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 4/62 (6%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR   Y   LS +EC  L+ LA PH+R S V+D +   ++ + +RTS G  L    D II
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTQRAPIRTSRGATL----DPII 283

Query: 144 RD 145
            D
Sbjct: 284 ED 285


>gi|418521653|ref|ZP_13087695.1| hypothetical protein WS7_11622 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410702188|gb|EKQ60697.1| hypothetical protein WS7_11622 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
          Length = 418

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 4/62 (6%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR   Y   LS +EC  L+ LA PH+R S V+D +   ++ + +RTS G  L    D II
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTQRAPIRTSRGATL----DPII 283

Query: 144 RD 145
            D
Sbjct: 284 ED 285


>gi|242047776|ref|XP_002461634.1| hypothetical protein SORBIDRAFT_02g005770 [Sorghum bicolor]
 gi|241925011|gb|EER98155.1| hypothetical protein SORBIDRAFT_02g005770 [Sorghum bicolor]
          Length = 79

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 17/35 (48%), Positives = 26/35 (74%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRK 111
           V+ +SW+PR FVY  FLS +EC++L+ L + H R+
Sbjct: 45  VKAVSWQPRIFVYKGFLSDDECDHLVTLVSEHRRR 79


>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
 gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
          Length = 511

 Score = 47.8 bits (112), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/106 (28%), Positives = 50/106 (47%), Gaps = 15/106 (14%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +EV+  +P   ++H+ LS  E + L  +A PH+ +S VV          R+  S+GT++ 
Sbjct: 312 MEVLVLDPLVVIFHDVLSSREIDGLQEIARPHLERSMVVKYRANVQGKHRI--SAGTWVE 369

Query: 137 RGRDKIIRDIEKRIADFT-------------FFPLGMQFKFSLFFF 169
           R  + +   IE+RIAD                + +G Q+K    FF
Sbjct: 370 RKYNNLTWRIERRIADMVDLNLEGSEPFYVINYGIGGQYKAHWDFF 415


>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
           harrisii]
          Length = 385

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 38/71 (53%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR   +H  +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   D ++
Sbjct: 186 PRIVRFHEIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPVV 245

Query: 144 RDIEKRIADFT 154
             I  RI D T
Sbjct: 246 SRINMRIQDLT 256


>gi|428178571|gb|EKX47446.1| hypothetical protein GUITHDRAFT_152114 [Guillardia theta CCMP2712]
          Length = 262

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 46/80 (57%), Gaps = 1/80 (1%)

Query: 76  WVEVISWEPRAFVYHNFLSKEECEYLINLA-TPHMRKSTVVDSDTGKSKDSRVRTSSGTF 134
           ++E I+  PR F   N L+K+ECE+L+ LA    + K+ ++   T K  +S  RT+ G +
Sbjct: 56  YLEQINASPRVFRIRNLLTKQECEHLMLLAFRKGLSKTMIMPYGTHKLVESTTRTNDGAW 115

Query: 135 LARGRDKIIRDIEKRIADFT 154
           L   +D ++R +E+ +   T
Sbjct: 116 LDFLQDDVVRRLEETLGKLT 135


>gi|398806116|ref|ZP_10565064.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
 gi|398089832|gb|EJL80333.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
          Length = 294

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 35/74 (47%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V  NFLS EEC+ L   A P    +TVVD        +  R++    L     +++
Sbjct: 95  PRIVVLDNFLSSEECDGLCEEARPAFAPATVVDPHQDAVHAAHFRSNDSAQLPAAGSELV 154

Query: 144 RDIEKRIADFTFFP 157
           R +E RI   T +P
Sbjct: 155 RRVEARIERLTGWP 168


>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
 gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score = 47.4 bits (111), Expect = 0.002,   Method: Composition-based stats.
 Identities = 25/77 (32%), Positives = 46/77 (59%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E +S +P   +YH+ + + E + +  L T  M ++ V  + T +S  S VRTS  TF+A+
Sbjct: 322 EELSHDPLLVLYHDVIYQSEIDVIRQLTTNRMARAMV--TLTNQSTVSNVRTSQITFIAK 379

Query: 138 GRDKIIRDIEKRIADFT 154
              ++++ I++R+AD T
Sbjct: 380 TEHEVLQTIDRRVADMT 396


>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
          Length = 217

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/75 (34%), Positives = 39/75 (52%), Gaps = 5/75 (6%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           EP   V  N LS EEC+ LI L+   + +S + +++        +RTSS TF+    + I
Sbjct: 38  EPLIVVLGNVLSDEECDELIRLSKDRINRSKIANANV-----DNMRTSSSTFIEENENII 92

Query: 143 IRDIEKRIADFTFFP 157
           +  IEKRI+     P
Sbjct: 93  VSRIEKRISQIMNIP 107


>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
 gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
          Length = 272

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 38/84 (45%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +P+  +  N LS EEC+ +I        +STV     G S     RTS   F+ RG  ++
Sbjct: 82  QPQIILLGNVLSDEECDAIIAHCGTRYTRSTVTGEADGSSMVHEGRTSEMAFIQRGEAEV 141

Query: 143 IRDIEKRIADFTFFPLGMQFKFSL 166
              IE+R+A    +P      F L
Sbjct: 142 AERIERRLAALAHWPAECSEPFQL 165


>gi|297301157|ref|XP_001103971.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Macaca
           mulatta]
          Length = 512

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|294499597|ref|YP_003563297.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
 gi|294349534|gb|ADE69863.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
          Length = 219

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 41/70 (58%), Gaps = 6/70 (8%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
           EP   V  N LS EEC+ LI L+   M++S +     G +++ + +RTSSG F     ++
Sbjct: 38  EPLVLVLGNVLSNEECDELIQLSKDKMQRSKI-----GAAREVNSIRTSSGMFFEESENE 92

Query: 142 IIRDIEKRIA 151
           ++  IE+R++
Sbjct: 93  LVHQIERRLS 102


>gi|452752943|ref|ZP_21952682.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
           proteobacterium JLT2015]
 gi|451959765|gb|EMD82182.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
           proteobacterium JLT2015]
          Length = 314

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 42/83 (50%), Gaps = 2/83 (2%)

Query: 74  EQW-VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSG 132
           E W  E +S  P   +  +  S  EC YL  ++ P +R ST++D  TG  +   VRTS G
Sbjct: 118 ENWRTEPVSETPSIRMVRHLFSSAECAYLQQMSAPRLRPSTILDPQTGARRPDPVRTSVG 177

Query: 133 TFLAR-GRDKIIRDIEKRIADFT 154
             L+    D ++  + +RIA  T
Sbjct: 178 AALSPVEEDLVVGMLNRRIAAAT 200


>gi|195159297|ref|XP_002020518.1| GL13472 [Drosophila persimilis]
 gi|194117287|gb|EDW39330.1| GL13472 [Drosophila persimilis]
          Length = 526

 Score = 47.4 bits (111), Expect = 0.002,   Method: Composition-based stats.
 Identities = 25/77 (32%), Positives = 46/77 (59%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E +S +P   +YH+ + + E + +  L T  M ++ V  + T +S  S VRTS  TF+A+
Sbjct: 293 EELSHDPLLVLYHDVIYQSEIDVIRQLTTNRMARAMV--TLTNQSTVSNVRTSQITFIAK 350

Query: 138 GRDKIIRDIEKRIADFT 154
              ++++ I++R+AD T
Sbjct: 351 TEHEVLQTIDRRVADMT 367


>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-1 [Nomascus leucogenys]
          Length = 502

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 300 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 359

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 360 PVVSRINMRIQDLT 373


>gi|195425415|ref|XP_002061004.1| GK10713 [Drosophila willistoni]
 gi|194157089|gb|EDW71990.1| GK10713 [Drosophila willistoni]
          Length = 502

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 40/78 (51%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE+++  P   +YH+ L   E E L  LA P + +ST+ D D   +     RTS+  FL 
Sbjct: 280 VEILNNLPFVAIYHDVLYDREIEELKRLAVPTITRSTIYDYDKEGNVPVNFRTSNSVFLL 339

Query: 137 RGRDKIIRDIEKRIADFT 154
                ++  + +R+AD T
Sbjct: 340 NNASYLVDILRQRVADMT 357


>gi|325929527|ref|ZP_08190641.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
           [Xanthomonas perforans 91-118]
 gi|325540037|gb|EGD11665.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
           [Xanthomonas perforans 91-118]
          Length = 418

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 33/62 (53%), Gaps = 4/62 (6%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR   Y   LS +EC  L+ LA PH+R S V+D +   +  + +RTS G  L    D II
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTGRAPIRTSHGATL----DPII 283

Query: 144 RD 145
            D
Sbjct: 284 ED 285


>gi|395820528|ref|XP_003783616.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Otolemur
           garnettii]
          Length = 516

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|217272851|ref|NP_001136068.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Homo
           sapiens]
 gi|114631189|ref|XP_001140871.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 10 [Pan
           troglodytes]
          Length = 516

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|426255748|ref|XP_004021510.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Ovis
           aries]
          Length = 516

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|431904119|gb|ELK09541.1| Prolyl 4-hydroxylase subunit alpha-1 [Pteropus alecto]
          Length = 507

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 41/75 (54%), Gaps = 1/75 (1%)

Query: 81  SWE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
            W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   
Sbjct: 331 EWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYE 390

Query: 140 DKIIRDIEKRIADFT 154
           + ++  I  RI D T
Sbjct: 391 NPVVSRINMRIQDLT 405


>gi|219124513|ref|XP_002182546.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405892|gb|EEC45833.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 193

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 42/76 (55%), Gaps = 4/76 (5%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V+ +S  PRAF   NFL+  E ++++ L    ++K   +   +     S  RTSS T+LA
Sbjct: 1   VKALSCAPRAFQVENFLTDVEADHIVGL----VQKKNDMQRSSTNGHISETRTSSTTWLA 56

Query: 137 RGRDKIIRDIEKRIAD 152
           R  D +I  I +R+AD
Sbjct: 57  RHSDPVIDSIFRRVAD 72


>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
          Length = 244

 Score = 47.4 bits (111), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 40/72 (55%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   + +
Sbjct: 44  KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 103

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 104 VSRINMRIQDLT 115


>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
          Length = 488

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 40/72 (55%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   + +
Sbjct: 288 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 347

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 348 VSRINMRIQDLT 359


>gi|21107513|gb|AAM36222.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 273

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 4/62 (6%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR   Y   LS +EC  L+ LA PH+R S V+D +   ++ + +RTS G  L    D II
Sbjct: 83  PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTQRAPIRTSRGATL----DPII 138

Query: 144 RD 145
            D
Sbjct: 139 ED 140


>gi|291404186|ref|XP_002718473.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 3
           [Oryctolagus cuniculus]
          Length = 516

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|170064960|ref|XP_001867743.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
 gi|167882146|gb|EDS45529.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
          Length = 545

 Score = 47.0 bits (110), Expect = 0.003,   Method: Composition-based stats.
 Identities = 22/72 (30%), Positives = 37/72 (51%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           EP   +YH  +S  E E +  LA P  R++TV +  TG+ + +  R S   +L      +
Sbjct: 341 EPYIVIYHEVMSDAEIEVIKRLAKPRFRRATVQNYKTGELEVANYRISKSAWLKDEEHSV 400

Query: 143 IRDIEKRIADFT 154
           +R + +R+ D T
Sbjct: 401 VRTVGQRVEDMT 412


>gi|344274276|ref|XP_003408943.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3
           [Loxodonta africana]
          Length = 516

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 332 WDKPRIVRFHDIISDAEIEVVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
          Length = 210

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/84 (30%), Positives = 45/84 (53%), Gaps = 9/84 (10%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           EP   V  N LS EEC+ LI+L+   M +S +       ++++ +RTS+  FL     ++
Sbjct: 32  EPFVAVLGNVLSDEECDELISLSKDRMNRSKI-----AGNQENDIRTSTSVFLPEDASEV 86

Query: 143 IRDIEKRIADFTFFPL----GMQF 162
           ++ +EKRI+     P+    G+Q 
Sbjct: 87  VQRVEKRISQIMNIPVEHGEGLQL 110


>gi|357517889|ref|XP_003629233.1| hypothetical protein MTR_8g074850 [Medicago truncatula]
 gi|355523255|gb|AET03709.1| hypothetical protein MTR_8g074850 [Medicago truncatula]
          Length = 60

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 20/43 (46%), Positives = 33/43 (76%), Gaps = 4/43 (9%)

Query: 51 KANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFL 93
          K  +++ ++ K+  +SE D+ +  +WVE++SWEPRAF+YHNFL
Sbjct: 6  KGENVTVVLLKN--KSEDDDNK--RWVEIVSWEPRAFLYHNFL 44


>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
 gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
          Length = 297

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 36/74 (48%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P+  ++   L+ +EC+ L+ L+   + +S VV+ DTG       RTS G         +I
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHALI 164

Query: 144 RDIEKRIADFTFFP 157
             IE RIA  T  P
Sbjct: 165 ARIEARIAAVTGVP 178


>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
 gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
          Length = 297

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 36/74 (48%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P+  ++   L+ +EC+ L+ L+   + +S VV+ DTG       RTS G         +I
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHPLI 164

Query: 144 RDIEKRIADFTFFP 157
             IE RIA  T  P
Sbjct: 165 TRIEARIAAVTGVP 178


>gi|291224083|ref|XP_002732036.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit-like [Saccoglossus
           kowalevskii]
          Length = 491

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 42/81 (51%), Gaps = 4/81 (4%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV----DSDTGKSKDSRVRTSSGT 133
           E++   PR  +YH+ +S EE   +I LA P +R+S VV          S D+  R SSG 
Sbjct: 296 EIMHHNPRVVLYHDVISDEEINEVIKLAKPKLRRSLVVTKGSSPSGTGSSDAEYRVSSGG 355

Query: 134 FLARGRDKIIRDIEKRIADFT 154
           +L      +I  + +RI+D +
Sbjct: 356 WLEDWDGTVIAKLTRRISDIS 376


>gi|224006261|ref|XP_002292091.1| hypothetical protein THAPSDRAFT_263436 [Thalassiosira pseudonana
           CCMP1335]
 gi|220972610|gb|EED90942.1| hypothetical protein THAPSDRAFT_263436 [Thalassiosira pseudonana
           CCMP1335]
          Length = 232

 Score = 47.0 bits (110), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 47/81 (58%), Gaps = 5/81 (6%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPH-----MRKSTVVDSDTGKSKDSRVRTSSG 132
           +V+S  PR      FLS  E ++LI+LA+       M++STV+ S+   +  +  R+SSG
Sbjct: 1   KVLSCAPRVLEVKKFLSPVEVQHLIDLASGAKGDVAMQRSTVLASNIRGATKTDTRSSSG 60

Query: 133 TFLARGRDKIIRDIEKRIADF 153
            ++ R +D I+  I +RIAD 
Sbjct: 61  GWIHREQDVIVDTIFRRIADL 81


>gi|24651407|ref|NP_733371.1| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
 gi|20269806|gb|AAM18058.1|AF495536_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]EFB
           [Drosophila melanogaster]
 gi|15292529|gb|AAK93533.1| SD05564p [Drosophila melanogaster]
 gi|23172692|gb|AAF57053.2| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
 gi|220946562|gb|ACL85824.1| PH4alphaEFB-PA [synthetic construct]
          Length = 550

 Score = 46.6 bits (109), Expect = 0.003,   Method: Composition-based stats.
 Identities = 22/78 (28%), Positives = 41/78 (52%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E +  +P   +YH+ +   E + +  +A P  R++TV +S TG  + +  R S   +L 
Sbjct: 339 LEEVHADPYIVIYHDAMYDSEIDLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLK 398

Query: 137 RGRDKIIRDIEKRIADFT 154
              D++I  + +R AD T
Sbjct: 399 TQEDRVIETVVQRTADMT 416


>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
           [Papio anubis]
          Length = 379

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 177 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 236

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 237 PVVSRINMRIQDLT 250


>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
 gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
          Length = 219

 Score = 46.6 bits (109), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 40/70 (57%), Gaps = 6/70 (8%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
           EP   V  N LS EEC+ LI L+   M++S +     G  ++ + +RTSSG F     ++
Sbjct: 38  EPLVLVLGNVLSNEECDELIQLSKDKMQRSKI-----GAEREVNSIRTSSGMFFEESENE 92

Query: 142 IIRDIEKRIA 151
           ++  IE+R++
Sbjct: 93  LVHQIERRLS 102


>gi|195575089|ref|XP_002105512.1| GD21521 [Drosophila simulans]
 gi|194201439|gb|EDX15015.1| GD21521 [Drosophila simulans]
          Length = 550

 Score = 46.6 bits (109), Expect = 0.003,   Method: Composition-based stats.
 Identities = 22/78 (28%), Positives = 41/78 (52%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E +  +P   +YH+ +   E + +  +A P  R++TV +S TG  + +  R S   +L 
Sbjct: 339 LEEVHADPYIVIYHDAMYDSEIDLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLK 398

Query: 137 RGRDKIIRDIEKRIADFT 154
              D++I  + +R AD T
Sbjct: 399 TQEDRVIETVVQRTADMT 416


>gi|195341536|ref|XP_002037362.1| GM12882 [Drosophila sechellia]
 gi|194131478|gb|EDW53521.1| GM12882 [Drosophila sechellia]
          Length = 550

 Score = 46.6 bits (109), Expect = 0.003,   Method: Composition-based stats.
 Identities = 22/78 (28%), Positives = 41/78 (52%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E +  +P   +YH+ +   E + +  +A P  R++TV +S TG  + +  R S   +L 
Sbjct: 339 LEEVHADPYIVIYHDAMYDSEIDLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLK 398

Query: 137 RGRDKIIRDIEKRIADFT 154
              D++I  + +R AD T
Sbjct: 399 TQEDRVIETVVQRTADMT 416


>gi|91091610|ref|XP_969386.1| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
           putative [Tribolium castaneum]
 gi|270001037|gb|EEZ97484.1| hypothetical protein TcasGA2_TC011321 [Tribolium castaneum]
          Length = 536

 Score = 46.6 bits (109), Expect = 0.004,   Method: Composition-based stats.
 Identities = 22/78 (28%), Positives = 42/78 (53%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE     P  F++ + L+  E   +  +A P  +++TV ++DTG+ + ++ R S   +L 
Sbjct: 325 VEEAHHRPDIFIFRDVLADSEIATIKRMAQPRFKRATVQNTDTGELEIAQYRISKSAWLK 384

Query: 137 RGRDKIIRDIEKRIADFT 154
               K I D+ +R++D T
Sbjct: 385 EEEHKHIADVSQRVSDMT 402


>gi|390989336|ref|ZP_10259634.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
           pv. punicae str. LMG 859]
 gi|372555840|emb|CCF66609.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
           pv. punicae str. LMG 859]
          Length = 228

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 34/62 (54%), Gaps = 4/62 (6%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR   Y   LS +EC  L+ LA PH+R S V+D +   ++ + +RTS G  L    D II
Sbjct: 38  PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTQRAPIRTSRGATL----DPII 93

Query: 144 RD 145
            D
Sbjct: 94  ED 95


>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
 gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
          Length = 216

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 52/98 (53%), Gaps = 10/98 (10%)

Query: 77  VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
           +++IS   EP   V  N +S EEC+ LI ++   +++ST+     G S+D + +RTSSG 
Sbjct: 30  IQIISKFEEPLIVVLGNVISDEECDELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGA 84

Query: 134 FLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYV 171
           FL    +++   IEKRI+     P+       +  + V
Sbjct: 85  FLE--ENELTSKIEKRISSIMNVPVAHGEGLHILNYEV 120


>gi|242018356|ref|XP_002429643.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
           humanus corporis]
 gi|212514628|gb|EEB16905.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
           humanus corporis]
          Length = 534

 Score = 46.6 bits (109), Expect = 0.004,   Method: Composition-based stats.
 Identities = 22/72 (30%), Positives = 39/72 (54%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  +YH+ LS  E + +  LA P  +++TV +S+TGK + +  R S   +L       
Sbjct: 333 DPRIVLYHDVLSDREIKTIQQLAVPRFKRATVQNSETGKLEVAHYRISKSAWLEDVDHPY 392

Query: 143 IRDIEKRIADFT 154
           +  + +R+ D T
Sbjct: 393 VAKVSQRVEDIT 404


>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
 gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
          Length = 216

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 31/98 (31%), Positives = 52/98 (53%), Gaps = 10/98 (10%)

Query: 77  VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
           +++IS   EP   V  N +S EEC+ LI ++   +++ST+     G S+D + +RTSSG 
Sbjct: 30  IQIISKFEEPLIVVLGNVISDEECDELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGA 84

Query: 134 FLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYV 171
           FL    +++   IEKRI+     P+       +  + V
Sbjct: 85  FLE--ENELTSKIEKRISSIMNVPVAHGEGLHILNYEV 120


>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
 gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
          Length = 216

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 49/85 (57%), Gaps = 10/85 (11%)

Query: 77  VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
           +++IS   EP   V  N +S EEC+ LI ++   +++ST+     G S+D + +RTSSG 
Sbjct: 30  IQIISKFEEPLIVVLGNVISDEECDELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGA 84

Query: 134 FLARGRDKIIRDIEKRIADFTFFPL 158
           FL    +++   IEKRI+     P+
Sbjct: 85  FLE--ENELTSKIEKRISSIMNVPV 107


>gi|198449518|ref|XP_002136915.1| GA26872, isoform A [Drosophila pseudoobscura pseudoobscura]
 gi|198130643|gb|EDY67473.1| GA26872, isoform A [Drosophila pseudoobscura pseudoobscura]
          Length = 543

 Score = 46.6 bits (109), Expect = 0.004,   Method: Composition-based stats.
 Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 5/83 (6%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E +S +P   +YHN LS EE   L N++TP + ++ V DS   K K S  RT+    + 
Sbjct: 327 MEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARVFDSGIRKPKISPARTADEVQIP 386

Query: 137 RGR-----DKIIRDIEKRIADFT 154
             +      +++  I+KR+ D T
Sbjct: 387 NPKLVAEDIQLVERIQKRMTDLT 409


>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
 gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
          Length = 496

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 41/75 (54%), Gaps = 2/75 (2%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P   V+H+ LS  E + L  LA P + ++TVV       KDSR RTS GT++ R  + + 
Sbjct: 303 PFIIVFHDVLSPREIDELQKLARPLLERTTVVKFKK-YEKDSR-RTSKGTWIERDHNNLT 360

Query: 144 RDIEKRIADFTFFPL 158
           + IE+RI D     L
Sbjct: 361 KRIERRITDMVELDL 375


>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
           caballus]
          Length = 302

 Score = 46.6 bits (109), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 40/72 (55%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S   +L+   + +
Sbjct: 102 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 161

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 162 VSRINMRIQDLT 173


>gi|260825357|ref|XP_002607633.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
 gi|229292981|gb|EEN63643.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
          Length = 520

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 22/71 (30%), Positives = 38/71 (53%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P+ +V HN L+  E E +  LA P +R++ V    TG+ + +  R S   +L     ++I
Sbjct: 322 PKLWVLHNILTDPEMEVIKKLAQPRLRRARVESPTTGEGELASYRISKSAWLYDWEHRVI 381

Query: 144 RDIEKRIADFT 154
           R + +R+ D T
Sbjct: 382 RRVNQRVEDVT 392


>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
 gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
          Length = 216

 Score = 46.2 bits (108), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 49/85 (57%), Gaps = 10/85 (11%)

Query: 77  VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
           +++IS   EP   V  N +S EEC+ LI ++   +++ST+     G S+D + +RTSSG 
Sbjct: 30  IQIISKFEEPLIVVLGNVISDEECDELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGA 84

Query: 134 FLARGRDKIIRDIEKRIADFTFFPL 158
           FL    +++   IEKRI+     P+
Sbjct: 85  FLE--ENELTSKIEKRISSIMNVPV 107


>gi|295704991|ref|YP_003598066.1| 2OG-Fe(II) oxygenase [Bacillus megaterium DSM 319]
 gi|294802650|gb|ADF39716.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium DSM 319]
          Length = 219

 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 41/70 (58%), Gaps = 6/70 (8%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
           EP   V  N LS EEC+ LI L+   M++S +     G +++ + +RTSSG F     ++
Sbjct: 38  EPLVLVLGNVLSNEECDELIRLSKDKMQRSKI-----GAAREVNSIRTSSGMFFDESENE 92

Query: 142 IIRDIEKRIA 151
           ++  IE+R++
Sbjct: 93  LVHQIERRLS 102


>gi|350014318|dbj|GAA37183.1| prolyl 4-hydroxylase [Clonorchis sinensis]
          Length = 595

 Score = 46.2 bits (108), Expect = 0.005,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 43/77 (55%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+  +PR  ++++ +   E   +  LA P +R++TV +  TGK +++  RTS   +L  
Sbjct: 384 EVLYPDPRIVMWYDVIHPSEVGRIQELALPRLRRATVKNPVTGKLENAYYRTSKSAWLQD 443

Query: 138 GRDKIIRDIEKRIADFT 154
           G D++   + +RI   T
Sbjct: 444 GLDEVTHRLNQRIHALT 460


>gi|195061074|ref|XP_001995919.1| GH14105 [Drosophila grimshawi]
 gi|193891711|gb|EDV90577.1| GH14105 [Drosophila grimshawi]
          Length = 513

 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 25/82 (30%), Positives = 44/82 (53%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E++  +P   +YH+ +S  E E L  LA P ++++ VVD  T ++   + RTS  T+L 
Sbjct: 314 MELLQLDPYMVLYHDAISPREIEDLQFLAMPRLKRAKVVDQVTHRNMMVKERTSKVTWLG 373

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
              +     + KRI D + F +
Sbjct: 374 DATNAFTMRLNKRIEDMSGFTM 395


>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
 gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
          Length = 537

 Score = 46.2 bits (108), Expect = 0.005,   Method: Composition-based stats.
 Identities = 29/86 (33%), Positives = 47/86 (54%), Gaps = 4/86 (4%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE ++  P   +YH+ + + E + L  L      ++ VV + T  S  S+ RTS   F+A
Sbjct: 328 VEELNRNPLLVLYHDVIYQSEIDVLNKLNRKRYERAGVVINST--STVSKKRTSQHIFIA 385

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQF 162
             R K++R I++R+AD T   L MQ+
Sbjct: 386 ATRHKVLRTIDQRVADMT--NLNMQY 409


>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Cricetulus griseus]
          Length = 534

 Score = 46.2 bits (108), Expect = 0.005,   Method: Composition-based stats.
 Identities = 22/72 (30%), Positives = 38/72 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLETVHYRISKSAWLSGYEDPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
 gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
          Length = 493

 Score = 46.2 bits (108), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 23/82 (28%), Positives = 46/82 (56%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E++  +P   +YH+ +S  E   L ++ATP ++++TV  +   +S+  + RTS   +  
Sbjct: 288 MELVGLDPYMVLYHDVISALEISQLQDMATPGLKRATVYKASGRRSEVVKTRTSKVAWFP 347

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
              +++   + +RIAD T F L
Sbjct: 348 DTFNELTERLNRRIADMTNFDL 369


>gi|390176836|ref|XP_003736216.1| GA26872, isoform B [Drosophila pseudoobscura pseudoobscura]
 gi|388858809|gb|EIM52289.1| GA26872, isoform B [Drosophila pseudoobscura pseudoobscura]
          Length = 567

 Score = 46.2 bits (108), Expect = 0.005,   Method: Composition-based stats.
 Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 5/83 (6%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E +S +P   +YHN LS EE   L N++TP + ++ V DS   K K S  RT+    + 
Sbjct: 351 MEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARVFDSGIRKPKISPARTADEVQIP 410

Query: 137 RGR-----DKIIRDIEKRIADFT 154
             +      +++  I+KR+ D T
Sbjct: 411 NPKLVAEDIQLVERIQKRMTDLT 433


>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
           [Rattus norvegicus]
          Length = 534

 Score = 46.2 bits (108), Expect = 0.005,   Method: Composition-based stats.
 Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   D
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGYED 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
 gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide [Mus musculus]
 gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
 gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
 gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
           musculus]
          Length = 534

 Score = 46.2 bits (108), Expect = 0.005,   Method: Composition-based stats.
 Identities = 22/72 (30%), Positives = 38/72 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
          Length = 534

 Score = 45.8 bits (107), Expect = 0.006,   Method: Composition-based stats.
 Identities = 22/72 (30%), Positives = 38/72 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
          Length = 561

 Score = 45.8 bits (107), Expect = 0.006,   Method: Composition-based stats.
 Identities = 22/72 (30%), Positives = 38/72 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|3169183|gb|AAC17826.1| hypothetical protein [Arabidopsis thaliana]
          Length = 1036

 Score = 45.8 bits (107), Expect = 0.006,   Method: Composition-based stats.
 Identities = 27/95 (28%), Positives = 47/95 (49%), Gaps = 1/95 (1%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +SW PR F   NF +K++CE +I++A P ++ ST+      K    + R S         
Sbjct: 800 LSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSTLALRKETKHFQMQYR-SLHQHTDEDE 858

Query: 140 DKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
             ++  IE++IA  T FP      F++  + + +K
Sbjct: 859 SGVLAAIEEKIALATRFPKDYYESFNILRYQLGQK 893


>gi|321474953|gb|EFX85917.1| hypothetical protein DAPPUDRAFT_309108 [Daphnia pulex]
          Length = 549

 Score = 45.8 bits (107), Expect = 0.006,   Method: Composition-based stats.
 Identities = 21/72 (29%), Positives = 38/72 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +P   +YH+ +  EE E +  LA P  +++TV++S TGK + ++ R S   FL       
Sbjct: 346 KPLLVIYHDVIFDEEIETVKKLAHPRFKRTTVMNSATGKLETAKYRISKAAFLKNKEHHH 405

Query: 143 IRDIEKRIADFT 154
           +  + +R+   T
Sbjct: 406 VLKMSRRVGAIT 417


>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
          Length = 534

 Score = 45.8 bits (107), Expect = 0.006,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 42/82 (51%), Gaps = 1/82 (1%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E I  +P   +YH  LS  E   LI  A  +M K+T +  +    K +R RT+ G +L +
Sbjct: 320 EQIGLDPYVVLYHEVLSAREISMLIGKAAQNM-KNTKIHKERAVPKKNRGRTAKGFWLKK 378

Query: 138 GRDKIIRDIEKRIADFTFFPLG 159
             +++ + I +RI D T F L 
Sbjct: 379 ESNELTKRITRRIMDMTGFDLA 400


>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           NRRL B-14911]
 gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           NRRL B-14911]
          Length = 217

 Score = 45.8 bits (107), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 23/89 (25%), Positives = 45/89 (50%), Gaps = 4/89 (4%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           EP   +  N LS EECE LI ++   +++S + ++ T       +RTSS  F   G +++
Sbjct: 38  EPLIVILGNVLSDEECEGLIRMSEDKLKRSKIGNTRTVDD----IRTSSSMFFEEGENEL 93

Query: 143 IRDIEKRIADFTFFPLGMQFKFSLFFFYV 171
           +  IE+R++     P+       +  +++
Sbjct: 94  VARIERRLSQIMNIPVEHGEGLQMLNYHI 122


>gi|303273602|ref|XP_003056161.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226462245|gb|EEH59537.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 750

 Score = 45.8 bits (107), Expect = 0.007,   Method: Composition-based stats.
 Identities = 26/65 (40%), Positives = 39/65 (60%), Gaps = 7/65 (10%)

Query: 88  VYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA--RGRDKIIRD 145
           V+ +FLS  EC+ L+ +A P +R+S V D      K S  RTSS TFL   +  + ++R 
Sbjct: 533 VFDHFLSAVECDDLVAIAAPDLRRSRVTD-----GKLSEGRTSSSTFLTGCKQEEPLVRA 587

Query: 146 IEKRI 150
           IE+R+
Sbjct: 588 IEQRL 592


>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
 gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
          Length = 534

 Score = 45.8 bits (107), Expect = 0.007,   Method: Composition-based stats.
 Identities = 27/82 (32%), Positives = 42/82 (51%), Gaps = 1/82 (1%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E I  +P   +YH  LS  E   LI  A  +M K+T +  +    K +R RT+ G +L +
Sbjct: 320 EQIGLDPYVVLYHEVLSAREISMLIGKAAQNM-KNTKIHKERAVPKKNRGRTAKGFWLKK 378

Query: 138 GRDKIIRDIEKRIADFTFFPLG 159
             +++ + I +RI D T F L 
Sbjct: 379 ESNELTKRITRRIMDMTGFDLA 400


>gi|292619367|ref|XP_001922562.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Danio rerio]
          Length = 541

 Score = 45.4 bits (106), Expect = 0.007,   Method: Composition-based stats.
 Identities = 22/74 (29%), Positives = 40/74 (54%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   YH  ++++E E +  L+ P +R++T+ +  TG  + +  R S   +LA    
Sbjct: 339 WDRPRIIRYHEIITEQEIEKIKELSKPRLRRATISNPITGVLETAHYRISKSAWLAAYEH 398

Query: 141 KIIRDIEKRIADFT 154
            ++  I +RI D T
Sbjct: 399 PVVDRINQRIEDIT 412


>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
           [Monodelphis domestica]
          Length = 537

 Score = 45.4 bits (106), Expect = 0.007,   Method: Composition-based stats.
 Identities = 22/72 (30%), Positives = 38/72 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H  +S  E E + +LA P +R++T+ +  TG  + +  R S   +L+   D +
Sbjct: 337 KPRIVRFHEIISDAEIEIVKDLAKPRLRRATISNPITGVLETAHYRISKSAWLSGYEDPV 396

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 397 VSRINMRIQDLT 408


>gi|412994121|emb|CCO14632.1| predicted protein [Bathycoccus prasinos]
          Length = 341

 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 29/111 (26%), Positives = 59/111 (53%), Gaps = 5/111 (4%)

Query: 68  GDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDS-- 125
           GDE   E   +++S  PR+ +Y NF S  +C+ ++  A   + KS +     G++ ++  
Sbjct: 119 GDEYLTELKFQLLSTAPRSVMYRNFASDADCDAIVEAARSRLHKSGLA-LKRGETLETTK 177

Query: 126 RVRTSSGTFLARGRDK--IIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
            +RTSSGTFL    ++   ++ +E+++A  T  P      +++  + + +K
Sbjct: 178 NIRTSSGTFLTSKMEQSGALKRVEEKMARATHIPATHGEAYNILRYEIGQK 228


>gi|224009604|ref|XP_002293760.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
           CCMP1335]
 gi|220970432|gb|EED88769.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
           CCMP1335]
          Length = 206

 Score = 45.4 bits (106), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 31/85 (36%), Positives = 46/85 (54%), Gaps = 5/85 (5%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATP-HMRKSTVVDSD--TGKSKDS--RVRTSS 131
           ++V+S  PRAF   NFLS+ E ++++ L T   + +ST   SD  T   +DS    RTS 
Sbjct: 3   LKVLSCAPRAFEIENFLSQTEVDHIMYLTTGMKLHRSTTAGSDQITADERDSTRNTRTSL 62

Query: 132 GTFLARGRDKIIRDIEKRIADFTFF 156
            T++ R +  II  I +R AD    
Sbjct: 63  NTWVYREKSAIIDTIYRRAADLQLM 87


>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
           IMCC9480]
 gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
           IMCC9480]
          Length = 280

 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 41/75 (54%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  V  N LS +EC+ +  ++     +ST +D+ +G ++    RTS    + RG  ++I
Sbjct: 92  PRIVVLGNVLSDDECDAIAAMSRTRFARSTTIDNASGINRFDDSRTSESAHIQRGETELI 151

Query: 144 RDIEKRIADFTFFPL 158
             I+ R+A  + +P+
Sbjct: 152 ARIDARLAALSGWPV 166


>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 215

 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 38/80 (47%), Gaps = 5/80 (6%)

Query: 79  VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
           V+  EP    +   L+ +EC  LI  A P +R+S +V+        S +RTS G F    
Sbjct: 25  VLHKEPLIMRFERLLTDDECRQLIEAAAPRLRESKLVNKVV-----SEIRTSRGMFFEEE 79

Query: 139 RDKIIRDIEKRIADFTFFPL 158
            +  I  IEKRI+     P+
Sbjct: 80  ENPFIHRIEKRISALMNVPI 99


>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
 gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
           KWC4]
          Length = 215

 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 29/90 (32%), Positives = 41/90 (45%), Gaps = 7/90 (7%)

Query: 69  DEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVR 128
           D+G  E    V+  EP    +   LS +EC  LI  A P +++S +V+        S +R
Sbjct: 17  DDGVVE--ATVLHQEPLIVRFERLLSDDECRQLIETAAPRLKESKLVNKVV-----SDIR 69

Query: 129 TSSGTFLARGRDKIIRDIEKRIADFTFFPL 158
           TS G F        I  IE+RIA     P+
Sbjct: 70  TSRGMFFEEEESPFIHRIERRIAQLMNVPI 99


>gi|30681957|ref|NP_850038.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|330252315|gb|AEC07409.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 274

 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 41/78 (52%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +SW PR F   NF +K++CE +I++A P ++ ST+       ++ ++   S         
Sbjct: 71  LSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSTLALRKGETAETTQNYRSLHQHTDEDE 130

Query: 140 DKIIRDIEKRIADFTFFP 157
             ++  IE++IA  T FP
Sbjct: 131 SGVLAAIEEKIALATRFP 148


>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
           SAR86A]
 gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
           SAR86A]
          Length = 205

 Score = 45.4 bits (106), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 24/93 (25%), Positives = 46/93 (49%), Gaps = 2/93 (2%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V + S +P  +V +NFLS +ECE  + +    M ++ V+  D  +S+    RT+   +L 
Sbjct: 10  VTLYSADPIVYVVNNFLSDDECEAFVEMGKGKMERAKVISDD--ESEFHASRTNDFCWLE 67

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
                +I ++ KR +     P+    +F L ++
Sbjct: 68  HSASDVIHEVSKRFSVLVKMPINNAEQFQLVYY 100


>gi|170064956|ref|XP_001867741.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
 gi|167882144|gb|EDS45527.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
          Length = 520

 Score = 45.4 bits (106), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 39/78 (50%), Gaps = 2/78 (2%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +EV++ EP   VYH  +S  E   LI LA P +++S V   DT   + S++R S   +  
Sbjct: 317 LEVVNLEPLIVVYHEAVSDREIAKLIELARPLIKRSAV--GDTRSEQISKIRISQNAWFE 374

Query: 137 RGRDKIIRDIEKRIADFT 154
              D I+  + +R  D  
Sbjct: 375 NEHDPIVETLNQRARDMA 392


>gi|428183249|gb|EKX52107.1| hypothetical protein GUITHDRAFT_150687 [Guillardia theta CCMP2712]
          Length = 315

 Score = 45.4 bits (106), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 27/86 (31%), Positives = 45/86 (52%), Gaps = 3/86 (3%)

Query: 72  RAEQWVEVIS--WEPRAFVYHNFLSKEECEYLINLAT-PHMRKSTVVDSDTGKSKDSRVR 128
           + E W+E IS    PR +V HN L+KEECE L +L     M K+ ++     +  +S  R
Sbjct: 74  KNEFWIETISDLPGPRIYVLHNILTKEECESLKSLGVMAGMEKALIIPYGGKELVESSTR 133

Query: 129 TSSGTFLARGRDKIIRDIEKRIADFT 154
           T++  +L   +  ++  +E  +A  T
Sbjct: 134 TNTAAWLEYHQGPVVTKLENLLAKVT 159


>gi|321474876|gb|EFX85840.1| hypothetical protein DAPPUDRAFT_309107 [Daphnia pulex]
          Length = 528

 Score = 45.1 bits (105), Expect = 0.010,   Method: Composition-based stats.
 Identities = 20/72 (27%), Positives = 35/72 (48%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +P   +YH  +   E + +  LA P  +++ V D DTG+S   + R +   FL      +
Sbjct: 328 KPLLVIYHGVIFDAEIDVVKKLAQPRFKRTGVTDRDTGRSMPVQYRIAKAAFLKDSEHNL 387

Query: 143 IRDIEKRIADFT 154
           I  + +R+ D T
Sbjct: 388 IVKMSRRVGDIT 399


>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
 gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
          Length = 216

 Score = 45.1 bits (105), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 48/85 (56%), Gaps = 10/85 (11%)

Query: 77  VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
           +++IS   EP   V  N +S EEC  LI ++   +++ST+     G S+D + +RTSSG 
Sbjct: 30  IQIISKFEEPLIVVLGNVISDEECGELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGA 84

Query: 134 FLARGRDKIIRDIEKRIADFTFFPL 158
           FL    +++   IEKRI+     P+
Sbjct: 85  FLE--ENELTSKIEKRISSIMNVPV 107


>gi|239816557|ref|YP_002945467.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
 gi|239803134|gb|ACS20201.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
          Length = 296

 Score = 45.1 bits (105), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 38/71 (53%)

Query: 88  VYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIE 147
           V  +  S EECE LI LA P +  ST VD  TG+++    R+S G F     +  +  ++
Sbjct: 103 VLSDVFSAEECEALIALARPRLAPSTSVDPLTGRNRLGAQRSSLGMFFRLRENAFVARLD 162

Query: 148 KRIADFTFFPL 158
           +R+++    P+
Sbjct: 163 ERLSELMNLPV 173


>gi|312383453|gb|EFR28539.1| hypothetical protein AND_03427 [Anopheles darlingi]
          Length = 341

 Score = 45.1 bits (105), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 39/72 (54%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +P   +YH+ +S  E E + + A P  R++TV +  TG+ + +  R S   +L     ++
Sbjct: 113 QPDIVIYHDVMSDREIELIKHYARPRFRRATVQNYKTGELEFANYRISKSAWLKDTEHEV 172

Query: 143 IRDIEKRIADFT 154
           IR + +R+ D T
Sbjct: 173 IRTVNQRVEDMT 184


>gi|229098707|ref|ZP_04229647.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
 gi|423441025|ref|ZP_17417931.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
 gi|423533441|ref|ZP_17509859.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
 gi|228684786|gb|EEL38724.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
 gi|402417686|gb|EJV49986.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
 gi|402463660|gb|EJV95360.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
          Length = 216

 Score = 44.7 bits (104), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 48/85 (56%), Gaps = 10/85 (11%)

Query: 77  VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
           +++IS   EP   V  N +S EEC  LI ++   +++ST+     G ++D + +RTSSG 
Sbjct: 30  IQIISKFEEPLIVVLGNVISDEECNELIEMSKNKIKRSTI-----GSARDVNDIRTSSGA 84

Query: 134 FLARGRDKIIRDIEKRIADFTFFPL 158
           FL    +++   IEKRI+     P+
Sbjct: 85  FLE--ENELTSKIEKRISSIMNVPV 107


>gi|428183403|gb|EKX52261.1| hypothetical protein GUITHDRAFT_84780 [Guillardia theta CCMP2712]
          Length = 199

 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 28/73 (38%), Positives = 37/73 (50%), Gaps = 4/73 (5%)

Query: 74  EQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRK----STVVDSDTGKSKDSRVRT 129
           E W+E I   PR FV HN LSKEECE L +L      K    S V+  D  K + + +  
Sbjct: 32  EFWIEKIMESPRIFVLHNLLSKEECENLRDLGIARGMKRNAQSPVLGDDPRKHEVATLDF 91

Query: 130 SSGTFLARGRDKI 142
           +   F+ R  DK+
Sbjct: 92  NENDFVRRLEDKL 104


>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
 gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
 gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
 gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
 gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
 gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
 gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
 gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
 gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
 gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
 gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
 gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
 gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
 gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
          Length = 216

 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 29/85 (34%), Positives = 48/85 (56%), Gaps = 10/85 (11%)

Query: 77  VEVIS--WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGT 133
           +++IS   EP   V  N +S EEC  LI ++   +++ST+     G ++D + +RTSSG 
Sbjct: 30  IQIISKFEEPLIVVLGNVISDEECNELIEMSKNKIKRSTI-----GSARDVNDIRTSSGA 84

Query: 134 FLARGRDKIIRDIEKRIADFTFFPL 158
           FL    +++   IEKRI+     P+
Sbjct: 85  FLE--ENELTSKIEKRISSIMNVPV 107


>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
          Length = 536

 Score = 44.7 bits (104), Expect = 0.013,   Method: Composition-based stats.
 Identities = 23/72 (31%), Positives = 37/72 (51%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   + + +S EE E +  LA P + ++TV D +TGK   +  R S   +L+     +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 395

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 396 VSRINTRIQDLT 407


>gi|195159160|ref|XP_002020450.1| GL13507 [Drosophila persimilis]
 gi|194117219|gb|EDW39262.1| GL13507 [Drosophila persimilis]
          Length = 543

 Score = 44.7 bits (104), Expect = 0.013,   Method: Composition-based stats.
 Identities = 27/83 (32%), Positives = 44/83 (53%), Gaps = 5/83 (6%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E +S +P   +YH+ LS EE   L N++TP + ++ V DS   K K S  RT+    + 
Sbjct: 327 MEELSLDPYIVLYHSVLSDEEMARLENMSTPLLHRARVFDSGIRKPKISPARTADEVQIP 386

Query: 137 RGR-----DKIIRDIEKRIADFT 154
             +      +++  I+KRI D T
Sbjct: 387 NPKLVAEDIQLVECIQKRITDLT 409


>gi|115434812|ref|NP_001042164.1| Os01g0174500 [Oryza sativa Japonica Group]
 gi|55296794|dbj|BAD68120.1| prolyl 4-hydroxylase -like [Oryza sativa Japonica Group]
 gi|113531695|dbj|BAF04078.1| Os01g0174500 [Oryza sativa Japonica Group]
 gi|222617830|gb|EEE53962.1| hypothetical protein OsJ_00571 [Oryza sativa Japonica Group]
          Length = 303

 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 40/79 (50%), Gaps = 13/79 (16%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV-VDSDTGKSKDSRVRTSSGTFLARG 138
           +SW PR F+Y  FLS  EC++L+++   +M  S    D D         R SS   +   
Sbjct: 63  LSWHPRIFLYEGFLSDMECDHLVSMGRGNMESSLAFTDGD---------RNSSYNNI--- 110

Query: 139 RDKIIRDIEKRIADFTFFP 157
            D ++  IE RI+ ++F P
Sbjct: 111 EDIVVSKIEDRISLWSFLP 129


>gi|307190793|gb|EFN74662.1| Prolyl 4-hydroxylase subunit alpha-2 [Camponotus floridanus]
          Length = 476

 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 36/72 (50%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  VYHN +  EE E +  +A P  +++TV +  TG  + +  R S   +L     K 
Sbjct: 268 DPRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 327

Query: 143 IRDIEKRIADFT 154
           +  + KR+   T
Sbjct: 328 VAAVSKRVEHMT 339


>gi|302842389|ref|XP_002952738.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
 gi|300262082|gb|EFJ46291.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
          Length = 281

 Score = 44.7 bits (104), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 18/35 (51%), Positives = 26/35 (74%)

Query: 125 SRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLG 159
           S +RTS G FL RG D+I++ +E+RIA +T  P+G
Sbjct: 36  SNIRTSYGVFLDRGEDEIVKRVEERIAAWTLMPVG 70


>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
           guttata]
          Length = 536

 Score = 44.7 bits (104), Expect = 0.013,   Method: Composition-based stats.
 Identities = 23/72 (31%), Positives = 37/72 (51%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   + + +S EE E +  LA P + ++TV D +TGK   +  R S   +L+     +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 395

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 396 VSRINTRIQDLT 407


>gi|301093292|ref|XP_002997494.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110636|gb|EEY68688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 324

 Score = 44.7 bits (104), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 49/92 (53%), Gaps = 1/92 (1%)

Query: 71  GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSK-DSRVRT 129
           G+ +  +E +S  P  F    FL  +E + ++ L+  H++ STV   D  + +  +  RT
Sbjct: 102 GKGDVVLETLSLTPLVFSVDEFLKDDEIDIIMALSLEHLKPSTVTLMDGHEDRAATDWRT 161

Query: 130 SSGTFLARGRDKIIRDIEKRIADFTFFPLGMQ 161
           S+  FL+  +   + +I++R+AD T  P+  Q
Sbjct: 162 STTYFLSSSKHSKLDEIDQRVADLTKVPVDHQ 193


>gi|357467087|ref|XP_003603828.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492876|gb|AES74079.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 156

 Score = 44.7 bits (104), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 18/25 (72%), Positives = 23/25 (92%)

Query: 134 FLARGRDKIIRDIEKRIADFTFFPL 158
           FL RG+DKII++IE+RIADFTF P+
Sbjct: 2   FLKRGKDKIIQNIERRIADFTFIPV 26


>gi|348683507|gb|EGZ23322.1| hypothetical protein PHYSODRAFT_310730 [Phytophthora sojae]
          Length = 417

 Score = 44.7 bits (104), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 48/92 (52%), Gaps = 1/92 (1%)

Query: 71  GRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSR-VRT 129
           G+ +  +E +S  P  F    FL  +E + ++NL+  H++ S V   D  +++ +   RT
Sbjct: 195 GKGDVVLETLSMTPLVFSVEEFLKDDEIDIIMNLSLEHLKPSGVTLMDGHENRAATDWRT 254

Query: 130 SSGTFLARGRDKIIRDIEKRIADFTFFPLGMQ 161
           S+  FL       I +I++R++D T  P+  Q
Sbjct: 255 STTYFLPSDAHPKIDEIDQRVSDLTKVPIDHQ 286


>gi|383864775|ref|XP_003707853.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Megachile
           rotundata]
          Length = 550

 Score = 44.7 bits (104), Expect = 0.014,   Method: Composition-based stats.
 Identities = 21/72 (29%), Positives = 36/72 (50%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  +YHN +  EE E +  +A P  +++TV +  TG  + +  R S   +L     K 
Sbjct: 342 DPRIVIYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 401

Query: 143 IRDIEKRIADFT 154
           +  + KR+   T
Sbjct: 402 VAAVSKRVEHMT 413


>gi|332026992|gb|EGI67088.1| Prolyl 4-hydroxylase subunit alpha-1 [Acromyrmex echinatior]
          Length = 415

 Score = 44.7 bits (104), Expect = 0.015,   Method: Compositional matrix adjust.
 Identities = 22/72 (30%), Positives = 36/72 (50%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  VYHN +  EE E +  +A P  +++TV +  TG  + +  R S   +L     K 
Sbjct: 207 DPRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 266

Query: 143 IRDIEKRIADFT 154
           +  + KR+   T
Sbjct: 267 VAAVSKRVEHMT 278


>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
           gallus]
          Length = 536

 Score = 44.7 bits (104), Expect = 0.016,   Method: Composition-based stats.
 Identities = 23/72 (31%), Positives = 37/72 (51%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   + + +S EE E +  LA P + ++TV D +TGK   +  R S   +L+     +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 395

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 396 VSRINTRIQDLT 407


>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
           gallus]
          Length = 536

 Score = 44.7 bits (104), Expect = 0.016,   Method: Composition-based stats.
 Identities = 23/72 (31%), Positives = 37/72 (51%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   + + +S EE E +  LA P + ++TV D +TGK   +  R S   +L+     +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 395

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 396 VSRINTRIQDLT 407


>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
          Length = 537

 Score = 44.3 bits (103), Expect = 0.016,   Method: Composition-based stats.
 Identities = 20/72 (27%), Positives = 38/72 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +P   VYH+ +S +E E +  +A P  +++T+ +S TG+ + +  R S   +L       
Sbjct: 335 KPMIVVYHDVMSDDEIETVKKMAKPRFKRATIRNSKTGELEPANYRISKSAWLKSEEHDH 394

Query: 143 IRDIEKRIADFT 154
           I  + +R+ D T
Sbjct: 395 ILKVTRRVGDIT 406


>gi|412988626|emb|CCO17962.1| predicted protein [Bathycoccus prasinos]
          Length = 312

 Score = 44.3 bits (103), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 28/91 (30%), Positives = 41/91 (45%), Gaps = 14/91 (15%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  ISW+P A  Y  FLS  EC+YLI                T     S   +S    + 
Sbjct: 85  VSQISWQPNAVKYEKFLSNSECDYLI--------------EKTFSLSSSSSSSSVNLVVE 130

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQFKFSLF 167
           R  D +++DIE+R+A++T  P     K  ++
Sbjct: 131 REGDSVVKDIERRLAEWTHTPSSHGEKLRVY 161


>gi|112984520|ref|NP_001037195.1| prolyl 4-hydroxylase alpha subunit precursor [Bombyx mori]
 gi|37543673|gb|AAM21932.1| prolyl 4-hydroxylase alpha subunit [Bombyx mori]
          Length = 550

 Score = 44.3 bits (103), Expect = 0.017,   Method: Composition-based stats.
 Identities = 21/78 (26%), Positives = 40/78 (51%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE +  +P  F++H  ++ +E E++   A P  +++ V D  TG+   +  R S  ++L 
Sbjct: 330 VEQMYVKPDIFMFHEVMTDDEIEFIKKRAKPRFKRAVVHDPKTGELTPAHYRISKSSWLR 389

Query: 137 RGRDKIIRDIEKRIADFT 154
                +I  I +R+ D T
Sbjct: 390 DEESPVIARITQRVTDMT 407


>gi|326432043|gb|EGD77613.1| hypothetical protein PTSG_12777 [Salpingoeca sp. ATCC 50818]
          Length = 500

 Score = 44.3 bits (103), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 27/83 (32%), Positives = 46/83 (55%)

Query: 72  RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
           R  + +  I  EP  F   +FL++ E E+L +       + +  +S+ G    S+ RTS+
Sbjct: 51  RCRRKLHRIHAEPNIFFVEDFLTRHEVEHLSSFVKGMRLQRSFTESELGSQVISQERTST 110

Query: 132 GTFLARGRDKIIRDIEKRIADFT 154
             +L++G D+IIR IE+R A+ T
Sbjct: 111 FMWLSKGEDEIIRRIERRAAEMT 133


>gi|307106128|gb|EFN54375.1| hypothetical protein CHLNCDRAFT_135671 [Chlorella variabilis]
          Length = 486

 Score = 44.3 bits (103), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 42/81 (51%), Gaps = 5/81 (6%)

Query: 79  VISWE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL-A 136
           V+SW  PR  +  +FL+ +E E+LI  AT    +S VV  +   SK    RTS G++L  
Sbjct: 181 VVSWSSPRVLLIRDFLTPDETEHLIRQATGGFARSEVVAEE---SKQHEARTSYGSWLNG 237

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
             RD  + +I+ RI      P
Sbjct: 238 AKRDDKVLEIQNRIHRLVGIP 258


>gi|115495019|ref|NP_001069238.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
 gi|122144801|sp|Q1RMU3.1|P4HA1_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|92097479|gb|AAI14709.1| Prolyl 4-hydroxylase, alpha polypeptide I [Bos taurus]
 gi|296472132|tpg|DAA14247.1| TPA: prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
 gi|440892721|gb|ELR45796.1| Prolyl 4-hydroxylase subunit alpha-1 [Bos grunniens mutus]
          Length = 534

 Score = 44.3 bits (103), Expect = 0.018,   Method: Composition-based stats.
 Identities = 21/72 (29%), Positives = 38/72 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   + +
Sbjct: 334 KPRIIRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
          Length = 534

 Score = 44.3 bits (103), Expect = 0.019,   Method: Composition-based stats.
 Identities = 21/72 (29%), Positives = 38/72 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
           sapiens]
 gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
 gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
           sapiens]
 gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
           troglodytes]
 gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
           troglodytes]
 gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
 gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
 gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
 gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
 gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
           sapiens]
 gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
           sapiens]
 gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [synthetic
           construct]
 gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [synthetic
           construct]
 gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
 gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score = 44.3 bits (103), Expect = 0.019,   Method: Composition-based stats.
 Identities = 21/72 (29%), Positives = 38/72 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|37496185|emb|CAE47803.1| Prolyl 4-hydroxylase alpha subunit [Sus scrofa]
          Length = 263

 Score = 44.3 bits (103), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 22/74 (29%), Positives = 41/74 (55%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E + + +LA P + ++TV D +TGK   ++ R S   +L+   +
Sbjct: 101 WDKPRIIRFHDIISDAEIDIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEN 160

Query: 141 KIIRDIEKRIADFT 154
            ++  +  RI D T
Sbjct: 161 PVVSRLNMRIQDLT 174


>gi|195113247|ref|XP_002001179.1| GI22114 [Drosophila mojavensis]
 gi|193917773|gb|EDW16640.1| GI22114 [Drosophila mojavensis]
          Length = 487

 Score = 44.3 bits (103), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 19/63 (30%), Positives = 39/63 (61%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E++S +P   V+H+ + + E E+L  ++ P ++++TVV  D  +    + RT++G FL 
Sbjct: 298 MELLSEDPYMVVFHDVIYESEIEHLNRISKPFLQRATVVVEDNSEDTLIKFRTANGAFLY 357

Query: 137 RGR 139
           R +
Sbjct: 358 RDK 360


>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
 gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
          Length = 534

 Score = 44.3 bits (103), Expect = 0.019,   Method: Composition-based stats.
 Identities = 21/72 (29%), Positives = 38/72 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|426255744|ref|XP_004021508.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Ovis
           aries]
          Length = 534

 Score = 44.3 bits (103), Expect = 0.020,   Method: Composition-based stats.
 Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
 gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
          Length = 454

 Score = 44.3 bits (103), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 24/71 (33%), Positives = 34/71 (47%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  ++   L+  EC+ L+ LA   + +S V++ DTG       RTS G     G   +I
Sbjct: 132 PRVTLFQQLLTDAECDALVALARGRLARSPVINPDTGDENLIEARTSLGAMFQVGEHPLI 191

Query: 144 RDIEKRIADFT 154
             IE  IA  T
Sbjct: 192 ERIEDCIAAVT 202


>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
 gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
 gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
          Length = 534

 Score = 44.3 bits (103), Expect = 0.020,   Method: Composition-based stats.
 Identities = 21/72 (29%), Positives = 38/72 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
           [Oryctolagus cuniculus]
          Length = 534

 Score = 44.3 bits (103), Expect = 0.020,   Method: Composition-based stats.
 Identities = 21/72 (29%), Positives = 38/72 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   + +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 566

 Score = 44.3 bits (103), Expect = 0.020,   Method: Composition-based stats.
 Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
           garnettii]
          Length = 534

 Score = 44.3 bits (103), Expect = 0.021,   Method: Composition-based stats.
 Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
           latipes]
          Length = 517

 Score = 43.9 bits (102), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+S +P   +YHNF++  E E +   A P +R+S V   +   + + R+  S+  +L  
Sbjct: 313 EVLSLQPYVVIYHNFITDREAEEIKGFAQPALRRSVVASGENQATVEYRISKSA--WLKG 370

Query: 138 GRDKIIRDIEKRIADFT 154
               I+  +++RI+  T
Sbjct: 371 SESCIVGKLDQRISMLT 387


>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
          Length = 507

 Score = 43.9 bits (102), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 23/74 (31%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   D
Sbjct: 305 WDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLETVHYRISKSAWLSGYED 364

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 365 PVVSRINMRIQDLT 378


>gi|323452216|gb|EGB08091.1| hypothetical protein AURANDRAFT_26622 [Aureococcus anophagefferens]
          Length = 190

 Score = 43.9 bits (102), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 36/69 (52%), Gaps = 3/69 (4%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR F+    LS+ EC+++I L T  +RKS V     G    S+ RTS   +L R    I+
Sbjct: 1   PRVFLVREMLSEFECDHIIELGTKVVRKSMV---GQGGGFTSKTRTSENGWLRRSASPIL 57

Query: 144 RDIEKRIAD 152
            +I KR  D
Sbjct: 58  ENIYKRFGD 66


>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
 gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
          Length = 533

 Score = 43.9 bits (102), Expect = 0.022,   Method: Composition-based stats.
 Identities = 28/82 (34%), Positives = 43/82 (52%), Gaps = 2/82 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E I  +P   +YH  LS  E   L+  A  +M K+T V S+   + + R RT+ G +L +
Sbjct: 320 EQIGLKPYVVLYHEVLSAREISMLMGKAAQNM-KNTRVQSEKAVNTN-RERTAKGYWLKK 377

Query: 138 GRDKIIRDIEKRIADFTFFPLG 159
             +++ R I +RI D T F L 
Sbjct: 378 ESNEMTRRITRRIVDMTGFDLA 399


>gi|297803562|ref|XP_002869665.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297315501|gb|EFH45924.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 290

 Score = 43.9 bits (102), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 43/78 (55%), Gaps = 17/78 (21%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +SW+PR F+Y  FLS+EE ++LI+L     RK    D+    S D+  +T          
Sbjct: 61  LSWQPRVFLYRGFLSEEESDHLISL-----RK----DTSEVTSGDADGKTQ--------L 103

Query: 140 DKIIRDIEKRIADFTFFP 157
           D ++  IE++I+ +TF P
Sbjct: 104 DPVVAGIEEKISAWTFLP 121


>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
           [Loxodonta africana]
          Length = 534

 Score = 43.9 bits (102), Expect = 0.024,   Method: Composition-based stats.
 Identities = 21/72 (29%), Positives = 38/72 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   + +
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRDIEKRIADFT 154
           +  I  RI D T
Sbjct: 394 VSRINMRIQDLT 405


>gi|326914688|ref|XP_003203656.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Meleagris
           gallopavo]
          Length = 539

 Score = 43.9 bits (102), Expect = 0.025,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E +  +P   +YH+F+S  E E +  LA P +++S V   +  K +    R S   +L  
Sbjct: 335 ETLRLQPYIVLYHDFVSDAEAETIKGLAGPWLQRSVVASGE--KQQKVEYRISKSAWLKD 392

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++R +E R+A  T
Sbjct: 393 TADPVVRALELRMAAIT 409


>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
 gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
          Length = 536

 Score = 43.9 bits (102), Expect = 0.027,   Method: Composition-based stats.
 Identities = 25/70 (35%), Positives = 36/70 (51%), Gaps = 4/70 (5%)

Query: 81  SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR--- 137
           S +P    YH+ LS  +   L  +A PHMR+STV     G++K S  R S   +LA    
Sbjct: 327 SLDPFVVTYHDMLSPNKIAQLREMAVPHMRRSTVNPLPGGQNKKSSFRVSKNAWLAYETH 386

Query: 138 -GRDKIIRDI 146
               K++RD+
Sbjct: 387 PTMGKMLRDL 396


>gi|312092237|ref|XP_003147267.1| hypothetical protein LOAG_11701 [Loa loa]
          Length = 553

 Score = 43.5 bits (101), Expect = 0.029,   Method: Composition-based stats.
 Identities = 20/74 (27%), Positives = 40/74 (54%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE++   P A ++H+ +S EE   +  LA P + ++TV + +TG  + +  R S   +L 
Sbjct: 320 VEIVYQNPLAVLFHDIMSDEESRIIEMLAVPKLDRATVHNVETGNLETASYRISKSAWLR 379

Query: 137 RGRDKIIRDIEKRI 150
               +++  I +R+
Sbjct: 380 STEHEVVNRINRRL 393


>gi|219113719|ref|XP_002186443.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209583293|gb|ACI65913.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 230

 Score = 43.5 bits (101), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 28/84 (33%), Positives = 45/84 (53%), Gaps = 8/84 (9%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG--------KSKDSRVR 128
           ++V+S  PRAF   NFLS++E E+++ LA+    K +     TG        ++   R R
Sbjct: 5   LKVLSCAPRAFEIENFLSRQEVEHIVQLASGVDLKLSSTGDITGHKETPKELQTDSRRTR 64

Query: 129 TSSGTFLARGRDKIIRDIEKRIAD 152
           TS  +++ R +  II  I +R AD
Sbjct: 65  TSYNSWVPREKSPIIDAIYRRAAD 88


>gi|157114985|ref|XP_001658091.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
 gi|108877086|gb|EAT41311.1| AAEL007038-PA [Aedes aegypti]
          Length = 545

 Score = 43.5 bits (101), Expect = 0.029,   Method: Composition-based stats.
 Identities = 23/78 (29%), Positives = 41/78 (52%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E  + +P   +YH+ +S+ E E +  LA P  R++TV +  TG+ + +  R S   +L 
Sbjct: 335 LEEANLKPYIVIYHDVISEAEMELVKRLAKPRFRRATVQNYKTGELEVANYRISKSAWLK 394

Query: 137 RGRDKIIRDIEKRIADFT 154
                 I+ I +R+ D T
Sbjct: 395 DHEHPYIKAIGERVEDMT 412


>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
 gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
          Length = 293

 Score = 43.5 bits (101), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 36/75 (48%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P   V    L  EEC+ LI  +   +++ST VD   G  +    R+S GTF     D  I
Sbjct: 97  PTIAVLDQVLDDEECDELIRRSADKLQRSTTVDPVNGGYEVIAARSSEGTFFPVNADDFI 156

Query: 144 RDIEKRIADFTFFPL 158
             +++RIA+    P+
Sbjct: 157 ARLDRRIAELMNCPV 171


>gi|357605723|gb|EHJ64752.1| prolyl 4-hydroxylase alpha subunit [Danaus plexippus]
          Length = 235

 Score = 43.5 bits (101), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 40/78 (51%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E +   P   V+++ LS  E +Y+  +A P  R++TV D  TG+   +  R S   +L 
Sbjct: 15  MEYLYRNPDIIVFNDVLSDYEIDYIKRIAQPRFRRATVHDPATGELVPAHYRISKSAWLK 74

Query: 137 RGRDKIIRDIEKRIADFT 154
                ++  + +R+AD T
Sbjct: 75  DEESAVVARVSRRVADIT 92


>gi|393903732|gb|EFO16802.2| hypothetical protein LOAG_11701 [Loa loa]
          Length = 531

 Score = 43.5 bits (101), Expect = 0.030,   Method: Composition-based stats.
 Identities = 20/74 (27%), Positives = 40/74 (54%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE++   P A ++H+ +S EE   +  LA P + ++TV + +TG  + +  R S   +L 
Sbjct: 321 VEIVYQNPLAVLFHDIMSDEESRIIEMLAVPKLDRATVHNVETGNLETASYRISKSAWLR 380

Query: 137 RGRDKIIRDIEKRI 150
               +++  I +R+
Sbjct: 381 STEHEVVNRINRRL 394


>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
           (Silurana) tropicalis]
          Length = 526

 Score = 43.5 bits (101), Expect = 0.031,   Method: Composition-based stats.
 Identities = 21/74 (28%), Positives = 40/74 (54%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   YH+ +S EE   +  LA P +R++T+ +  TG  + ++ R +   +L+   D
Sbjct: 324 WDKPRIVRYHDIISDEEISKVKELAKPRLRRATISNPITGVLETAQYRITKSAWLSGYED 383

Query: 141 KIIRDIEKRIADFT 154
            ++  + +RI   T
Sbjct: 384 PVVARLNRRIEGVT 397


>gi|451851302|gb|EMD64600.1| hypothetical protein COCSADRAFT_51715, partial [Cochliobolus
           sativus ND90Pr]
          Length = 209

 Score = 43.5 bits (101), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 40/77 (51%), Gaps = 5/77 (6%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V V+S EP      NFLS+ E  Y+++L  P    S V    TG S+D   R+S+  FL 
Sbjct: 5   VHVLSREPYVVYITNFLSQAEISYILSLGEPLFAPSLV---KTG-SEDVSTRSSTSAFLP 60

Query: 137 RGRDKIIRDIEKRIADF 153
              DKI  +I+ R A F
Sbjct: 61  T-NDKITSNIQARAASF 76


>gi|357135725|ref|XP_003569459.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 1
           [Brachypodium distachyon]
          Length = 303

 Score = 43.5 bits (101), Expect = 0.032,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 43/78 (55%), Gaps = 11/78 (14%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           ++W PR F+Y  FLS  EC++L+ +A  ++  S +V++       +R  T + T      
Sbjct: 63  LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLVNA------GARNITQNST-----D 111

Query: 140 DKIIRDIEKRIADFTFFP 157
           D ++  IE RI+ ++F P
Sbjct: 112 DIVVSKIEDRISLWSFIP 129


>gi|268572523|ref|XP_002641343.1| C. briggsae CBR-DPY-18 protein [Caenorhabditis briggsae]
 gi|94442971|emb|CAJ98658.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
          Length = 559

 Score = 43.5 bits (101), Expect = 0.033,   Method: Composition-based stats.
 Identities = 23/74 (31%), Positives = 39/74 (52%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE+  + P A ++ + +S EE   +  LA P + ++TV DS TGK   +  R S   +L 
Sbjct: 321 VEIKRFNPLAVLFKDVISDEEVATIQELAKPKLARATVHDSVTGKLVTATYRISKSAWLK 380

Query: 137 RGRDKIIRDIEKRI 150
               +++  + KRI
Sbjct: 381 AWEHEVVERVNKRI 394


>gi|301626782|ref|XP_002942567.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Xenopus
           (Silurana) tropicalis]
          Length = 716

 Score = 43.5 bits (101), Expect = 0.035,   Method: Composition-based stats.
 Identities = 21/77 (27%), Positives = 41/77 (53%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E++S  P+  +YH+F+S  E E +  LA+P + +S V   +  K  ++  R S   +L  
Sbjct: 533 EIVSLRPQVVLYHDFVSDLEAEKIKELASPWLHRSVVASGE--KQAEAEYRISKSAWLKD 590

Query: 138 GRDKIIRDIEKRIADFT 154
                +++++ RI+  T
Sbjct: 591 TIHPFVQNLDTRISGVT 607


>gi|308508235|ref|XP_003116301.1| CRE-PHY-3 protein [Caenorhabditis remanei]
 gi|308251245|gb|EFO95197.1| CRE-PHY-3 protein [Caenorhabditis remanei]
          Length = 253

 Score = 43.1 bits (100), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 24/86 (27%), Positives = 44/86 (51%), Gaps = 11/86 (12%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV---DSDTGKSKDSRVRTSSGT 133
           +E++SWEP   +Y N L+  +    +NL    + K  ++    SD G  K++  R ++G+
Sbjct: 21  LEIVSWEPALVIYRNLLTPRQTSDFMNL----IEKRDLIAQKTSDYGNQKETTHRRANGS 76

Query: 134 FLARGRDKIIRD----IEKRIADFTF 155
           F+  G  +I  +    ++KRI    F
Sbjct: 77  FIEHGEAEITAEVHNKVQKRIPALNF 102


>gi|325920649|ref|ZP_08182559.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
           [Xanthomonas gardneri ATCC 19865]
 gi|325548839|gb|EGD19783.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
           [Xanthomonas gardneri ATCC 19865]
          Length = 422

 Score = 43.1 bits (100), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 22/54 (40%), Positives = 30/54 (55%), Gaps = 4/54 (7%)

Query: 92  FLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRD 145
            LS +EC  L+ LA PH+R S VVD +   +  + +RTS G  L    D I+ D
Sbjct: 240 VLSADECRLLMLLARPHLRASQVVDPNDASTHRTPIRTSRGATL----DPILED 289


>gi|443709454|gb|ELU04126.1| hypothetical protein CAPTEDRAFT_167710 [Capitella teleta]
          Length = 535

 Score = 43.1 bits (100), Expect = 0.039,   Method: Composition-based stats.
 Identities = 23/81 (28%), Positives = 43/81 (53%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E ++++P   VYH  +S ++ + +  LATP + ++TVV+S TG+ + ++ R S   +L  
Sbjct: 331 ETMNFDPWIAVYHQLMSDKDIDDIKALATPRLARATVVNSVTGELEFAKYRISKSGWLKD 390

Query: 138 GRDKIIRDIEKRIADFTFFPL 158
                +  I  R +  T   L
Sbjct: 391 EEHPTVAKISNRCSALTNLSL 411


>gi|308497208|ref|XP_003110791.1| CRE-DPY-18 protein [Caenorhabditis remanei]
 gi|308242671|gb|EFO86623.1| CRE-DPY-18 protein [Caenorhabditis remanei]
          Length = 559

 Score = 43.1 bits (100), Expect = 0.040,   Method: Composition-based stats.
 Identities = 23/78 (29%), Positives = 40/78 (51%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE+  + P A ++ + +S +E   +  LA P + ++TV DS TGK   +  R S   +L 
Sbjct: 321 VEIKRFNPLAVLFKDVISDDEVATIQELAKPKLARATVHDSATGKLVTATYRISKSAWLK 380

Query: 137 RGRDKIIRDIEKRIADFT 154
               +++  + KRI   T
Sbjct: 381 EWEHEVVERVNKRIELMT 398


>gi|345481336|ref|XP_001600680.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Nasonia
           vitripennis]
          Length = 556

 Score = 43.1 bits (100), Expect = 0.041,   Method: Composition-based stats.
 Identities = 19/72 (26%), Positives = 38/72 (52%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  +YH+ +  +E E +  +A P  +++TV +  TG+ + +  R S   +L     K 
Sbjct: 348 DPRIVIYHDVIYDDEIETIKRMAQPRFKRATVQNYKTGELEIANYRISKSAWLQEHEHKH 407

Query: 143 IRDIEKRIADFT 154
           +R + +R+   T
Sbjct: 408 VRAVSQRVEHMT 419


>gi|407699315|ref|YP_006824102.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii str.
           'Black Sea 11']
 gi|407248462|gb|AFT77647.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
           'Black Sea 11']
          Length = 354

 Score = 43.1 bits (100), Expect = 0.042,   Method: Compositional matrix adjust.
 Identities = 37/127 (29%), Positives = 60/127 (47%), Gaps = 2/127 (1%)

Query: 32  LILLAFGILSMPSSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHN 91
           L+ LA  I +  +    +++ +D+  + R ++E      G+     EV+       +Y +
Sbjct: 103 LMSLAPNICNQLNLVPQAQRQSDVK-VYRLAVEEISSALGKIYAPTEVLDQTLPVELYVD 161

Query: 92  FLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR-DKIIRDIEKRI 150
            LS+ EC YLI   +  ++ S VVD  TG  K   VRTS    +A    D I R ++K I
Sbjct: 162 VLSEYECAYLITKFSSLLQPSMVVDPLTGNGKVDNVRTSYVAIIAPSYCDWITRKLDKVI 221

Query: 151 ADFTFFP 157
           +  T  P
Sbjct: 222 SQVTHTP 228


>gi|607947|gb|AAA62207.1| prolyl 4-hydroxylase alpha subunit [Caenorhabditis elegans]
          Length = 558

 Score = 43.1 bits (100), Expect = 0.045,   Method: Composition-based stats.
 Identities = 23/78 (29%), Positives = 39/78 (50%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE+  + P A ++ + +S +E   +  LA P + ++TV DS TGK   +  R S   +L 
Sbjct: 320 VEIKRFNPLAVLFKDVISDDEVAAIQELAKPKLARATVHDSVTGKLVTATYRISKSAWLK 379

Query: 137 RGRDKIIRDIEKRIADFT 154
                ++  + KRI   T
Sbjct: 380 EWEGDVVETVNKRIGYMT 397


>gi|323454062|gb|EGB09933.1| hypothetical protein AURANDRAFT_14928, partial [Aureococcus
           anophagefferens]
          Length = 182

 Score = 43.1 bits (100), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 23/54 (42%), Positives = 31/54 (57%), Gaps = 2/54 (3%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           P  +   NFL++EEC+ LI+ A  HM  + VV    G+   S  RTSS  +LAR
Sbjct: 1   PPIYTVQNFLTEEECDALIDSAKDHMTPAPVVGPGNGEVSVS--RTSSTCYLAR 52


>gi|255085194|ref|XP_002505028.1| predicted protein [Micromonas sp. RCC299]
 gi|226520297|gb|ACO66286.1| predicted protein [Micromonas sp. RCC299]
          Length = 439

 Score = 43.1 bits (100), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 20/49 (40%), Positives = 29/49 (59%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDS 125
           V+ +S  PR  V HNF+SKEE   ++++A P +  S VV   T K  D+
Sbjct: 202 VKQVSRHPRLAVIHNFISKEEAAAIVDVAAPELHPSLVVRHQTAKRGDT 250


>gi|38454288|ref|NP_942070.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Rattus norvegicus]
 gi|81870816|sp|Q6W3E9.1|P4HA3_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|36962768|gb|AAQ87605.1| collagen prolyl 4-hydroxylase alpha III subunit [Rattus norvegicus]
          Length = 544

 Score = 42.7 bits (99), Expect = 0.047,   Method: Composition-based stats.
 Identities = 24/77 (31%), Positives = 39/77 (50%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI   P   +YH+F+S EE + +  LA P +++S V   +  K      R S   +L  
Sbjct: 340 EVIHLRPLVALYHDFVSDEEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 397

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  +++RIA  T
Sbjct: 398 TVDPVLVTLDRRIAALT 414


>gi|195505190|ref|XP_002099397.1| GE10881 [Drosophila yakuba]
 gi|194185498|gb|EDW99109.1| GE10881 [Drosophila yakuba]
          Length = 487

 Score = 42.7 bits (99), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 40/78 (51%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E    +P   +YH+ +   E + +  +A P  R++TV +S TG  + +  R S   +L 
Sbjct: 276 LEEAHADPYIVIYHDAMYDSEIDVIKRMARPRFRRATVQNSVTGALETANYRISKSAWLK 335

Query: 137 RGRDKIIRDIEKRIADFT 154
              D++I  + +R AD T
Sbjct: 336 THEDRVIGTVVQRTADMT 353


>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
 gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
          Length = 521

 Score = 42.7 bits (99), Expect = 0.048,   Method: Compositional matrix adjust.
 Identities = 21/83 (25%), Positives = 43/83 (51%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E+I  +P   +YH+ +S  E   L  +A P ++++TV +S    ++  + RT+   +  
Sbjct: 318 MELIGLDPYMVLYHDVISPNEIAELQEMAKPELKRATVYNSTKNTNQFVKTRTAKVAWFL 377

Query: 137 RGRDKIIRDIEKRIADFTFFPLG 159
              +++   + +RI D T F L 
Sbjct: 378 DTFNQLTERLNQRIMDMTNFVLN 400


>gi|326923465|ref|XP_003207956.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 3
           [Meleagris gallopavo]
          Length = 518

 Score = 42.7 bits (99), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 38/74 (51%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   + + +S EE E +  LA P + ++TV D +TGK   +  R S   +L+    
Sbjct: 334 WDKPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYES 393

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 394 PVVSRINTRIQDLT 407


>gi|350416719|ref|XP_003491070.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
           impatiens]
          Length = 557

 Score = 42.7 bits (99), Expect = 0.050,   Method: Composition-based stats.
 Identities = 20/72 (27%), Positives = 36/72 (50%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  VYHN +  EE E +  +A P  +++TV +  TG  + +  R S   +L     + 
Sbjct: 349 DPRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHEH 408

Query: 143 IRDIEKRIADFT 154
           +  + +R+   T
Sbjct: 409 VAAVSRRVEHMT 420


>gi|340722330|ref|XP_003399560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
           terrestris]
          Length = 557

 Score = 42.7 bits (99), Expect = 0.050,   Method: Composition-based stats.
 Identities = 20/72 (27%), Positives = 36/72 (50%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  VYHN +  EE E +  +A P  +++TV +  TG  + +  R S   +L     + 
Sbjct: 349 DPRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHEH 408

Query: 143 IRDIEKRIADFT 154
           +  + +R+   T
Sbjct: 409 VAAVSRRVEHMT 420


>gi|17552840|ref|NP_499464.1| Protein DPY-18 [Caenorhabditis elegans]
 gi|20455505|sp|Q10576.2|P4HA1_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; AltName: Full=Protein dumpy-18; Flags:
           Precursor
 gi|3881011|emb|CAA21045.1| Protein DPY-18 [Caenorhabditis elegans]
 gi|6900013|emb|CAB71298.1| prolyl 4-hydroxylase alpha subunit 1 [Caenorhabditis elegans]
          Length = 559

 Score = 42.7 bits (99), Expect = 0.051,   Method: Composition-based stats.
 Identities = 23/78 (29%), Positives = 39/78 (50%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE+  + P A ++ + +S +E   +  LA P + ++TV DS TGK   +  R S   +L 
Sbjct: 321 VEIKRFNPLAVLFKDVISDDEVAAIQELAKPKLARATVHDSVTGKLVTATYRISKSAWLK 380

Query: 137 RGRDKIIRDIEKRIADFT 154
                ++  + KRI   T
Sbjct: 381 EWEGDVVETVNKRIGYMT 398


>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Meleagris gallopavo]
          Length = 536

 Score = 42.7 bits (99), Expect = 0.051,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 38/74 (51%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   + + +S EE E +  LA P + ++TV D +TGK   +  R S   +L+    
Sbjct: 334 WDKPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYES 393

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 394 PVVSRINTRIQDLT 407


>gi|433460968|ref|ZP_20418587.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
 gi|432190746|gb|ELK47751.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
          Length = 211

 Score = 42.7 bits (99), Expect = 0.053,   Method: Compositional matrix adjust.
 Identities = 30/77 (38%), Positives = 40/77 (51%), Gaps = 8/77 (10%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDK 141
           EP+  +  N +S+EECE LI L+     K  V  S  G   D S +RTSS  FL    D+
Sbjct: 33  EPKIAILGNVVSEEECEALIRLS-----KDKVNRSKIGSDHDVSDIRTSSSAFLP--DDE 85

Query: 142 IIRDIEKRIADFTFFPL 158
           +   IEKR+A     P+
Sbjct: 86  LTGRIEKRLAQIMNVPV 102


>gi|348505573|ref|XP_003440335.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oreochromis
           niloticus]
          Length = 517

 Score = 42.7 bits (99), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 43/77 (55%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E++S +P   +YH+F++  E E + +LA P +R+S V   +   + D R+  S+  +L  
Sbjct: 313 ELVSLQPYVVLYHDFVTDTEAEDIKSLAHPGLRRSVVAAGEKQATADYRISKSA--WLKG 370

Query: 138 GRDKIIRDIEKRIADFT 154
               I+  +++RI+  T
Sbjct: 371 SAQSIVGKLDQRISLLT 387


>gi|319795182|ref|YP_004156822.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
 gi|315597645|gb|ADU38711.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
          Length = 296

 Score = 42.7 bits (99), Expect = 0.056,   Method: Compositional matrix adjust.
 Identities = 21/79 (26%), Positives = 38/79 (48%)

Query: 91  NFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRI 150
           N +   EC+ LI +A P +  ST+VD  +G+   S  R S G F     + ++  +++R+
Sbjct: 107 NVVDAHECKALIEMAKPRLAPSTLVDPMSGRDVVSDKRASWGMFFRLCENDLVARLDRRL 166

Query: 151 ADFTFFPLGMQFKFSLFFF 169
           +     PL       L ++
Sbjct: 167 SALMNLPLENGEGLHLLYY 185


>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
 gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
          Length = 549

 Score = 42.7 bits (99), Expect = 0.057,   Method: Composition-based stats.
 Identities = 25/78 (32%), Positives = 41/78 (52%), Gaps = 2/78 (2%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE +S +P    +H+ LS +E E +I      + +S +    TG S  S +RTS  T+L 
Sbjct: 339 VEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEI--GQTGNSTVSEIRTSQNTWLW 396

Query: 137 RGRDKIIRDIEKRIADFT 154
              +  + DI++R+ D T
Sbjct: 397 YENNPWLADIKQRLEDIT 414


>gi|297825201|ref|XP_002880483.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297326322|gb|EFH56742.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 272

 Score = 42.7 bits (99), Expect = 0.058,   Method: Compositional matrix adjust.
 Identities = 25/100 (25%), Positives = 52/100 (52%), Gaps = 10/100 (10%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV-----DSDTGKSKDSRVRTSSGTF 134
           +SW PR F   NF +K++CE +I++A P ++ S +       ++T ++  +R++ +    
Sbjct: 69  LSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSLLALRKGETAETTQNVRTRLKKTD--- 125

Query: 135 LARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
                  I+  IE++IA  T  P+     F++  + + +K
Sbjct: 126 --EDESGILAAIEEKIALATRIPIDYYESFNILRYQLGQK 163


>gi|380025232|ref|XP_003696381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Apis florea]
          Length = 537

 Score = 42.7 bits (99), Expect = 0.058,   Method: Composition-based stats.
 Identities = 20/72 (27%), Positives = 36/72 (50%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  VYHN +  +E E +  +A P  +++TV +  TG  + +  R S   +L     K 
Sbjct: 329 DPRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 388

Query: 143 IRDIEKRIADFT 154
           +  + +R+   T
Sbjct: 389 VAAVSRRVEHMT 400


>gi|395521232|ref|XP_003764722.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Sarcophilus
           harrisii]
          Length = 521

 Score = 42.7 bits (99), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+  EP   +YH+F+S  E + +   A P +++S V   +  K +    R S   +L  
Sbjct: 317 EVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVASGE--KQQQVEYRISKSAWLKD 374

Query: 138 GRDKIIRDIEKRIADFT 154
             D I+  +++RIA  T
Sbjct: 375 TVDPILVSLDRRIAALT 391


>gi|449673565|ref|XP_002167120.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 571

 Score = 42.7 bits (99), Expect = 0.060,   Method: Composition-based stats.
 Identities = 20/75 (26%), Positives = 41/75 (54%)

Query: 75  QWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTF 134
           Q VE +  +P  F+  N +S+++   +   A+P +R++T+ D  TGK + +  R S   +
Sbjct: 364 QKVERVWVDPEIFILRNIISEKQINLIKEAASPMLRRATIQDPITGKLRHADYRISKSAW 423

Query: 135 LARGRDKIIRDIEKR 149
           L+  +   ++ +E R
Sbjct: 424 LSTNKYNFLQALEAR 438


>gi|195452738|ref|XP_002073478.1| GK14139 [Drosophila willistoni]
 gi|194169563|gb|EDW84464.1| GK14139 [Drosophila willistoni]
          Length = 215

 Score = 42.7 bits (99), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 39/78 (50%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +EV+S +P   +YH+F+   E E+L   +   +  +  VD DTGK      RT+   +  
Sbjct: 31  MEVLSLDPYIVLYHDFILSSEQEFLKAESIERLSVAETVDPDTGKWYADASRTAKAMWFY 90

Query: 137 RGRDKIIRDIEKRIADFT 154
                +IR I +RI + T
Sbjct: 91  DTSSVVIRRINQRIEEIT 108


>gi|308801327|ref|XP_003077977.1| Prolyl 4-hydroxylase alpha subunit (ISS) [Ostreococcus tauri]
 gi|116056428|emb|CAL52717.1| Prolyl 4-hydroxylase alpha subunit (ISS) [Ostreococcus tauri]
          Length = 281

 Score = 42.7 bits (99), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 29/84 (34%), Positives = 44/84 (52%), Gaps = 7/84 (8%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +  +S +   FV  +FL+ EE + LINLA P +++S V D      K S  RTSS  FL 
Sbjct: 72  IRCLSEDCLLFVLEDFLTDEEGDALINLARPALQRSRVTD-----GKLSEGRTSSSMFLT 126

Query: 137 --RGRDKIIRDIEKRIADFTFFPL 158
             R  + I+  +++RI      P+
Sbjct: 127 GNRSNNPIVIALQRRIQSVLRLPI 150


>gi|381200649|ref|ZP_09907785.1| Prolyl 4-hydroxylase alpha subunit [Sphingobium yanoikuyae XLDN2-5]
          Length = 305

 Score = 42.4 bits (98), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 32/132 (24%), Positives = 56/132 (42%), Gaps = 3/132 (2%)

Query: 44  SSSGDSRKANDLSSIVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLIN 103
           ++ GD      L+ I R ++          E       W+ R F    FL+ +EC ++I+
Sbjct: 83  AAQGDPVAQQQLALINRMALAPDGAPVAVPEGEALAGGWDVRLF--RQFLTGDECHHVIS 140

Query: 104 LATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR-DKIIRDIEKRIADFTFFPLGMQF 162
                +  + V+D  +G+     VRTS G      R D +I+ I +RIA  +   L    
Sbjct: 141 EGQALLEPAMVIDPRSGRPMPHPVRTSDGGIFGPAREDLVIQAINRRIAAASGTMLSGGE 200

Query: 163 KFSLFFFYVLKK 174
             +L  + V ++
Sbjct: 201 PLTLLRYAVGQQ 212


>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
           carolinensis]
          Length = 542

 Score = 42.4 bits (98), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 25/74 (33%), Positives = 37/74 (50%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +   +S EE E +  LA P + ++TV D  TGK   +  R S   +L+   +
Sbjct: 340 WDRPRIVRFVEIISDEEIETVKELAKPRLSRATVHDPQTGKLTTAHYRVSKSAWLSGYEN 399

Query: 141 KIIRDIEKRIADFT 154
            I+  I  RI D T
Sbjct: 400 PIVARINTRIQDLT 413


>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
 gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
          Length = 211

 Score = 42.4 bits (98), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 29/100 (29%), Positives = 47/100 (47%), Gaps = 17/100 (17%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           EP   V  N LS EEC+ LI LA   +++S +  +     +++ +RTSS  F+    + I
Sbjct: 32  EPLIVVLGNVLSDEECDELIQLAGDKVKRSKIGTT----REENELRTSSSMFIEDDENLI 87

Query: 143 IRDIEKRIADFTFFPL-------------GMQFKFSLFFF 169
           +  ++KRI+     P+             G Q+K    FF
Sbjct: 88  VTRVKKRISAIMKIPMEHGEGLQILRYTPGQQYKAHHDFF 127


>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
 gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [Danio rerio]
          Length = 536

 Score = 42.4 bits (98), Expect = 0.065,   Method: Composition-based stats.
 Identities = 23/74 (31%), Positives = 37/74 (50%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   YH  +S  E E +  +A P +R++T+ +  TG  + +  R S   +L+    
Sbjct: 334 WDRPRIVRYHEIISDSEIETVKEMAKPRLRRATISNPITGVLETAPYRISKSAWLSGYEH 393

Query: 141 KIIRDIEKRIADFT 154
             I  I +RI D T
Sbjct: 394 STIERINQRIEDVT 407


>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
           gallus]
          Length = 489

 Score = 42.4 bits (98), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 38/75 (50%), Gaps = 1/75 (1%)

Query: 81  SWE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
            W+ PR   + + +S EE E +  LA P + ++TV D +TGK   +  R S   +L+   
Sbjct: 286 EWDKPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYE 345

Query: 140 DKIIRDIEKRIADFT 154
             ++  I  RI D T
Sbjct: 346 SPVVSRINTRIQDLT 360


>gi|307211752|gb|EFN87747.1| Prolyl 4-hydroxylase subunit alpha-1 [Harpegnathos saltator]
          Length = 415

 Score = 42.4 bits (98), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 35/72 (48%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR   YHN +  EE E +  +A P  +++TV +  TG  + +  R S   +L     K 
Sbjct: 207 DPRIVFYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 266

Query: 143 IRDIEKRIADFT 154
           +  + KR+   T
Sbjct: 267 VAAVSKRVEHMT 278


>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1
          Length = 516

 Score = 42.4 bits (98), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 38/75 (50%), Gaps = 1/75 (1%)

Query: 81  SWE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
            W+ PR   + + +S EE E +  LA P + ++TV D +TGK   +  R S   +L+   
Sbjct: 313 EWDKPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYE 372

Query: 140 DKIIRDIEKRIADFT 154
             ++  I  RI D T
Sbjct: 373 SPVVSRINTRIQDLT 387


>gi|321474877|gb|EFX85841.1| hypothetical protein DAPPUDRAFT_208740 [Daphnia pulex]
          Length = 545

 Score = 42.4 bits (98), Expect = 0.070,   Method: Composition-based stats.
 Identities = 19/72 (26%), Positives = 37/72 (51%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +P   +YHN ++ +E E +  +A P  +++TV +S TG  + +  R S   +L       
Sbjct: 343 KPLIVIYHNVINDDEIETVKKMAQPRFKRATVQNSVTGNLEPANYRISKSAWLKSEEHDH 402

Query: 143 IRDIEKRIADFT 154
           +  + +R+ D T
Sbjct: 403 VFKVTRRVGDVT 414


>gi|281350467|gb|EFB26051.1| hypothetical protein PANDA_009188 [Ailuropoda melanoleuca]
          Length = 511

 Score = 42.4 bits (98), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  I  RI D T
Sbjct: 392 PVVSRINMRIQDLT 405


>gi|424864983|ref|ZP_18288870.1| oxidoreductase, 2OG-Fe(II) oxygenase family [SAR86 cluster
           bacterium SAR86B]
 gi|400759083|gb|EJP73273.1| oxidoreductase, 2OG-Fe(II) oxygenase family [SAR86 cluster
           bacterium SAR86B]
          Length = 418

 Score = 42.4 bits (98), Expect = 0.071,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 37/62 (59%), Gaps = 1/62 (1%)

Query: 91  NFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRI 150
           NF++ EEC+ LI +   H  +S+VV   T +S  +  RTSS + L    + II+ I K+I
Sbjct: 6   NFITHEECDELIKMIDAHHTRSSVVVGGTDRSDITDHRTSSTSNLDP-NNVIIKSIHKKI 64

Query: 151 AD 152
           AD
Sbjct: 65  AD 66


>gi|449513594|ref|XP_002191636.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
           [Taeniopygia guttata]
          Length = 346

 Score = 42.4 bits (98), Expect = 0.076,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 38/75 (50%), Gaps = 1/75 (1%)

Query: 81  SWE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
            W+ PR   + + +S EE E +  LA P + ++TV D +TGK   +  R S   +L+   
Sbjct: 220 EWDKPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYE 279

Query: 140 DKIIRDIEKRIADFT 154
             ++  I  RI D T
Sbjct: 280 SPVVSRINTRIQDLT 294


>gi|15077349|gb|AAK83137.1| prolyl 4-hydroxylase alpha subunit [Cavia porcellus]
          Length = 141

 Score = 42.4 bits (98), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 39/75 (52%), Gaps = 1/75 (1%)

Query: 81  SWE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
            W+ PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   
Sbjct: 43  EWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYE 102

Query: 140 DKIIRDIEKRIADFT 154
           + ++  I  RI D T
Sbjct: 103 NPVVSRINMRIQDLT 117


>gi|256083648|ref|XP_002578053.1| prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
 gi|360044447|emb|CCD81995.1| putative prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
          Length = 584

 Score = 42.4 bits (98), Expect = 0.078,   Method: Composition-based stats.
 Identities = 24/81 (29%), Positives = 42/81 (51%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E ++ +PR  ++++ +   E E +  LATP +R++TV +  TG  + +  RTS   +L  
Sbjct: 373 ETLNPDPRIVMWYDLIFPSEIEKIKELATPRLRRATVKNPVTGILEIAFYRTSKSAWLPH 432

Query: 138 GRDKIIRDIEKRIADFTFFPL 158
              +I   I +RI   T   L
Sbjct: 433 SMSEITDQISQRIRAVTGLSL 453


>gi|240974259|ref|XP_002401836.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215491070|gb|EEC00711.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 490

 Score = 42.4 bits (98), Expect = 0.079,   Method: Compositional matrix adjust.
 Identities = 22/77 (28%), Positives = 42/77 (54%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+  +PR  +YH+ +SK E + +  LA P ++++TV +  +G+ + +  R S   +L  
Sbjct: 285 EVMFPKPRIVIYHDVMSKHEMDVVKLLAQPRLKRATVQNYKSGELEVANYRISKSAWLRN 344

Query: 138 GRDKIIRDIEKRIADFT 154
               +I  + +RI   T
Sbjct: 345 EEHGVIARVTRRIEHIT 361


>gi|194764881|ref|XP_001964556.1| GF23245 [Drosophila ananassae]
 gi|190614828|gb|EDV30352.1| GF23245 [Drosophila ananassae]
          Length = 460

 Score = 42.0 bits (97), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 28/93 (30%), Positives = 46/93 (49%), Gaps = 7/93 (7%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E IS +P   VYH+ + + E  +L++      R S V     G+S+ S +RTS      
Sbjct: 262 MEEISTDPYMVVYHDVIYENEINWLLD--NSDFRTSLV-----GESQISTLRTSQDMPFG 314

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
               +++R+IEKRI D T   + +   F L  +
Sbjct: 315 ANSGEVMRNIEKRIKDMTGLSMDLSEDFMLINY 347


>gi|91806774|gb|ABE66114.1| oxidoreductase 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 125

 Score = 42.0 bits (97), Expect = 0.080,   Method: Compositional matrix adjust.
 Identities = 16/20 (80%), Positives = 19/20 (95%)

Query: 74  EQWVEVISWEPRAFVYHNFL 93
           E+W+EVI+ EPRAFVYHNFL
Sbjct: 85  ERWLEVITKEPRAFVYHNFL 104


>gi|351696981|gb|EHA99899.1| Prolyl 4-hydroxylase subunit alpha-3 [Heterocephalus glaber]
          Length = 572

 Score = 42.0 bits (97), Expect = 0.084,   Method: Composition-based stats.
 Identities = 24/77 (31%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 368 EVIHLEPYVALYHDFVSDPEAQKIRKLAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 425

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RIA  T
Sbjct: 426 TADPVLVTLDHRIAALT 442


>gi|321463241|gb|EFX74258.1| hypothetical protein DAPPUDRAFT_22132 [Daphnia pulex]
          Length = 523

 Score = 42.0 bits (97), Expect = 0.085,   Method: Composition-based stats.
 Identities = 25/78 (32%), Positives = 39/78 (50%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E  S+EP  + +H+ LS EE E +  LA P + +S V        + S VRTS   +L 
Sbjct: 312 IEQHSFEPAIYTFHDVLSDEEIETIKELAKPLLARSMVQGKLGVGHEVSNVRTSKTAWLP 371

Query: 137 RGRDKIIRDIEKRIADFT 154
            G   ++  + +RI   T
Sbjct: 372 EGLHPLLNRLSRRIGLIT 389


>gi|440899661|gb|ELR50930.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Bos grunniens mutus]
          Length = 478

 Score = 42.0 bits (97), Expect = 0.086,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 274 EVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 331

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RIA  T
Sbjct: 332 TVDPVLVTLDHRIAALT 348


>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
 gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score = 42.0 bits (97), Expect = 0.088,   Method: Composition-based stats.
 Identities = 25/78 (32%), Positives = 41/78 (52%), Gaps = 2/78 (2%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE +S +P    +H+ LS +E E +I      + +S +    TG S  S +RTS  T+L 
Sbjct: 339 VEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEI--GQTGNSTVSDIRTSQNTWLW 396

Query: 137 RGRDKIIRDIEKRIADFT 154
              +  + DI++R+ D T
Sbjct: 397 YENNPWLADIKQRLEDIT 414


>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
 gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
          Length = 487

 Score = 42.0 bits (97), Expect = 0.088,   Method: Compositional matrix adjust.
 Identities = 21/83 (25%), Positives = 42/83 (50%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E+I  +P   +YH+ +S  E   L  +A P ++++ V +S     + S+ RT+   +  
Sbjct: 281 MELIGLDPYMVLYHDVISPNEIAELQEMAKPQLKRARVYNSTKNTDQLSKTRTAKLAWFL 340

Query: 137 RGRDKIIRDIEKRIADFTFFPLG 159
              +++   + +RI D T F L 
Sbjct: 341 DTFNQLTERLNQRIMDMTNFVLN 363


>gi|218187602|gb|EEC70029.1| hypothetical protein OsI_00603 [Oryza sativa Indica Group]
          Length = 549

 Score = 42.0 bits (97), Expect = 0.089,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 39/79 (49%), Gaps = 13/79 (16%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV-VDSDTGKSKDSRVRTSSGTFLARG 138
           +SW PR F+Y  FLS  EC++L++    +M  S    D D         R SS   +   
Sbjct: 309 LSWHPRIFLYEGFLSDMECDHLVSTGRGNMDSSLAFTDGD---------RNSSYNNI--- 356

Query: 139 RDKIIRDIEKRIADFTFFP 157
            D ++  IE RI+ ++F P
Sbjct: 357 EDIVVSKIEDRISLWSFLP 375


>gi|426245942|ref|XP_004016760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Ovis
           aries]
          Length = 514

 Score = 42.0 bits (97), Expect = 0.092,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 310 EVIHLEPYVVLYHDFVSDAEAQKIRGLAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 367

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RIA  T
Sbjct: 368 TVDPVLVTLDHRIAALT 384


>gi|348555277|ref|XP_003463450.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cavia porcellus]
          Length = 584

 Score = 42.0 bits (97), Expect = 0.092,   Method: Composition-based stats.
 Identities = 25/77 (32%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V  +  GK      R S   +L  
Sbjct: 380 EVIHLEPYVALYHDFVSDPEAQKIRELAEPWLQRSVV--ASGGKQLQVEYRISKSAWLKD 437

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  +  RIA  T
Sbjct: 438 TVDPMLVTLNHRIAALT 454


>gi|219123691|ref|XP_002182153.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217406114|gb|EEC46054.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 188

 Score = 42.0 bits (97), Expect = 0.094,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 43/80 (53%), Gaps = 5/80 (6%)

Query: 79  VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
           V++  P  F   NFL+  ECE+LI++A      + VV    G+   S  RTSS  +L+R 
Sbjct: 1   VLNTSPPMFAVDNFLTPLECEFLIHMAQDSFGPAPVVGKGAGEVSPS--RTSSTCYLSR- 57

Query: 139 RDKIIRDIEKRIADFTFFPL 158
             + + D+ ++++  T  P+
Sbjct: 58  --EDLPDLMRKVSSLTGKPI 75


>gi|2980790|emb|CAA18166.1| hypothetical protein [Arabidopsis thaliana]
          Length = 316

 Score = 42.0 bits (97), Expect = 0.094,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 25/80 (31%)

Query: 85  RAFVYHNFLSKEECEYLINLATPHMRKSTV----VDSDTGKSKDSRVRTSSGTFLARGRD 140
           R F+Y  FLS+EEC++LI+L     RK T     VD+D GK++                D
Sbjct: 72  RVFLYRGFLSEEECDHLISL-----RKETTEVYSVDAD-GKTQ---------------LD 110

Query: 141 KIIRDIEKRIADFTFFPLGM 160
            ++  IE++++ +TF P G+
Sbjct: 111 PVVAGIEEKVSAWTFLPGGL 130


>gi|148233143|ref|NP_001090904.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Sus scrofa]
 gi|83778522|gb|ABC47142.1| procollagen-proline 2-oxoglutarate-4-dioxygenase [Sus scrofa]
          Length = 534

 Score = 42.0 bits (97), Expect = 0.095,   Method: Composition-based stats.
 Identities = 20/74 (27%), Positives = 39/74 (52%), Gaps = 1/74 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           W+ PR   +H+ +S  E + + +LA P +R++T+ +  TG  +    R S   +L+   +
Sbjct: 332 WDKPRIIRFHDIISDAEIDIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYEN 391

Query: 141 KIIRDIEKRIADFT 154
            ++  +  RI D T
Sbjct: 392 PVVSRLNMRIQDLT 405


>gi|149068803|gb|EDM18355.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III [Rattus
           norvegicus]
          Length = 266

 Score = 42.0 bits (97), Expect = 0.096,   Method: Compositional matrix adjust.
 Identities = 27/94 (28%), Positives = 46/94 (48%), Gaps = 4/94 (4%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI   P   +YH+F+S EE + +  LA P +++S V   +  K      R S   +L  
Sbjct: 78  EVIHLRPLVALYHDFVSDEEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 135

Query: 138 GRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYV 171
             D ++  +++RIA  T   L +Q  ++ +   V
Sbjct: 136 TVDPVLVTLDRRIAALTG--LDIQPPYAEYLQVV 167


>gi|328790718|ref|XP_392392.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Apis mellifera]
          Length = 415

 Score = 42.0 bits (97), Expect = 0.098,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 36/72 (50%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  VYHN +  +E E +  +A P  +++TV +  TG  + +  R S   +L     K 
Sbjct: 207 DPRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 266

Query: 143 IRDIEKRIADFT 154
           +  + +R+   T
Sbjct: 267 VAAVSRRVEHMT 278


>gi|427410797|ref|ZP_18900999.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425710785|gb|EKU73805.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 322

 Score = 42.0 bits (97), Expect = 0.100,   Method: Compositional matrix adjust.
 Identities = 25/95 (26%), Positives = 45/95 (47%), Gaps = 3/95 (3%)

Query: 81  SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR- 139
            W+ R F    FL+ +EC ++I+     +  + V+D  +G+     +RTS G      R 
Sbjct: 137 GWDVRLF--RQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHPIRTSDGGIFGPARE 194

Query: 140 DKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           D +I+ I +RIA  +   L      +L  + V ++
Sbjct: 195 DLVIQAINRRIAAASGTMLSGGEPLTLLRYAVGQQ 229


>gi|383642155|ref|ZP_09954561.1| hypothetical protein SeloA3_06917 [Sphingomonas elodea ATCC 31461]
          Length = 327

 Score = 42.0 bits (97), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 1/73 (1%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDK- 141
           +PR   +  FLS+EEC ++   A   +  S V+D ++G+     +RTS G  +    +  
Sbjct: 139 DPRVEHFPGFLSREECAHVATTAQDLLEPSFVLDPNSGRPIPHPIRTSDGGAIGPTNENL 198

Query: 142 IIRDIEKRIADFT 154
           ++R I  RIA  T
Sbjct: 199 VVRAINLRIAAAT 211


>gi|307108823|gb|EFN57062.1| hypothetical protein CHLNCDRAFT_143806 [Chlorella variabilis]
          Length = 514

 Score = 42.0 bits (97), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 31/56 (55%), Gaps = 5/56 (8%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFL 135
           +S EP   +   FLS EEC  +  L  PHM++S V   D     ++ +RTS GTFL
Sbjct: 283 VSEEPCLLLADAFLSPEECGEVRALGAPHMKRSKVSAGD-----ETPLRTSWGTFL 333


>gi|307103831|gb|EFN52088.1| hypothetical protein CHLNCDRAFT_139357 [Chlorella variabilis]
          Length = 1038

 Score = 42.0 bits (97), Expect = 0.10,   Method: Composition-based stats.
 Identities = 17/39 (43%), Positives = 26/39 (66%)

Query: 120 GKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPL 158
           G   D  +RTS GTFL R +D+++  IE R+A++T  P+
Sbjct: 3   GSISDDPIRTSWGTFLTRAQDEVVYAIEHRVANWTHLPV 41


>gi|48675383|ref|NP_001001598.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
 gi|75053350|sp|Q75UG4.1|P4HA3_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|47115494|dbj|BAD18888.1| Collagen prolyl 4-hydroxylase alpha III subunit [Bos taurus]
 gi|296479828|tpg|DAA21943.1| TPA: prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
          Length = 544

 Score = 42.0 bits (97), Expect = 0.10,   Method: Composition-based stats.
 Identities = 24/77 (31%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 340 EVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 397

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RIA  T
Sbjct: 398 TVDPVLVTLDHRIAALT 414


>gi|328696638|ref|XP_003240086.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Acyrthosiphon pisum]
          Length = 534

 Score = 41.6 bits (96), Expect = 0.11,   Method: Composition-based stats.
 Identities = 19/73 (26%), Positives = 38/73 (52%)

Query: 82  WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDK 141
           + PR  +Y + L   E E +  +A P ++++TV +  TG+ + +  R S   +L    D 
Sbjct: 327 FSPRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFADYRISKSAWLKEHEDV 386

Query: 142 IIRDIEKRIADFT 154
           ++ ++ KR+   T
Sbjct: 387 VVANVAKRVEVMT 399


>gi|193688213|ref|XP_001943683.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Acyrthosiphon pisum]
          Length = 552

 Score = 41.6 bits (96), Expect = 0.11,   Method: Composition-based stats.
 Identities = 19/73 (26%), Positives = 38/73 (52%)

Query: 82  WEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDK 141
           + PR  +Y + L   E E +  +A P ++++TV +  TG+ + +  R S   +L    D 
Sbjct: 345 FSPRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFADYRISKSAWLKEHEDV 404

Query: 142 IIRDIEKRIADFT 154
           ++ ++ KR+   T
Sbjct: 405 VVANVAKRVEVMT 417


>gi|308475644|ref|XP_003100040.1| hypothetical protein CRE_20852 [Caenorhabditis remanei]
 gi|308266092|gb|EFP10045.1| hypothetical protein CRE_20852 [Caenorhabditis remanei]
          Length = 300

 Score = 41.6 bits (96), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 23/58 (39%), Positives = 35/58 (60%), Gaps = 2/58 (3%)

Query: 77  VEVISWEPRAFVYHNFLSKEECE-YLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGT 133
           +EV+SW P   +Y NF +K++ E YL  L    + +  VVD + GK   S+VR ++GT
Sbjct: 83  IEVLSWSPPLVIYRNFFTKKQVESYLQLLKIQSLEEQEVVD-EKGKPFISKVRVANGT 139


>gi|386712780|ref|YP_006179102.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
           2266]
 gi|384072335|emb|CCG43825.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
           2266]
          Length = 211

 Score = 41.6 bits (96), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 27/75 (36%), Positives = 39/75 (52%), Gaps = 6/75 (8%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P   +  N +S+EECE LI L+   M +S +      + + S +RTSS TFL    D + 
Sbjct: 34  PLIAILGNVVSEEECEELIFLSKNKMNRSKI----GSQHEVSDIRTSSSTFLP--EDDLT 87

Query: 144 RDIEKRIADFTFFPL 158
             IEKR+A     P+
Sbjct: 88  NRIEKRVAQIMNVPV 102


>gi|444512226|gb|ELV10078.1| Prolyl 4-hydroxylase subunit alpha-1 [Tupaia chinensis]
          Length = 474

 Score = 41.6 bits (96), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 39/75 (52%), Gaps = 1/75 (1%)

Query: 81  SWE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
            W+ PR   +H+ +S  E E + +LA P +R++T+ +  TG  +    R S   +L+   
Sbjct: 304 EWDKPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYE 363

Query: 140 DKIIRDIEKRIADFT 154
           + ++  I  RI D T
Sbjct: 364 NPVVSRINMRIQDLT 378


>gi|363729586|ref|XP_417248.3| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Gallus gallus]
          Length = 542

 Score = 41.2 bits (95), Expect = 0.14,   Method: Composition-based stats.
 Identities = 22/77 (28%), Positives = 39/77 (50%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E +  +P   +YH+F+S  E E +  LA P +++S V   +  K +    R S   +L  
Sbjct: 338 ETLRLQPYIVLYHDFVSDAEAETIKGLAGPWLQRSVVASGE--KQQKVEYRISKSAWLKD 395

Query: 138 GRDKIIRDIEKRIADFT 154
             D +++ +E R+A  T
Sbjct: 396 TADPVVQALELRMAAIT 412


>gi|156370183|ref|XP_001628351.1| predicted protein [Nematostella vectensis]
 gi|156215325|gb|EDO36288.1| predicted protein [Nematostella vectensis]
          Length = 478

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 41/78 (52%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE ++ +PR  ++   ++  E   +  +A+P + ++TV + DTG  + +  R S   +L 
Sbjct: 329 VEQVNDDPRVVIFRGLVTDRETARIKQIASPMLNRATVYNIDTGVLEYADYRVSKSAWLE 388

Query: 137 RGRDKIIRDIEKRIADFT 154
              D+ I  + KRIA  T
Sbjct: 389 DHLDETIATVNKRIAMVT 406


>gi|332140647|ref|YP_004426385.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
           'Deep ecotype']
 gi|327550669|gb|AEA97387.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
           'Deep ecotype']
          Length = 376

 Score = 41.2 bits (95), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 35/68 (51%), Gaps = 1/68 (1%)

Query: 88  VYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR-DKIIRDI 146
           VY + LS+ EC YLI      ++ S VVD  TG+ K   VRTS    +     D I R +
Sbjct: 180 VYESILSEYECRYLITKFNALLKPSMVVDPVTGRGKIDSVRTSYVAVIEPAHCDWITRKL 239

Query: 147 EKRIADFT 154
           +K I+  T
Sbjct: 240 DKTISQIT 247


>gi|395814850|ref|XP_003780953.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Otolemur
           garnettii]
          Length = 544

 Score = 41.2 bits (95), Expect = 0.15,   Method: Composition-based stats.
 Identities = 24/77 (31%), Positives = 40/77 (51%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V   +     D R+  S+  +L  
Sbjct: 340 EVIHLEPFVALYHDFVSDSEAQKIRELAEPWLQRSVVASGEKQLQVDYRISKSA--WLKD 397

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RIA  T
Sbjct: 398 TVDPMLVTLDHRIAALT 414


>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
 gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
          Length = 514

 Score = 41.2 bits (95), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 43/78 (55%), Gaps = 2/78 (2%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           ++ ++ +P   +YH+ +S +E + +I+++ P M +S V   D  +   S+ RTSS  +L 
Sbjct: 309 LQEVNHDPMIVMYHDVISNKEIDAIISISKPLMHRSMV--GDDHEKAVSKTRTSSNAWLD 366

Query: 137 RGRDKIIRDIEKRIADFT 154
                ++R + +R  D T
Sbjct: 367 DVMHPVVRTLSQRTEDMT 384


>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
           kowalevskii]
          Length = 533

 Score = 41.2 bits (95), Expect = 0.16,   Method: Composition-based stats.
 Identities = 19/77 (24%), Positives = 41/77 (53%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+  +P+  ++H+ +   E   +  LA+P +R++T+ +S TG  + +  R S   +L+ 
Sbjct: 327 EVVFDKPKLIIFHDAILTNEIRKVKALASPRLRRATIQNSVTGNLEFAEYRISKSAWLSE 386

Query: 138 GRDKIIRDIEKRIADFT 154
               ++  +  RI  +T
Sbjct: 387 DDGDVVHRLNHRIEQYT 403


>gi|126327904|ref|XP_001367838.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Monodelphis
           domestica]
          Length = 559

 Score = 41.2 bits (95), Expect = 0.16,   Method: Composition-based stats.
 Identities = 22/77 (28%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+  EP   +YH+F+S  E + +   A P +++S V   +  K +    R S   +L  
Sbjct: 355 EVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVASGE--KQQQVEYRISKSAWLKD 412

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RIA  T
Sbjct: 413 TVDPMLVSLDHRIAALT 429


>gi|51490656|emb|CAF31507.1| prolyl 4-hydroxylase 2 precursor [Brugia malayi]
          Length = 551

 Score = 41.2 bits (95), Expect = 0.16,   Method: Composition-based stats.
 Identities = 20/74 (27%), Positives = 40/74 (54%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE++   P   ++ + +S EE   +  LA P + ++TV +  TG  + +  RTS  ++L 
Sbjct: 321 VEIVHQNPLVVLFRDIVSDEEMRIIEMLAVPKLARATVHNVVTGNIETAFYRTSQSSWLG 380

Query: 137 RGRDKIIRDIEKRI 150
               ++++ I KR+
Sbjct: 381 STEHEVVKRINKRL 394


>gi|398810140|ref|ZP_10568970.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
 gi|398083831|gb|EJL74535.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
          Length = 296

 Score = 41.2 bits (95), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 21/71 (29%), Positives = 35/71 (49%)

Query: 88  VYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIE 147
           V  +    +ECE LI LA P +  ST VD  +G+      R+S G F     +  I  ++
Sbjct: 103 VLDDVFDPQECEELIALARPRLAPSTTVDPLSGRDLVGEQRSSLGMFFRLRENAFIARLD 162

Query: 148 KRIADFTFFPL 158
           +R+++    P+
Sbjct: 163 QRVSELMNLPV 173


>gi|194765140|ref|XP_001964685.1| GF23318 [Drosophila ananassae]
 gi|190614957|gb|EDV30481.1| GF23318 [Drosophila ananassae]
          Length = 412

 Score = 41.2 bits (95), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 27/89 (30%), Positives = 43/89 (48%), Gaps = 3/89 (3%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E I  +P   V+H+ LS  E   LI+L    + ++  V+    KS    VRT+   ++ R
Sbjct: 242 EQIGLDPYVVVFHDVLSPREISKLISLTDRKLVQAVTVNK---KSFKEMVRTAKAHWVYR 298

Query: 138 GRDKIIRDIEKRIADFTFFPLGMQFKFSL 166
           G  ++ + I +RI D + F L     F L
Sbjct: 299 GYQELTKRIYRRIHDMSGFELADAENFQL 327


>gi|410632646|ref|ZP_11343301.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
 gi|410147883|dbj|GAC20168.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
          Length = 480

 Score = 41.2 bits (95), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 42/92 (45%), Gaps = 17/92 (18%)

Query: 91  NFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRI 150
           +FL  +EC+ LI L     + ST+    T ++ D + RTSS   L   +D +IR I+ +I
Sbjct: 103 DFLLPQECQALIELIEQAKQPSTI----TSENPDQQFRTSSTCHLGNMQDPVIRKIDLQI 158

Query: 151 ADF-------------TFFPLGMQFKFSLFFF 169
             +               + LG QFK    +F
Sbjct: 159 CQYLGIDPSYSEVIQGQHYQLGQQFKPHTDYF 190


>gi|344296798|ref|XP_003420090.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Loxodonta
           africana]
          Length = 544

 Score = 41.2 bits (95), Expect = 0.16,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 40/77 (51%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F++  E + +  LA P +++S V   +     D R+  S+  +L  
Sbjct: 340 EVIHLEPYVVLYHDFVNDMEAQKIKGLAEPWLQRSVVASGEKQLQVDYRISKSA--WLKD 397

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RIA  T
Sbjct: 398 SVDPMLVTLDHRIAALT 414


>gi|7269410|emb|CAB81370.1| hypothetical protein [Arabidopsis thaliana]
          Length = 315

 Score = 41.2 bits (95), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 41/76 (53%), Gaps = 17/76 (22%)

Query: 85  RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIR 144
           R F+Y  FLS+EEC++LI+L      +   VD+D GK++                D ++ 
Sbjct: 71  RVFLYRGFLSEEECDHLISLGK-ETTEVYSVDAD-GKTQ---------------LDPVVA 113

Query: 145 DIEKRIADFTFFPLGM 160
            IE++++ +TF P G+
Sbjct: 114 GIEEKVSAWTFLPGGL 129


>gi|194213450|ref|XP_001495951.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Equus
           caballus]
          Length = 548

 Score = 41.2 bits (95), Expect = 0.17,   Method: Composition-based stats.
 Identities = 24/77 (31%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 344 EVIHLEPYVVLYHDFVSDSEAQKIRGLAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 401

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RIA  T
Sbjct: 402 TVDPMLVTLDHRIAALT 418


>gi|324510827|gb|ADY44523.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
          Length = 551

 Score = 41.2 bits (95), Expect = 0.17,   Method: Composition-based stats.
 Identities = 20/74 (27%), Positives = 38/74 (51%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE++   P A ++H  +S EE   +  LA P + ++TV ++ TG  + +  R S   +L 
Sbjct: 322 VEIMRLNPLAVLFHQIMSDEEAHIIEMLAIPKLNRATVQNAMTGGLETASYRISKSAWLK 381

Query: 137 RGRDKIIRDIEKRI 150
               +++    KR+
Sbjct: 382 PHEHEVVDRFNKRL 395


>gi|421871431|ref|ZP_16303052.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
           laterosporus GI-9]
 gi|372459315|emb|CCF12601.1| 2OG-Fe(II) oxygenase superfamily protein [Brevibacillus
           laterosporus GI-9]
          Length = 201

 Score = 41.2 bits (95), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 25/94 (26%), Positives = 48/94 (51%), Gaps = 6/94 (6%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKS--KDSRVRTSSGTFL 135
           ++++ +P    Y + +S E C+ LINLA   +  +TVV    G+S  + S VR S   + 
Sbjct: 6   QLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVV----GQSGLEVSHVRISELAWF 61

Query: 136 ARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
               +++++ I K+IA+    P+    K  +  +
Sbjct: 62  CHNYNEVVQSICKQIAEIVEQPIHYAEKLQVAHY 95


>gi|312385412|gb|EFR29925.1| hypothetical protein AND_00803 [Anopheles darlingi]
          Length = 468

 Score = 41.2 bits (95), Expect = 0.17,   Method: Compositional matrix adjust.
 Identities = 23/78 (29%), Positives = 40/78 (51%), Gaps = 2/78 (2%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E +S +P   VYH  +S  E + +I ++   +R++ V   D  K + S+ RTSS  +L 
Sbjct: 317 LEEVSLDPFIVVYHQVISDNEIKTIIEISRDSLRRAMV--GDVAKQEVSKARTSSNAWLD 374

Query: 137 RGRDKIIRDIEKRIADFT 154
                 +R + +R  D T
Sbjct: 375 DPMHPHVRSLSRRTEDMT 392


>gi|358398156|gb|EHK47514.1| hypothetical protein TRIATDRAFT_52078, partial [Trichoderma
           atroviride IMI 206040]
          Length = 1051

 Score = 40.8 bits (94), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 38/75 (50%), Gaps = 2/75 (2%)

Query: 79  VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
           ++S EP       FLS+ E ++L+ ++ P    ST+ D  +   +DS +R S+   L   
Sbjct: 822 ILSREPLVVYLEGFLSESERKHLLEISEPIFEPSTITDDSSSTHRDSTIRDSAVALLP-- 879

Query: 139 RDKIIRDIEKRIADF 153
           R  I+R IE R   F
Sbjct: 880 RTDIVRCIEARAKAF 894


>gi|339009924|ref|ZP_08642495.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
 gi|338773194|gb|EGP32726.1| 2OG-Fe(II) oxygenase [Brevibacillus laterosporus LMG 15441]
          Length = 201

 Score = 40.8 bits (94), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 25/94 (26%), Positives = 48/94 (51%), Gaps = 6/94 (6%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKS--KDSRVRTSSGTFL 135
           ++++ +P    Y + +S E C+ LINLA   +  +TVV    G+S  + S VR S   + 
Sbjct: 6   QLLNQQPFIGCYPSLISSEACQSLINLARGQLTPATVV----GQSGLEVSHVRISELAWF 61

Query: 136 ARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFF 169
               +++++ I K+IA+    P+    K  +  +
Sbjct: 62  CHNYNEVVQSICKQIAEIVEQPIHYAEKLQVAHY 95


>gi|443705944|gb|ELU02240.1| hypothetical protein CAPTEDRAFT_227850 [Capitella teleta]
          Length = 475

 Score = 40.8 bits (94), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 21/77 (27%), Positives = 41/77 (53%), Gaps = 1/77 (1%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E++   P  +++H+F+S  E + L ++A P  + S V+D   G+S     R SS  F+  
Sbjct: 178 ELLHANPEIYLFHDFISDSEIQRLKDMAEPQFQSSAVLDDTGGESFFDVSRLSSTAFVND 237

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  + +R++  T
Sbjct: 238 SND-LVASLNRRVSKLT 253


>gi|339261892|ref|XP_003367679.1| prolyl 4-hydroxylase subunit alpha-2 [Trichinella spiralis]
 gi|316962562|gb|EFV48687.1| prolyl 4-hydroxylase subunit alpha-2 [Trichinella spiralis]
          Length = 319

 Score = 40.8 bits (94), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 18/54 (33%), Positives = 33/54 (61%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
           VEV+ W+P+  ++   +S  E   L  LA P + ++TV +S+TG+ + ++ R S
Sbjct: 195 VEVMHWKPKIVIFRQVISANEIAVLKTLAYPRLSRATVQNSETGELETAKYRIS 248


>gi|224107303|ref|XP_002314439.1| predicted protein [Populus trichocarpa]
 gi|222863479|gb|EEF00610.1| predicted protein [Populus trichocarpa]
          Length = 143

 Score = 40.8 bits (94), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 18/45 (40%), Positives = 29/45 (64%)

Query: 117 SDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFTFFPLGMQ 161
           ++   SK S+VRTSSG F+++ +D I+   E +IA +T  P G+ 
Sbjct: 37  AEVKPSKISQVRTSSGMFISKAKDPIVAGTEDKIATWTSLPKGVN 81


>gi|298712929|emb|CBJ26831.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 294

 Score = 40.8 bits (94), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 24/54 (44%), Positives = 31/54 (57%), Gaps = 2/54 (3%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           P  +V  +F S  EC+ LI LA  +M  S VV +  G+  +S  RTSS  FLAR
Sbjct: 101 PPLYVVDDFFSGPECDALIALAGNYMIVSPVVGAGAGEVSES--RTSSSCFLAR 152


>gi|431838427|gb|ELK00359.1| Prolyl 4-hydroxylase subunit alpha-3 [Pteropus alecto]
          Length = 483

 Score = 40.8 bits (94), Expect = 0.21,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 279 EVIHLEPYVVLYHDFVSDLEAQKIRGLAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 336

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RIA  T
Sbjct: 337 TADPMLVTLDHRIAALT 353


>gi|196011900|ref|XP_002115813.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
 gi|190581589|gb|EDV21665.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
          Length = 581

 Score = 40.8 bits (94), Expect = 0.21,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 39/77 (50%), Gaps = 1/77 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV-DSDTGKSKDSRVRTSSGTFL 135
           VEV+S +P   +YHN L+  E   L  LA+P ++++ VV   D    +++  R S   +L
Sbjct: 345 VEVLSLQPYIVIYHNLLTNSEVVLLKTLASPLLKRAVVVGKPDKEYGEETTYRISKTAWL 404

Query: 136 ARGRDKIIRDIEKRIAD 152
            +     ++ I   I D
Sbjct: 405 DKEDHPAVKRITTLIGD 421


>gi|355709028|gb|AES03457.1| prolyl 4-hydroxylase, alpha polypeptide III [Mustela putorius furo]
          Length = 477

 Score = 40.8 bits (94), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 274 EVIHLEPYVVLYHDFVSDMEAQKIRGLAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 331

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++ +++ RI   T
Sbjct: 332 TVDPLLVNLDHRIGALT 348


>gi|219113023|ref|XP_002186095.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209582945|gb|ACI65565.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 508

 Score = 40.8 bits (94), Expect = 0.22,   Method: Compositional matrix adjust.
 Identities = 25/77 (32%), Positives = 43/77 (55%), Gaps = 2/77 (2%)

Query: 79  VISWEPRAFVYHNFLSKEECEYLINLATPH-MRKSTV-VDSDTGKSKDSRVRTSSGTFLA 136
           V+S  PR F   +FLS  E E+L+N+A+   +++ST+     +  + +   RTS+  ++ 
Sbjct: 283 VLSCVPRVFEVKDFLSDMEVEHLLNIASKRKLKRSTMHAGGSSEATTNDDTRTSTNDWIP 342

Query: 137 RGRDKIIRDIEKRIADF 153
           R +D I   I +R AD 
Sbjct: 343 RHQDLITDTIYRRAADL 359


>gi|195505253|ref|XP_002099424.1| GE23369 [Drosophila yakuba]
 gi|194185525|gb|EDW99136.1| GE23369 [Drosophila yakuba]
          Length = 164

 Score = 40.4 bits (93), Expect = 0.23,   Method: Compositional matrix adjust.
 Identities = 26/90 (28%), Positives = 44/90 (48%), Gaps = 2/90 (2%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E +   P   +YH+ +S +E   LI LA   ++ S V  +     K  R+RT    ++ 
Sbjct: 1   MEQVGLNPYVVLYHDVISPQESAQLIELAASDLKASGVFQAKGSTFK--RLRTVKARWIK 58

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQFKFSL 166
           +  +++ + I +RI D T F L    KF L
Sbjct: 59  KEFNELTKRITRRIRDMTGFDLKEGEKFQL 88


>gi|339236275|ref|XP_003379692.1| prolyl 4-hydroxylase subunit alpha-2 [Trichinella spiralis]
 gi|316977629|gb|EFV60704.1| prolyl 4-hydroxylase subunit alpha-2 [Trichinella spiralis]
          Length = 441

 Score = 40.4 bits (93), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 18/54 (33%), Positives = 33/54 (61%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
           VEV+ W+P+  ++   +S  E   L  LA P + ++TV +S+TG+ + ++ R S
Sbjct: 208 VEVMHWKPKIVIFRQVISANEIAVLKTLAYPRLSRATVQNSETGELETAKYRIS 261


>gi|21358233|ref|NP_651814.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
 gi|20269810|gb|AAM18060.1|AF495538_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE3
           [Drosophila melanogaster]
 gi|15291443|gb|AAK92990.1| GH21465p [Drosophila melanogaster]
 gi|23172714|gb|AAN14251.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
 gi|220945610|gb|ACL85348.1| PH4alphaNE3-PA [synthetic construct]
 gi|220955396|gb|ACL90241.1| PH4alphaNE3-PA [synthetic construct]
          Length = 481

 Score = 40.4 bits (93), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 38/70 (54%), Gaps = 1/70 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E IS EP   VYH+ L  ++ + LI LA P ++ + + D +  +++ S  RT  G  L 
Sbjct: 298 MEEISLEPHIVVYHDILPDKDIQQLITLAEPLLKPTEMFDDNKNEARSS-YRTPLGGPLL 356

Query: 137 RGRDKIIRDI 146
               + +RDI
Sbjct: 357 DSLTQRMRDI 366


>gi|170591594|ref|XP_001900555.1| prolyl 4-hydroxylase 2 precursor [Brugia malayi]
 gi|158592167|gb|EDP30769.1| prolyl 4-hydroxylase 2 precursor, putative [Brugia malayi]
          Length = 405

 Score = 40.0 bits (92), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 20/75 (26%), Positives = 40/75 (53%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE++   P   ++ + +S EE   +  LA P + ++TV +  TG  + +  RTS  ++L 
Sbjct: 321 VEIVHQNPLVVLFRDIVSDEEMRIIEMLAVPKLARATVHNVVTGNIETAFYRTSQSSWLG 380

Query: 137 RGRDKIIRDIEKRIA 151
               ++++ I KR+ 
Sbjct: 381 STEHEVVKRINKRLG 395


>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
 gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
          Length = 537

 Score = 40.0 bits (92), Expect = 0.32,   Method: Composition-based stats.
 Identities = 20/54 (37%), Positives = 29/54 (53%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +P    YH+ LS ++   L  +A P MR+STV     G++K S  R S   +LA
Sbjct: 330 DPYVVTYHDMLSAQKIRDLRQMAVPRMRRSTVNPLPGGQNKKSAFRVSKNAWLA 383


>gi|223996679|ref|XP_002288013.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977129|gb|EED95456.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 520

 Score = 40.0 bits (92), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 24/94 (25%), Positives = 43/94 (45%), Gaps = 17/94 (18%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG------------KSKDS 125
           + +S  P  ++ HN L+++EC+ L+ +A     +   +D   G            K   +
Sbjct: 222 DAVSRNPVVYLIHNMLTEDECDELVQMANGKYER---IDDSNGINNYLENTMASQKKGQT 278

Query: 126 RVRTSSGTFLARG--RDKIIRDIEKRIADFTFFP 157
                  T+L +G    K I+DI++RI+  T FP
Sbjct: 279 TAVNVDKTYLWKGGVAGKTIKDIDERISQVTGFP 312


>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
           2522]
 gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
           2522]
          Length = 229

 Score = 40.0 bits (92), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 21/76 (27%), Positives = 38/76 (50%), Gaps = 5/76 (6%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           EP   +  N LS+EEC+ LI+L+   + +S + +          +RTSS  F     + +
Sbjct: 43  EPLIVLLGNVLSEEECDQLISLSKDRIERSKISNKSV-----HDLRTSSSMFFDDAENDV 97

Query: 143 IRDIEKRIADFTFFPL 158
           +  +EKR++     P+
Sbjct: 98  VSTVEKRVSQIMKIPV 113


>gi|81870817|sp|Q6W3F0.1|P4HA3_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|36962749|gb|AAQ87604.1| collagen prolyl 4-hydroxylase alpha III subunit [Mus musculus]
          Length = 542

 Score = 40.0 bits (92), Expect = 0.34,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+   P   +YH+F+S EE + +  LA P +++S V   +  K      R S   +L  
Sbjct: 338 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 395

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RIA  T
Sbjct: 396 TVDPMLVTLDHRIAALT 412


>gi|332211329|ref|XP_003254773.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Nomascus
           leucogenys]
          Length = 544

 Score = 40.0 bits (92), Expect = 0.34,   Method: Composition-based stats.
 Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 340 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 397

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  +  RIA  T
Sbjct: 398 TVDPMLVTLNHRIAALT 414


>gi|227908832|ref|NP_796135.3| prolyl 4-hydroxylase subunit alpha-3 precursor [Mus musculus]
          Length = 542

 Score = 40.0 bits (92), Expect = 0.34,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+   P   +YH+F+S EE + +  LA P +++S V   +  K      R S   +L  
Sbjct: 338 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 395

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RIA  T
Sbjct: 396 TVDPMLVTLDHRIAALT 412


>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
          Length = 249

 Score = 40.0 bits (92), Expect = 0.35,   Method: Compositional matrix adjust.
 Identities = 19/71 (26%), Positives = 37/71 (52%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           PR  +Y + L   E E +  +A P ++++TV +  TG+ + +  R S   +L    D ++
Sbjct: 44  PRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFADYRISKSAWLKEHEDVVV 103

Query: 144 RDIEKRIADFT 154
            ++ KR+   T
Sbjct: 104 ANVAKRVEVMT 114


>gi|281353153|gb|EFB28737.1| hypothetical protein PANDA_003344 [Ailuropoda melanoleuca]
          Length = 456

 Score = 40.0 bits (92), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 275 EVIHLEPYVVLYHDFVSDGEAQKIRGLAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 332

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RI   T
Sbjct: 333 TVDPLLVTLDHRIGALT 349


>gi|396458036|ref|XP_003833631.1| hypothetical protein LEMA_P063920.1 [Leptosphaeria maculans JN3]
 gi|312210179|emb|CBX90266.1| hypothetical protein LEMA_P063920.1 [Leptosphaeria maculans JN3]
          Length = 312

 Score = 40.0 bits (92), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 38/157 (24%), Positives = 71/157 (45%), Gaps = 20/157 (12%)

Query: 12  RKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSG---------DSRKANDLSSIVRKS 62
           R+    ++ LT L+ + F  L  +A+ +   PS S           S+ A+  +++  ++
Sbjct: 2   RRLLPPSIPLTSLLQYLF--LAAIAYLLAGAPSPSSLSTFSFTADSSQAAHRTAALHAQT 59

Query: 63  MERSEGDEGRAE------QWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVD 116
           +E     E   E      + V V+S EP       F+++EE   ++ L+ PH   ST+ +
Sbjct: 60  LENLAHPEPGLECAQYAYRGVYVLSREPLVVYIEGFVAREEAGEIVGLSNPHFTPSTIWN 119

Query: 117 SDTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADF 153
               ++ D ++R S    L   R  +I+ IE+R   F
Sbjct: 120 QGH-ETLDPKIRHSEKARLP--RTPLIQCIEQRAHSF 153


>gi|195452742|ref|XP_002073480.1| GK13123 [Drosophila willistoni]
 gi|194169565|gb|EDW84466.1| GK13123 [Drosophila willistoni]
          Length = 540

 Score = 40.0 bits (92), Expect = 0.38,   Method: Composition-based stats.
 Identities = 26/85 (30%), Positives = 45/85 (52%), Gaps = 4/85 (4%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE ++ +P    +HN +S +E + LI      +++S V     G S  S VRTS  T+L 
Sbjct: 325 VEQLNLDPYVAYFHNVISDDETDDLIEHGMGQVKRSRV--GTVGNSTVSEVRTSQNTWLW 382

Query: 137 RGRDKIIRDIEKRIADFTFFPLGMQ 161
             +   +++++ R+ D T   LGM+
Sbjct: 383 YEQQPWLKNLKLRLEDIT--GLGME 405


>gi|452979728|gb|EME79490.1| hypothetical protein MYCFIDRAFT_142999 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 246

 Score = 39.7 bits (91), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 42/77 (54%), Gaps = 2/77 (2%)

Query: 80  ISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR 139
           +S++P      + +S EE +YLI LA P + KS V  ++ G   DS  RTSS  FL    
Sbjct: 1   MSYDPLIAHLEDLVSYEERQYLIKLARPLLHKSQVSLAN-GTQTDSPGRTSSTAFLP-SS 58

Query: 140 DKIIRDIEKRIADFTFF 156
           D ++  + +R A+F  +
Sbjct: 59  DPVVMHVLERAAEFQGY 75


>gi|358377859|gb|EHK15542.1| hypothetical protein TRIVIDRAFT_64561 [Trichoderma virens Gv29-8]
          Length = 219

 Score = 39.7 bits (91), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 40/80 (50%), Gaps = 4/80 (5%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           + ++S  P      NF++ EE  +L ++A  H + S V+    G S    VRTS  T + 
Sbjct: 1   MHLLSKSPLVIYIENFITDEERLHLQHIAEGHFKHSAVLKG--GNSAIHSVRTSQSTTVD 58

Query: 137 RGRDKIIRDIEKRIADFTFF 156
           R  D ++R IE R  DF  F
Sbjct: 59  R--DAVVRCIEARALDFQGF 76


>gi|297689698|ref|XP_002822285.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pongo abelii]
          Length = 544

 Score = 39.7 bits (91), Expect = 0.40,   Method: Composition-based stats.
 Identities = 24/77 (31%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 340 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 397

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  +  RIA  T
Sbjct: 398 TVDPMLVTLNHRIAALT 414


>gi|340516384|gb|EGR46633.1| predicted protein [Trichoderma reesei QM6a]
          Length = 1043

 Score = 39.7 bits (91), Expect = 0.45,   Method: Composition-based stats.
 Identities = 24/77 (31%), Positives = 36/77 (46%), Gaps = 2/77 (2%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V ++  EP       FLS+EE  +L+ ++ P    ST+    +   +D+ VR SS   L 
Sbjct: 831 VRILRREPLVVYLEGFLSEEERRHLLEISEPLFEPSTITHDASSTHRDTTVRDSSVALLP 890

Query: 137 RGRDKIIRDIEKRIADF 153
             R   +R IE R   F
Sbjct: 891 --RTDTVRCIEARSLAF 905


>gi|148684485|gb|EDL16432.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III [Mus musculus]
          Length = 396

 Score = 39.7 bits (91), Expect = 0.47,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+   P   +YH+F+S EE + +  LA P +++S V   +  K      R S   +L  
Sbjct: 181 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 238

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RIA  T
Sbjct: 239 TVDPMLVTLDHRIAALT 255


>gi|301115862|ref|XP_002905660.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110449|gb|EEY68501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 215

 Score = 39.7 bits (91), Expect = 0.49,   Method: Compositional matrix adjust.
 Identities = 22/79 (27%), Positives = 39/79 (49%), Gaps = 1/79 (1%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFLARGRDKI 142
           P  F    FL  +E + ++ L+ PH+  S V   D  +++  +  RTS+  +L      +
Sbjct: 3   PLVFSVEEFLRDDEIDVILELSMPHLAPSGVTLQDGHENRPATDWRTSTTYWLDSSSHPV 62

Query: 143 IRDIEKRIADFTFFPLGMQ 161
           ++ I+KR AD    P+  Q
Sbjct: 63  VQTIDKRTADLVKVPISHQ 81


>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
 gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
          Length = 537

 Score = 39.7 bits (91), Expect = 0.51,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 28/56 (50%)

Query: 81  SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           S +P    YH+ LS  +   L  +A P MR+STV     G+ K S  R S   +LA
Sbjct: 328 SLDPYVATYHDMLSPRKISQLREMAVPRMRRSTVNPLPGGQHKKSAFRVSKNAWLA 383


>gi|410860761|ref|YP_006975995.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii AltDE1]
 gi|410818023|gb|AFV84640.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii AltDE1]
          Length = 376

 Score = 39.7 bits (91), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 36/68 (52%), Gaps = 1/68 (1%)

Query: 88  VYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGR-DKIIRDI 146
           VY + LS+ EC YLI   +  ++ S VVD  TG+ K   VRTS    +     D I R +
Sbjct: 180 VYESILSEYECRYLIAKFSALLKPSMVVDPVTGRGKIDSVRTSYVAVIEPTHCDWITRKL 239

Query: 147 EKRIADFT 154
           +K I+  T
Sbjct: 240 DKIISQIT 247


>gi|380495790|emb|CCF32129.1| 2OG-Fe(II) oxygenase [Colletotrichum higginsianum]
          Length = 254

 Score = 39.7 bits (91), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 22/80 (27%), Positives = 44/80 (55%), Gaps = 2/80 (2%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +S++P      NF++ +E  +L+ +A P   +S  + +D G+S  ++ RTSS  +L 
Sbjct: 45  VRTLSYDPLIQHLENFITPQESRHLVAIAQPRFERSLAIRTD-GRSVAAQERTSSTAYLP 103

Query: 137 RGRDKIIRDIEKRIADFTFF 156
              D +++ I  R ++F  +
Sbjct: 104 PD-DPVVQCITSRASEFQGY 122


>gi|52139015|gb|AAH82538.1| P4ha3 protein [Mus musculus]
          Length = 404

 Score = 39.3 bits (90), Expect = 0.53,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+   P   +YH+F+S EE + +  LA P +++S V   +  K      R S   +L  
Sbjct: 200 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 257

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RIA  T
Sbjct: 258 TVDPMLVTLDHRIAALT 274


>gi|407929772|gb|EKG22582.1| Oxoglutarate/iron-dependent oxygenase [Macrophomina phaseolina MS6]
          Length = 260

 Score = 39.3 bits (90), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 39/79 (49%), Gaps = 2/79 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
            + S +P      ++LSK E EYL+ LA P+ R+S V      ++ D  +R+S    +  
Sbjct: 44  HIFSTDPLIIYIEDYLSKAETEYLLELAVPYYRQSPVSKGYQLEAYDREIRSSMSAVVP- 102

Query: 138 GRDKIIRDIEKRIADFTFF 156
             D ++  IE+R   F  +
Sbjct: 103 -DDPVVSCIEERSVGFQGY 120


>gi|391342914|ref|XP_003745760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Metaseiulus
           occidentalis]
          Length = 525

 Score = 39.3 bits (90), Expect = 0.56,   Method: Composition-based stats.
 Identities = 18/78 (23%), Positives = 42/78 (53%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +EVI   P   ++H+ +S +E + +I L+ P ++++TV ++ +G+ + +  R S   +L 
Sbjct: 319 LEVIHERPYLALFHDIMSDDEIQTVIELSAPRLKRATVQNAKSGELEVANYRISKSAWLK 378

Query: 137 RGRDKIIRDIEKRIADFT 154
               +++  +  R    T
Sbjct: 379 NHDHEVVERLSFRFEYLT 396


>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
 gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
          Length = 516

 Score = 39.3 bits (90), Expect = 0.57,   Method: Compositional matrix adjust.
 Identities = 22/78 (28%), Positives = 37/78 (47%), Gaps = 2/78 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV++ +P   VYH+  S  E   +I L  P + +S V   D  K + S+ RTS  ++L  
Sbjct: 314 EVVNLDPFVAVYHDAASDAEINKVIELGRPQINRSMV--GDAAKKEVSKSRTSQNSWLTD 371

Query: 138 GRDKIIRDIEKRIADFTF 155
               ++  + +R  D   
Sbjct: 372 YDHPVVAALSRRTKDMAL 389


>gi|156370133|ref|XP_001628326.1| predicted protein [Nematostella vectensis]
 gi|156215300|gb|EDO36263.1| predicted protein [Nematostella vectensis]
          Length = 526

 Score = 39.3 bits (90), Expect = 0.59,   Method: Composition-based stats.
 Identities = 25/78 (32%), Positives = 46/78 (58%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E++S  P+  ++HN LS+ E E ++ LA P +R++ V + +TG+ +D   R S   +L+
Sbjct: 319 MEIVSVNPQITLFHNVLSEMEIEQMLELARPRLRRARVNNLETGEIEDVDYRISQIAWLS 378

Query: 137 RGRDKIIRDIEKRIADFT 154
                I+R I +R+   T
Sbjct: 379 DSDGDIVRRINRRVGFIT 396


>gi|26352077|dbj|BAC39675.1| unnamed protein product [Mus musculus]
          Length = 383

 Score = 39.3 bits (90), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+   P   +YH+F+S EE + +  LA P +++S V   +  K      R S   +L  
Sbjct: 200 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 257

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RIA  T
Sbjct: 258 TVDPMLVTLDHRIAALT 274


>gi|410447164|ref|ZP_11301266.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [SAR86 cluster
           bacterium SAR86E]
 gi|409980151|gb|EKO36903.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [SAR86 cluster
           bacterium SAR86E]
          Length = 214

 Score = 39.3 bits (90), Expect = 0.63,   Method: Compositional matrix adjust.
 Identities = 23/95 (24%), Positives = 49/95 (51%), Gaps = 8/95 (8%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRV--RTSSGTF 134
           V + S  P  ++  NFLS  EC+  IN A   ++ STV+    G + + ++  RTS   +
Sbjct: 14  VYLYSVNPIVYLVKNFLSDLECDAFINEAEGRLQDSTVI----GANDEIKLGARTSQNCW 69

Query: 135 LARGRDKIIRDIEKRIADFTFFPL--GMQFKFSLF 167
           +    ++++ ++ KR++     P+    Q++ + +
Sbjct: 70  IEHDANELVHEVSKRLSILAQIPIRNAEQYQLACY 104


>gi|242821944|ref|XP_002487784.1| prolyl 4-hydroxylase alpha subunit, putative [Talaromyces
           stipitatus ATCC 10500]
 gi|218712705|gb|EED12130.1| prolyl 4-hydroxylase alpha subunit, putative [Talaromyces
           stipitatus ATCC 10500]
          Length = 257

 Score = 39.3 bits (90), Expect = 0.65,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 37/76 (48%), Gaps = 1/76 (1%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +P      +F+S +E  +L+ LA      S++ D+D     D   R S    L R  D+I
Sbjct: 56  DPLVVYVKDFISTQEAAHLVQLANGRFTPSSIWDNDGKSHIDKAYRLSMTAKLDR-SDEI 114

Query: 143 IRDIEKRIADFTFFPL 158
           I+ IE+R   F F+ L
Sbjct: 115 IKKIEERAISFPFYKL 130


>gi|403263105|ref|XP_003923900.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3, partial [Saimiri boliviensis boliviensis]
          Length = 534

 Score = 39.3 bits (90), Expect = 0.66,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+  EP   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 330 EVLHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 387

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  +  RIA  T
Sbjct: 388 TVDPMLVTLNHRIAALT 404


>gi|302903109|ref|XP_003048786.1| hypothetical protein NECHADRAFT_71135 [Nectria haematococca mpVI
           77-13-4]
 gi|256729720|gb|EEU43073.1| hypothetical protein NECHADRAFT_71135 [Nectria haematococca mpVI
           77-13-4]
          Length = 1080

 Score = 39.3 bits (90), Expect = 0.66,   Method: Composition-based stats.
 Identities = 22/72 (30%), Positives = 36/72 (50%), Gaps = 2/72 (2%)

Query: 79  VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
           ++  EP       FLS+EE ++L+ ++ P    ST+ +      +DS VR SS   +   
Sbjct: 878 ILHREPLVVYLEGFLSEEERKHLLEISEPIFEPSTITNDGEATHRDSSVRDSSVAVIP-- 935

Query: 139 RDKIIRDIEKRI 150
           R   +R IE R+
Sbjct: 936 RTDAVRCIENRV 947


>gi|242085722|ref|XP_002443286.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
 gi|241943979|gb|EES17124.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
          Length = 147

 Score = 39.3 bits (90), Expect = 0.66,   Method: Compositional matrix adjust.
 Identities = 17/41 (41%), Positives = 25/41 (60%)

Query: 134 FLARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFFFYVLKK 174
           FL RG+D I+R IE+RIAD+T  P+       +  + V +K
Sbjct: 2   FLKRGQDTIVRTIEQRIADYTSVPIENGEPLQVLHYAVGQK 42


>gi|410665706|ref|YP_006918077.1| 2OG-Fe(II) oxygenase [Simiduia agarivorans SA1 = DSM 21679]
 gi|409028063|gb|AFV00348.1| 2OG-Fe(II) oxygenase [Simiduia agarivorans SA1 = DSM 21679]
          Length = 270

 Score = 38.9 bits (89), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 34/65 (52%), Gaps = 4/65 (6%)

Query: 87  FVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDI 146
           FV  +FLS  ECE ++ L    +R STV   +         RTSS   L R    +IRDI
Sbjct: 84  FVLKDFLSAAECEGVMALMDSKLRPSTVTTGNLHYG----YRTSSTCDLGRMVSPLIRDI 139

Query: 147 EKRIA 151
           +++IA
Sbjct: 140 DQKIA 144


>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
 gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
          Length = 578

 Score = 38.9 bits (89), Expect = 0.68,   Method: Composition-based stats.
 Identities = 21/82 (25%), Positives = 41/82 (50%), Gaps = 1/82 (1%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHM-RKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           E++S  P   +YH+ ++  E   L NL+ PHM R++   +    +      RTS+  +L 
Sbjct: 379 ELLSLAPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNSVWLT 438

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
              + ++  +E+R+   T F +
Sbjct: 439 SHENAVMERLERRVGVMTNFEM 460


>gi|195505214|ref|XP_002099407.1| GE23379 [Drosophila yakuba]
 gi|194185508|gb|EDW99119.1| GE23379 [Drosophila yakuba]
          Length = 547

 Score = 38.9 bits (89), Expect = 0.68,   Method: Compositional matrix adjust.
 Identities = 23/94 (24%), Positives = 46/94 (48%), Gaps = 6/94 (6%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD--SRVRTSSGTF 134
            E++S +P   ++H+ +S++E   + + +  HM  S   D D   S+D  +  RTS   +
Sbjct: 330 TEILSIDPFVLLFHDMISQKESTLIRSSSKEHMLPSATTDVDASGSEDHVATFRTSKSVW 389

Query: 135 LARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFF 168
            +   +   + I +R+ D T    G+   F+ +F
Sbjct: 390 YSSTSNDTTKRITERLGDAT----GLDMNFTEYF 419


>gi|89092696|ref|ZP_01165649.1| hypothetical protein MED92_15358 [Neptuniibacter caesariensis]
 gi|89083208|gb|EAR62427.1| hypothetical protein MED92_15358 [Oceanospirillum sp. MED92]
          Length = 441

 Score = 38.9 bits (89), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 39/81 (48%), Gaps = 7/81 (8%)

Query: 85  RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIR 144
             F  +NFLS EEC  +I L   H R ST  + +TG  K    RTS    L+      + 
Sbjct: 71  EVFYVNNFLSPEECAQMIELIQHHQRPSTTTN-ETGHYK--HYRTSKTCDLSLLESTFVA 127

Query: 145 DIEKRIADFTFFPLGMQFKFS 165
           +I++RI       LG++  +S
Sbjct: 128 EIDQRICKM----LGIEPSYS 144


>gi|410975458|ref|XP_003994148.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Felis catus]
          Length = 567

 Score = 38.9 bits (89), Expect = 0.73,   Method: Composition-based stats.
 Identities = 18/50 (36%), Positives = 30/50 (60%), Gaps = 1/50 (2%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S
Sbjct: 332 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 381


>gi|301759032|ref|XP_002915381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Ailuropoda
           melanoleuca]
          Length = 539

 Score = 38.9 bits (89), Expect = 0.77,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 335 EVIHLEPYVVLYHDFVSDGEAQKIRGLAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 392

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RI   T
Sbjct: 393 TVDPLLVTLDHRIGALT 409


>gi|428172003|gb|EKX40915.1| hypothetical protein GUITHDRAFT_112917 [Guillardia theta CCMP2712]
          Length = 421

 Score = 38.9 bits (89), Expect = 0.78,   Method: Compositional matrix adjust.
 Identities = 27/88 (30%), Positives = 42/88 (47%), Gaps = 5/88 (5%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV-VDSDTGKSKDSRVRTSSGTFL 135
            +V S  PR     +FL+ EEC  LI+ A P M +STV  + D+  S     RTSS  +L
Sbjct: 194 AKVRSISPRVLEVEDFLTPEECHELISSAKPLMSRSTVSAEGDSAVSLQESSRTSSTAWL 253

Query: 136 ARGRDKIIRDIEKRIADFTFFPLGMQFK 163
                 +   +  R++      +G+ F+
Sbjct: 254 PPHSHTLANKLYDRVSSL----VGIDFR 277


>gi|219116048|ref|XP_002178819.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409586|gb|EEC49517.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 417

 Score = 38.9 bits (89), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 40/81 (49%), Gaps = 3/81 (3%)

Query: 77  VEVISWEP--RAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG-KSKDSRVRTSSGT 133
           V+V+   P  +  V   F+S EEC+ + + A P + + TV D   G K  +SR    +G 
Sbjct: 185 VQVLHNRPSSQIHVLEGFISPEECQAIKDAAAPKLHRGTVADGKGGSKLSESRKAWQAGV 244

Query: 134 FLARGRDKIIRDIEKRIADFT 154
            +       I D++KR+  +T
Sbjct: 245 GVDYSHKNAISDLKKRLFAYT 265


>gi|194905372|ref|XP_001981184.1| GG11758 [Drosophila erecta]
 gi|190655822|gb|EDV53054.1| GG11758 [Drosophila erecta]
          Length = 550

 Score = 38.9 bits (89), Expect = 0.86,   Method: Composition-based stats.
 Identities = 21/72 (29%), Positives = 39/72 (54%), Gaps = 2/72 (2%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +P   +YH+ + + E + +  L    + ++TV   +  +S  S VRTS  TF+     K+
Sbjct: 329 DPLLVLYHDVIYQSEIDVIRKLTENRLMRATVTGHN--ESLVSNVRTSQFTFIPASAHKV 386

Query: 143 IRDIEKRIADFT 154
           +  I++R+AD T
Sbjct: 387 LSTIDQRVADMT 398


>gi|355752458|gb|EHH56578.1| hypothetical protein EGM_06023, partial [Macaca fascicularis]
          Length = 586

 Score = 38.5 bits (88), Expect = 0.91,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 36/77 (46%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +   A P +++S V   +  K      R S   +L  
Sbjct: 327 EVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 384

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  +  RIA  T
Sbjct: 385 TVDPMLVTLNHRIAALT 401


>gi|310795631|gb|EFQ31092.1| 2OG-Fe(II) oxygenase superfamily protein [Glomerella graminicola
           M1.001]
          Length = 259

 Score = 38.5 bits (88), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 44/80 (55%), Gaps = 3/80 (3%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V V+S  P      +FL+  E  +L+++A+     S+V+D+  GK+  + VRTS  T L 
Sbjct: 39  VHVVSKSPLVIYLADFLTPMERGHLLSMASGTFTHSSVIDASGGKTTHN-VRTSRSTSLW 97

Query: 137 RGRDKIIRDIEKRIADFTFF 156
             R+ ++R IE+R   F  +
Sbjct: 98  --RNDVVRCIEERAVAFQGY 115


>gi|355566863|gb|EHH23242.1| hypothetical protein EGK_06672, partial [Macaca mulatta]
          Length = 583

 Score = 38.5 bits (88), Expect = 0.94,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 36/77 (46%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +   A P +++S V   +  K      R S   +L  
Sbjct: 324 EVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 381

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  +  RIA  T
Sbjct: 382 TVDPMLVTLNHRIAALT 398


>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
 gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
          Length = 538

 Score = 38.5 bits (88), Expect = 0.94,   Method: Composition-based stats.
 Identities = 22/74 (29%), Positives = 37/74 (50%)

Query: 81  SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           S +P    YH+ LS ++   L  +A PHM++STV      +SK S  R S   +L     
Sbjct: 329 SLDPLVVSYHDMLSPQQIIELRQMAVPHMKRSTVNPLPGRQSKKSAFRVSKNAWLEYDTH 388

Query: 141 KIIRDIEKRIADFT 154
            ++  + + ++D T
Sbjct: 389 PMMGRMLRDLSDAT 402


>gi|402894624|ref|XP_003910453.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3 [Papio anubis]
          Length = 535

 Score = 38.5 bits (88), Expect = 0.97,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 36/77 (46%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +   A P +++S V   +  K      R S   +L  
Sbjct: 331 EVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 388

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  +  RIA  T
Sbjct: 389 TVDPMLVTLNHRIAALT 405


>gi|194373965|dbj|BAG62295.1| unnamed protein product [Homo sapiens]
          Length = 604

 Score = 38.5 bits (88), Expect = 0.99,   Method: Composition-based stats.
 Identities = 24/77 (31%), Positives = 36/77 (46%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 340 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 397

Query: 138 GRDKIIRDIEKRIADFT 154
             D  +  +  RIA  T
Sbjct: 398 TVDPKLVTLNHRIAALT 414


>gi|224014480|ref|XP_002296902.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220968282|gb|EED86630.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 638

 Score = 38.5 bits (88), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 21/65 (32%), Positives = 29/65 (44%), Gaps = 9/65 (13%)

Query: 72  RAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
           +  QWV+VIS EP   + H FL  E C+ ++           V   DT  S DS+   + 
Sbjct: 69  QEHQWVDVISSEPPLLIVHGFLEPEHCDSIVQ---------AVNHDDTDPSSDSKTTLTR 119

Query: 132 GTFLA 136
            T  A
Sbjct: 120 STMGA 124


>gi|380484283|emb|CCF40094.1| hypothetical protein CH063_10756 [Colletotrichum higginsianum]
          Length = 254

 Score = 38.5 bits (88), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 43/78 (55%), Gaps = 2/78 (2%)

Query: 79  VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
           ++S++P      NF+SK E E+L++L     + S+V + + G +  S  RTSS  +L  G
Sbjct: 50  ILSYDPLMIHLENFISKAEREHLLSLGQSSFQASSVTNWN-GTNVISEDRTSSSAYLPDG 108

Query: 139 RDKIIRDIEKRIADFTFF 156
            D I+  I  R ++F  +
Sbjct: 109 -DPIVERIISRASEFQGY 125


>gi|223993535|ref|XP_002286451.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220977766|gb|EED96092.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 679

 Score = 38.1 bits (87), Expect = 1.1,   Method: Composition-based stats.
 Identities = 24/71 (33%), Positives = 39/71 (54%), Gaps = 9/71 (12%)

Query: 76  WVEVISWEPRAFVYHNFLSKEECEYLINLA------TPHMRKSTVVDSDTGKSKDSRVRT 129
           W+EVIS +PR F   NF  KEE + +++ A      T  M++S+     +G + +S+ RT
Sbjct: 331 WLEVISLKPRVFDIFNFFDKEESKAIVDKAIAETSETHRMKRSST--GASGYNVNSQ-RT 387

Query: 130 SSGTFLARGRD 140
           S   F   G++
Sbjct: 388 SENGFDTHGKE 398


>gi|33589818|ref|NP_878907.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Homo sapiens]
 gi|114639354|ref|XP_001174896.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan troglodytes]
 gi|397487266|ref|XP_003814725.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan paniscus]
 gi|74738714|sp|Q7Z4N8.1|P4HA3_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|33188232|gb|AAP97874.1| prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
 gi|36962719|gb|AAQ87603.1| collagen prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
 gi|37182165|gb|AAQ88885.1| GPGA711 [Homo sapiens]
 gi|109658570|gb|AAI17334.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
 gi|119595341|gb|EAW74935.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III, isoform CRA_b
           [Homo sapiens]
 gi|410219716|gb|JAA07077.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
 gi|410248278|gb|JAA12106.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
 gi|410336087|gb|JAA36990.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
          Length = 544

 Score = 38.1 bits (87), Expect = 1.2,   Method: Composition-based stats.
 Identities = 24/77 (31%), Positives = 36/77 (46%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 340 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 397

Query: 138 GRDKIIRDIEKRIADFT 154
             D  +  +  RIA  T
Sbjct: 398 TVDPKLVTLNHRIAALT 414


>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
 gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
          Length = 513

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 21/82 (25%), Positives = 41/82 (50%), Gaps = 1/82 (1%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHM-RKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           E++S  P   +YH+ ++  E   L NL+ PHM R++   +    +      RTS+  +L 
Sbjct: 316 EILSLSPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNSVWLT 375

Query: 137 RGRDKIIRDIEKRIADFTFFPL 158
              + ++  +E+R+   T F +
Sbjct: 376 SHENAVMERLERRVGVMTNFEM 397


>gi|358379380|gb|EHK17060.1| hypothetical protein TRIVIDRAFT_112751, partial [Trichoderma virens
           Gv29-8]
          Length = 1039

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 37/75 (49%), Gaps = 2/75 (2%)

Query: 79  VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
           ++S EP       FLS+ E ++L+ ++ P    ST+    +   +D+ VR SS   L   
Sbjct: 807 ILSREPLVVYLEGFLSESERKHLLEISEPIFEPSTITHDSSATHRDTAVRDSSVAVLP-- 864

Query: 139 RDKIIRDIEKRIADF 153
           R  I+R IE R   F
Sbjct: 865 RTDIVRCIEARARAF 879


>gi|452987886|gb|EME87641.1| hypothetical protein MYCFIDRAFT_26183 [Pseudocercospora fijiensis
           CIRAD86]
          Length = 287

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 24/74 (32%), Positives = 36/74 (48%), Gaps = 3/74 (4%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKII 143
           P       FLS EE E+L+ L+      STVV+    +  D +VR S    +   RD ++
Sbjct: 79  PLVIYVEGFLSGEEAEHLVALSEDRWNVSTVVNQGV-EGIDDKVRKSEKASIP--RDHVV 135

Query: 144 RDIEKRIADFTFFP 157
           + IE+R   F  +P
Sbjct: 136 QCIEQRALSFQGWP 149


>gi|59809017|gb|AAH89446.1| P4HA3 protein [Homo sapiens]
          Length = 528

 Score = 38.1 bits (87), Expect = 1.2,   Method: Composition-based stats.
 Identities = 24/77 (31%), Positives = 36/77 (46%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 324 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 381

Query: 138 GRDKIIRDIEKRIADFT 154
             D  +  +  RIA  T
Sbjct: 382 TVDPKLVTLNHRIAALT 398


>gi|119595340|gb|EAW74934.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III, isoform CRA_a
           [Homo sapiens]
          Length = 657

 Score = 38.1 bits (87), Expect = 1.2,   Method: Composition-based stats.
 Identities = 24/77 (31%), Positives = 36/77 (46%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 372 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 429

Query: 138 GRDKIIRDIEKRIADFT 154
             D  +  +  RIA  T
Sbjct: 430 TVDPKLVTLNHRIAALT 446


>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
 gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
          Length = 534

 Score = 38.1 bits (87), Expect = 1.2,   Method: Composition-based stats.
 Identities = 24/84 (28%), Positives = 39/84 (46%), Gaps = 5/84 (5%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E  S +P    +H+ LS+     L  +A PHM++STV     G+ + S  R S   +L 
Sbjct: 321 LEEHSLDPLVVTFHDMLSQHRIAELREMAVPHMQRSTVNPLPGGQRRKSAFRVSKNAWLP 380

Query: 137 RGRD----KIIRDIEKRIA-DFTF 155
                   +++RD+      D TF
Sbjct: 381 YSTHPTMGRMLRDVSDATGLDMTF 404


>gi|453089562|gb|EMF17602.1| hypothetical protein SEPMUDRAFT_35426 [Mycosphaerella populorum
           SO2202]
          Length = 331

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 42/81 (51%), Gaps = 3/81 (3%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
             + S  P      +FLS EE E+LI L+T +   ST+ ++   ++ D ++R S    + 
Sbjct: 89  THIFSTSPLVIYIEDFLSAEEVEHLIALSTNNWTPSTIFNAGL-ETTDPKIRKSEKASIP 147

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
           R R  +++ IE+R   F  +P
Sbjct: 148 RSR--LVQCIEQRALAFQGWP 166


>gi|341893923|gb|EGT49858.1| CBN-PHY-3 protein [Caenorhabditis brenneri]
          Length = 275

 Score = 38.1 bits (87), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 22/86 (25%), Positives = 43/86 (50%), Gaps = 11/86 (12%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVV---DSDTGKSKDSRVRTSSGT 133
           +E++SW P   +Y N L+  +    ++     + K  +V    SD GKS ++  R ++G+
Sbjct: 43  MEIVSWSPTLVIYRNLLTPRQASEFLDF----IEKRDMVMQKTSDNGKSIETTHRRANGS 98

Query: 134 FLARGRDKIIRD----IEKRIADFTF 155
           F+  G  ++  +    ++KRI    F
Sbjct: 99  FIDHGATEVTSEVHNWVQKRIPALNF 124


>gi|195505218|ref|XP_002099409.1| GE10887 [Drosophila yakuba]
 gi|194185510|gb|EDW99121.1| GE10887 [Drosophila yakuba]
          Length = 521

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 21/72 (29%), Positives = 40/72 (55%), Gaps = 2/72 (2%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +P   +YH+ + + E + +  L    ++++TV   +  +S  S VRTS  TF+     K+
Sbjct: 301 DPLLVLYHDVIYQSEIDVIRKLTENRLKRATVTGHN--ESVVSNVRTSQFTFIPVSAHKV 358

Query: 143 IRDIEKRIADFT 154
           +  I++R+AD T
Sbjct: 359 LSTIDQRVADMT 370


>gi|194905392|ref|XP_001981188.1| GG11756 [Drosophila erecta]
 gi|190655826|gb|EDV53058.1| GG11756 [Drosophila erecta]
          Length = 509

 Score = 38.1 bits (87), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 22/80 (27%), Positives = 43/80 (53%), Gaps = 2/80 (2%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E++S +P   ++H+ +S ++   + NLA   + ++  V  D G  K+   RT+ GT+L 
Sbjct: 312 MELLSLDPYVVLFHDVVSDQDILSIRNLAKGGLARAVTVTQD-GNDKEDPARTTKGTWLV 370

Query: 137 RGRDKIIRDIEKRIADFTFF 156
               K+I+ + +   D T F
Sbjct: 371 EN-SKLIQRLSQLSQDMTNF 389


>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
 gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
          Length = 517

 Score = 38.1 bits (87), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 45/84 (53%), Gaps = 4/84 (4%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSK---DSRVRTSSGTF 134
           E++S  P   +YH+ ++  E   L NL+ P M++  +V  +  K +   DS  RTS+  +
Sbjct: 316 EILSLSPYMVLYHDVITPLESLTLKNLSKPLMKRRAMVMVNNLKVRPFIDSG-RTSNSVW 374

Query: 135 LARGRDKIIRDIEKRIADFTFFPL 158
           LA   + ++  +E+R+   T F +
Sbjct: 375 LASHENAVMERLERRVGVMTNFEM 398


>gi|426365135|ref|XP_004049642.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Gorilla gorilla
           gorilla]
          Length = 500

 Score = 38.1 bits (87), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 19/53 (35%), Positives = 31/53 (58%), Gaps = 1/53 (1%)

Query: 82  WE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGT 133
           W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S  T
Sbjct: 299 WDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKRT 351


>gi|417402564|gb|JAA48127.1| Putative prolyl 4-hydroxylase alpha subunit [Desmodus rotundus]
          Length = 544

 Score = 38.1 bits (87), Expect = 1.4,   Method: Composition-based stats.
 Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+  EP   +YH+F++  E + +   A P +++S V   +  K      R S   +L  
Sbjct: 340 EVVHLEPYVVLYHDFVNDLEAQKIRGFAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 397

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  +++RIA  T
Sbjct: 398 TVDPMLVTLDRRIAALT 414


>gi|156352054|ref|XP_001622587.1| predicted protein [Nematostella vectensis]
 gi|156209158|gb|EDO30487.1| predicted protein [Nematostella vectensis]
          Length = 531

 Score = 38.1 bits (87), Expect = 1.4,   Method: Composition-based stats.
 Identities = 23/82 (28%), Positives = 42/82 (51%), Gaps = 4/82 (4%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE +  +P  ++  + +   E EY+   ATP +R++TV +  TG+ + +  R S   +L 
Sbjct: 320 VEELHSDPPIWMLRDVMYDSEIEYIKRTATPKLRRATVTNLKTGELEFADYRISKSGWLE 379

Query: 137 RGRD----KIIRDIEKRIADFT 154
             RD    KI+  + +R +  T
Sbjct: 380 DPRDDNEEKILNRVNRRTSIIT 401


>gi|452847027|gb|EME48959.1| hypothetical protein DOTSEDRAFT_67862 [Dothistroma septosporum
           NZE10]
          Length = 279

 Score = 38.1 bits (87), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 38/145 (26%), Positives = 64/145 (44%), Gaps = 17/145 (11%)

Query: 22  TLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRK--------SMERSEGDEGRA 73
           T++ +   AI++ +  G   + SS  D     D+     K         M + E    R 
Sbjct: 10  TVVEVAGIAIILYVLLGAPGLYSSGNDRTTQQDVPVAKAKVEQLVYPSPMLKCE----RH 65

Query: 74  EQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG-KSKDSRVRTSSG 132
           +    + S  P      NFLS++E ++LI+L+      STV  S+ G ++ D  VR S  
Sbjct: 66  DYGTHIFSTSPLVVYIENFLSEQEVQHLIDLSDDAWNVSTV--SNQGIEAIDDSVRKSEK 123

Query: 133 TFLARGRDKIIRDIEKRIADFTFFP 157
             +   RD+ ++ IE+R   F  +P
Sbjct: 124 AAIK--RDQTVQCIEQRALSFQGWP 146


>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
          Length = 235

 Score = 38.1 bits (87), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 38/74 (51%), Gaps = 11/74 (14%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           EP+   YH+ +S  E E L ++A P + +     S TG    S +RTS   FL    D++
Sbjct: 52  EPKIIRYHDVISDTEIETLKDIARPELTR-----SQTGWGVISEIRTSQSVFL----DEV 102

Query: 143 --IRDIEKRIADFT 154
             +  I +RIAD T
Sbjct: 103 GTVARISQRIADIT 116


>gi|426369750|ref|XP_004051847.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Gorilla
           gorilla gorilla]
          Length = 517

 Score = 38.1 bits (87), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 24/77 (31%), Positives = 36/77 (46%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 313 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 370

Query: 138 GRDKIIRDIEKRIADFT 154
             D  +  +  RIA  T
Sbjct: 371 TVDPKLVALNHRIAALT 387


>gi|397568977|gb|EJK46463.1| hypothetical protein THAOC_34869 [Thalassiosira oceanica]
          Length = 519

 Score = 38.1 bits (87), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 29/94 (30%), Positives = 43/94 (45%), Gaps = 17/94 (18%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKS---KDSRVRTSSG-- 132
           + +S  P  ++ HN LS EECE L+  A    +K   +D  +G S   + S      G  
Sbjct: 214 DAVSKSPVVYLIHNLLSDEECEALLKSAE---KKYERIDDTSGTSNYLESSMASEKQGVP 270

Query: 133 -------TFLARGR--DKIIRDIEKRIADFTFFP 157
                  T L +G    K  +DI++RI+  T FP
Sbjct: 271 TAIDVDQTMLWKGSVGGKTFKDIDERISQVTGFP 304


>gi|310789496|gb|EFQ25029.1| 2OG-Fe(II) oxygenase family Oxidoreductase [Glomerella graminicola
           M1.001]
          Length = 302

 Score = 38.1 bits (87), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 21/74 (28%), Positives = 35/74 (47%), Gaps = 2/74 (2%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V + S EP       FLS EE  +L++++ P    ST+     G  +D+ +R S    + 
Sbjct: 66  VRIFSKEPLVLYIEGFLSPEERAHLLDISQPLFEPSTITHDGAGVQRDTSIRESEVAMVP 125

Query: 137 RGRDKIIRDIEKRI 150
             R   +R +E R+
Sbjct: 126 --RTDTVRCVEARV 137


>gi|113682363|ref|NP_001038463.1| prolyl 4-hydroxylase, alpha polypeptide I a precursor [Danio rerio]
          Length = 522

 Score = 38.1 bits (87), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 25/97 (25%), Positives = 41/97 (42%), Gaps = 23/97 (23%)

Query: 81  SWE-PRAFVYHNFLSKEECEYLINLATPHM----------------------RKSTVVDS 117
            W+ PR   YH  ++++E E +  L+ P +                      R++TV D 
Sbjct: 297 EWDRPRIIRYHEIITEQEIEKIKELSKPRLRRATISNPITGVLETAHYRISKRRATVHDP 356

Query: 118 DTGKSKDSRVRTSSGTFLARGRDKIIRDIEKRIADFT 154
            TGK   ++ R S   +LA     ++  I +RI D T
Sbjct: 357 QTGKLTTAQYRVSKSAWLAAYEHPVVDRINQRIEDIT 393


>gi|397620797|gb|EJK65916.1| hypothetical protein THAOC_13183 [Thalassiosira oceanica]
          Length = 520

 Score = 37.7 bits (86), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 18/78 (23%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +++I  +P  +   +FLS +EC+ ++  A PH++   V +   G+ +    RTS+   + 
Sbjct: 314 LKMIHQDPDIYSVDDFLSADECDRIVAKACPHLKPCLVNNERNGRVEQDPARTSTNANVP 373

Query: 137 RGRDKIIRDIEKRIADFT 154
           +     +  + ++I D T
Sbjct: 374 QVE---VPTVMRKITDLT 388


>gi|407698902|ref|YP_006823689.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
           'Black Sea 11']
 gi|407248049|gb|AFT77234.1| hypothetical protein AMBLS11_03220 [Alteromonas macleodii str.
           'Black Sea 11']
          Length = 263

 Score = 37.7 bits (86), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 20/64 (31%), Positives = 35/64 (54%), Gaps = 4/64 (6%)

Query: 87  FVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDI 146
           F Y +FLS +EC+ ++ L    +  S +     G +    +RTSS   LA   +K+++D+
Sbjct: 83  FAYDDFLSSQECDDIVALTKDKLAPSKLA----GAASADDIRTSSTCELAFLGNKLVKDV 138

Query: 147 EKRI 150
           + RI
Sbjct: 139 DSRI 142


>gi|344253558|gb|EGW09662.1| Glucose 1,6-bisphosphate synthase [Cricetulus griseus]
          Length = 904

 Score = 37.7 bits (86), Expect = 1.6,   Method: Composition-based stats.
 Identities = 23/77 (29%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI   P   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 732 EVIHLRPFVALYHDFVSDAEAQKIRELAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 789

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RIA  T
Sbjct: 790 TVDPMLGTLDHRIAALT 806


>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
 gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
          Length = 533

 Score = 37.7 bits (86), Expect = 1.6,   Method: Composition-based stats.
 Identities = 21/74 (28%), Positives = 35/74 (47%)

Query: 81  SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRD 140
           S +P    YH+ LS ++   L  +A PHM++STV     G+   S  R S   +L     
Sbjct: 324 SLDPLVVSYHDMLSPQQIGELRAMAVPHMQRSTVNPLSGGQRMKSAFRVSKNAWLPYSTH 383

Query: 141 KIIRDIEKRIADFT 154
            ++  + + + D T
Sbjct: 384 PMMGRMLRDVGDAT 397


>gi|114799222|ref|YP_760562.1| 2OG-Fe(II) oxygenase [Hyphomonas neptunium ATCC 15444]
 gi|114739396|gb|ABI77521.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Hyphomonas neptunium
           ATCC 15444]
          Length = 298

 Score = 37.7 bits (86), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 24/84 (28%), Positives = 44/84 (52%), Gaps = 10/84 (11%)

Query: 84  PRA--FVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDK 141
           P+A  +V+ NFL+ E C+ LI L    +R ST  D+      D ++RTS  + +      
Sbjct: 100 PKAQLYVWPNFLAPETCDALIALTDERLRASTTTDA----FADPKIRTSRSSDIGTMGHN 155

Query: 142 IIRDIEKRIADFTFFPLGMQFKFS 165
           ++  +++ IA+     LG+ + +S
Sbjct: 156 LVMQLDELIAE----ALGIHWSYS 175


>gi|73988166|ref|XP_851718.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Canis lupus
           familiaris]
          Length = 544

 Score = 37.7 bits (86), Expect = 1.7,   Method: Composition-based stats.
 Identities = 22/77 (28%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F++  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 340 EVIHLEPYVVLYHDFVNDVEAQKIRGLAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 397

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RI   T
Sbjct: 398 TVDPLLVTLDHRIGALT 414


>gi|297515507|gb|ADI44133.1| RT08151p [Drosophila melanogaster]
          Length = 546

 Score = 37.7 bits (86), Expect = 1.7,   Method: Composition-based stats.
 Identities = 20/72 (27%), Positives = 39/72 (54%), Gaps = 2/72 (2%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +P   +YH+ + + E + +  L    + ++T+   +  +S  S VRTS  TF+     K+
Sbjct: 332 DPLLVLYHDVIYQSEIDVIRKLTENRLMRATITSHN--ESVVSNVRTSQFTFIPVTAHKV 389

Query: 143 IRDIEKRIADFT 154
           +  I++R+AD T
Sbjct: 390 LSTIDQRVADMT 401


>gi|116008434|ref|NP_651806.2| CG9698 [Drosophila melanogaster]
 gi|113194862|gb|AAF57062.2| CG9698 [Drosophila melanogaster]
          Length = 547

 Score = 37.7 bits (86), Expect = 1.7,   Method: Composition-based stats.
 Identities = 20/72 (27%), Positives = 39/72 (54%), Gaps = 2/72 (2%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +P   +YH+ + + E + +  L    + ++T+   +  +S  S VRTS  TF+     K+
Sbjct: 332 DPLLVLYHDVIYQSEIDVIRKLTENRLMRATITSHN--ESVVSNVRTSQFTFIPVTAHKV 389

Query: 143 IRDIEKRIADFT 154
           +  I++R+AD T
Sbjct: 390 LSTIDQRVADMT 401


>gi|417402369|gb|JAA48034.1| Putative prolyl 4-hydroxylase alpha subunit [Desmodus rotundus]
          Length = 529

 Score = 37.7 bits (86), Expect = 1.8,   Method: Composition-based stats.
 Identities = 21/77 (27%), Positives = 38/77 (49%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EV+  EP   +YH+F++  E + +   A P +++S V   +  K      R S   +L  
Sbjct: 340 EVVHLEPYVVLYHDFVNDLEAQKIRGFAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 397

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  +++RIA  T
Sbjct: 398 TVDPMLVTLDRRIAALT 414


>gi|351714551|gb|EHB17470.1| Prolyl 4-hydroxylase subunit alpha-1 [Heterocephalus glaber]
          Length = 388

 Score = 37.4 bits (85), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 18/51 (35%), Positives = 30/51 (58%), Gaps = 1/51 (1%)

Query: 81  SWE-PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
            W+ PR   +H+ +S  E E + +LA P + ++TV D +TGK   ++ R S
Sbjct: 336 EWDKPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVS 386


>gi|406595590|ref|YP_006746720.1| hypothetical protein MASE_03040 [Alteromonas macleodii ATCC 27126]
 gi|407682553|ref|YP_006797727.1| hypothetical protein AMEC673_03255 [Alteromonas macleodii str.
           'English Channel 673']
 gi|406372911|gb|AFS36166.1| hypothetical protein MASE_03040 [Alteromonas macleodii ATCC 27126]
 gi|407244164|gb|AFT73350.1| hypothetical protein AMEC673_03255 [Alteromonas macleodii str.
           'English Channel 673']
          Length = 263

 Score = 37.4 bits (85), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 20/64 (31%), Positives = 35/64 (54%), Gaps = 4/64 (6%)

Query: 87  FVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDI 146
           F Y +FLS +EC+ ++ L    +  S +     G +    +RTSS   LA   +K+++D+
Sbjct: 83  FSYDDFLSSQECDDIVALTKDKLAPSKLA----GAASADDIRTSSTCELAFLGNKLVKDV 138

Query: 147 EKRI 150
           + RI
Sbjct: 139 DNRI 142


>gi|354504916|ref|XP_003514519.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cricetulus
           griseus]
          Length = 509

 Score = 37.4 bits (85), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 23/77 (29%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI   P   +YH+F+S  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 305 EVIHLRPFVALYHDFVSDAEAQKIRELAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 362

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RIA  T
Sbjct: 363 TVDPMLGTLDHRIAALT 379


>gi|410972729|ref|XP_003992809.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Felis catus]
          Length = 533

 Score = 37.4 bits (85), Expect = 2.4,   Method: Composition-based stats.
 Identities = 22/77 (28%), Positives = 37/77 (48%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           EVI  EP   +YH+F++  E + +  LA P +++S V   +  K      R S   +L  
Sbjct: 329 EVIHLEPYVVLYHDFVNDLEAQKIRGLAEPWLQRSVVASGE--KQLPVEYRISKSAWLKD 386

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  ++ RI   T
Sbjct: 387 TVDPLLVTLDHRIGALT 403


>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
          Length = 490

 Score = 37.4 bits (85), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 35/72 (48%), Gaps = 7/72 (9%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           EP+   YH+ +S  E E L ++A P + +     S TG    S +RTS   FL       
Sbjct: 307 EPKIIRYHDVISDTEIETLKDIARPELTR-----SQTGWGVISDIRTSQSVFLEEV--GT 359

Query: 143 IRDIEKRIADFT 154
           +  I +RIAD T
Sbjct: 360 VARISQRIADIT 371


>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
          Length = 508

 Score = 37.0 bits (84), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 25/72 (34%), Positives = 35/72 (48%), Gaps = 7/72 (9%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           EP+   YH+ +S  E E L ++A P + +     S TG    S +RTS   FL       
Sbjct: 325 EPKIIRYHDVISDTEIETLKDIARPELTR-----SQTGWGVISDIRTSQSVFLEEV--GT 377

Query: 143 IRDIEKRIADFT 154
           +  I +RIAD T
Sbjct: 378 VARISQRIADIT 389


>gi|449304751|gb|EMD00758.1| hypothetical protein BAUCODRAFT_144379 [Baudoinia compniacensis
           UAMH 10762]
          Length = 284

 Score = 37.0 bits (84), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 37/141 (26%), Positives = 60/141 (42%), Gaps = 8/141 (5%)

Query: 22  TLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRKSMER-----SEGDEGRAEQW 76
           T++ +   A +I +  G   +PS+    R A   +  VR  +E       E +    +  
Sbjct: 13  TVVEVAGIAAIIYVLLGAPGLPSAWRTGRSAEKDTPAVRTKIESLVSPAKELNCPTHKYE 72

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
             V S  P       FLS  E E+LI+++    + ST V +D  +  D  VR S    + 
Sbjct: 73  THVFSSSPLILYVDGFLSDSEAEHLIDISADKWQIST-VSTDGIERTDESVRKSEKALVQ 131

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
             RD  ++ IE+R   F  +P
Sbjct: 132 --RDLTVQCIEQRALSFQGWP 150


>gi|407686446|ref|YP_006801619.1| hypothetical protein AMBAS45_03290 [Alteromonas macleodii str.
           'Balearic Sea AD45']
 gi|407289826|gb|AFT94138.1| hypothetical protein AMBAS45_03290 [Alteromonas macleodii str.
           'Balearic Sea AD45']
          Length = 263

 Score = 37.0 bits (84), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 20/64 (31%), Positives = 35/64 (54%), Gaps = 4/64 (6%)

Query: 87  FVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKIIRDI 146
           F Y +FLS +EC+ ++ L    +  S +     G +    +RTSS   LA   +K+++D+
Sbjct: 83  FSYDDFLSSQECDDIVALTKDKLAPSKLA----GAASADDIRTSSTCELAFLGNKLVKDV 138

Query: 147 EKRI 150
           + RI
Sbjct: 139 DSRI 142


>gi|224011205|ref|XP_002295377.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|209583408|gb|ACI64094.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 207

 Score = 37.0 bits (84), Expect = 2.9,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 35/81 (43%), Gaps = 7/81 (8%)

Query: 84  PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDS-----DTGKSKDSRVRTSSGTFLARG 138
           P       FLS EEC   I L      +ST   S      T  SK+S  RTS+ T+   G
Sbjct: 9   PWVVAIEGFLSDEECNRFIELGGDRYERSTEYASTMNLDGTFDSKESSGRTSTNTWCGEG 68

Query: 139 --RDKIIRDIEKRIADFTFFP 157
              D II+ + +R+   T  P
Sbjct: 69  CRDDPIIKKVIERMESLTGIP 89


>gi|159490898|ref|XP_001703410.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
 gi|158280334|gb|EDP06092.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
          Length = 429

 Score = 37.0 bits (84), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 26/81 (32%), Positives = 40/81 (49%), Gaps = 1/81 (1%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGK-SKDSRVRTSSGTFLA 136
           +++S  PR  V+ NF+ K   E +I LA+  M  S +      +   + +VRTS GTFL 
Sbjct: 216 QILSLYPRIKVFPNFVDKARREEIIALASKFMYPSGLAYRPGEQVEAEQQVRTSKGTFLG 275

Query: 137 RGRDKIIRDIEKRIADFTFFP 157
                 +  +E +IA  T  P
Sbjct: 276 GDSSPALTWLESKIAAVTDIP 296


>gi|296217074|ref|XP_002754870.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Callithrix
           jacchus]
          Length = 544

 Score = 37.0 bits (84), Expect = 3.0,   Method: Composition-based stats.
 Identities = 21/77 (27%), Positives = 36/77 (46%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E++  EP   +YH+F+S  E + +   A P +++S V   +  K      R S   +L  
Sbjct: 340 EILHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVASGE--KQLQVEYRISKSAWLKD 397

Query: 138 GRDKIIRDIEKRIADFT 154
             D ++  +  RIA  T
Sbjct: 398 TVDPMLVTLNHRIAALT 414


>gi|342871685|gb|EGU74177.1| hypothetical protein FOXB_15312 [Fusarium oxysporum Fo5176]
          Length = 274

 Score = 37.0 bits (84), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 35/136 (25%), Positives = 61/136 (44%), Gaps = 14/136 (10%)

Query: 26  MFTFAILILLAFGILSMPSSS----GDSRKANDLS-------SIVRKSMERSEGDEGRAE 74
           MF + I +L    + S P S       + + ND+S       S++      +   E   +
Sbjct: 1   MFQYVIGLLAIILVFSNPISDFLYPDRATRINDVSPRPKFNESLLAIDPPNATAPECAPD 60

Query: 75  QWV-EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGT 133
            +V  ++S EP      NFL++ E E+L++++ P    ST+ ++     +DS VR S   
Sbjct: 61  GYVARILSREPLIIYLENFLNEGEREHLLDISEPIFVPSTITNNGEATHRDSSVRDSHVA 120

Query: 134 FLARGRDKIIRDIEKR 149
            +   R   +R IE R
Sbjct: 121 LIP--RTDPVRCIESR 134


>gi|448930198|gb|AGE53763.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus IL-3A]
 gi|448931603|gb|AGE55164.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus MA-1E]
          Length = 239

 Score = 37.0 bits (84), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 33/67 (49%), Gaps = 11/67 (16%)

Query: 90  HNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDS-------RVRTSSGTFLARGRDKI 142
           H+FLS  EC+ LIN A     K  ++ S+ G + D        + R S  T+   G  KI
Sbjct: 56  HDFLSDAECDVLINAAI----KKGLIKSEVGGATDDDPIKLDPKSRNSEQTWFTPGEHKI 111

Query: 143 IRDIEKR 149
           I  I+K+
Sbjct: 112 IDKIQKK 118


>gi|46122191|ref|XP_385649.1| hypothetical protein FG05473.1 [Gibberella zeae PH-1]
          Length = 1067

 Score = 37.0 bits (84), Expect = 3.2,   Method: Composition-based stats.
 Identities = 21/71 (29%), Positives = 36/71 (50%), Gaps = 2/71 (2%)

Query: 79  VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARG 138
           ++S EP       FL++EE E+L++++ P    ST+  +     +D+ VR S    + R 
Sbjct: 859 IVSREPLIVYLEGFLNQEEREHLLDISEPIFVPSTITSNGEATHRDASVRDSDVAVIPRT 918

Query: 139 RDKIIRDIEKR 149
               +R IE R
Sbjct: 919 DS--VRCIESR 927


>gi|223999645|ref|XP_002289495.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220974703|gb|EED93032.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 1578

 Score = 37.0 bits (84), Expect = 3.3,   Method: Composition-based stats.
 Identities = 18/48 (37%), Positives = 24/48 (50%)

Query: 84   PRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSS 131
            P  F   NFLS E+C+ ++  A PH R   V +  T   +    RTSS
Sbjct: 1379 PDIFTVDNFLSDEDCDRIVAKADPHFRPCLVKNESTEAVEQDPFRTSS 1426


>gi|448927821|gb|AGE51393.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus CviKI]
          Length = 239

 Score = 37.0 bits (84), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 23/67 (34%), Positives = 33/67 (49%), Gaps = 11/67 (16%)

Query: 90  HNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDS-------RVRTSSGTFLARGRDKI 142
           H+FLS  EC+ LIN A     K  ++ S+ G + D        + R S  T+   G  KI
Sbjct: 56  HDFLSDAECDVLINAAI----KKGLIKSEVGGATDDDPIKLDPKSRNSEQTWFTPGEHKI 111

Query: 143 IRDIEKR 149
           I  I+K+
Sbjct: 112 IDKIQKK 118


>gi|397620233|gb|EJK65613.1| hypothetical protein THAOC_13503 [Thalassiosira oceanica]
          Length = 643

 Score = 36.6 bits (83), Expect = 3.8,   Method: Composition-based stats.
 Identities = 26/84 (30%), Positives = 41/84 (48%), Gaps = 5/84 (5%)

Query: 76  WVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVD-SDTGKS--KDSRVRTSSG 132
           W+EVIS EPR F   NF  ++E   +++ A     +S  +  S TG S    +  RTS  
Sbjct: 291 WLEVISLEPRVFDVFNFFDRDESAAIVDKALKETSESHRIKRSSTGASGYNVNSQRTSEN 350

Query: 133 TFLARGRDKIIRDIEKRIADFTFF 156
            F   G  K+ + +++R  +   F
Sbjct: 351 GFDTHG--KVSQAVKRRCMNVLGF 372


>gi|380477665|emb|CCF44025.1| 2OG-Fe(II) oxygenase, partial [Colletotrichum higginsianum]
          Length = 233

 Score = 36.6 bits (83), Expect = 3.8,   Method: Compositional matrix adjust.
 Identities = 25/80 (31%), Positives = 42/80 (52%), Gaps = 3/80 (3%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
             ++S  P       F++  E ++L+++A+     S V+D+  GK+  S VRTS  T L 
Sbjct: 39  AHMVSKSPLVVYLAGFVTLAERKHLLDMASGTFTHSGVIDASGGKTTHS-VRTSQSTSLW 97

Query: 137 RGRDKIIRDIEKRIADFTFF 156
             RD ++R IE+R   F  +
Sbjct: 98  --RDDVVRCIEERAVAFQGY 115


>gi|292621357|ref|XP_691737.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Danio rerio]
          Length = 538

 Score = 36.6 bits (83), Expect = 4.0,   Method: Composition-based stats.
 Identities = 18/77 (23%), Positives = 40/77 (51%), Gaps = 2/77 (2%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLAR 137
           E+IS +P   ++H F+++ E + +   A P +R+S V       + + R+  S+  +L  
Sbjct: 334 EIISLQPYVVLFHGFVTQAEAKNIRKYAMPGLRRSVVASGMNQATAEYRISKSA--WLKE 391

Query: 138 GRDKIIRDIEKRIADFT 154
              +++  +++RI   T
Sbjct: 392 SAHEVVGKLDQRITLVT 408


>gi|429863917|gb|ELA38324.1| 2og-fe oxygenase family protein [Colletotrichum gloeosporioides
           Nara gc5]
          Length = 260

 Score = 36.6 bits (83), Expect = 4.1,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 41/80 (51%), Gaps = 3/80 (3%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           + ++S  P      +F++  E ++L+ L +     S V+D+  GK+   +VRTS  T L 
Sbjct: 40  MHIVSKSPLVIYIPDFVTPAERKHLLELGSETFTHSGVIDAAGGKTT-HQVRTSQSTALW 98

Query: 137 RGRDKIIRDIEKRIADFTFF 156
             RD ++R IE R   F  +
Sbjct: 99  --RDDVVRCIEDRAIAFQGY 116


>gi|405964867|gb|EKC30309.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
          Length = 591

 Score = 36.2 bits (82), Expect = 4.4,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 40/79 (50%), Gaps = 6/79 (7%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGK------SKDSRVRTSS 131
           EV+++EPR  ++H+ +S    E+L ++A+    +STV   +TG        K   VR S 
Sbjct: 361 EVVNYEPRIAIFHDVISPTSIEHLKSVASKGFTRSTVFLENTGPDGHVTYGKLDNVRVSQ 420

Query: 132 GTFLARGRDKIIRDIEKRI 150
            ++L       +  +E RI
Sbjct: 421 TSWLGTDEYPELSRLENRI 439


>gi|169604152|ref|XP_001795497.1| hypothetical protein SNOG_05086 [Phaeosphaeria nodorum SN15]
 gi|111066357|gb|EAT87477.1| hypothetical protein SNOG_05086 [Phaeosphaeria nodorum SN15]
          Length = 257

 Score = 36.2 bits (82), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 39/80 (48%), Gaps = 3/80 (3%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
             V S +P      NF++ EE E+L  ++      S + DS +G+   +  R S  T L 
Sbjct: 33  THVFSTDPLVIYIPNFITPEESEHLQTVSAGKFTSSQIADS-SGQQHLASTRKSQSTSLE 91

Query: 137 RGRDKIIRDIEKRIADFTFF 156
              D+++R IE+R  +F  F
Sbjct: 92  --SDEVVRCIEQRALEFQGF 109


>gi|303289727|ref|XP_003064151.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226454467|gb|EEH51773.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 433

 Score = 36.2 bits (82), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 39/80 (48%), Gaps = 2/80 (2%)

Query: 70  EGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRT 129
           +GR E  V  IS  P  F    F+++ E E +I  A P MR+S  V +D GK+  +R R 
Sbjct: 202 DGR-ELTVRTISVTPPVFRVEGFVTEREREAVIRKAAPEMRESRTV-ADGGKAGPTRRRK 259

Query: 130 SSGTFLARGRDKIIRDIEKR 149
           S   +L    D  +  +  R
Sbjct: 260 SETAWLVTHGDDALEALSTR 279


>gi|312599252|gb|ADQ91275.1| hypothetical protein BpV2_108c [Bathycoccus sp. RCC1105 virus BpV2]
          Length = 197

 Score = 36.2 bits (82), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 22/76 (28%), Positives = 40/76 (52%), Gaps = 5/76 (6%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +PR  V  N LS++EC+++ N+A+  ++ STV  S   +  D  +R S   +L    D +
Sbjct: 23  KPR--VLKNVLSEDECKHIQNIASKKLQTSTVSKS---RDIDESIRKSETAWLKASEDPV 77

Query: 143 IRDIEKRIADFTFFPL 158
           +  + ++    T  PL
Sbjct: 78  VDKLIRKCVSMTDRPL 93


>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
 gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
          Length = 537

 Score = 35.8 bits (81), Expect = 5.9,   Method: Composition-based stats.
 Identities = 20/60 (33%), Positives = 30/60 (50%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           +E  S +P    +H+ LS  +   L  +A P M++STV     G+ K S  R S   +LA
Sbjct: 324 LEEHSLDPYVASFHDMLSPRKISQLREMAVPRMQRSTVNPRPGGQHKKSAFRVSKNAWLA 383


>gi|195341556|ref|XP_002037372.1| GM12148 [Drosophila sechellia]
 gi|194131488|gb|EDW53531.1| GM12148 [Drosophila sechellia]
          Length = 542

 Score = 35.8 bits (81), Expect = 6.0,   Method: Compositional matrix adjust.
 Identities = 25/94 (26%), Positives = 45/94 (47%), Gaps = 8/94 (8%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVD---SDTGKSKDSRVRTSSGTF 134
           E++S +P   + H+ +S++E   + N +  HM  S   D   SDT    D+  RTS   +
Sbjct: 326 EILSVDPFVVLLHDMISQKESTLIRNSSKEHMLPSATTDPDASDTETQVDT-YRTSKSVW 384

Query: 135 LARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFF 168
            +   +   + I +R+ D T    G+   F+ F+
Sbjct: 385 YSSDFNDTTKKITERLGDAT----GLDMNFTEFY 414


>gi|389639356|ref|XP_003717311.1| oxidoreductase [Magnaporthe oryzae 70-15]
 gi|351643130|gb|EHA50992.1| oxidoreductase [Magnaporthe oryzae 70-15]
          Length = 305

 Score = 35.8 bits (81), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 36/73 (49%), Gaps = 3/73 (4%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V V S +P      NFLS ++  +L++++ P    STV D D G  +D  +R S    + 
Sbjct: 82  VHVFSRQPLVLYVENFLSVDDRRHLLDISEPIYTPSTVTD-DAGPRRDKSIRDSEVALVP 140

Query: 137 RGRDKIIRDIEKR 149
             R   +R IE R
Sbjct: 141 --RTDTVRCIEAR 151


>gi|402087435|gb|EJT82333.1| oxidoreductase [Gaeumannomyces graminis var. tritici R3-111a-1]
          Length = 355

 Score = 35.8 bits (81), Expect = 6.1,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 42/79 (53%), Gaps = 4/79 (5%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDS--RVRTSSGTF 134
           V+++S  P     H F++ +E E+L ++      +S V  +D   ++ +  +VRTS  T 
Sbjct: 113 VQIVSKSPLVIYLHGFITSQEREHLQDVTKNTFTRSGVTPNDGSDAEKAIHQVRTSRSTS 172

Query: 135 LARGRDKIIRDIEKRIADF 153
           ++R  D ++R IE R   F
Sbjct: 173 VSR--DAVVRCIEDRALAF 189


>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
 gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
          Length = 534

 Score = 35.8 bits (81), Expect = 6.2,   Method: Composition-based stats.
 Identities = 20/56 (35%), Positives = 28/56 (50%)

Query: 81  SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           S +P    YH+ LS  +   L  +A P M +STV     G++K S  R S   +LA
Sbjct: 325 SLDPFVVTYHDMLSPRKIADLRLMAVPRMHRSTVNPLPGGQNKKSSFRVSKNAWLA 380


>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
 gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
          Length = 534

 Score = 35.8 bits (81), Expect = 6.3,   Method: Composition-based stats.
 Identities = 20/56 (35%), Positives = 28/56 (50%)

Query: 81  SWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           S +P    YH+ LS  +   L  +A P M +STV     G++K S  R S   +LA
Sbjct: 325 SLDPFVVTYHDMLSPRKIADLRLMAVPRMHRSTVNPLPGGQNKKSSFRVSKNAWLA 380


>gi|195575111|ref|XP_002105523.1| GD16991 [Drosophila simulans]
 gi|194201450|gb|EDX15026.1| GD16991 [Drosophila simulans]
          Length = 542

 Score = 35.8 bits (81), Expect = 6.6,   Method: Compositional matrix adjust.
 Identities = 22/94 (23%), Positives = 44/94 (46%), Gaps = 6/94 (6%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDS--RVRTSSGTF 134
            E++S +P   + H+ +S++E   + N +  HM  S   D D+  ++      RTS   +
Sbjct: 325 TEILSVDPFVLLLHDMISQKESTLIRNSSKEHMLPSATTDPDSSDTETQVDTYRTSKSVW 384

Query: 135 LARGRDKIIRDIEKRIADFTFFPLGMQFKFSLFF 168
            +   +   + I +R+ D T    G+   F+ F+
Sbjct: 385 YSSDFNDTTKKITERLGDAT----GLDTNFTEFY 414


>gi|347835670|emb|CCD50242.1| similar to oxidoreductase [Botryotinia fuckeliana]
          Length = 254

 Score = 35.8 bits (81), Expect = 6.7,   Method: Compositional matrix adjust.
 Identities = 21/80 (26%), Positives = 43/80 (53%), Gaps = 2/80 (2%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           V  +S++P      NF++ +E ++L+ +A    R S  V +D G++  +  RTSS  +L 
Sbjct: 45  VRTLSYDPLIQHIENFITSQEADHLLKIAKSKFRPSRAVRTD-GQAVATHERTSSTAYLP 103

Query: 137 RGRDKIIRDIEKRIADFTFF 156
              D +++ I  R +++  +
Sbjct: 104 -SDDPVVQCIRSRASEYQGY 122


>gi|361126579|gb|EHK98573.1| putative prolyl 4-hydroxylase subunit alpha-3 [Glarea lozoyensis
           74030]
          Length = 181

 Score = 35.8 bits (81), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 43/86 (50%), Gaps = 5/86 (5%)

Query: 79  VISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV-VDSDTGKSKDSRVRTSSGTFLAR 137
           ++S EP      NFLS +E ++LI+++      STV   SDT   KD  VR S    +  
Sbjct: 62  ILSHEPLVIYIENFLSPQESKHLIDISEDKFAPSTVSTGSDTSIRKD--VRLSEVALV-- 117

Query: 138 GRDKIIRDIEKRIADFTFFPLGMQFK 163
            RD  ++ IE+R   F  +   +Q +
Sbjct: 118 DRDDTVQCIEQRARAFQGWRPDLQIE 143


>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
 gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
          Length = 517

 Score = 35.8 bits (81), Expect = 6.9,   Method: Compositional matrix adjust.
 Identities = 20/78 (25%), Positives = 39/78 (50%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE+++  P    Y++ L+  E E L  +++P +R+S + +      +    RTS+  F+ 
Sbjct: 317 VELLNRSPYVAAYYDVLNDSEIEELKLMSSPQIRRSLLYNHTLDIDQADVDRTSNSVFME 376

Query: 137 RGRDKIIRDIEKRIADFT 154
                ++  I +R AD T
Sbjct: 377 ETGITLLETISQRAADMT 394


>gi|341903277|gb|EGT59212.1| hypothetical protein CAEBREN_12309 [Caenorhabditis brenneri]
          Length = 297

 Score = 35.8 bits (81), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 21/62 (33%), Positives = 33/62 (53%), Gaps = 2/62 (3%)

Query: 72  RAEQWVEVISWEPRAFVYHNFLSKEEC-EYLINLATPHMRKSTVVDSDTGKSKDSRVRTS 130
           + E  +EV+SW P   VY  F + ++  +YL       + + TVV +D G S+ S VR +
Sbjct: 83  QEETRIEVLSWSPPLVVYRKFFTDKQVDDYLRIFKKASLEQQTVVSAD-GTSRSSNVRVA 141

Query: 131 SG 132
            G
Sbjct: 142 KG 143


>gi|195172672|ref|XP_002027120.1| GL20071 [Drosophila persimilis]
 gi|194112933|gb|EDW34976.1| GL20071 [Drosophila persimilis]
          Length = 455

 Score = 35.8 bits (81), Expect = 7.0,   Method: Compositional matrix adjust.
 Identities = 24/80 (30%), Positives = 38/80 (47%), Gaps = 1/80 (1%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE +S +P   +YH+ +   E E L + A P M +S V       SK++  RTS   F  
Sbjct: 252 VEPLSQDPYIAMYHDVIYDSEIEELKDNAFPDMERSKVYTYSDEDSKNTG-RTSMSAFQT 310

Query: 137 RGRDKIIRDIEKRIADFTFF 156
             + K +  + +R+   T F
Sbjct: 311 DHQYKAVTKVNRRVMHMTGF 330


>gi|405964866|gb|EKC30308.1| KRR1 small subunit processome component-like protein [Crassostrea
           gigas]
          Length = 885

 Score = 35.8 bits (81), Expect = 7.2,   Method: Compositional matrix adjust.
 Identities = 15/43 (34%), Positives = 29/43 (67%)

Query: 78  EVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTG 120
           EV+++EPR  ++H+ +S    E+L ++A+  + +STV   +TG
Sbjct: 653 EVVNYEPRIAIFHDVISSTSIEHLKSIASKGLTRSTVFLENTG 695


>gi|195341560|ref|XP_002037374.1| GM12888 [Drosophila sechellia]
 gi|194131490|gb|EDW53533.1| GM12888 [Drosophila sechellia]
          Length = 501

 Score = 35.8 bits (81), Expect = 7.4,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 39/72 (54%), Gaps = 2/72 (2%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +P   +YH+ + + E + +  L    + ++T+   +  +S  S VRTS  TF+     K+
Sbjct: 301 DPLLVLYHDVIYQSEIDVIRKLTKNRLMRATITSHN--ESVVSNVRTSQITFIPVTAHKV 358

Query: 143 IRDIEKRIADFT 154
           +  I++R+AD T
Sbjct: 359 LSTIDQRVADMT 370


>gi|195575115|ref|XP_002105525.1| GD21527 [Drosophila simulans]
 gi|194201452|gb|EDX15028.1| GD21527 [Drosophila simulans]
          Length = 495

 Score = 35.4 bits (80), Expect = 8.5,   Method: Compositional matrix adjust.
 Identities = 20/72 (27%), Positives = 39/72 (54%), Gaps = 2/72 (2%)

Query: 83  EPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLARGRDKI 142
           +P   +YH+ + + E + +  L    + ++T+   +  +S  S VRTS  TF+     K+
Sbjct: 295 DPLLVLYHDVIYQSEIDVIRKLTKNRLMRATITSHN--ESVVSNVRTSQFTFIPVTAHKV 352

Query: 143 IRDIEKRIADFT 154
           +  I++R+AD T
Sbjct: 353 LSTIDQRVADMT 364


>gi|219116348|ref|XP_002178969.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217409736|gb|EEC49667.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 302

 Score = 35.4 bits (80), Expect = 8.6,   Method: Compositional matrix adjust.
 Identities = 26/83 (31%), Positives = 44/83 (53%), Gaps = 5/83 (6%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKD-SRVRTSSGTFL 135
             V+S EP   + H+FLS   C+ LI+ AT       ++ S TG  ++ S +RTS+  +L
Sbjct: 89  AHVVSSEPPLVLIHDFLSTSMCKNLIDTAT---STDKMIRSTTGSEQETSTIRTSTTVWL 145

Query: 136 ARGR-DKIIRDIEKRIADFTFFP 157
              +  +  R I ++I+  + FP
Sbjct: 146 NDEQVPETSRIIAEKISSISGFP 168


>gi|429850117|gb|ELA25418.1| prolyl 4-hydroxylase alpha [Colletotrichum gloeosporioides Nara
           gc5]
          Length = 207

 Score = 35.4 bits (80), Expect = 8.8,   Method: Compositional matrix adjust.
 Identities = 27/78 (34%), Positives = 40/78 (51%), Gaps = 3/78 (3%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTV-VDSDTGKSKDSRVRTSSGTFL 135
           + ++S++P       FLS  E  +L+ L  P   +S V VD++     DS  RTSS  FL
Sbjct: 16  IRILSYDPLIIYVVGFLSWAERNHLLQLGQPLFERSAVYVDAENSNLYDSD-RTSSTAFL 74

Query: 136 ARGRDKIIRDIEKRIADF 153
               D I+R I KR ++ 
Sbjct: 75  PD-EDPIVRRIIKRASEI 91


>gi|221119879|ref|XP_002156288.1| PREDICTED: transmembrane prolyl 4-hydroxylase-like [Hydra
           magnipapillata]
          Length = 628

 Score = 35.4 bits (80), Expect = 9.5,   Method: Composition-based stats.
 Identities = 18/48 (37%), Positives = 27/48 (56%)

Query: 58  IVRKSMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLA 105
           +++  +E+     G  E  + VIS  P  F   NF+ +EECE LI+LA
Sbjct: 61  VMKIGLEKRVELPGNVEVTMRVISINPFIFEIENFIDEEECELLIDLA 108


>gi|242775444|ref|XP_002478645.1| DNA replication licensing factor Mcm7, putative [Talaromyces
           stipitatus ATCC 10500]
 gi|218722264|gb|EED21682.1| DNA replication licensing factor Mcm7, putative [Talaromyces
           stipitatus ATCC 10500]
          Length = 807

 Score = 35.4 bits (80), Expect = 9.5,   Method: Composition-based stats.
 Identities = 30/120 (25%), Positives = 49/120 (40%), Gaps = 5/120 (4%)

Query: 2   AKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRK 61
           A P Y R+  R S    + L   ++  F ++ L+    L  PS   D + AN ++ +   
Sbjct: 551 ANPLYGRYNPRISPVENINLPAALLSRFDVMFLM----LDTPSRHDDEQLANHVTYVHMH 606

Query: 62  SMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGK 121
           +      D G      EV  +  RA  Y   + KE  +Y++  A   MRK    D  + +
Sbjct: 607 NKHPETSDTGVVFSPNEVRQFVARARTYRPVVPKEVSDYMVG-AYVRMRKQQKKDEGSKR 665


>gi|194765174|ref|XP_001964702.1| GF23328 [Drosophila ananassae]
 gi|190614974|gb|EDV30498.1| GF23328 [Drosophila ananassae]
          Length = 542

 Score = 35.4 bits (80), Expect = 9.6,   Method: Composition-based stats.
 Identities = 21/78 (26%), Positives = 38/78 (48%), Gaps = 2/78 (2%)

Query: 77  VEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGKSKDSRVRTSSGTFLA 136
           VE ++ +P    +H  ++  E E +I      M +S V  S    +  S +RTS+ T+L 
Sbjct: 332 VEQLNLDPYVAYFHEAINSSEMEQIIEKGLGSMERSRVGQSQNATT--SEIRTSANTWLW 389

Query: 137 RGRDKIIRDIEKRIADFT 154
              +  +  I++R+ D T
Sbjct: 390 YNENPWLSKIKQRLEDIT 407


>gi|212532389|ref|XP_002146351.1| DNA replication licensing factor Mcm7, putative [Talaromyces
           marneffei ATCC 18224]
 gi|210071715|gb|EEA25804.1| DNA replication licensing factor Mcm7, putative [Talaromyces
           marneffei ATCC 18224]
          Length = 807

 Score = 35.0 bits (79), Expect = 10.0,   Method: Composition-based stats.
 Identities = 29/120 (24%), Positives = 48/120 (40%), Gaps = 5/120 (4%)

Query: 2   AKPRYSRFPTRKSSSSTLILTLLIMFTFAILILLAFGILSMPSSSGDSRKANDLSSIVRK 61
           A P Y R+  R S    + L   ++  F ++ L+    L  PS   D   AN ++ +   
Sbjct: 551 ANPLYGRYNPRISPVENINLPAALLSRFDVMFLM----LDTPSRDADEELANHVTYVHMH 606

Query: 62  SMERSEGDEGRAEQWVEVISWEPRAFVYHNFLSKEECEYLINLATPHMRKSTVVDSDTGK 121
           +     GD        EV  +  RA  Y   + KE  +Y++  A   MR+    D  + +
Sbjct: 607 NKHPETGDTAIVFSPNEVRQFVARARTYRPLVPKEVSDYMVG-AYVRMRRQQKKDEGSKR 665


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.323    0.136    0.389 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,511,490,118
Number of Sequences: 23463169
Number of extensions: 92600539
Number of successful extensions: 254120
Number of sequences better than 100.0: 895
Number of HSP's better than 100.0 without gapping: 685
Number of HSP's successfully gapped in prelim test: 210
Number of HSP's that attempted gapping in prelim test: 253215
Number of HSP's gapped (non-prelim): 905
length of query: 174
length of database: 8,064,228,071
effective HSP length: 132
effective length of query: 42
effective length of database: 9,262,057,059
effective search space: 389006396478
effective search space used: 389006396478
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (22.0 bits)
S2: 71 (32.0 bits)