BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 023053
         (288 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
           from Gallus gallus gi|212530 [Arabidopsis thaliana]
 gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
 gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 287

 Score =  489 bits (1260), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 227/287 (79%), Positives = 260/287 (90%), Gaps = 1/287 (0%)

Query: 1   MVKLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRRR 60
           M KLRHSR QA+KWSTL LVL MLFMLTIVLLMLLA G+F +PI +D+S P DL+ FRR 
Sbjct: 1   MAKLRHSRFQARKWSTLMLVLFMLFMLTIVLLMLLAFGVFSLPINNDESSPIDLSYFRRA 60

Query: 61  AFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTG 120
           A E+   + + +G+QWTE+++WEPRAFVYHNFLSK ECEYLI LAKP+MVKSTVVDS+TG
Sbjct: 61  ATERSEGLGK-RGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETG 119

Query: 121 QSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYD 180
           +SKDSRVRTSSGTFL+RG+D+II+ IEKRIAD+TFIP +HGEG+QVLHYE GQKY+ HYD
Sbjct: 120 KSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYD 179

Query: 181 YFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVK 240
           YF+DEFNTKNGGQRMAT+LMYLSDVEEGGETVFPAAN NF+SV W+NELSECGK+GLSVK
Sbjct: 180 YFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVK 239

Query: 241 PKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           P+ GDALLFWSMRPDATLDP+SLHGGCPVIRGNKWS+TKWMHV EYK
Sbjct: 240 PRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGEYK 286


>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 287

 Score =  489 bits (1258), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 226/287 (78%), Positives = 260/287 (90%), Gaps = 1/287 (0%)

Query: 1   MVKLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRRR 60
           M KLRHSR QA+KWSTL LVL MLFMLTIVLLMLLA G+F +PI +D+S P DL+ FRR 
Sbjct: 1   MAKLRHSRFQARKWSTLMLVLFMLFMLTIVLLMLLAFGVFSLPINNDESSPIDLSYFRRA 60

Query: 61  AFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTG 120
           A E+   + + +G+QWTE+++WEPRAFVYHNFLSK ECEYLI LAKP+MVKSTVVDS+TG
Sbjct: 61  ATERSEGLGK-RGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETG 119

Query: 121 QSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYD 180
           +SKDSRVRTSSGTFL+RG+D+II+ IEKRIAD+TFIP +HGEG+Q+LHYE GQKY+ HYD
Sbjct: 120 KSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQILHYEAGQKYEPHYD 179

Query: 181 YFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVK 240
           YF+DEFNTKNGGQRMAT+LMYLSDVEEGGETVFPAAN NF+SV W+NELSECGK+GLSVK
Sbjct: 180 YFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVK 239

Query: 241 PKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           P+ GDALLFWSMRPDATLDP+SLHGGCPVIRGNKWS+TKWMHV EYK
Sbjct: 240 PRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWMHVGEYK 286


>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
          Length = 287

 Score =  488 bits (1255), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 226/287 (78%), Positives = 260/287 (90%), Gaps = 1/287 (0%)

Query: 1   MVKLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRRR 60
           M KLRHSR QA+KWSTL LVL MLFMLTIVLLMLLA G+F +PI +D+S P DL+ FRR 
Sbjct: 1   MAKLRHSRFQARKWSTLMLVLFMLFMLTIVLLMLLAFGVFSLPINNDESSPIDLSYFRRA 60

Query: 61  AFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTG 120
           A E+   + + +G+QWTE+++WEPRAFVYHNFLSK ECEYLI LAKP+MVKSTVVDS+TG
Sbjct: 61  ATERSEGLGK-RGDQWTEVLSWEPRAFVYHNFLSKEECEYLISLAKPHMVKSTVVDSETG 119

Query: 121 QSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYD 180
           +SKDSRVRTSSGTFL+RG+D+II+ IEKRIAD+TFIP +HGEG+QVLHYE GQKY+ HYD
Sbjct: 120 KSKDSRVRTSSGTFLRRGRDKIIKTIEKRIADYTFIPADHGEGLQVLHYEAGQKYEPHYD 179

Query: 181 YFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVK 240
           YF+DEFNTKNGGQRMAT+LMYLSDVEEGGETVFPAAN NF+SV W+NELSECGK+GLSVK
Sbjct: 180 YFVDEFNTKNGGQRMATMLMYLSDVEEGGETVFPAANMNFSSVPWYNELSECGKKGLSVK 239

Query: 241 PKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           P+ GDALLFWSMRPDATLDP+SLHGGCPVIRGNKWS+TKW+HV EYK
Sbjct: 240 PRMGDALLFWSMRPDATLDPTSLHGGCPVIRGNKWSSTKWIHVGEYK 286


>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
 gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
          Length = 288

 Score =  483 bits (1244), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 224/287 (78%), Positives = 254/287 (88%)

Query: 1   MVKLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRRR 60
           MVK+RHSRL  KKWST TLVLSMLFMLT+VL +LL +G F +P+  +DS PNDL S+RR 
Sbjct: 1   MVKVRHSRLHTKKWSTFTLVLSMLFMLTVVLFILLGLGAFSLPVSSEDSSPNDLNSYRRI 60

Query: 61  AFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTG 120
           A E       ++ EQWTEI++WEPRAF+YHNFLSK ECEYLI+LAKP+M+KSTVVDSKTG
Sbjct: 61  ASESDGDGMGKREEQWTEILSWEPRAFLYHNFLSKEECEYLINLAKPHMMKSTVVDSKTG 120

Query: 121 QSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYD 180
           +SKDSRVRTSSG FL+RG+DR+IR IEKRIADF+FIP+EHGEG+QVLHYEVGQKY+AH+D
Sbjct: 121 RSKDSRVRTSSGMFLRRGRDRVIREIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEAHFD 180

Query: 181 YFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVK 240
           YFLDEFNTKNGGQR ATLLMYLSDVEEGGETVFPAAN N ++V WWNELSEC KQGLS+K
Sbjct: 181 YFLDEFNTKNGGQRTATLLMYLSDVEEGGETVFPAANMNISAVPWWNELSECAKQGLSLK 240

Query: 241 PKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           PK G+ALLFWS RPDATLDPSSLHG CPVIRGNKWSATKWMH+ EYK
Sbjct: 241 PKMGNALLFWSTRPDATLDPSSLHGSCPVIRGNKWSATKWMHLGEYK 287


>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 289

 Score =  461 bits (1186), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 229/287 (79%), Positives = 256/287 (89%), Gaps = 1/287 (0%)

Query: 1   MVKLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDLT-SFRR 59
           M K+RHSRLQA+K STLTLV SMLFMLT+VLLMLLA+GIF +P+  DDSPPNDL  S+RR
Sbjct: 1   MAKMRHSRLQARKMSTLTLVFSMLFMLTVVLLMLLALGIFSLPMSTDDSPPNDLAASYRR 60

Query: 60  RAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKT 119
            A E+      ++ EQWTEI++WEPRAFVYHNFLSK ECEYLI LAKP+MVKSTVVDSKT
Sbjct: 61  MAAERDYDGLGKRVEQWTEIISWEPRAFVYHNFLSKEECEYLIALAKPHMVKSTVVDSKT 120

Query: 120 GQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHY 179
           G+SKDSRVRTSSG FL+RG+D+IIR IEKRIADF+FIP+EHGEG+QVLHYEVGQKY+AHY
Sbjct: 121 GRSKDSRVRTSSGMFLRRGRDKIIRNIEKRIADFSFIPIEHGEGLQVLHYEVGQKYEAHY 180

Query: 180 DYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
           DYFLDEFNTKNGGQR ATLLMYLSDVEEGGETVFPAA AN ++V  WNELSEC +QGLSV
Sbjct: 181 DYFLDEFNTKNGGQRTATLLMYLSDVEEGGETVFPAAKANISNVPSWNELSECARQGLSV 240

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           KPK G+ALLFWS RPDATLDP+SLHG CPVIRGNKWSATKWMH+ EY
Sbjct: 241 KPKMGNALLFWSTRPDATLDPASLHGSCPVIRGNKWSATKWMHLGEY 287


>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
          Length = 290

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 213/286 (74%), Positives = 251/286 (87%)

Query: 3   KLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRRRAF 62
           K  H+R Q KKWST +LVL  LF LT++L++LLA+GI Y+P  DDD P  DL++FRR+  
Sbjct: 5   KHTHTRAQGKKWSTFSLVLWALFFLTLILVVLLALGIVYLPTTDDDFPTTDLSAFRRKTS 64

Query: 63  EKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQS 122
           +   S+ E   EQWTEI++WEPRAF+YHNFLSK ECEYLI+LAKP MVKS+VVDSKTG+S
Sbjct: 65  QSAESLVENPPEQWTEILSWEPRAFIYHNFLSKEECEYLIELAKPQMVKSSVVDSKTGKS 124

Query: 123 KDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF 182
            +SRVRTSSG FLKRG+D+I++ IEKRIADFTFIP E+GEG+Q+LHYEVGQKY+ HYDYF
Sbjct: 125 TESRVRTSSGMFLKRGKDKIVQNIEKRIADFTFIPEENGEGLQILHYEVGQKYEPHYDYF 184

Query: 183 LDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPK 242
           LDEFNTKNGGQR+AT+LMYLSDVEEGGETVFPAANANF+SV WWN+LS+C ++GLSVKPK
Sbjct: 185 LDEFNTKNGGQRIATVLMYLSDVEEGGETVFPAANANFSSVPWWNDLSQCARKGLSVKPK 244

Query: 243 RGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
            GDALLFWSMRPDATLDPSSLHGGCPVI+GNKWS+TKWMH+ EYK 
Sbjct: 245 MGDALLFWSMRPDATLDPSSLHGGCPVIKGNKWSSTKWMHLREYKV 290


>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
 gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
          Length = 288

 Score =  444 bits (1143), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 231/289 (79%), Positives = 258/289 (89%), Gaps = 2/289 (0%)

Query: 1   MVKLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDS-PPNDLTSFRR 59
           M K R+SR   K+WSTL LVLS+L MLT+VLLMLLA+GI  +PIG  DS   NDL+SFRR
Sbjct: 1   MAKGRYSRGHGKRWSTLALVLSLLLMLTVVLLMLLALGIVSLPIGTVDSDAANDLSSFRR 60

Query: 60  RAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKT 119
           + F+    + + +GEQWTEIV+WEPRAF+YHNFLSK ECEY+I LAKPYM KSTVVDS+T
Sbjct: 61  KTFDGGEGLGK-RGEQWTEIVSWEPRAFIYHNFLSKEECEYMISLAKPYMKKSTVVDSET 119

Query: 120 GQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHY 179
           G+SKDSRVRTSSG FL+RG+D+IIR IEKRIADFTFIP+EHGEG+QVLHYEVGQKYDAHY
Sbjct: 120 GRSKDSRVRTSSGMFLRRGRDKIIRDIEKRIADFTFIPVEHGEGLQVLHYEVGQKYDAHY 179

Query: 180 DYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
           DYFLDEFNTKNGGQR+ATLLMYLSDVEEGGETVFPA  ANF+SV WWNELSECGK+GLSV
Sbjct: 180 DYFLDEFNTKNGGQRIATLLMYLSDVEEGGETVFPATKANFSSVPWWNELSECGKKGLSV 239

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           KPK GDALLFWSMRPDATLDPSSLHGGCPVI+GNKWS+TKWMHV EYKA
Sbjct: 240 KPKMGDALLFWSMRPDATLDPSSLHGGCPVIKGNKWSSTKWMHVEEYKA 288


>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 289

 Score =  437 bits (1124), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 205/290 (70%), Positives = 245/290 (84%), Gaps = 3/290 (1%)

Query: 1   MVKLRHSRLQAKKWST-LTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSP-PNDLTSFR 58
           M K R+SRL ++K S+  TL+ S+    T ++L+LL  GI  IP  + + P PNDLTS  
Sbjct: 1   MAKSRYSRLPSRKSSSPYTLIFSLFIAFTFLILILLVFGILSIPSSNQNLPKPNDLTSIV 60

Query: 59  RRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSK 118
               + R+   E KGEQW E+V+WEPRAFVYHNFL+K ECEYLID+AKP M KSTVVDS+
Sbjct: 61  HNTVD-RNDDEEGKGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSE 119

Query: 119 TGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAH 178
           TG+SKDSRVRTSSGTFL RG+D+I+R IEK+IADFTFIP+EHGEG+QVLHYEVGQKY+ H
Sbjct: 120 TGKSKDSRVRTSSGTFLARGRDKIVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPH 179

Query: 179 YDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLS 238
           YDYFLDEFNTKNGGQR+AT+LMYL+DVEEGGETVFPAA  NF++V W+NELS+CGK+GLS
Sbjct: 180 YDYFLDEFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSNVPWYNELSDCGKKGLS 239

Query: 239 VKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           +KPKRGDALLFWSM+PDATLD SSLHGGCPVI+GNKWS+TKW+ V+EYK 
Sbjct: 240 IKPKRGDALLFWSMKPDATLDASSLHGGCPVIKGNKWSSTKWIRVNEYKT 289


>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
 gi|255647110|gb|ACU24023.1| unknown [Glycine max]
          Length = 289

 Score =  434 bits (1117), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 205/291 (70%), Positives = 246/291 (84%), Gaps = 5/291 (1%)

Query: 1   MVKLRHSRLQAKKWSTL-TLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDS--PPNDLTSF 57
           M K R+SRL  +K S+  TL+L++  + T ++L+LLA+GI  IP     +   PNDL S 
Sbjct: 1   MAKPRYSRLPPRKSSSSSTLILTLFLVFTFLVLILLALGILSIPSSSRGNLPKPNDLASI 60

Query: 58  RRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDS 117
            R   E  +S ++E+GEQW E+V+WEPRAFVYHNFL+K ECEYLID+AKP M KSTVVDS
Sbjct: 61  ARNTIE--TSDSDERGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDS 118

Query: 118 KTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDA 177
           +TG+SKDSRVRTSSGTFL RG+D+I+R IEK+I+DFTFIP+EHGEG+QVLHYEVGQKY+ 
Sbjct: 119 ETGKSKDSRVRTSSGTFLARGRDKIVRNIEKKISDFTFIPVEHGEGLQVLHYEVGQKYEP 178

Query: 178 HYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGL 237
           HYDYFLD+FNTKNGGQR+AT+LMYL+DVEEGGETVFPAA  NF+ V WWNEL ECGK+GL
Sbjct: 179 HYDYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSFVPWWNELFECGKKGL 238

Query: 238 SVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           S+KPKRGDALLFWSM+PDA+LDPSSLHGGCPVI+GNKWS+TKWM V EY A
Sbjct: 239 SIKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWMRVSEYNA 289


>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
          Length = 290

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 205/291 (70%), Positives = 244/291 (83%), Gaps = 4/291 (1%)

Query: 1   MVKLRHSRLQAKKWSTL-TLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDS--PPNDLTSF 57
           M K R+SRL  +K S+  TL+L++  + T ++L+LLA+GI  IP     +   PNDL S 
Sbjct: 1   MAKPRYSRLPPRKSSSSSTLILTLFLVFTFLVLILLALGILSIPSSSRGNLPKPNDLASI 60

Query: 58  RRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDS 117
            R      S   + +GEQW E+V+WEPRAFVYHNFL+K ECEYLID+AKP M KS+VVDS
Sbjct: 61  ARNTIHT-SDDDDVRGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVVDS 119

Query: 118 KTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDA 177
           +TG+SKDSRVRTSSGTFL RG+D+I+R IEKRIA ++FIP+EHGEG+QVLHYEVGQKY+ 
Sbjct: 120 ETGKSKDSRVRTSSGTFLARGRDKIVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEP 179

Query: 178 HYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGL 237
           HYDYFLD+FNTKNGGQR+AT+LMYL+DVEEGGETVFPAA  NF+SV WWNELSECGK+GL
Sbjct: 180 HYDYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVFPAAKGNFSSVPWWNELSECGKKGL 239

Query: 238 SVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           S+KPKRGDALLFWSM+PDATLDPSSLHGGCPVI+GNKWS+TKWM V EYKA
Sbjct: 240 SIKPKRGDALLFWSMKPDATLDPSSLHGGCPVIKGNKWSSTKWMRVSEYKA 290


>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 280

 Score =  431 bits (1107), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 207/286 (72%), Positives = 246/286 (86%), Gaps = 10/286 (3%)

Query: 3   KLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRRRAF 62
           K  H R Q KK STL  +L  LFMLT+V+++LLA+GI Y+P   DDS    L + RR+ +
Sbjct: 5   KHTHPRSQVKKLSTL--ILLTLFMLTLVIIVLLALGILYLPNTTDDS----LITDRRKIY 58

Query: 63  EKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQS 122
           E   S+AE+K EQWTEI++WEPRAFVYHNFLSK ECE+LI+LAKP++ KS+VVDSKTG+S
Sbjct: 59  E---SLAEKK-EQWTEILSWEPRAFVYHNFLSKEECEHLINLAKPFLAKSSVVDSKTGKS 114

Query: 123 KDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF 182
            +SRVRTSSG FLKRG+D+II+ IE+RIADFTFIP+E+GEG+QVLHY VG+KY+ HYDYF
Sbjct: 115 TESRVRTSSGMFLKRGKDKIIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYF 174

Query: 183 LDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPK 242
           LDEFNTKNGGQR+AT+LMYLSDVEEGGETVFPAA ANF+SV WWN+LSEC ++GLS+KPK
Sbjct: 175 LDEFNTKNGGQRVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPK 234

Query: 243 RGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
            GDALLFWSMRPDATLD SSLHGGCPVI GNKWS+TKWMH+ EYK 
Sbjct: 235 MGDALLFWSMRPDATLDASSLHGGCPVIVGNKWSSTKWMHLEEYKV 280


>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
          Length = 288

 Score =  429 bits (1103), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 209/288 (72%), Positives = 247/288 (85%), Gaps = 2/288 (0%)

Query: 1   MVKLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSP-PNDLTSFRR 59
           M K R+ R   +K S+ T+VLSML M + V+L+LLA+G+  IP    DSP  NDL++   
Sbjct: 1   MAKPRYPRFPPRKSSSSTVVLSMLLMFSFVVLVLLALGLLSIPSHSGDSPRANDLSTIVH 60

Query: 60  RAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKT 119
           R  E RS   + KGE W+E+++WEPRAFVYHNFLSK ECEYLI LAKP+M KSTVVDS T
Sbjct: 61  RTVE-RSDGNDGKGEPWSEVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSST 119

Query: 120 GQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHY 179
           G+SKDSRVRTSSGTFL RGQD+IIRGIEKR++DFTF+P+EHGEG+Q+LHYEVGQKY+ HY
Sbjct: 120 GKSKDSRVRTSSGTFLTRGQDKIIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHY 179

Query: 180 DYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
           DYFLD++NTKNGGQRMAT+LMYLSDVEEGGETVFPAA  NF+SV WWNELS+CGK+GLSV
Sbjct: 180 DYFLDDYNTKNGGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKEGLSV 239

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           KPK GDALLFWSM+PDA+LDPSSLHGGCPVI+GNKWS+TKW+ V+EYK
Sbjct: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287


>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
          Length = 288

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 209/288 (72%), Positives = 246/288 (85%), Gaps = 2/288 (0%)

Query: 1   MVKLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSP-PNDLTSFRR 59
           M K R+ R   +K S+ T+VLSML M + V+L+LLA+G+  IP    DSP  NDL++   
Sbjct: 1   MAKPRYPRFPPRKSSSSTVVLSMLLMFSFVVLVLLALGLLSIPSHSGDSPRANDLSTIVH 60

Query: 60  RAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKT 119
           R  E RS   + KGE W+E+++WEPRAFVYHNFLSK ECEYLI LAKP+M KSTVVDS T
Sbjct: 61  RTVE-RSDGNDGKGEPWSEVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSST 119

Query: 120 GQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHY 179
           G+SKDSRVRTSSGTFL RGQD+IIRGIEKR++DFTF+P+EHGEG+Q+LHYEVGQKY+ HY
Sbjct: 120 GKSKDSRVRTSSGTFLTRGQDKIIRGIEKRLSDFTFLPVEHGEGLQILHYEVGQKYEPHY 179

Query: 180 DYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
           DYFLD++NTKNGGQRMAT+LMYLSDVEEGGETVFPAA  NF+SV WWNELS CGK+GLSV
Sbjct: 180 DYFLDDYNTKNGGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSXCGKEGLSV 239

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           KPK GDALLFWSM+PDA+LDPSSLHGGCPVI+GNKWS+TKW+ V+EYK
Sbjct: 240 KPKMGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWIRVNEYK 287


>gi|240256489|ref|NP_201407.4| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
 gi|332010770|gb|AED98153.1| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
          Length = 289

 Score =  425 bits (1092), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 200/288 (69%), Positives = 238/288 (82%), Gaps = 2/288 (0%)

Query: 1   MVKLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGD-DDSPPNDLTSFRR 59
           M + R+ R  A+K S  TLV ++L M T V+L+LLA GI  +P  +   S  NDLTS  R
Sbjct: 2   MARPRNHRPSARKSSHSTLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIVR 61

Query: 60  RAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKT 119
           +  + RS   + K E+W EI++WEPRA VYHNFL+K EC+YLI+LAKP+M KSTVVD KT
Sbjct: 62  KTLQ-RSGEDDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKT 120

Query: 120 GQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHY 179
           G+S DSRVRTSSGTFL RG+D+ IR IEKRI+DFTFIP+EHGEG+QVLHYE+GQKY+ HY
Sbjct: 121 GKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHY 180

Query: 180 DYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
           DYF+DE+NT+NGGQR+AT+LMYLSDVEEGGETVFPAA  N+++V WWNELSECGK GLSV
Sbjct: 181 DYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSV 240

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           KPK GDALLFWSM PDATLDPSSLHGGC VI+GNKWS+TKW+ VHEYK
Sbjct: 241 KPKMGDALLFWSMTPDATLDPSSLHGGCAVIKGNKWSSTKWLRVHEYK 288


>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 287

 Score =  421 bits (1081), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 208/290 (71%), Positives = 245/290 (84%), Gaps = 5/290 (1%)

Query: 1   MVKLRHSRLQAKKWSTL-TLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPP-NDLTSFR 58
           M K R SR   +K S+  TLV ++L M T V+L+LLA+GI  IP     S   +DL+S  
Sbjct: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60

Query: 59  RRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSK 118
           R+  +    + EEKGEQW E+++WEPRAFVYHNFL+K ECEYLI LAKP+M KSTVVDS+
Sbjct: 61  RKTSD---DVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSE 117

Query: 119 TGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAH 178
           TGQSKDSRVRTSSGTFL RG+D+ +R IEKR++DF+FIP+EHGEG+QVLHYEVGQKY+ H
Sbjct: 118 TGQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPH 177

Query: 179 YDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLS 238
           +DYFLDE+NTKNGGQR+AT+LMYLSDVEEGGETVFPAA  NF+SV WWNELS+CGK+GLS
Sbjct: 178 FDYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLS 237

Query: 239 VKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           VKPKRGDALLFWSM+PDA+LDPSSLHGGCPVI+GNKWSATKWM V EYKA
Sbjct: 238 VKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWMRVEEYKA 287


>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 287

 Score =  419 bits (1076), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 207/290 (71%), Positives = 245/290 (84%), Gaps = 5/290 (1%)

Query: 1   MVKLRHSRLQAKKWSTL-TLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPP-NDLTSFR 58
           M K R SR   +K S+  TLV ++L M T V+L+LLA+GI  IP     S   +DL+S  
Sbjct: 1   MAKHRQSRFPTRKSSSSSTLVFTLLIMFTFVILILLALGILSIPGNSGGSTKVHDLSSIV 60

Query: 59  RRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSK 118
           R+  +    + EEKGEQW E+++WEPRAFVYHNFL+K ECEYLI LAKP+M KSTVVDS+
Sbjct: 61  RKTSD---DVDEEKGEQWVEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSE 117

Query: 119 TGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAH 178
           TGQSKDSRVRTSSGTFL RG+D+ +R IEKR++DF+FIP+EHGEG+QVLHYEVGQKY+ H
Sbjct: 118 TGQSKDSRVRTSSGTFLPRGRDKTVRTIEKRLSDFSFIPVEHGEGLQVLHYEVGQKYEPH 177

Query: 179 YDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLS 238
           +DYFLDE+NTKNGGQR+AT+LMYLSDVEEGGETVFPAA  NF+SV WWNELS+CGK+GLS
Sbjct: 178 FDYFLDEYNTKNGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLS 237

Query: 239 VKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           VKPKRGDALLFWSM+PDA+LDPSSLHGGCPVI+GNKWSATKW+ V EYKA
Sbjct: 238 VKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSATKWVRVEEYKA 287


>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 287

 Score =  414 bits (1064), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 206/288 (71%), Positives = 242/288 (84%), Gaps = 3/288 (1%)

Query: 1   MVKLRHSRLQAKKWST-LTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRR 59
           M K R+SRL A+K S+  T++L+ML M T V+L+LLA+GI  +P    D   +DLT+   
Sbjct: 1   MAKARYSRLPARKSSSPTTMILTMLLMFTFVILILLALGILSVPSNSGDKA-HDLTTIVH 59

Query: 60  RAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKT 119
              E+     + KGE+W E+++WEPRAFVYHNFL+K ECEYLI+LAKP M KSTVVDS+T
Sbjct: 60  NK-EQSFDGDDGKGERWAEVISWEPRAFVYHNFLTKEECEYLINLAKPNMQKSTVVDSET 118

Query: 120 GQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHY 179
           G+SKDSRVRTSSGTFL RG+D+ IR IEKRIADF+FIP+EHGEG+QVLHYEVGQKY+ H+
Sbjct: 119 GRSKDSRVRTSSGTFLSRGRDKKIRDIEKRIADFSFIPVEHGEGLQVLHYEVGQKYEPHF 178

Query: 180 DYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
           DYF DEFNTKNGGQR+ATLLMYLSDVEEGGETVFPAA  NF++V WWNELSECGK+GLSV
Sbjct: 179 DYFNDEFNTKNGGQRVATLLMYLSDVEEGGETVFPAAKGNFSAVPWWNELSECGKKGLSV 238

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           KP  GDALLFWSM+PDATLDPSSLHGGCPVI GNKWSATKWM V+EY+
Sbjct: 239 KPNMGDALLFWSMKPDATLDPSSLHGGCPVINGNKWSATKWMRVNEYR 286


>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 284

 Score =  412 bits (1060), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 199/285 (69%), Positives = 229/285 (80%), Gaps = 7/285 (2%)

Query: 3   KLRHSRLQAKKWSTLTLV-LSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRRRA 61
           K ++ +LQ +KWST  L  + M  +L +   ML+A+  F  P    ++  +  +S R  A
Sbjct: 5   KGKYIKLQGRKWSTFQLSKMIMALVLALGFFMLIALRFFSPP----ETSHHRFSSVRHTA 60

Query: 62  FEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQ 121
           F   S    ++G+QW E ++WEPRAFVYHNFLSK EC YLI LAKP+M KSTVVDSKTG+
Sbjct: 61  F--LSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGE 118

Query: 122 SKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDY 181
           S DSRVRTSSG FL RGQD+IIR IEKRIADFTFIP+EHGEG+Q+LHYEVGQKYDAHYDY
Sbjct: 119 SVDSRVRTSSGMFLNRGQDKIIRNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYDY 178

Query: 182 FLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKP 241
           F+DE+N K GGQRMATLLMYLSDVEEGGETVFPAA  NF+SV WWNELSECGK GLSVKP
Sbjct: 179 FVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKP 238

Query: 242 KRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           K GDALLFWSM+PDATLDP+SLHG CPVIRGNKWS TKWMHV +Y
Sbjct: 239 KMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 283


>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 318

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 191/290 (65%), Positives = 224/290 (77%), Gaps = 17/290 (5%)

Query: 15  STLTLVLSMLFMLTIVLLMLLAMGIFYIPIG-----------------DDDSPPNDLTSF 57
           S   + L  L + +  LL L+A G+F +P+                   ++S  ++  S 
Sbjct: 28  SPYAVALGALLIASAFLLALIAFGVFSLPVSAPNLATTAGGGETESGSTEESGGSESHSA 87

Query: 58  RRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDS 117
           R RA    S    E+G QWTE+++WEPRAFVYHNFLSK ECEYLI LAKP M KSTVVDS
Sbjct: 88  RSRARRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEECEYLIGLAKPRMEKSTVVDS 147

Query: 118 KTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDA 177
            TG+SKDSRVRTSSG FL+RG+D++IR IE+RIAD+TFIP EHGEG+QVLHYEVGQKY+ 
Sbjct: 148 TTGKSKDSRVRTSSGMFLRRGRDKVIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEP 207

Query: 178 HYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGL 237
           H+DYFLDEFNTKNGGQRMAT+LMYLSDVEEGGET+FP AN N +S+ W NELSEC ++GL
Sbjct: 208 HFDYFLDEFNTKNGGQRMATILMYLSDVEEGGETIFPDANVNSSSLPWHNELSECARKGL 267

Query: 238 SVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           +VKPK GDALLFWSM PDATLDP SLHGGCPVIRGNKWS+TKWMHV EYK
Sbjct: 268 AVKPKMGDALLFWSMNPDATLDPLSLHGGCPVIRGNKWSSTKWMHVGEYK 317


>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
 gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
          Length = 307

 Score =  400 bits (1027), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 180/218 (82%), Positives = 203/218 (93%)

Query: 71  EKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS 130
           E+G QWTE+++WEPRAFVYHNFLSK ECEYLI LAKP+MVKSTVVDS TG+SKDSRVRTS
Sbjct: 90  ERGAQWTEVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTS 149

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN 190
           SG FL+RG+D++IR IEKRIAD+TFIP +HGEG+QVLHYEVGQKY+ H+DYFLDEFNTKN
Sbjct: 150 SGMFLQRGRDKVIRAIEKRIADYTFIPADHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKN 209

Query: 191 GGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFW 250
           GGQRMATLLMYLSDVEEGGET+FP AN N +S+ W+NELSEC K+GLSVKPK GDALLFW
Sbjct: 210 GGQRMATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSECAKRGLSVKPKMGDALLFW 269

Query: 251 SMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           SM+PDATLDP SLHGGCPVIRGNKWS+TKWMH+HEYKA
Sbjct: 270 SMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEYKA 307


>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
          Length = 310

 Score =  399 bits (1026), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 188/266 (70%), Positives = 219/266 (82%), Gaps = 12/266 (4%)

Query: 35  LAMGIFYIPI------------GDDDSPPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAW 82
           +A G+F +P+               D+ P D    R RA    S    E+G QWTE+++W
Sbjct: 45  IAFGVFSLPVSAPNAATTDSAAAGGDAEPADPRPPRTRARRDLSEGLGERGAQWTEVISW 104

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EPRAFVYHNFLSK EC+YLI LAKP+MVKSTVVDS TG+SKDSRVRTSSG FL+RG+D++
Sbjct: 105 EPRAFVYHNFLSKEECDYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKV 164

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           IR IEKRIAD+TFIPMEHGEG+QVLHYEVGQKY+ H+DYFLDE+NTKNGGQRMATLLMYL
Sbjct: 165 IRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRMATLLMYL 224

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           SDVEEGGET+FP AN N +S+ W+NELSEC ++GL+VKPK GDALLFWSM+PDATLDP S
Sbjct: 225 SDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATLDPLS 284

Query: 263 LHGGCPVIRGNKWSATKWMHVHEYKA 288
           LHGGCPVI+GNKWS+TKWMHV EYKA
Sbjct: 285 LHGGCPVIKGNKWSSTKWMHVREYKA 310


>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
 gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
 gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
 gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
          Length = 308

 Score =  399 bits (1026), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 180/218 (82%), Positives = 204/218 (93%)

Query: 71  EKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS 130
           E+G QWTE+++WEPRAFVYHNFLSK ECEYLI LAKP+MVKSTVVDS TG+SKDSRVRTS
Sbjct: 91  ERGAQWTEVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTS 150

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN 190
           SG FL+RG+D++IR IEKRIAD+TFIP++HGEG+QVLHYEVGQKY+ H+DYFLDEFNTKN
Sbjct: 151 SGMFLQRGRDKVIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKN 210

Query: 191 GGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFW 250
           GGQRMATLLMYLSDVEEGGET+FP AN N +S+ W+NELSEC K+GLSVKPK GDALLFW
Sbjct: 211 GGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNELSECAKRGLSVKPKMGDALLFW 270

Query: 251 SMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           SM+PDATLDP SLHGGCPVIRGNKWS+TKWMH+HEYKA
Sbjct: 271 SMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEYKA 308


>gi|449443243|ref|XP_004139389.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 284

 Score =  397 bits (1020), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 190/286 (66%), Positives = 225/286 (78%), Gaps = 13/286 (4%)

Query: 5   RHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPP----NDLTSFRRR 60
           ++ +LQ KKWST  L         +++ ++LA+G F +      SPP    +  +S R  
Sbjct: 7   KYIKLQGKKWSTFQLS-------KMIMALVLALGFFMLIALRFLSPPETSHHRFSSVRHT 59

Query: 61  AFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTG 120
           AF   S    ++G+QW E ++WEPRAFVYHNFLSK EC YLI LAKP+M KSTVVD++TG
Sbjct: 60  AF--LSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETG 117

Query: 121 QSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYD 180
           ++ +  VRTSSG FL RGQD+I+  IEKRIADFTFIP+EHGEG+Q+LHYEVGQKYDAHYD
Sbjct: 118 KNVEDSVRTSSGMFLNRGQDKIVSNIEKRIADFTFIPIEHGEGLQILHYEVGQKYDAHYD 177

Query: 181 YFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVK 240
           YF+DE+N K GGQRMATLLMYLSDVEEGGETVFPAA  NF+SV WWNELS+CGK GLSVK
Sbjct: 178 YFVDEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVK 237

Query: 241 PKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           PK GDALLFWSM+PDATLDP+SLHG CPVIRGNKWS TKWMHV +Y
Sbjct: 238 PKMGDALLFWSMKPDATLDPTSLHGACPVIRGNKWSCTKWMHVDKY 283


>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
 gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
 gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
 gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
 gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
 gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
 gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
 gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
          Length = 307

 Score =  395 bits (1016), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 186/265 (70%), Positives = 217/265 (81%), Gaps = 17/265 (6%)

Query: 37  MGIFYIPIGDDDSPPN-------------DLTSFRRRAFEKRSSIAEEKGEQWTEIVAWE 83
            G+F +P+    S PN             +    R RA         E+G QWTE+++WE
Sbjct: 47  FGVFSLPV----SAPNAAATTGTAAGGETESADVRPRARRDLGEGLGERGAQWTEVISWE 102

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PRAFVYHNFLSK ECEYLI LAKP+MVKSTVVDS TG+SKDSRVRTSSG FL+RG+D++I
Sbjct: 103 PRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVI 162

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLS 203
           R IEKRIAD+TFIP++HGEG+QVLHYEVGQKY+ H+DYFLDEFNTKNGGQR+ATLLMYLS
Sbjct: 163 RAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQRIATLLMYLS 222

Query: 204 DVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSL 263
           DVEEGGET+FP AN N +S+ W+NELS+C K+GLSVKPK GDALLFWSM+PDATLDP SL
Sbjct: 223 DVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPDATLDPLSL 282

Query: 264 HGGCPVIRGNKWSATKWMHVHEYKA 288
           HGGCPVI+GNKWS+TKWMH+HEYKA
Sbjct: 283 HGGCPVIKGNKWSSTKWMHIHEYKA 307


>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
 gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
          Length = 307

 Score =  394 bits (1012), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 185/265 (69%), Positives = 217/265 (81%), Gaps = 17/265 (6%)

Query: 37  MGIFYIPIGDDDSPPN-------------DLTSFRRRAFEKRSSIAEEKGEQWTEIVAWE 83
            G+F +P+    S PN             +    R RA         E+G QWTE+++WE
Sbjct: 47  FGVFSLPV----SAPNAAATTGTAAGGETESADVRPRARRDLGEGLGERGAQWTEVISWE 102

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PRAFVYHNFLSK ECEYLI LAKP+MVKSTVVDS TG+SKDSRVRTSSG FL+RG++++I
Sbjct: 103 PRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRNKVI 162

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLS 203
           R IEKRIAD+TFIP++HGEG+QVLHYEVGQKY+ H+DYFLDEFNTKNGGQR+ATLLMYLS
Sbjct: 163 RAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQRIATLLMYLS 222

Query: 204 DVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSL 263
           DVEEGGET+FP AN N +S+ W+NELS+C K+GLSVKPK GDALLFWSM+PDATLDP SL
Sbjct: 223 DVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPDATLDPLSL 282

Query: 264 HGGCPVIRGNKWSATKWMHVHEYKA 288
           HGGCPVI+GNKWS+TKWMH+HEYKA
Sbjct: 283 HGGCPVIKGNKWSSTKWMHIHEYKA 307


>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
 gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
          Length = 307

 Score =  393 bits (1009), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 185/265 (69%), Positives = 216/265 (81%), Gaps = 17/265 (6%)

Query: 37  MGIFYIPIGDDDSPPN-------------DLTSFRRRAFEKRSSIAEEKGEQWTEIVAWE 83
            G+F +P+    S PN             +    R RA         E+G QWTE+++WE
Sbjct: 47  FGVFSLPV----SAPNAAATTGTAAGGETESADVRPRARRDLGEGLGERGAQWTEVISWE 102

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PRAFVYHNFLSK ECEYLI LAKP+MVKSTVVDS TG+SKDSRVRTSSG FL+RG+D++I
Sbjct: 103 PRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVI 162

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLS 203
           R IEKRIAD+TFIP++HGEG+QVLHYEVGQKY+ H+DYFLDEFNTKNGGQR+ATLLMYLS
Sbjct: 163 RAIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKNGGQRIATLLMYLS 222

Query: 204 DVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSL 263
           DVEEGGET+FP AN N +S+ W+NELS+C K+GLSVKPK GDALLFWSM+P ATLDP SL
Sbjct: 223 DVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWSMKPGATLDPLSL 282

Query: 264 HGGCPVIRGNKWSATKWMHVHEYKA 288
           HGGCPVI+GNKWS+TKWMH+HEYKA
Sbjct: 283 HGGCPVIKGNKWSSTKWMHIHEYKA 307


>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 315

 Score =  393 bits (1009), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 177/231 (76%), Positives = 206/231 (89%)

Query: 58  RRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDS 117
           R R     S    E+G QWTE+++WEPRAFVYHNFLSK ECEYLI+LAKP MVKSTVVDS
Sbjct: 85  RNRGRRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEECEYLIELAKPRMVKSTVVDS 144

Query: 118 KTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDA 177
           +TG+SKDSRVRTSSG FL+RG+D++IR IE+RIAD+TFIP EHGEG+QVLHYEVGQKY+ 
Sbjct: 145 ETGKSKDSRVRTSSGMFLQRGRDKVIRAIERRIADYTFIPAEHGEGLQVLHYEVGQKYEP 204

Query: 178 HYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGL 237
           H+DYFLDEFNTKNGGQRMAT+LMYLSD+EEGGET+FP AN N +S+ W+NELSEC ++GL
Sbjct: 205 HFDYFLDEFNTKNGGQRMATILMYLSDIEEGGETIFPDANVNSSSLPWYNELSECARKGL 264

Query: 238 SVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           +VKPK GDALLFWSM+PDATLDP SLHGGCPVI+GNKWS+TKW+HV EYKA
Sbjct: 265 AVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWLHVGEYKA 315


>gi|326495334|dbj|BAJ85763.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 300

 Score =  392 bits (1007), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 178/231 (77%), Positives = 207/231 (89%)

Query: 57  FRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVD 116
           FRR AFE R     EKGE WTE+++WEPRAF+YHNFLSK ECEYLI LAKP+M KSTVVD
Sbjct: 69  FRRSAFESRLEKGGEKGEPWTEVLSWEPRAFIYHNFLSKEECEYLISLAKPHMKKSTVVD 128

Query: 117 SKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYD 176
           S TG SKDSRVRTSSGTFL+RGQD+I+R IEKRI+DFTFIP+E+GEG+QVLHYEVGQKY+
Sbjct: 129 SATGGSKDSRVRTSSGTFLRRGQDKIVRTIEKRISDFTFIPVENGEGLQVLHYEVGQKYE 188

Query: 177 AHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQG 236
            H+DYF D+FNTKNGGQR+AT+LMYLSDVEEGGETVFP+A  N +S+ ++NELSEC K+G
Sbjct: 189 PHFDYFHDDFNTKNGGQRIATVLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRG 248

Query: 237 LSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           +SVKPK GDALLFWSMRPD TLDP+SLHGGCPVI+G+KWS+TKW+ VHEYK
Sbjct: 249 ISVKPKMGDALLFWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHEYK 299


>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 306

 Score =  389 bits (1000), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 178/231 (77%), Positives = 206/231 (89%)

Query: 57  FRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVD 116
           FRR A E R  +  EKGE WTE+++WEPRAF+YHNFLSK ECEYLI LAKP+M KSTVVD
Sbjct: 75  FRRSASESRLEMRGEKGEPWTEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVD 134

Query: 117 SKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYD 176
           S TG SKDSRVRTSSGTFL+RGQD++IR IEKRI+DFTFIP E+GEG+QVLHYEVGQKY+
Sbjct: 135 SATGGSKDSRVRTSSGTFLRRGQDKVIRTIEKRISDFTFIPAENGEGLQVLHYEVGQKYE 194

Query: 177 AHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQG 236
            H+DYF D+FNTKNGGQR+ATLLMYLSDVEEGGETVFP+A  N +S+ ++NELSEC K+G
Sbjct: 195 PHFDYFHDDFNTKNGGQRIATLLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRG 254

Query: 237 LSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           +SVKPK GDALLFWSMRPD TLDP+SLHGGCPVI+G+KWS+TKW+ VHEYK
Sbjct: 255 ISVKPKMGDALLFWSMRPDGTLDPTSLHGGCPVIKGDKWSSTKWIRVHEYK 305


>gi|224133600|ref|XP_002327635.1| predicted protein [Populus trichocarpa]
 gi|222836720|gb|EEE75113.1| predicted protein [Populus trichocarpa]
          Length = 291

 Score =  387 bits (995), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 196/290 (67%), Positives = 237/290 (81%), Gaps = 3/290 (1%)

Query: 1   MVKLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDS---PPNDLTSF 57
           M K R+SR+ ++K  + TL+L++L M T V+L+LL +GI  IP           NDL+S 
Sbjct: 1   MAKARYSRIPSRKSPSSTLILTLLLMFTFVILILLGLGILSIPSTSSSDSSRQANDLSSI 60

Query: 58  RRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDS 117
              +    S   E K EQW E+++W+PRAFVYHNFL+KAECEYLI+LAKP M KSTVVDS
Sbjct: 61  AHHSRIDGSGDDEGKAEQWAEVISWKPRAFVYHNFLTKAECEYLINLAKPRMQKSTVVDS 120

Query: 118 KTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDA 177
            TG+SKDS+VRTSSGTFL RG+D+I+R IEKRIADF+FIP+EHGEG+Q+LHYEVGQ+Y+ 
Sbjct: 121 STGKSKDSKVRTSSGTFLPRGRDKIVRDIEKRIADFSFIPVEHGEGLQILHYEVGQRYEP 180

Query: 178 HYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGL 237
           H+DYF+DE+NTKNGGQR+AT+LMYLSDVEEGGETVFP+A  N ++V WWNELSECGK GL
Sbjct: 181 HFDYFMDEYNTKNGGQRIATVLMYLSDVEEGGETVFPSAEGNISAVPWWNELSECGKGGL 240

Query: 238 SVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           SVKPK GDALLFWSM PD + DPSSLHGGCPVIRGNKWS+TKWM V+EYK
Sbjct: 241 SVKPKMGDALLFWSMNPDGSPDPSSLHGGCPVIRGNKWSSTKWMRVNEYK 290


>gi|297802350|ref|XP_002869059.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297314895|gb|EFH45318.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 290

 Score =  386 bits (992), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 184/289 (63%), Positives = 227/289 (78%), Gaps = 6/289 (2%)

Query: 3   KLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIP-IGDDDSPPNDLTSFRRRA 61
           K +H R Q +K S  T   ++L +   V+L+L+ +GI  +P      S P DLT+  +  
Sbjct: 4   KPKHLRNQPRK-SFSTQAFTVLILGLFVILILVGLGILSLPNTNKSSSRPMDLTTIVQ-T 61

Query: 62  FEKRSSIAEEK---GEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSK 118
            E+R S  +E+   G++W E+++WEPRAFVYHNFL+  ECE+LI LAKP MVKS VVD K
Sbjct: 62  IEERESYGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAKPSMVKSKVVDVK 121

Query: 119 TGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAH 178
           TG+S DSRVRTSSGTFLKRG D I+  IE RI+DFTFIP+E+GEG+QVLHYEVGQKY+ H
Sbjct: 122 TGKSIDSRVRTSSGTFLKRGHDEIVEEIENRISDFTFIPIENGEGLQVLHYEVGQKYEPH 181

Query: 179 YDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLS 238
           +DYF DEFN + GGQR+AT+LMYLSDV+EGGETVFPAA  N + V WW+ELS+CGK+GLS
Sbjct: 182 HDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNISDVPWWDELSQCGKEGLS 241

Query: 239 VKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           V PK+ DALLFWSM+PDA+LDPSSLHGGCPVI+GNKWS+TKW HVHEY 
Sbjct: 242 VLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKWSSTKWFHVHEYN 290


>gi|10177121|dbj|BAB10411.1| prolyl 4-hydroxylase, alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 267

 Score =  385 bits (988), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 183/266 (68%), Positives = 218/266 (81%), Gaps = 2/266 (0%)

Query: 1   MVKLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGD-DDSPPNDLTSFRR 59
           M + R+ R  A+K S  TLV ++L M T V+L+LLA GI  +P  +   S  NDLTS  R
Sbjct: 2   MARPRNHRPSARKSSHSTLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIVR 61

Query: 60  RAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKT 119
           +  + RS   + K E+W EI++WEPRA VYHNFL+K EC+YLI+LAKP+M KSTVVD KT
Sbjct: 62  KTLQ-RSGEDDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKT 120

Query: 120 GQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHY 179
           G+S DSRVRTSSGTFL RG+D+ IR IEKRI+DFTFIP+EHGEG+QVLHYE+GQKY+ HY
Sbjct: 121 GKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHY 180

Query: 180 DYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
           DYF+DE+NT+NGGQR+AT+LMYLSDVEEGGETVFPAA  N+++V WWNELSECGK GLSV
Sbjct: 181 DYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSV 240

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHG 265
           KPK GDALLFWSM PDATLDPSSLHG
Sbjct: 241 KPKMGDALLFWSMTPDATLDPSSLHG 266


>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
           Arabidopsis thaliana
 gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
 gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
           thaliana]
 gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 291

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 174/242 (71%), Positives = 208/242 (85%), Gaps = 1/242 (0%)

Query: 47  DDSPPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAK 106
           + S  NDLT+  R++ E  S   E  GE+W E+++WEPRA VYHNFL+  ECE+LI LAK
Sbjct: 51  NSSKTNDLTNIVRKS-ETSSGDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAK 109

Query: 107 PYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQV 166
           P MVKSTVVD KTG SKDSRVRTSSGTFL+RG D ++  IEKRI+DFTFIP+E+GEG+QV
Sbjct: 110 PSMVKSTVVDEKTGGSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQV 169

Query: 167 LHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWW 226
           LHY+VGQKY+ HYDYFLDEFNTKNGGQR+AT+LMYLSDV++GGETVFPAA  N ++V WW
Sbjct: 170 LHYQVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWW 229

Query: 227 NELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           NELS+CGK+GLSV PK+ DALLFW+MRPDA+LDPSSLHGGCPV++GNKWS+TKW HVHE+
Sbjct: 230 NELSKCGKEGLSVLPKKRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEF 289

Query: 287 KA 288
           K 
Sbjct: 290 KV 291


>gi|21593091|gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
          Length = 291

 Score =  383 bits (984), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 174/242 (71%), Positives = 207/242 (85%), Gaps = 1/242 (0%)

Query: 47  DDSPPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAK 106
           + S  NDLT+  R++ E  S   E  GE+W E+++WEPRA VYHNFL+  ECE+LI LAK
Sbjct: 51  NSSKTNDLTNIVRKS-ETSSGDEEGNGERWVEVISWEPRAVVYHNFLTNEECEHLISLAK 109

Query: 107 PYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQV 166
           P MVKSTVVD KTG SKDSRVRTSSGTFL+RG D ++  IEKRI+DFTFIP+E+GEG+QV
Sbjct: 110 PSMVKSTVVDEKTGGSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQV 169

Query: 167 LHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWW 226
           LHY+VGQKY+ HYDYFLDEFNTKNGGQR+AT+LMYLSDV++GGETVFPAA  N ++V WW
Sbjct: 170 LHYQVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWW 229

Query: 227 NELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           NELS+CGK+GLSV PK  DALLFW+MRPDA+LDPSSLHGGCPV++GNKWS+TKW HVHE+
Sbjct: 230 NELSKCGKEGLSVLPKXRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEF 289

Query: 287 KA 288
           K 
Sbjct: 290 KV 291


>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 291

 Score =  382 bits (982), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 173/242 (71%), Positives = 208/242 (85%), Gaps = 1/242 (0%)

Query: 47  DDSPPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAK 106
           + S  NDLT+  R++ E      +  GE+W E+++WEPRA VYHNFLS  ECE+LI+LAK
Sbjct: 51  NSSKTNDLTNIVRKS-ETSYGDEDGNGERWVEVISWEPRAVVYHNFLSNEECEHLINLAK 109

Query: 107 PYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQV 166
           P MVKSTVVD KTG SKDSRVRTSSGTFL+RG D ++  IEKRI+DFTFIP+E+GEG+QV
Sbjct: 110 PSMVKSTVVDEKTGGSKDSRVRTSSGTFLRRGHDEVVEVIEKRISDFTFIPVENGEGLQV 169

Query: 167 LHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWW 226
           LHY+VGQKY+ HYDYFLDEFNTKNGGQR+AT+LMYLSDV++GGETVFPAA  N ++V WW
Sbjct: 170 LHYQVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWW 229

Query: 227 NELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           NELS+CGK+GLSV PK+ DALLFW+MRPDA+LDPSSLHGGCPV++GNKWS+TKW HVHE+
Sbjct: 230 NELSKCGKEGLSVLPKKRDALLFWNMRPDASLDPSSLHGGCPVVKGNKWSSTKWFHVHEF 289

Query: 287 KA 288
           K 
Sbjct: 290 KV 291


>gi|297797785|ref|XP_002866777.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297312612|gb|EFH43036.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 266

 Score =  382 bits (980), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 182/266 (68%), Positives = 216/266 (81%), Gaps = 2/266 (0%)

Query: 1   MVKLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGD-DDSPPNDLTSFRR 59
           M + R  R  A+K S  TLV ++L M T V+L+LLA GI  +P  +   S  NDLTS  R
Sbjct: 1   MARPRSHRPSARKSSRSTLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIVR 60

Query: 60  RAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKT 119
           +  + R    + K E+W EI++WEPRA VYHNFL+K EC+YLI+LAKP+M KSTVVD KT
Sbjct: 61  KTLQ-RGVEDDSKNERWVEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKT 119

Query: 120 GQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHY 179
           G+S DSRVRTSSGTFL RG+D+ IR IEKRI+DFTFIP+EHGEG+QVLHYE+GQKY+ HY
Sbjct: 120 GKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHY 179

Query: 180 DYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
           DYF+DE+NT+NGGQR+AT+LMYLSDVEEGGETVFPAA  N+++V WWNELSECGK GLSV
Sbjct: 180 DYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSV 239

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHG 265
           KPK GDALLFWSM PDATLDPSSLHG
Sbjct: 240 KPKMGDALLFWSMTPDATLDPSSLHG 265


>gi|116788056|gb|ABK24739.1| unknown [Picea sitchensis]
          Length = 303

 Score =  380 bits (977), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 191/291 (65%), Positives = 232/291 (79%), Gaps = 20/291 (6%)

Query: 15  STLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDLTS-------FRRRAFEKRSS 67
           ST TLVL+ML ML+IVLLMLLA+GI  +P+  +   P+++++          +  ++  S
Sbjct: 14  STFTLVLTMLLMLSIVLLMLLALGIVSLPV--NSRAPDEISNGGVYSEHSGGKKLQETYS 71

Query: 68  IAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR- 126
              ++ +QW E+++WEPRA +YHNFL+K ECEYLI+LAKP+M KSTVVDS TG+SKDSR 
Sbjct: 72  NGMDEPKQWAEVLSWEPRAILYHNFLNKEECEYLINLAKPHMAKSTVVDSATGKSKDSRF 131

Query: 127 ----------VRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYD 176
                     VRTSSG FL RGQD+ IR IEKRIADFTFIP EHGEG+QVLHYEVGQKY+
Sbjct: 132 VHRWKSNDSRVRTSSGMFLNRGQDKTIRSIEKRIADFTFIPAEHGEGLQVLHYEVGQKYE 191

Query: 177 AHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQG 236
            H+DYFLDEFNTKNGGQR+AT+LMYLSDVE+GGETVFPA+  N +SV WW+ELSEC K G
Sbjct: 192 PHFDYFLDEFNTKNGGQRIATVLMYLSDVEKGGETVFPASKVNSSSVPWWDELSECAKAG 251

Query: 237 LSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           +SV+P+ GDALLFWSMRPDA LDPSSLH GCPVI+G+KWSATKW+HV EYK
Sbjct: 252 ISVRPRMGDALLFWSMRPDAELDPSSLHAGCPVIQGDKWSATKWIHVGEYK 302


>gi|224117220|ref|XP_002331751.1| predicted protein [Populus trichocarpa]
 gi|222874448|gb|EEF11579.1| predicted protein [Populus trichocarpa]
          Length = 266

 Score =  379 bits (973), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 187/288 (64%), Positives = 226/288 (78%), Gaps = 22/288 (7%)

Query: 1   MVKLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRRR 60
           M K R+SR+  +K  + TL+               ++ + +         PNDL+S    
Sbjct: 1   MAKARYSRISTRKSPSSTLIRK-------------SLNVHF---------PNDLSSIAHN 38

Query: 61  AFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTG 120
           +    S   E K EQW E ++WEPRAF+YHNFL+KAEC+YLI+LAKP+M KS VVDS +G
Sbjct: 39  SKIHESGDDEGKAEQWVEAISWEPRAFIYHNFLTKAECDYLINLAKPHMQKSMVVDSSSG 98

Query: 121 QSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYD 180
           +SKDSRVRTSSGTFL RG+D+IIR IEKRIADF+FIP EHGEG+Q+LHYEVGQKY+ H+D
Sbjct: 99  KSKDSRVRTSSGTFLPRGRDKIIRDIEKRIADFSFIPSEHGEGLQILHYEVGQKYEPHFD 158

Query: 181 YFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVK 240
           YF+D++NT+NGGQR+AT+LMYLSDVEEGGETVFP+A  N +SV WWNELSECGK GLSVK
Sbjct: 159 YFMDDYNTENGGQRIATVLMYLSDVEEGGETVFPSAKGNISSVPWWNELSECGKGGLSVK 218

Query: 241 PKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           PK GDALLFWSM+PDA+LDPSSLHGGCPVIRGNKWS+TKWM V+EYKA
Sbjct: 219 PKMGDALLFWSMKPDASLDPSSLHGGCPVIRGNKWSSTKWMRVNEYKA 266


>gi|242039227|ref|XP_002467008.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
 gi|241920862|gb|EER94006.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
          Length = 307

 Score =  373 bits (958), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 174/238 (73%), Positives = 205/238 (86%)

Query: 51  PNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMV 110
           P   + FRR  F+    +  EKGE WTE+++WEPRAFVYHNFLSK EC++LI LAKP+M 
Sbjct: 70  PRPRSHFRRSTFDSGLEMRGEKGEPWTEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMK 129

Query: 111 KSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYE 170
           KSTVVDS TG SKDSRVRTSSG FL+RGQD+II+ IEKRIADFTFIP+EHGEG+QVLHYE
Sbjct: 130 KSTVVDSATGASKDSRVRTSSGMFLRRGQDKIIQTIEKRIADFTFIPVEHGEGLQVLHYE 189

Query: 171 VGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELS 230
           VGQKY+ H+DYF D++NTKNGGQR+ATLLMYLSDVE+GGETVFP++  N +S  ++NELS
Sbjct: 190 VGQKYEPHFDYFHDDYNTKNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELS 249

Query: 231 ECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           EC K GLSVKPK GDALLFWSM+PD ++D +SLHGGCPVI+GNKWS+TKWM VHEYKA
Sbjct: 250 ECAKGGLSVKPKMGDALLFWSMKPDGSMDSTSLHGGCPVIKGNKWSSTKWMRVHEYKA 307


>gi|449520146|ref|XP_004167095.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 249

 Score =  369 bits (948), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 174/241 (72%), Positives = 200/241 (82%), Gaps = 6/241 (2%)

Query: 49  SPP----NDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDL 104
           SPP    +  +S R  AF   S    ++G+QW E ++WEPRAFVYHNFLSK EC YLI L
Sbjct: 9   SPPETSHHRFSSVRHTAF--LSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISL 66

Query: 105 AKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGI 164
           AKP+M KSTVVD++TG++ +  VRTSSG FL RGQD+I+  IEKRIADFTFIP+EHGEG+
Sbjct: 67  AKPHMEKSTVVDNETGKNVEDSVRTSSGMFLNRGQDKIVSNIEKRIADFTFIPIEHGEGL 126

Query: 165 QVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVR 224
           Q+LHYEVGQKYDAHYD+F DEFN K  GQRMATLLMYLSDVEEGGETVFPAA  NF+SV 
Sbjct: 127 QILHYEVGQKYDAHYDFFDDEFNLKEIGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVP 186

Query: 225 WWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVH 284
           WWNELS+CGK GLSVKPK GDALLFWSM+PD TLDP+SLHG CPVIRGNKWS TKW+HV+
Sbjct: 187 WWNELSKCGKGGLSVKPKMGDALLFWSMKPDTTLDPTSLHGACPVIRGNKWSCTKWIHVN 246

Query: 285 E 285
           +
Sbjct: 247 Q 247


>gi|42567428|ref|NP_195306.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|332661174|gb|AEE86574.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 290

 Score =  367 bits (943), Expect = 3e-99,   Method: Compositional matrix adjust.
 Identities = 171/254 (67%), Positives = 206/254 (81%), Gaps = 3/254 (1%)

Query: 36  AMGIFYIPIGDDDSP-PNDLTSFRRRAFEKRSSIAEE--KGEQWTEIVAWEPRAFVYHNF 92
            +GIF +P  +  S  P DLT+  +   E+ S   EE   G++W E+++WEPRAFVYHNF
Sbjct: 36  GLGIFSLPSTNKTSSMPMDLTTIVQTIQERESFGDEEDGNGDRWLEVISWEPRAFVYHNF 95

Query: 93  LSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIAD 152
           L+  ECE+LI LAKP M+KS VVD KTG+S DSRVRTSSGTFL RG D I+  IE RI+D
Sbjct: 96  LTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRVRTSSGTFLNRGHDEIVEEIENRISD 155

Query: 153 FTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETV 212
           FTFIP E+GEG+QVLHYEVGQ+Y+ H+DYF DEFN + GGQR+AT+LMYLSDV+EGGETV
Sbjct: 156 FTFIPPENGEGLQVLHYEVGQRYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETV 215

Query: 213 FPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRG 272
           FPAA  N + V WW+ELS+CGK+GLSV PK+ DALLFWSM+PDA+LDPSSLHGGCPVI+G
Sbjct: 216 FPAAKGNVSDVPWWDELSQCGKEGLSVLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKG 275

Query: 273 NKWSATKWMHVHEY 286
           NKWS+TKW HVHEY
Sbjct: 276 NKWSSTKWFHVHEY 289


>gi|115482738|ref|NP_001064962.1| Os10g0497800 [Oryza sativa Japonica Group]
 gi|78708853|gb|ABB47828.1| prolyl 4-hydroxylase alpha subunit, putative, expressed [Oryza
           sativa Japonica Group]
 gi|113639571|dbj|BAF26876.1| Os10g0497800 [Oryza sativa Japonica Group]
 gi|215767852|dbj|BAH00081.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218184821|gb|EEC67248.1| hypothetical protein OsI_34188 [Oryza sativa Indica Group]
          Length = 321

 Score =  367 bits (942), Expect = 4e-99,   Method: Compositional matrix adjust.
 Identities = 170/217 (78%), Positives = 197/217 (90%)

Query: 71  EKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS 130
           EKGE WTE+++WEPRAF+YHNFLSK ECEYLI LAKP+M KSTVVD+ TG SKDSRVRTS
Sbjct: 104 EKGEPWTEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSRVRTS 163

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN 190
           SG FL RGQD+IIR IEKRI+D+TFIP+E+GEG+QVLHYEVGQKY+ H+DYF DEFNTKN
Sbjct: 164 SGMFLGRGQDKIIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKN 223

Query: 191 GGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFW 250
           GGQR+ATLLMYLSDVEEGGET+FP++ AN +S  ++NELSEC K+GL+VKPK GDALLFW
Sbjct: 224 GGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLFW 283

Query: 251 SMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           SMRPD +LD +SLHGGCPVI+GNKWS+TKWM VHEYK
Sbjct: 284 SMRPDGSLDATSLHGGCPVIKGNKWSSTKWMRVHEYK 320


>gi|414870899|tpg|DAA49456.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 364

 Score =  366 bits (939), Expect = 6e-99,   Method: Compositional matrix adjust.
 Identities = 172/228 (75%), Positives = 199/228 (87%), Gaps = 4/228 (1%)

Query: 60  RAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKT 119
           R  E R     EKGE WTE+++WEPRAFVYHNFLSK EC++LI LAKP+M KSTVVDS T
Sbjct: 140 RGLETRG----EKGEPWTEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSAT 195

Query: 120 GQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHY 179
           G SKDSRVRTSSG FL+RGQD+IIR IEKRIAD+TFIP+E GEG+QVLHYEVGQKY+ H+
Sbjct: 196 GGSKDSRVRTSSGMFLRRGQDKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHF 255

Query: 180 DYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
           DYF D++NTKNGGQR+ATLLMYLSDVE+GGETVFP++  N +S  ++NELSEC K GLSV
Sbjct: 256 DYFHDDYNTKNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSV 315

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           KPK GDALLFWSM+PD +LDP+SLHGGCPVI+GNKWS+TKWM VHEYK
Sbjct: 316 KPKMGDALLFWSMKPDGSLDPTSLHGGCPVIKGNKWSSTKWMRVHEYK 363


>gi|212720775|ref|NP_001131953.1| uncharacterized protein LOC100193348 [Zea mays]
 gi|194693016|gb|ACF80592.1| unknown [Zea mays]
 gi|347978798|gb|AEP37741.1| prolyl 4-hydroxylase 1 [Zea mays]
 gi|414870898|tpg|DAA49455.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 307

 Score =  366 bits (939), Expect = 7e-99,   Method: Compositional matrix adjust.
 Identities = 171/227 (75%), Positives = 199/227 (87%)

Query: 61  AFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTG 120
           AF+       EKGE WTE+++WEPRAFVYHNFLSK EC++LI LAKP+M KSTVVDS TG
Sbjct: 80  AFDSGLETRGEKGEPWTEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATG 139

Query: 121 QSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYD 180
            SKDSRVRTSSG FL+RGQD+IIR IEKRIAD+TFIP+E GEG+QVLHYEVGQKY+ H+D
Sbjct: 140 GSKDSRVRTSSGMFLRRGQDKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFD 199

Query: 181 YFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVK 240
           YF D++NTKNGGQR+ATLLMYLSDVE+GGETVFP++  N +S  ++NELSEC K GLSVK
Sbjct: 200 YFHDDYNTKNGGQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVK 259

Query: 241 PKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           PK GDALLFWSM+PD +LDP+SLHGGCPVI+GNKWS+TKWM VHEYK
Sbjct: 260 PKMGDALLFWSMKPDGSLDPTSLHGGCPVIKGNKWSSTKWMRVHEYK 306


>gi|357517881|ref|XP_003629229.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523251|gb|AET03705.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 278

 Score =  364 bits (934), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 177/289 (61%), Positives = 219/289 (75%), Gaps = 12/289 (4%)

Query: 1   MVKLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFY-IPIGDDDSPPNDLTSFRR 59
           MVK RHSRL  +K  +LT   +++F L +   +L+ + +   IP        N + S   
Sbjct: 1   MVKFRHSRLGPRK-PSLTASQTLIFTLFVTFTLLILILLTLRIP------KLNHINSISH 53

Query: 60  RAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKT 119
            A        +   ++W +IV+WEPRAF+YHNFL+K ECE+LI+ AKP M KS+VVD++T
Sbjct: 54  NALRSE----DNDNKRWVQIVSWEPRAFLYHNFLTKKECEHLINTAKPSMQKSSVVDNET 109

Query: 120 GQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHY 179
           G+SKDS VRTSSGTFL RG D I+R IEKRIADFTFIP+E+GE   VL YEVGQKYD H 
Sbjct: 110 GKSKDSSVRTSSGTFLDRGGDEIVRNIEKRIADFTFIPVENGESFNVLRYEVGQKYDPHL 169

Query: 180 DYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
           DYF D++NT NGGQR+AT+LMYLSDVEEGGETVFPAA  N +SV WWNELS+CGK+GLS+
Sbjct: 170 DYFADDYNTVNGGQRIATMLMYLSDVEEGGETVFPAAKGNISSVPWWNELSDCGKKGLSI 229

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           KPK GDALLFWSM+PD TLDPSSLHG CPVI+G+KWS TKWM ++E++A
Sbjct: 230 KPKMGDALLFWSMKPDGTLDPSSLHGACPVIKGDKWSCTKWMRINEFRA 278


>gi|222613083|gb|EEE51215.1| hypothetical protein OsJ_32038 [Oryza sativa Japonica Group]
          Length = 222

 Score =  364 bits (934), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 170/217 (78%), Positives = 197/217 (90%)

Query: 71  EKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS 130
           EKGE WTE+++WEPRAF+YHNFLSK ECEYLI LAKP+M KSTVVD+ TG SKDSRVRTS
Sbjct: 5   EKGEPWTEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSRVRTS 64

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN 190
           SG FL RGQD+IIR IEKRI+D+TFIP+E+GEG+QVLHYEVGQKY+ H+DYF DEFNTKN
Sbjct: 65  SGMFLGRGQDKIIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKN 124

Query: 191 GGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFW 250
           GGQR+ATLLMYLSDVEEGGET+FP++ AN +S  ++NELSEC K+GL+VKPK GDALLFW
Sbjct: 125 GGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLFW 184

Query: 251 SMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           SMRPD +LD +SLHGGCPVI+GNKWS+TKWM VHEYK
Sbjct: 185 SMRPDGSLDATSLHGGCPVIKGNKWSSTKWMRVHEYK 221


>gi|90704797|dbj|BAE92293.1| putative prolyl 4-hydroxylase, alpha subunit [Cryptomeria japonica]
          Length = 302

 Score =  359 bits (922), Expect = 7e-97,   Method: Compositional matrix adjust.
 Identities = 173/274 (63%), Positives = 215/274 (78%), Gaps = 6/274 (2%)

Query: 16  TLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSP---PNDLTSFRRRAFEKRSSIAEEK 72
           T+  +L    +L++ +L +LA+ +F +P+    +P      L+      + KRS   +E 
Sbjct: 30  TVPPLLFRFTLLSVFVLFVLAIWVFSVPV--KRTPYQISRQLSESIAADYAKRSDGKDEP 87

Query: 73  GEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSG 132
            E+  E+++WEPRAF+YHNFL+K ECEYLI++AKP+MVKS VVDSKTG S DS VRTSSG
Sbjct: 88  KER-VEVLSWEPRAFLYHNFLAKDECEYLINIAKPHMVKSMVVDSKTGGSMDSNVRTSSG 146

Query: 133 TFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGG 192
            FL RGQD+IIR IEKRIADF+ IP+EHGEG+ VLHYEV QKYDAHYDYF D  N KNGG
Sbjct: 147 WFLNRGQDKIIRRIEKRIADFSHIPVEHGEGLHVLHYEVEQKYDAHYDYFSDTINVKNGG 206

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
           QR AT+LMYLSDVE+GGETVFP +  N +SV WW+ELSECG+ GLSV+PK GDALLFWS+
Sbjct: 207 QRGATMLMYLSDVEKGGETVFPQSKVNSSSVPWWDELSECGRSGLSVRPKMGDALLFWSV 266

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           +PDA+LDPSSLHG CPVI+GNKWSATKWM +++Y
Sbjct: 267 KPDASLDPSSLHGSCPVIQGNKWSATKWMRLNKY 300


>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
 gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
          Length = 256

 Score =  356 bits (913), Expect = 8e-96,   Method: Compositional matrix adjust.
 Identities = 161/211 (76%), Positives = 188/211 (89%)

Query: 76  WTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFL 135
           WTE ++W+PRA V+HNFLS  EC++LI LA+P M +S VVD++TG+SKDSRVRTSSGTFL
Sbjct: 45  WTETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSRVRTSSGTFL 104

Query: 136 KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
           +RGQD II  IE+RIA FTFIP EHGEG+QVLHYEVGQKYDAH+DYF D+ NTKNGGQR+
Sbjct: 105 RRGQDEIISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNTKNGGQRV 164

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           AT+LMYLSDVEEGGETVFP+A  N +SV WW+ELSECGK+G+SVKP++GDALLFWSM PD
Sbjct: 165 ATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECGKKGVSVKPRKGDALLFWSMSPD 224

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           A LDP SLHGGCPVI+GNKWSATKWMH+ EY
Sbjct: 225 AELDPFSLHGGCPVIKGNKWSATKWMHLREY 255


>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
 gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
          Length = 256

 Score =  354 bits (909), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 160/211 (75%), Positives = 187/211 (88%)

Query: 76  WTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFL 135
           WTE ++W+PRA V+HNFLS  EC++LI LA+P M +S VVD++TG+SKDSRVRTSSGTFL
Sbjct: 45  WTETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSRVRTSSGTFL 104

Query: 136 KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
           +RGQD II  IE+RIA FTFIP EHGEG+QVLHYEVGQKYDAH+DYF D+ NTKNGGQR+
Sbjct: 105 RRGQDEIISRIEERIAKFTFIPKEHGEGLQVLHYEVGQKYDAHHDYFHDKVNTKNGGQRV 164

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           AT+LMYLSDVEEGGETVFP+A  N +SV WW+ELSEC K+G+SVKP++GDALLFWSM PD
Sbjct: 165 ATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECAKKGVSVKPRKGDALLFWSMSPD 224

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           A LDP SLHGGCPVI+GNKWSATKWMH+ EY
Sbjct: 225 AELDPFSLHGGCPVIKGNKWSATKWMHLREY 255


>gi|222623961|gb|EEE58093.1| hypothetical protein OsJ_08962 [Oryza sativa Japonica Group]
          Length = 387

 Score =  352 bits (902), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 167/242 (69%), Positives = 196/242 (80%), Gaps = 12/242 (4%)

Query: 35  LAMGIFYIPI------------GDDDSPPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAW 82
           +A G+F +P+               D+ P D    R RA    S    E+G QWTE+++W
Sbjct: 45  IAFGVFSLPVSAPNAATTDSAAAGGDAEPADPRPPRTRARRDLSEGLGERGAQWTEVISW 104

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EPRAFVYHNFLSK EC+YLI LAKP+MVKSTVVDS TG+SKDSRVRTSSG FL+RG+D++
Sbjct: 105 EPRAFVYHNFLSKEECDYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKV 164

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           IR IEKRIAD+TFIPMEHGEG+QVLHYEVGQKY+ H+DYFLDE+NTKNGGQRMATLLMYL
Sbjct: 165 IRAIEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRMATLLMYL 224

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           SDVEEGGET+FP AN N +S+ W+NELSEC ++GL+VKPK GDALLFWSM+PDATLDP S
Sbjct: 225 SDVEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATLDPLS 284

Query: 263 LH 264
           LH
Sbjct: 285 LH 286


>gi|218191856|gb|EEC74283.1| hypothetical protein OsI_09531 [Oryza sativa Indica Group]
          Length = 376

 Score =  349 bits (895), Expect = 8e-94,   Method: Compositional matrix adjust.
 Identities = 166/239 (69%), Positives = 194/239 (81%), Gaps = 12/239 (5%)

Query: 38  GIFYIPI------------GDDDSPPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPR 85
           G+F +P+               D+ P D    R RA    S    E+G QWTE+++WEPR
Sbjct: 48  GVFSLPVSAPNAATTDSAAAGGDAEPADPRPPRTRARRDLSEGLGERGAQWTEVISWEPR 107

Query: 86  AFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRG 145
           AFVYHNFLSK EC+YLI LAKP+MVKSTVVDS TG+SKDSRVRTSSG FL+RG+D++IR 
Sbjct: 108 AFVYHNFLSKEECDYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRA 167

Query: 146 IEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDV 205
           IEKRIAD+TFIPMEHGEG+QVLHYEVGQKY+ H+DYFLDE+NTKNGGQRMATLLMYLSDV
Sbjct: 168 IEKRIADYTFIPMEHGEGLQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRMATLLMYLSDV 227

Query: 206 EEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLH 264
           EEGGET+FP AN N +S+ W+NELSEC ++GL+VKPK GDALLFWSM+PDATLDP SLH
Sbjct: 228 EEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATLDPLSLH 286


>gi|357517897|ref|XP_003629237.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523259|gb|AET03713.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|388513409|gb|AFK44766.1| unknown [Medicago truncatula]
 gi|388516345|gb|AFK46234.1| unknown [Medicago truncatula]
          Length = 275

 Score =  349 bits (895), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 167/286 (58%), Positives = 209/286 (73%), Gaps = 11/286 (3%)

Query: 1   MVKLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRRR 60
           MVK +HS +  +K S +T     L +      ++L +    IP        N L S    
Sbjct: 1   MVKFKHSNVGLRKPSLITCWTLFLTLFVTFTFLILIILTLRIP------KLNHLNSITHS 54

Query: 61  AFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTG 120
                +++  +  ++W +I++WEPRAF+YHNFL+K ECE+LI++AKP M KS V+D KTG
Sbjct: 55  -----NTLRNDDNKRWVQIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSEVIDEKTG 109

Query: 121 QSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYD 180
           +S +S +RTSSGTFL R  D I+  IEKRIADFTFIP+EHGE   VLHYEVGQKY+ HYD
Sbjct: 110 KSLNSSIRTSSGTFLDREGDEIVSNIEKRIADFTFIPVEHGESFNVLHYEVGQKYEPHYD 169

Query: 181 YFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVK 240
           YFLD F+T++ GQR+AT+LMYLSDVEEGGETVFP A  NF+SV WWNELS+CGK GLS+K
Sbjct: 170 YFLDTFSTRHAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIK 229

Query: 241 PKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           PK G+A+LFWSM+PDATLDPSSLHG CPVI+G+KWS  KWMH  EY
Sbjct: 230 PKMGNAILFWSMKPDATLDPSSLHGACPVIKGDKWSCAKWMHADEY 275


>gi|357517895|ref|XP_003629236.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523258|gb|AET03712.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 326

 Score =  345 bits (885), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 152/218 (69%), Positives = 184/218 (84%)

Query: 70  EEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRT 129
           ++  ++W +I++WEPRAF+YHNFL+K ECE+LI++AKP M KS V+D +TG   DSR RT
Sbjct: 108 DDDNKRWVQIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSAVIDEETGNGVDSRERT 167

Query: 130 SSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTK 189
           SSG FLKRG DRI++ IE+RIADFTFIP+EHGE   VLHYEVGQKY+ HYDYF+D F+T 
Sbjct: 168 SSGAFLKRGSDRIVKNIERRIADFTFIPVEHGENFNVLHYEVGQKYEPHYDYFMDTFSTT 227

Query: 190 NGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLF 249
             GQR+AT+LMYLSDVEEGGETVFP A  NF+SV WWNELS+CGK GLS+KPK G+A+LF
Sbjct: 228 YAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIKPKMGNAILF 287

Query: 250 WSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           WSM+PDATLDPSSLHG CPVI+G+KW   KWMHV E+K
Sbjct: 288 WSMKPDATLDPSSLHGACPVIKGDKWLCAKWMHVGEFK 325


>gi|356502598|ref|XP_003520105.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 296

 Score =  341 bits (874), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 172/298 (57%), Positives = 221/298 (74%), Gaps = 14/298 (4%)

Query: 1   MVKLRHSRLQAKKWS-------TLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPND 53
           MVK R SRL  +K S        LTL+ +  F++ I+L + +         G     PND
Sbjct: 1   MVKGRQSRLGHRKPSRGSSWPVMLTLLATCSFLILILLALPILSNSNANSSGRLIIKPND 60

Query: 54  LTSFRRRAFEKRSSIAEEK----GEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYM 109
           L S    A    + I+E +    GE+W EI++WEPR F+YHNFL+K ECE+LI++AKP M
Sbjct: 61  LNSI---ALNTTTHISEAEYDQLGERWVEIISWEPRIFLYHNFLTKEECEHLINIAKPNM 117

Query: 110 VKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHY 169
            KSTV++S+TG S +SRVRTSSGTFL RG+D+I+R IE RIADFTFIP+++GE +QVLHY
Sbjct: 118 RKSTVIESETGMSIESRVRTSSGTFLARGRDKIVRNIENRIADFTFIPVDNGEELQVLHY 177

Query: 170 EVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNEL 229
           +VG+KY  H+DYF+D+ NT NGG R+AT+LMYLSDVEEGGETVFP A  NF+S+  WNEL
Sbjct: 178 QVGEKYVPHHDYFMDDINTANGGDRIATMLMYLSDVEEGGETVFPDAKGNFSSMPGWNEL 237

Query: 230 SECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           S CGK+GLS+KPK  +ALLFWS++PDAT DP SLHG CPVI+GNKWS+TKW+ + E+K
Sbjct: 238 SVCGKKGLSIKPKMRNALLFWSIKPDATYDPLSLHGSCPVIKGNKWSSTKWIRIGEHK 295


>gi|357517885|ref|XP_003629231.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523253|gb|AET03707.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 279

 Score =  338 bits (868), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 168/289 (58%), Positives = 217/289 (75%), Gaps = 13/289 (4%)

Query: 1   MVKLRHSRLQAKKW---STLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDLTSF 57
           M KL+HSR+  +K    ++ TL+ + LF+  I L++ L +    IP       P  L S 
Sbjct: 1   MGKLKHSRVGPRKPLLPTSRTLIFT-LFVTFIFLIIFLILLSLRIP------KPKHLNSI 53

Query: 58  RRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDS 117
                 +R    ++  ++W EIV+WEPR F+YHNFL+K ECE+LI++AKP + KSTVVD 
Sbjct: 54  THINNLRRD---DDDNKRWVEIVSWEPRVFLYHNFLAKEECEHLINIAKPDVQKSTVVDD 110

Query: 118 KTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDA 177
            TG+S +S  RTSSGTF+ RG D+I+  IEKRIADFTFIP+EHGE + +LHYEVGQKYD 
Sbjct: 111 TTGKSVNSSARTSSGTFIDRGYDKILSDIEKRIADFTFIPVEHGEDVNILHYEVGQKYDF 170

Query: 178 HYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGL 237
           H DYF DE NTK+GG+R+AT+LMYLSDVEEGGETVFP+A  NF+SV WWNELS+CGK+GL
Sbjct: 171 HTDYFEDEVNTKHGGERIATMLMYLSDVEEGGETVFPSAKGNFSSVPWWNELSDCGKKGL 230

Query: 238 SVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           S+KPK G+A+LFW M+PDAT+DP S+HG CPVI+G+KWS TKWM V ++
Sbjct: 231 SIKPKMGNAILFWGMKPDATVDPLSVHGACPVIKGDKWSCTKWMRVGKW 279


>gi|302762452|ref|XP_002964648.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
 gi|300168377|gb|EFJ34981.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
          Length = 225

 Score =  337 bits (863), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 155/223 (69%), Positives = 184/223 (82%)

Query: 65  RSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD 124
           R  + E K E W+EI++W PRA + HNFL+  EC++LI +A P M KSTVVDS+TG S+D
Sbjct: 2   REEVGEGKHEPWSEIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRD 61

Query: 125 SRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD 184
           SRVRTSSG FL RGQDR+I  IE +IA  TFIP +HGEGIQVLHYE GQKYDAH+D+F D
Sbjct: 62  SRVRTSSGMFLNRGQDRVISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYD 121

Query: 185 EFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRG 244
             NT+NGGQR+ATLLMYL+DVEEGGETVFP +  N +S+ W N+LSECG++G+SV+PKRG
Sbjct: 122 TVNTRNGGQRIATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRG 181

Query: 245 DALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           DALLFWSM PDA LD SSLHGGCPVI+G+KWSATKWM V EYK
Sbjct: 182 DALLFWSMSPDAQLDHSSLHGGCPVIKGDKWSATKWMRVSEYK 224


>gi|356502610|ref|XP_003520111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 286

 Score =  336 bits (861), Expect = 9e-90,   Method: Compositional matrix adjust.
 Identities = 156/258 (60%), Positives = 196/258 (75%), Gaps = 14/258 (5%)

Query: 33  MLLAMGIFYIPIGDDDSPPNDLTSFRRRAFEKRSSIAEEKGEQWT---EIVAWEPRAFVY 89
           +LLA+ I   P  + +S           +  + + I  E+ +Q     E+++W+PRAF+Y
Sbjct: 38  ILLALHILSTPHANANS-----------SVSRNTHIEAEEDDQVALRMEVISWQPRAFLY 86

Query: 90  HNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKR 149
           HNFL+K ECEYLI++A P+M KSTV D+++GQS    VR S+G FL RGQD I+R IEKR
Sbjct: 87  HNFLTKEECEYLINIATPHMQKSTVADNQSGQSVVHDVRKSTGAFLDRGQDEIVRNIEKR 146

Query: 150 IADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGG 209
           IAD TFIP+E+GE I V+HYEVGQ YD HYDYF+D+FN +NGGQR+AT+LMYLS+VEEGG
Sbjct: 147 IADVTFIPIENGEPIYVIHYEVGQYYDPHYDYFIDDFNIENGGQRIATMLMYLSNVEEGG 206

Query: 210 ETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPV 269
           ET+FP A ANF+SV WWNELS CGK GLS+KPK GDALLFWSM+P+ATLD  +LH  CPV
Sbjct: 207 ETMFPRAKANFSSVPWWNELSNCGKMGLSIKPKMGDALLFWSMKPNATLDALTLHSACPV 266

Query: 270 IRGNKWSATKWMHVHEYK 287
           I+GNKWS TKWMH  E+K
Sbjct: 267 IKGNKWSCTKWMHPTEFK 284


>gi|168002780|ref|XP_001754091.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694645|gb|EDQ80992.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 214

 Score =  335 bits (858), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 150/214 (70%), Positives = 185/214 (86%)

Query: 75  QWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTF 134
           +W E+++WEPRAF+YH+FL++ EC +LI++A+P +VKSTVVDS TG+SKDSR+RTSSGTF
Sbjct: 1   RWVEVLSWEPRAFLYHHFLTEEECNHLIEVARPSLVKSTVVDSDTGKSKDSRLRTSSGTF 60

Query: 135 LKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
           L RGQD +I+ IEKRIADFTFIP E GEG+QVL Y+  +KY+ HYDYF D +NTKNGGQR
Sbjct: 61  LMRGQDPVIKRIEKRIADFTFIPAEQGEGLQVLQYKESEKYEPHYDYFHDAYNTKNGGQR 120

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +AT+LMYLS+VEEGGETVFPAA  N T V  W++LSEC ++GLSV+P+ GDALLFWSM+P
Sbjct: 121 IATVLMYLSNVEEGGETVFPAAQVNKTEVPDWDKLSECAQKGLSVRPRMGDALLFWSMKP 180

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           DATLD +SLHGGCPVI+G KWSATKW+HV  Y A
Sbjct: 181 DATLDSTSLHGGCPVIKGTKWSATKWLHVENYAA 214


>gi|168060785|ref|XP_001782374.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666166|gb|EDQ52828.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 211

 Score =  334 bits (856), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 149/211 (70%), Positives = 182/211 (86%)

Query: 76  WTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFL 135
           W E+++WEPRAF+YH+FL++ EC +LI++AKP +VKSTV+DS TG+SKDSRVRTSSGTFL
Sbjct: 1   WVEVLSWEPRAFLYHHFLTQVECNHLIEVAKPSLVKSTVIDSATGKSKDSRVRTSSGTFL 60

Query: 136 KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            RGQD II+ IEKRIADFTFIP+E GEG+QVL Y   +KY+ HYDYF D FNTKNGGQR+
Sbjct: 61  VRGQDHIIKRIEKRIADFTFIPVEQGEGLQVLQYRESEKYEPHYDYFHDAFNTKNGGQRI 120

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           AT+LMYLSDVE+GGETVFPA+  N + V  W++ SEC K+GLSV+P+ GDALLFWSM+PD
Sbjct: 121 ATVLMYLSDVEKGGETVFPASKVNASEVPDWDQRSECAKRGLSVRPRMGDALLFWSMKPD 180

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           A LDP+SLHG CPVI+G KWSATKW+HV +Y
Sbjct: 181 AKLDPTSLHGACPVIQGTKWSATKWLHVEKY 211


>gi|388520325|gb|AFK48224.1| unknown [Lotus japonicus]
          Length = 188

 Score =  333 bits (855), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 151/187 (80%), Positives = 175/187 (93%)

Query: 101 LIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEH 160
           +I+LAKP+M KS+VVDS+TG+S  SRVRTSSG FLKRG+D++I+ IEKRIADF FIP+E+
Sbjct: 1   MINLAKPHMAKSSVVDSQTGKSVGSRVRTSSGMFLKRGKDKVIQTIEKRIADFAFIPVEN 60

Query: 161 GEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANF 220
           GEG+QVLHYEVGQKY+ HYDYFLDEFNTKNGGQR+AT+LMYLSDVEEGGET+FPAA ANF
Sbjct: 61  GEGLQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIFPAAKANF 120

Query: 221 TSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKW 280
           +SV W+N+LS C K+GLSVKPKRGDALLFWS+RPDATLDPSSLHGGCPVIRGNKWS+TKW
Sbjct: 121 SSVPWYNDLSVCAKKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGNKWSSTKW 180

Query: 281 MHVHEYK 287
           MH+ EYK
Sbjct: 181 MHLEEYK 187


>gi|302815629|ref|XP_002989495.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
 gi|300142673|gb|EFJ09371.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
          Length = 213

 Score =  330 bits (847), Expect = 3e-88,   Method: Compositional matrix adjust.
 Identities = 151/212 (71%), Positives = 179/212 (84%)

Query: 76  WTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFL 135
           W+EI++W PRA + HNFL+  EC++LI +A P M KSTVVDS+TG S+DSRVRTSSG FL
Sbjct: 1   WSEIISWTPRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSRVRTSSGMFL 60

Query: 136 KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            RGQDR+I  IE +IA  TFIP +HGEGIQVLHYE GQKYDAH+D+F D  NT+NGGQR+
Sbjct: 61  NRGQDRVISEIEDKIAKLTFIPKDHGEGIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQRI 120

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           ATLLMYL+DVEEGGETVFP +  N +S+ W N+LSECG++G+SV+PKRGDALLFWSM PD
Sbjct: 121 ATLLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSPD 180

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           A LD SSLHGGCPVI+G+KWSATKWM V EYK
Sbjct: 181 AQLDHSSLHGGCPVIKGDKWSATKWMRVSEYK 212


>gi|238007346|gb|ACR34708.1| unknown [Zea mays]
          Length = 180

 Score =  330 bits (845), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 150/180 (83%), Positives = 169/180 (93%)

Query: 109 MVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLH 168
           MVKSTVVDS TG+SKDSRVRTSSG FL+RG+D++IR IEKRIAD+TFIP++HGEG+QVLH
Sbjct: 1   MVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRIADYTFIPVDHGEGLQVLH 60

Query: 169 YEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNE 228
           YEVGQKY+ H+DYFLDEFNTKNGGQRMATLLMYLSDVEEGGET+FP AN N +S+ W+NE
Sbjct: 61  YEVGQKYEPHFDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNE 120

Query: 229 LSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           LSEC K+GLSVKPK GDALLFWSM+PDATLDP SLHGGCPVIRGNKWS+TKWMH+HEYKA
Sbjct: 121 LSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEYKA 180


>gi|18071415|gb|AAL58274.1|AC068923_16 putative prolyl 4-hydroxylase, alpha subunit [Oryza sativa Japonica
           Group]
          Length = 343

 Score =  327 bits (838), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 152/200 (76%), Positives = 179/200 (89%)

Query: 71  EKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS 130
           EKGE WTE+++WEPRAF+YHNFLSK ECEYLI LAKP+M KSTVVD+ TG SKDSRVRTS
Sbjct: 104 EKGEPWTEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSRVRTS 163

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN 190
           SG FL RGQD+IIR IEKRI+D+TFIP+E+GEG+QVLHYEVGQKY+ H+DYF DEFNTKN
Sbjct: 164 SGMFLGRGQDKIIRTIEKRISDYTFIPVENGEGLQVLHYEVGQKYEPHFDYFHDEFNTKN 223

Query: 191 GGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFW 250
           GGQR+ATLLMYLSDVEEGGET+FP++ AN +S  ++NELSEC K+GL+VKPK GDALLFW
Sbjct: 224 GGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLFW 283

Query: 251 SMRPDATLDPSSLHGGCPVI 270
           SMRPD +LD +SLHG  P++
Sbjct: 284 SMRPDGSLDATSLHGEIPIL 303


>gi|224033439|gb|ACN35795.1| unknown [Zea mays]
          Length = 180

 Score =  326 bits (835), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 147/180 (81%), Positives = 169/180 (93%)

Query: 109 MVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLH 168
           MVKSTVVDS TG+SKDSRVRTSSG FL+RG+D++IR IEKRIAD+TFIP++HGEG+QVLH
Sbjct: 1   MVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRAIEKRIADYTFIPVDHGEGLQVLH 60

Query: 169 YEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNE 228
           YEVGQKY+ H+DYFLDEFNTKNGGQR+ATLLMYLSDVEEGGET+FP AN N +S+ W+NE
Sbjct: 61  YEVGQKYEPHFDYFLDEFNTKNGGQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNE 120

Query: 229 LSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           LS+C K+GLSVKPK GDALLFWSM+PDATLDP SLHGGCPVI+GNKWS+TKWMH+HEYKA
Sbjct: 121 LSDCAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKWSSTKWMHIHEYKA 180


>gi|363543299|ref|NP_001241865.1| prolyl 4-hydroxylase 5-1 [Zea mays]
 gi|347978814|gb|AEP37749.1| prolyl 4-hydroxylase 5-1 [Zea mays]
          Length = 180

 Score =  325 bits (832), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 148/180 (82%), Positives = 168/180 (93%)

Query: 109 MVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLH 168
           MVKSTVVDS TG+SKDSRVRTSSG FL+RG+D++IR IEKRI D+TFIP++HGEG+QVLH
Sbjct: 1   MVKSTVVDSTTGKSKDSRVRTSSGMFLQRGRDKVIRVIEKRITDYTFIPVDHGEGLQVLH 60

Query: 169 YEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNE 228
           YEVGQKY+ H+DYFLDEFNTKNGGQRMATLLM+LSDVEEGGET+FP AN N +S+ W+NE
Sbjct: 61  YEVGQKYEPHFDYFLDEFNTKNGGQRMATLLMHLSDVEEGGETIFPDANVNDSSLPWYNE 120

Query: 229 LSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           LSEC K+GLSVKPK GDALLFWSM+PDATLDP SLHGGCPVIRGNKWS+TKWMH+HEYKA
Sbjct: 121 LSECAKRGLSVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIRGNKWSSTKWMHIHEYKA 180


>gi|334188665|ref|NP_001190630.1| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
 gi|332010771|gb|AED98154.1| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
          Length = 243

 Score =  321 bits (823), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 157/244 (64%), Positives = 191/244 (78%), Gaps = 4/244 (1%)

Query: 1   MVKLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGD-DDSPPNDLTSFRR 59
           M + R+ R  A+K S  TLV ++L M T V+L+LLA GI  +P  +   S  NDLTS  R
Sbjct: 2   MARPRNHRPSARKSSHSTLVFAVLIMSTFVILILLAFGILSVPSNNAGSSKANDLTSIVR 61

Query: 60  RAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKT 119
           +  + RS   + K E+W EI++WEPRA VYHNFL   EC+YLI+LAKP+M KSTVVD KT
Sbjct: 62  KTLQ-RSGEDDSKNERWVEIISWEPRASVYHNFLE--ECKYLIELAKPHMEKSTVVDEKT 118

Query: 120 GQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHY 179
           G+S DSRVRTSSGTFL RG+D+ IR IEKRI+DFTFIP+EHGEG+QVLHYE+GQKY+ HY
Sbjct: 119 GKSTDSRVRTSSGTFLARGRDKTIREIEKRISDFTFIPVEHGEGLQVLHYEIGQKYEPHY 178

Query: 180 DYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
           DYF+DE+NT+NGGQR+AT+LMYLSDVEEGGETVFPAA  N+++V WWNELSECGK G   
Sbjct: 179 DYFMDEYNTRNGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGWVC 238

Query: 240 KPKR 243
             KR
Sbjct: 239 CYKR 242


>gi|363807814|ref|NP_001242181.1| uncharacterized protein LOC100782154 [Glycine max]
 gi|255644463|gb|ACU22735.1| unknown [Glycine max]
          Length = 285

 Score =  317 bits (811), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 148/255 (58%), Positives = 192/255 (75%), Gaps = 5/255 (1%)

Query: 28  TIVLLMLLAMGIFYIPIGDDDSP---PNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEP 84
           + ++L+ LA+ I   P  +  S    PNDL S  R      S     + ++W E+++WEP
Sbjct: 33  SFLILIPLALRILSTPHVNSSSALSKPNDLNSVPRNT--HVSEGENNRVKRWVEVMSWEP 90

Query: 85  RAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIR 144
           RAF+YHNFL+K ECEYLI+ A P M+KS V+D+++G+  ++  RTS+   ++RG+D+I+R
Sbjct: 91  RAFLYHNFLTKEECEYLINTATPNMLKSLVIDNESGEGIETSYRTSTEYVVERGKDKIVR 150

Query: 145 GIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSD 204
            IEKRIAD TFIP+EHGE + V+ Y VGQ Y+ H DYF +EF+  NGGQR+AT+LMYLS+
Sbjct: 151 NIEKRIADVTFIPIEHGEPLHVIRYAVGQYYEPHVDYFEEEFSLVNGGQRIATMLMYLSN 210

Query: 205 VEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLH 264
           VE GGETVFP ANANF+SV WWNELSECG+ GLS+KPK GDALLFWSM+PDATLDP +LH
Sbjct: 211 VEGGGETVFPIANANFSSVPWWNELSECGQTGLSIKPKMGDALLFWSMKPDATLDPLTLH 270

Query: 265 GGCPVIRGNKWSATK 279
             CPVI+GNKWS TK
Sbjct: 271 RACPVIKGNKWSCTK 285


>gi|3805847|emb|CAA21467.1| putative protein [Arabidopsis thaliana]
 gi|7270533|emb|CAB81490.1| putative protein [Arabidopsis thaliana]
          Length = 307

 Score =  308 bits (789), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 153/264 (57%), Positives = 186/264 (70%), Gaps = 34/264 (12%)

Query: 36  AMGIFYIPIGDDDSP-PNDLTSFRRRAFEKRSSIAEE--KGEQWTEIVAWEPRAFVYHNF 92
            +GIF +P  +  S  P DLT+  +   E+ S   EE   G++W E+++WEPRAFVYHNF
Sbjct: 36  GLGIFSLPSTNKTSSMPMDLTTIVQTIQERESFGDEEDGNGDRWLEVISWEPRAFVYHNF 95

Query: 93  LSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDS--------------------------- 125
           L+  ECE+LI LAKP M+KS VVD KTG+S DS                           
Sbjct: 96  LTNEECEHLISLAKPSMMKSKVVDVKTGKSIDSRFCTLTSVVVFTFQLNLERFENSKFAN 155

Query: 126 ----RVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDY 181
               RVRTSSGTFL RG D I+  IE RI+DFTFIP E+GEG+QVLHYEVGQ+Y+ H+DY
Sbjct: 156 PSLCRVRTSSGTFLNRGHDEIVEEIENRISDFTFIPPENGEGLQVLHYEVGQRYEPHHDY 215

Query: 182 FLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKP 241
           F DEFN + GGQR+AT+LMYLSDV+EGGETVFPAA  N + V WW+ELS+CGK+GLSV P
Sbjct: 216 FFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGNVSDVPWWDELSQCGKEGLSVLP 275

Query: 242 KRGDALLFWSMRPDATLDPSSLHG 265
           K+ DALLFWSM+PDA+LDPSSLHG
Sbjct: 276 KKRDALLFWSMKPDASLDPSSLHG 299


>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 683

 Score =  302 bits (773), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 142/217 (65%), Positives = 172/217 (79%), Gaps = 6/217 (2%)

Query: 71  EKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS 130
           E+ +QWTEI++  PRA +YHNFLSK ECE+LI+LAKP+M +S VVD  TG+ K+S  RTS
Sbjct: 106 ERKDQWTEILSSVPRASMYHNFLSKEECEHLINLAKPFMARSLVVDGVTGEVKESSSRTS 165

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN 190
           SG FL RG+D+I++ IE+RIAD T +P+E+GEG+ V+HY VGQK + HYDY  D   TKN
Sbjct: 166 SGMFLDRGKDKIVQNIERRIADITSVPIENGEGLHVIHYGVGQKCEPHYDYTSDGVVTKN 225

Query: 191 GGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFW 250
           GG R+AT+LMYLSDVEEGGETVFP A  NFTSV      S+C   GLSVKPK GDALLFW
Sbjct: 226 GGPRVATVLMYLSDVEEGGETVFPDAQPNFTSV------SKCSGDGLSVKPKMGDALLFW 279

Query: 251 SMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           SM+PD TLD SSLHGG PVIRGNKW++TKW+H+ E K
Sbjct: 280 SMKPDGTLDTSSLHGGSPVIRGNKWASTKWLHLRECK 316



 Score =  210 bits (535), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 111/200 (55%), Positives = 135/200 (67%), Gaps = 27/200 (13%)

Query: 92  FLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIA 151
           F SK ECE+LI+LAKP+M +S VVD  TG+ ++S  RTSSG FL+RG+D+I++ IE+RIA
Sbjct: 372 FGSKEECEHLINLAKPFMTRSLVVDGLTGKGRESSARTSSGRFLERGKDKIVQNIEQRIA 431

Query: 152 DFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL---DEFNTKNGGQRMATLLMYLSDVEEG 208
           D T IP                      D+ L       TKNGG R+AT+LMYLSDVEEG
Sbjct: 432 DITSIPR------------------MARDFMLFTAGGVVTKNGGPRVATVLMYLSDVEEG 473

Query: 209 GETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCP 268
           GETVFP A  N  SV  + E      +GLSVKPK GDALLF SM+PD TLD SSLHGG P
Sbjct: 474 GETVFPNAKPNINSVSKYPE------KGLSVKPKMGDALLFRSMKPDGTLDTSSLHGGSP 527

Query: 269 VIRGNKWSATKWMHVHEYKA 288
           VIRGNKW++TKW+H+ E+K 
Sbjct: 528 VIRGNKWASTKWLHLTEFKV 547


>gi|357467077|ref|XP_003603823.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492871|gb|AES74074.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 291

 Score =  286 bits (732), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 136/224 (60%), Positives = 179/224 (79%), Gaps = 6/224 (2%)

Query: 65  RSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD 124
           ++S++E K +QWTE+++ EPRA +YHNFLSK ECE+LI+LAKP+M +S VVD  TGQ   
Sbjct: 74  KTSLSERK-DQWTEVLSSEPRASMYHNFLSKEECEHLINLAKPFMQRSLVVDGVTGQGIL 132

Query: 125 SRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD 184
           + VRTSSGTFL+RG+D+I++ +E+RIAD T IP+E+GEG+Q++HYEVGQK++ HYDY  +
Sbjct: 133 NSVRTSSGTFLERGKDKIVQNVERRIADITSIPIENGEGLQIIHYEVGQKFEPHYDYNFN 192

Query: 185 EFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRG 244
              T NGG R+AT+LMYLSDVEEGGETVFP A  NF SV  ++     GK GL VKPK G
Sbjct: 193 WRITNNGGPRVATVLMYLSDVEEGGETVFPNAKPNFNSVSKYHP----GK-GLVVKPKMG 247

Query: 245 DALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           DALLFWS++PD +LD +SLHGG PVIRG+KW++ K +H+ E+K 
Sbjct: 248 DALLFWSVKPDGSLDTASLHGGSPVIRGSKWASNKLLHLTEFKV 291


>gi|357483927|ref|XP_003612250.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355513585|gb|AES95208.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 204

 Score =  284 bits (726), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 137/204 (67%), Positives = 165/204 (80%), Gaps = 3/204 (1%)

Query: 1   MVKLRHSRLQAKKWST-LTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSP-PNDLTSFR 58
           M K R+SRL ++K S+  TL+ S+    T ++L+LL  GI  IP  + + P PNDLTS  
Sbjct: 1   MAKSRYSRLPSRKSSSPYTLIFSLFIAFTFLILILLVFGILSIPSSNQNLPKPNDLTSIV 60

Query: 59  RRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSK 118
               + R+   E KGEQW E+V+WEPRAFVYHNFL+K ECEYLID+AKP M KSTVVDS+
Sbjct: 61  HNTVD-RNDDEEGKGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPSMHKSTVVDSE 119

Query: 119 TGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAH 178
           TG+SKDSRVRTSSGTFL RG+D+I+R IEK+IADFTFIP+EHGEG+QVLHYEVGQKY+ H
Sbjct: 120 TGKSKDSRVRTSSGTFLARGRDKIVRNIEKKIADFTFIPVEHGEGLQVLHYEVGQKYEPH 179

Query: 179 YDYFLDEFNTKNGGQRMATLLMYL 202
           YDYFLDEFNTKNGGQR+AT+LMYL
Sbjct: 180 YDYFLDEFNTKNGGQRIATVLMYL 203


>gi|357467087|ref|XP_003603828.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492876|gb|AES74079.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 156

 Score =  280 bits (716), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 126/155 (81%), Positives = 144/155 (92%)

Query: 134 FLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
           FLKRG+D+II+ IE+RIADFTFIP+E+GEG+QVLHY VG+KY+ HYDYFLDEFNTKNGGQ
Sbjct: 2   FLKRGKDKIIQNIERRIADFTFIPVENGEGLQVLHYGVGEKYEPHYDYFLDEFNTKNGGQ 61

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+AT+LMYLSDVEEGGETVFPAA ANF+SV WWN+LSEC ++GLS+KPK GDALLFWSMR
Sbjct: 62  RVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLFWSMR 121

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           PDATLD SSLHGGCPVI GNKWS+TKWMH+ EYK 
Sbjct: 122 PDATLDASSLHGGCPVIVGNKWSSTKWMHLEEYKV 156


>gi|255633460|gb|ACU17088.1| unknown [Glycine max]
          Length = 207

 Score =  272 bits (696), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 133/208 (63%), Positives = 166/208 (79%), Gaps = 4/208 (1%)

Query: 1   MVKLRHSRLQAKKWSTL-TLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDS--PPNDLTSF 57
           M K R+SRL  +K S+  TL+L++  + T ++L+LLA+GI  IP     +   PNDL S 
Sbjct: 1   MAKPRYSRLPPRKSSSSSTLILTLFLVFTFLVLILLALGILSIPSSSRGNLPKPNDLASI 60

Query: 58  RRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDS 117
            R      S   + +GEQW E+V+WEPRAFVYHNFL+K ECEYLID+AKP M KS+VVDS
Sbjct: 61  ARNTIHT-SDDDDVRGEQWVEVVSWEPRAFVYHNFLTKEECEYLIDIAKPNMHKSSVVDS 119

Query: 118 KTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDA 177
           +TG+SKDSRVRTSSGTFL RG+D+I+R IEKRIA ++FIP+EHGEG+QVLHYEVGQKY+ 
Sbjct: 120 ETGKSKDSRVRTSSGTFLARGRDKIVRDIEKRIAHYSFIPVEHGEGLQVLHYEVGQKYEP 179

Query: 178 HYDYFLDEFNTKNGGQRMATLLMYLSDV 205
           HYDYFLD+FNTKNGGQR+AT+LMYL+DV
Sbjct: 180 HYDYFLDDFNTKNGGQRIATVLMYLTDV 207


>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
           C-169]
          Length = 222

 Score =  268 bits (685), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 119/213 (55%), Positives = 165/213 (77%), Gaps = 6/213 (2%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+++WEPRA++YHNFL++AE +YL+   KP+M KS VVD++TG+S  S+VRTSSG FL R
Sbjct: 2   EVLSWEPRAYLYHNFLTEAEADYLVQKGKPHMEKSEVVDNETGKSAPSKVRTSSGMFLNR 61

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
           G+D +I  IE RIA +T IP E+GEG+Q+LHY+  ++Y  H+DYF D FNT+NGGQR+AT
Sbjct: 62  GEDDVIERIEARIAKYTAIPKENGEGLQILHYQASEEYRPHFDYFHDNFNTQNGGQRIAT 121

Query: 198 LLMYLSDVEEGGETVFPAAN--ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           +LMYLSDVE+GGETVFP ++   N  + ++    S+C + G + KPK+GDAL F+S+ PD
Sbjct: 122 MLMYLSDVEDGGETVFPESSDKPNVGNTKF----SQCAQAGAAAKPKKGDALFFYSLTPD 177

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
             +D  SLH GCPV++G+KWSATKW+ V  ++A
Sbjct: 178 GRMDEKSLHAGCPVMKGDKWSATKWLRVDRFEA 210


>gi|357517893|ref|XP_003629235.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523257|gb|AET03711.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 196

 Score =  267 bits (683), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 124/193 (64%), Positives = 152/193 (78%), Gaps = 14/193 (7%)

Query: 94  SKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADF 153
           +K ECE+LI++AKP M KSTV D +TG+S D+  RTSSGTF+ RG D+I+R IE+RIADF
Sbjct: 14  TKEECEHLINIAKPSMHKSTV-DDETGKSVDNSARTSSGTFINRGHDKILRNIEQRIADF 72

Query: 154 TFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVF 213
           TFIP+E+GE + +LHYEVGQKY+ H D+F DE NTKNGG             E+GGETVF
Sbjct: 73  TFIPVENGESVNILHYEVGQKYEPHPDFFTDEINTKNGG-------------EQGGETVF 119

Query: 214 PAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGN 273
           P A  NF+SV WWNELS+CGK+GLS+KPK GDALLFWSM+PD TLDP S+HG CPVI+G+
Sbjct: 120 PFAEGNFSSVPWWNELSDCGKKGLSIKPKMGDALLFWSMKPDGTLDPLSMHGACPVIKGD 179

Query: 274 KWSATKWMHVHEY 286
           KWS TKWM V ++
Sbjct: 180 KWSCTKWMRVGKW 192


>gi|359477455|ref|XP_002278454.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Vitis
           vinifera]
          Length = 296

 Score =  265 bits (677), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 119/207 (57%), Positives = 162/207 (78%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PRAFVY  FLS+ EC++LI LAK  + +S V D+ +G+S+ S VRTSSG F+ +G+
Sbjct: 40  ISWKPRAFVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSRLSEVRTSSGMFIGKGK 99

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +IA +TF+P ++GE +QVL YE GQKYDAHYDYF+D+ N   GG R+AT+L
Sbjct: 100 DPIVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHYDYFVDKVNIARGGHRIATVL 159

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLSDV +GGETVFP A  + +++   ++LSEC ++G++VKP++GDALLF+S+ P A  D
Sbjct: 160 MYLSDVVKGGETVFPMAEVSSSTLPTNDDLSECARKGIAVKPRKGDALLFFSLHPTAIPD 219

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEY 286
           P SLHGGCPVI G KWSATKW+HV  +
Sbjct: 220 PMSLHGGCPVIEGEKWSATKWIHVDSF 246


>gi|215490183|dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
          Length = 318

 Score =  262 bits (670), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 118/208 (56%), Positives = 160/208 (76%), Gaps = 1/208 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAFVY NFL+  EC++ I LAK  + KS V D+++G+S +S VRTSSG F ++ Q
Sbjct: 65  ISWRPRAFVYRNFLTDEECDHFITLAKHKLEKSMVADNESGKSVESEVRTSSGMFFRKAQ 124

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D+++  +E RIA +TF+P E+GE IQ+LHYE GQKY+ H+DYF D+ N + GG R+AT+L
Sbjct: 125 DQVVANVEARIAAWTFLPEENGESIQILHYEHGQKYEPHFDYFHDKVNQELGGHRVATVL 184

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLSDVE+GGETVFP + A  T  +  ++ S+C K+G +VKP++GDALLF+S+ PDAT D
Sbjct: 185 MYLSDVEKGGETVFPNSEAKKTQAK-GDDWSDCAKKGYAVKPRKGDALLFFSLHPDATTD 243

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           P SLHG CPVI G KWSATKW+HV  ++
Sbjct: 244 PLSLHGSCPVIEGEKWSATKWIHVRSFE 271


>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
          Length = 294

 Score =  261 bits (668), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 124/228 (54%), Positives = 167/228 (73%), Gaps = 2/228 (0%)

Query: 61  AFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTG 120
           AF + SS +        + ++W+PRAFVY  FL+  EC +LI LAK  + +S V D+++G
Sbjct: 17  AFVRESSSSAIINPSKAKQISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAVADNESG 76

Query: 121 QSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYD 180
            SK S VRTSSG F+ + +D I+ GIE++IA +TF+P E+GE IQVL YE GQKY+ HYD
Sbjct: 77  NSKTSEVRTSSGMFIPKAKDPIVSGIEEKIATWTFLPKENGEEIQVLRYEEGQKYEPHYD 136

Query: 181 YFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAA--NANFTSVRWWNELSECGKQGLS 238
           YF+D+ N   GG R+AT+LMYL++VE+GGETVFP A  +    S+   + LSEC K+G+ 
Sbjct: 137 YFVDKVNIARGGHRLATVLMYLTNVEKGGETVFPKAEESPRRRSMIADDSLSECAKKGIP 196

Query: 239 VKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           VKP++GDALLF+S+ P+AT DP SLHGGCPVI+G KWSATKW+HV  +
Sbjct: 197 VKPRKGDALLFYSLHPNATPDPLSLHGGCPVIQGEKWSATKWIHVDSF 244


>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 293

 Score =  261 bits (666), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 126/208 (60%), Positives = 162/208 (77%), Gaps = 3/208 (1%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF+Y  FLS AEC++L+ LAK  + KS V D+ +G+S  S+VRTSSGTFL + +
Sbjct: 34  LSWRPRAFLYSGFLSHAECDHLVKLAKGRLQKSMVADNDSGKSVMSQVRTSSGTFLNKHE 93

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D II GIEKR+A +TF+P E+ E IQVLHYEVGQKYDAH+DYF D+ N K GG R+AT+L
Sbjct: 94  DEIISGIEKRVAAWTFLPEENAESIQVLHYEVGQKYDAHFDYFHDKNNQKLGGHRVATVL 153

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNEL-SECGKQGLSVKPKRGDALLFWSMRPDATL 258
           MYL+DV++GGETVFP  NA    ++  +E  SEC + GL+VKP++GDALLF+S+  +AT 
Sbjct: 154 MYLTDVKKGGETVFP--NAEGRHLQHKDETWSECARSGLAVKPRKGDALLFFSLHINATT 211

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DPSSLHG CPVI G KWSATKW+HV  +
Sbjct: 212 DPSSLHGSCPVIEGEKWSATKWIHVRSF 239


>gi|255551575|ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 297

 Score =  261 bits (666), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 123/209 (58%), Positives = 157/209 (75%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W+PRAFVY  FL+  EC++LI LAK  + +S V D+++G+SK S VRTSSG F+ +G+
Sbjct: 39  VSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSEVRTSSGMFIAKGK 98

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D II GIE++I+ +TF+P E+GE +QVL YE GQKYD HYDYF D+ N   GG RMAT+L
Sbjct: 99  DPIIAGIEEKISTWTFLPKENGEDLQVLRYEHGQKYDPHYDYFADKINIARGGHRMATVL 158

Query: 200 MYLSDVEEGGETVFPAAN--ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           MYLSDV +GGETVFP A       +     +LSEC K+G+SVKP+RGDALLF+S+ P A 
Sbjct: 159 MYLSDVVKGGETVFPNAEEPPRRKATESHEDLSECAKKGISVKPRRGDALLFFSLHPTAI 218

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            DP+SLH GCPVI G KWSATKW+HV  +
Sbjct: 219 PDPNSLHAGCPVIEGEKWSATKWIHVDSF 247


>gi|384246332|gb|EIE19822.1| hypothetical protein COCSUDRAFT_25518 [Coccomyxa subellipsoidea
           C-169]
          Length = 347

 Score =  261 bits (666), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 122/209 (58%), Positives = 158/209 (75%), Gaps = 6/209 (2%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W PRAF+   FL +AECE+LI  AKP MVKSTVVD+ TG+S DS VRTS+GTF  R +
Sbjct: 86  VSWSPRAFLLKGFLKEAECEHLISKAKPSMVKSTVVDNDTGKSIDSTVRTSTGTFFGREE 145

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTK--NGGQRMAT 197
           D +I+GIE+RI+  T +P  +GEG+Q+LHYE GQKY+AH+D+F D+FN++  NGGQR+AT
Sbjct: 146 DEVIQGIERRISMITHLPEVNGEGLQILHYEDGQKYEAHHDFFHDKFNSRPENGGQRIAT 205

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           +LMYL+  EEGGETVFP A    T  +W    SEC + G +VK +RGDALLF+S+ P+  
Sbjct: 206 VLMYLTTAEEGGETVFPMAANKVTGPQW----SECARGGAAVKSRRGDALLFYSLLPNGE 261

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            DP+SLHG CP  +G KWSATKW+HV  +
Sbjct: 262 TDPTSLHGSCPTTKGEKWSATKWIHVGPF 290


>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
          Length = 1062

 Score =  259 bits (663), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 121/210 (57%), Positives = 160/210 (76%), Gaps = 7/210 (3%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF+Y  FLS  EC++L++LAK  M KS V D+ +G+S  S+VRTSSGTFL + +
Sbjct: 40  LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSGTFLSKHE 99

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIEKR+A +TF+P E+ E IQ+LHYE+GQKYDAH+DYF D+ N K GG R+AT+L
Sbjct: 100 DDIVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRVATVL 159

Query: 200 MYLSDVEEGGETVFPAANANFTSVR---WWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           MYL+DV++GGETVFP A      ++   W    S+C + GL+VKPK+GDALLF+S+  +A
Sbjct: 160 MYLTDVKKGGETVFPNAAGRHLQLKDETW----SDCARSGLAVKPKKGDALLFFSLHVNA 215

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           T DP+SLHG CPVI G KWSATKW+HV  +
Sbjct: 216 TTDPASLHGSCPVIEGEKWSATKWIHVRSF 245


>gi|50845214|gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
          Length = 316

 Score =  258 bits (660), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 121/208 (58%), Positives = 158/208 (75%), Gaps = 3/208 (1%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PRAF+Y  FL+  EC++LID+AK  + KS V D+++G+S  S VRTSSG FL++ Q
Sbjct: 58  LSWKPRAFLYEGFLTHEECDHLIDMAKDKLEKSMVADNESGKSIPSEVRTSSGMFLQKAQ 117

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IE RIA +TF+P+E+GE +Q+LHYE GQKY+ H+DYF D+ N + GG R+AT+L
Sbjct: 118 DDVVAAIEARIAAWTFLPIENGEAMQILHYERGQKYEPHFDYFHDKVNQQLGGHRIATVL 177

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNE-LSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           MYLS+VEEGGETVFP A A        NE LS+C K G SVKPK+GDALLF+S+ PDA+ 
Sbjct: 178 MYLSNVEEGGETVFPNAEAKLQLAN--NESLSDCAKGGYSVKPKKGDALLFFSLHPDAST 235

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           D  SLHG CPVI G KWSATKW+HV  +
Sbjct: 236 DSLSLHGSCPVIEGEKWSATKWIHVRSF 263


>gi|159794881|pdb|2JIJ|A Chain A, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
 gi|159794882|pdb|2JIJ|B Chain B, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
 gi|159794883|pdb|2JIJ|C Chain C, Crystal Structure Of The Apo Form Of Chlamydomonas
           Reinhardtii Prolyl-4 Hydroxylase Type I
          Length = 233

 Score =  258 bits (658), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 120/215 (55%), Positives = 163/215 (75%), Gaps = 9/215 (4%)

Query: 74  EQW-TEIV--AWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS 130
           E+W  E+V  +W PRAF+  NFLS  EC+Y+++ A+P MVKS+VVD+++G+S DS +RTS
Sbjct: 16  EEWRGEVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTS 75

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT-- 188
           +GT+  +G+D +I  IEKR+A  T IP+E+ EG+QVLHY  GQKY+ HYDYF D  N   
Sbjct: 76  TGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGP 135

Query: 189 KNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALL 248
           ++GGQR+ T+LMYL+ VEEGGETV P A    T   W    SEC K+GL+VKP +GDAL+
Sbjct: 136 EHGGQRVVTMLMYLTTVEEGGETVLPNAEQKVTGDGW----SECAKRGLAVKPIKGDALM 191

Query: 249 FWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHV 283
           F+S++PD + DP+SLHG CP ++G+KWSATKW+HV
Sbjct: 192 FYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 226


>gi|241913390|pdb|3GZE|A Chain A, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913391|pdb|3GZE|B Chain B, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913392|pdb|3GZE|C Chain C, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
 gi|241913393|pdb|3GZE|D Chain D, Algal Prolyl 4-Hydroxylase Complexed With Zinc And
           (Ser-Pro)5 Peptide Substrate
          Length = 225

 Score =  258 bits (658), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 120/215 (55%), Positives = 163/215 (75%), Gaps = 9/215 (4%)

Query: 74  EQW-TEIV--AWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS 130
           E+W  E+V  +W PRAF+  NFLS  EC+Y+++ A+P MVKS+VVD+++G+S DS +RTS
Sbjct: 8   EEWRGEVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTS 67

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT-- 188
           +GT+  +G+D +I  IEKR+A  T IP+E+ EG+QVLHY  GQKY+ HYDYF D  N   
Sbjct: 68  TGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGP 127

Query: 189 KNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALL 248
           ++GGQR+ T+LMYL+ VEEGGETV P A    T   W    SEC K+GL+VKP +GDAL+
Sbjct: 128 EHGGQRVVTMLMYLTTVEEGGETVLPNAEQKVTGDGW----SECAKRGLAVKPIKGDALM 183

Query: 249 FWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHV 283
           F+S++PD + DP+SLHG CP ++G+KWSATKW+HV
Sbjct: 184 FYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 218


>gi|359477453|ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
           vinifera]
 gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera]
          Length = 298

 Score =  258 bits (658), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 119/209 (56%), Positives = 160/209 (76%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PRAFVY  FLS+ EC++LI LAK  + +S V D+ +G+S+ S VRTSSG F+ +G+
Sbjct: 40  ISWKPRAFVYEGFLSEEECDHLISLAKSELKRSAVADNVSGKSRLSEVRTSSGMFIGKGK 99

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +IA +TF+P ++GE +QVL YE GQKYDAHYDYF+D+ N   GG R+AT+L
Sbjct: 100 DPIVAGIEDKIAAWTFLPKDNGEDMQVLRYEPGQKYDAHYDYFVDKVNIARGGHRIATVL 159

Query: 200 MYLSDVEEGGETVFPAAN--ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           MYLSDV +GGETVFP A   +    +   ++LSEC ++G++VKP++GDALLF+S+ P A 
Sbjct: 160 MYLSDVVKGGETVFPMAEEPSRRKPLPTNDDLSECARKGIAVKPRKGDALLFFSLHPTAI 219

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            DP SLHGGCPVI G KWSATKW+HV  +
Sbjct: 220 PDPMSLHGGCPVIEGEKWSATKWIHVDSF 248


>gi|159478673|ref|XP_001697425.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158274304|gb|EDP00087.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 297

 Score =  258 bits (658), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 120/215 (55%), Positives = 163/215 (75%), Gaps = 9/215 (4%)

Query: 74  EQW-TEIV--AWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS 130
           E+W  E+V  +W PRAF+  NFLS  EC+Y+++ A+P MVKS+VVD+++G+S DS +RTS
Sbjct: 36  EEWRGEVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTS 95

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT-- 188
           +GT+  +G+D +I  IEKR+A  T IP+E+ EG+QVLHY  GQKY+ HYDYF D  N   
Sbjct: 96  TGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGP 155

Query: 189 KNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALL 248
           ++GGQR+ T+LMYL+ VEEGGETV P A    T   W    SEC K+GL+VKP +GDAL+
Sbjct: 156 EHGGQRVVTMLMYLTTVEEGGETVLPNAEQKVTGDGW----SECAKRGLAVKPIKGDALM 211

Query: 249 FWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHV 283
           F+S++PD + DP+SLHG CP ++G+KWSATKW+HV
Sbjct: 212 FYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 246


>gi|159794879|pdb|2JIG|A Chain A, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
           Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
           Dicarboxylate
 gi|159794880|pdb|2JIG|B Chain B, Crystal Structure Of Chlamydomonas Reinhardtii Prolyl-4
           Hydroxylase Type I Complexed With Zinc And Pyridine-2,4-
           Dicarboxylate
          Length = 224

 Score =  257 bits (657), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 120/215 (55%), Positives = 163/215 (75%), Gaps = 9/215 (4%)

Query: 74  EQW-TEIV--AWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS 130
           E+W  E+V  +W PRAF+  NFLS  EC+Y+++ A+P MVKS+VVD+++G+S DS +RTS
Sbjct: 7   EEWRGEVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKMVKSSVVDNESGKSVDSEIRTS 66

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT-- 188
           +GT+  +G+D +I  IEKR+A  T IP+E+ EG+QVLHY  GQKY+ HYDYF D  N   
Sbjct: 67  TGTWFAKGEDSVISKIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGP 126

Query: 189 KNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALL 248
           ++GGQR+ T+LMYL+ VEEGGETV P A    T   W    SEC K+GL+VKP +GDAL+
Sbjct: 127 EHGGQRVVTMLMYLTTVEEGGETVLPNAEQKVTGDGW----SECAKRGLAVKPIKGDALM 182

Query: 249 FWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHV 283
           F+S++PD + DP+SLHG CP ++G+KWSATKW+HV
Sbjct: 183 FYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 217


>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
 gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
          Length = 298

 Score =  257 bits (657), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 121/209 (57%), Positives = 159/209 (76%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+ +PRAFVY  FL++ EC++++ LAK  + +S V D+ +G+SK S VRTSSGTF+ +G+
Sbjct: 40  VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSSGTFISKGK 99

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +I+ +TF+P E+GE IQVL YE GQKYDAH+DYF D+ N   GG RMAT+L
Sbjct: 100 DPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRMATIL 159

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNE--LSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           MYLS+V +GGETVFP A      V   NE  LS+C K+G++VKP++GDALLF+++ PDA 
Sbjct: 160 MYLSNVTKGGETVFPDAEIPSRRVLSENEEDLSDCAKRGIAVKPRKGDALLFFNLHPDAI 219

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            DP SLHGGCPVI G KWSATKW+HV  +
Sbjct: 220 PDPLSLHGGCPVIEGEKWSATKWIHVDSF 248


>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 303

 Score =  256 bits (655), Expect = 7e-66,   Method: Compositional matrix adjust.
 Identities = 119/209 (56%), Positives = 158/209 (75%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAFVY  FL+  EC++LI LAK  + +S+V D+ +G+SK S VRTSSG F+ + +
Sbjct: 44  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSEVRTSSGAFIHKAK 103

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +IA +TF+P ++GE IQVL YE GQKYDAH+DYF D+ N   GG RMAT+L
Sbjct: 104 DPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVL 163

Query: 200 MYLSDVEEGGETVFPAANAN--FTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           MYLSDVE+GGETVFP+A  +    +     +LS+C K+G++VKP++GDALLF+S+ P+A 
Sbjct: 164 MYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNAI 223

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            D SSLHGGCPVI G KWSATKW+ V  +
Sbjct: 224 PDTSSLHGGCPVIEGEKWSATKWIRVDSF 252


>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
 gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 298

 Score =  256 bits (654), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 120/209 (57%), Positives = 159/209 (76%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+ +PRAFVY  FL++ EC++++ LAK  + +S V D+ +G+SK S VRTSSGTF+ +G+
Sbjct: 40  VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSSGTFISKGK 99

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +I+ +TF+P E+GE IQVL YE GQKYDAH+DYF D+ N   GG RMAT+L
Sbjct: 100 DPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRMATIL 159

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWN--ELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           MYLS+V +GGETVFP A      V   N  +LS+C K+G++VKP++GDALLF+++ PDA 
Sbjct: 160 MYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAI 219

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            DP SLHGGCPVI G KWSATKW+HV  +
Sbjct: 220 PDPLSLHGGCPVIEGEKWSATKWIHVDSF 248


>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
 gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
 gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
           expressed [Oryza sativa Japonica Group]
 gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
 gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
          Length = 299

 Score =  256 bits (654), Expect = 8e-66,   Method: Compositional matrix adjust.
 Identities = 121/210 (57%), Positives = 160/210 (76%), Gaps = 7/210 (3%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF+Y  FLS  EC++L++LAK  M KS V D+ +G+S  S+VRTSSGTFL + +
Sbjct: 40  LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSGTFLSKHE 99

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIEKR+A +TF+P E+ E IQ+LHYE+GQKYDAH+DYF D+ N K GG R+AT+L
Sbjct: 100 DDIVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRVATVL 159

Query: 200 MYLSDVEEGGETVFPAANANFTSVR---WWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           MYL+DV++GGETVFP A      ++   W    S+C + GL+VKPK+GDALLF+S+  +A
Sbjct: 160 MYLTDVKKGGETVFPNAAGRHLQLKDETW----SDCARSGLAVKPKKGDALLFFSLHVNA 215

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           T DP+SLHG CPVI G KWSATKW+HV  +
Sbjct: 216 TTDPASLHGSCPVIEGEKWSATKWIHVRSF 245


>gi|307106819|gb|EFN55064.1| hypothetical protein CHLNCDRAFT_35843 [Chlorella variabilis]
          Length = 287

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 123/212 (58%), Positives = 156/212 (73%), Gaps = 6/212 (2%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E V+W PRAFVYHNFLS  ECE+L +LA+  + KSTVVD+KTG+S DS VRTSSGTFL R
Sbjct: 39  EQVSWRPRAFVYHNFLSDEECEHLKELARKRLTKSTVVDNKTGKSMDSTVRTSSGTFLAR 98

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN--TKNGGQRM 195
           G+D ++R IEKRI+  T IP E+GE IQ+L Y  GQKY+ H DYF D++N  T+NGGQR+
Sbjct: 99  GEDEVVRAIEKRISLVTMIPEENGEAIQILKYVDGQKYEPHTDYFHDKYNSRTENGGQRV 158

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           AT+LMYLS  EEGGETVFP A        W    SEC ++GL+VK  +G ALLF+S++P+
Sbjct: 159 ATILMYLSTPEEGGETVFPYAEKKVEGEGW----SECARKGLAVKAVKGSALLFYSLKPN 214

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
              D +S HG CP + G KWSAT+W+HV  ++
Sbjct: 215 GEEDQASTHGSCPTLAGEKWSATRWIHVGAFQ 246


>gi|302845234|ref|XP_002954156.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
           nagariensis]
 gi|300260655|gb|EFJ44873.1| hypothetical protein VOLCADRAFT_82641 [Volvox carteri f.
           nagariensis]
          Length = 309

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 120/215 (55%), Positives = 161/215 (74%), Gaps = 9/215 (4%)

Query: 74  EQW-TEIV--AWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS 130
           E+W  E++  +W PRAF+   FLS  ECE++I  AKP MVKS+VVD+ +G+S DS +RTS
Sbjct: 48  EEWRGEVIHLSWSPRAFLLKGFLSDEECEHIIAKAKPRMVKSSVVDNASGKSVDSEIRTS 107

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT-- 188
           +G +L +G+D II  IEKR+A  T IP+E+ EG+QVLHY  GQKY+ HYDYF D  N   
Sbjct: 108 TGAWLAKGEDEIISRIEKRVAQVTMIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNASP 167

Query: 189 KNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALL 248
           ++GGQR+ T+LMYL+ VEEGGETV P A+   +   W    SEC K+GL+VKP +GDAL+
Sbjct: 168 EHGGQRVVTVLMYLTTVEEGGETVLPHADQKVSGEGW----SECAKRGLAVKPVKGDALM 223

Query: 249 FWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHV 283
           F+S++PD + DP+SLHG CP ++G+KWSATKW+HV
Sbjct: 224 FYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 258


>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 298

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 119/209 (56%), Positives = 159/209 (76%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+ +PRAFVY  FL++ EC++++ LAK  + +S V D+ +G+SK S VRTSSGTF+ +G+
Sbjct: 40  VSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSSGTFIPKGK 99

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +I+ +TF+P E+GE IQVL YE GQKYDAH+DYF D+ N   GG R+AT+L
Sbjct: 100 DPIVSGIEDKISTWTFLPKENGEDIQVLRYEHGQKYDAHFDYFHDKVNIVRGGHRIATVL 159

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWN--ELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           MYLS+V +GGETVFP A      V   N  +LS+C K+G++VKP++GDALLF+++ PDA 
Sbjct: 160 MYLSNVTKGGETVFPDAEVPSCRVLSENKEDLSDCAKRGIAVKPRKGDALLFFNLHPDAI 219

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            DP SLHGGCPVI G KWSATKW+HV  +
Sbjct: 220 PDPLSLHGGCPVIEGEKWSATKWIHVDSF 248


>gi|224141327|ref|XP_002324025.1| predicted protein [Populus trichocarpa]
 gi|222867027|gb|EEF04158.1| predicted protein [Populus trichocarpa]
          Length = 239

 Score =  254 bits (650), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 118/207 (57%), Positives = 157/207 (75%), Gaps = 1/207 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PRAFVY  FLS  EC++LI+LAK  +VKS V + +TG+S +S+ RTSSG F+ + +
Sbjct: 21  LSWQPRAFVYKGFLSDEECDHLINLAKGKLVKSMVANDETGESMESQERTSSGMFIFKTE 80

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE RIA +TF+P E+GE IQ+L YE GQKY+AH DYF+D+ N + GG R AT+L
Sbjct: 81  DEIVNGIEARIAAWTFLPEENGEPIQILRYEHGQKYEAHIDYFVDKANQEEGGHRAATVL 140

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLSDV++GGETVFP + A  +  +  +  S+C K+G +VKP +GDALLF+S+ PDAT D
Sbjct: 141 MYLSDVKKGGETVFPTSEAEGSQAK-DDSWSDCAKKGYAVKPNKGDALLFFSLHPDATPD 199

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEY 286
           P SLH  CPVI G KWSATKW+HV  +
Sbjct: 200 PGSLHASCPVIEGEKWSATKWIHVRSF 226


>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
 gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
          Length = 296

 Score =  254 bits (649), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 117/204 (57%), Positives = 157/204 (76%), Gaps = 3/204 (1%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PRAF+Y  F+S AEC++++ +AK  + KS V D+++G+S  S +RTSSG FL +GQ
Sbjct: 45  LSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSNIRTSSGMFLSKGQ 104

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D +I  IE+RIA +TF+P E+GE IQVL YE G+KY+ HYDYF D++N   GG R+AT+L
Sbjct: 105 DEVINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYDYFHDKYNQALGGHRIATVL 164

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLSDV +GGETVFP++         W   S+C K+G++VKP++GDALLF+S+ PDAT D
Sbjct: 165 MYLSDVVKGGETVFPSSEDTTVKDDSW---SDCAKKGIAVKPRKGDALLFYSLHPDATPD 221

Query: 260 PSSLHGGCPVIRGNKWSATKWMHV 283
            SSLHGGCPVI G KWSATKW+HV
Sbjct: 222 ESSLHGGCPVIEGEKWSATKWIHV 245


>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
 gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
          Length = 300

 Score =  253 bits (647), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 121/210 (57%), Positives = 157/210 (74%), Gaps = 4/210 (1%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W+PRAFVY  FL+  EC++LI LAK  + +S V D+++G+SK S VRTSSG F+ + +
Sbjct: 42  VSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSEVRTSSGMFITKAK 101

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +IA +TF+P E+GE IQVL YE GQKYD HYDYF D+ N   GG R+AT+L
Sbjct: 102 DPIVAGIEDKIATWTFLPRENGEDIQVLRYEHGQKYDPHYDYFSDKVNIARGGHRVATVL 161

Query: 200 MYLSDVEEGGETVFPAANA---NFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           MYL+DVE+GGETVFP+A        SV    +LSEC ++G++VKP+RGDALLF+S+ P A
Sbjct: 162 MYLTDVEKGGETVFPSAEELPRRKASVS-HEDLSECARKGIAVKPRRGDALLFFSLYPTA 220

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
             D SS+H GCPVI G KWSATKW+HV  +
Sbjct: 221 VPDTSSIHAGCPVIEGEKWSATKWIHVDSF 250


>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
 gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
          Length = 297

 Score =  253 bits (647), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 121/208 (58%), Positives = 160/208 (76%), Gaps = 3/208 (1%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF+Y  FLS  EC++LI+LAK  M KS V D+ +G+S  S+VRTSSG FL + +
Sbjct: 38  LSWRPRAFLYSGFLSDTECDHLINLAKGSMEKSMVADNDSGKSLMSQVRTSSGAFLAKHE 97

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+  IEKR+A +TF+P E+ E +QVL YE+GQKYDAH+DYF D+ N K+GGQR AT+L
Sbjct: 98  DEIVSAIEKRVAAWTFLPEENAESMQVLRYEIGQKYDAHFDYFHDKNNVKHGGQRFATVL 157

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNEL-SECGKQGLSVKPKRGDALLFWSMRPDATL 258
           MYL+DV++GGETVFP  NA  + +++ +E  SEC + GL+VKPK+GDALLF+ +  +AT 
Sbjct: 158 MYLTDVKKGGETVFP--NAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFGLHLNATT 215

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           D SSLHG CPVI G KWSATKW+HV  +
Sbjct: 216 DTSSLHGSCPVIEGEKWSATKWIHVRSF 243


>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
 gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
          Length = 283

 Score =  253 bits (645), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 116/204 (56%), Positives = 158/204 (77%), Gaps = 2/204 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PRAF+Y  F+S AEC++++ +AK  + KS V D+++G+S  S +RTSSG FL +GQ
Sbjct: 31  LSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSNIRTSSGMFLSKGQ 90

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D +I  IE+RIA +TF+P E+GE IQVL YE G+KY+ HYDYF D++N   GG R+AT+L
Sbjct: 91  DEVINRIEERIAAWTFLPKENGEAIQVLRYEFGEKYEPHYDYFHDKYNQALGGHRIATVL 150

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLSD  +GGETVFP++  + T     +  S+C K+G++VKP++GDALLF+S+ PDAT D
Sbjct: 151 MYLSDAVKGGETVFPSSEEDTTVKD--DSWSDCAKKGIAVKPRKGDALLFYSLHPDATPD 208

Query: 260 PSSLHGGCPVIRGNKWSATKWMHV 283
            SSLHGGCPVI G KWSATKW+HV
Sbjct: 209 ESSLHGGCPVIEGEKWSATKWIHV 232


>gi|225452614|ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera]
          Length = 316

 Score =  252 bits (644), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 119/207 (57%), Positives = 156/207 (75%), Gaps = 1/207 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF+Y  FLS+ EC++LI LAK  + KS V D+++G+S  S VRTSSG FL + Q
Sbjct: 57  LSWRPRAFLYKGFLSEEECDHLITLAKDKLEKSMVADNESGKSIMSEVRTSSGMFLLKAQ 116

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+  IE RIA +TF+P+E+GE IQ+LHYE G+KY+ H+DYF D+ N   GG R+AT+L
Sbjct: 117 DEIVADIEARIAAWTFLPVENGESIQILHYENGEKYEPHFDYFHDKVNQLLGGHRIATVL 176

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYL+ VEEGGETVFP +   F+  +  +  S+C K+G +V PK+GDALLF+S+ PDAT D
Sbjct: 177 MYLATVEEGGETVFPNSEGRFSQPK-DDSWSDCAKKGYAVNPKKGDALLFFSLHPDATTD 235

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEY 286
           PSSLHG CPVI G KWSATKW+HV  +
Sbjct: 236 PSSLHGSCPVIAGEKWSATKWIHVRSF 262


>gi|357140446|ref|XP_003571778.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 298

 Score =  252 bits (643), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 116/208 (55%), Positives = 156/208 (75%), Gaps = 1/208 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF++  FLS+ EC+++I+LAK  + KS V D+++G+S  S VRTSSG FL++ Q
Sbjct: 38  LSWRPRAFLHKGFLSEPECDHMIELAKDKLEKSMVADNESGKSVQSEVRTSSGMFLEKRQ 97

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IE+RIA +TF+P E+GE IQ+LHY+ G+KY+ HYDYF D+ N   GG R+AT+L
Sbjct: 98  DEVVARIEERIAAWTFLPSENGESIQILHYKNGEKYEPHYDYFHDKNNQALGGHRIATVL 157

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLS+VE+GGET+FP A    T  +     SEC K G +VKP +GDALLF+S+ PDAT D
Sbjct: 158 MYLSNVEKGGETIFPNAEGKLTQHK-DETASECAKNGYAVKPMKGDALLFFSLHPDATTD 216

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           P SLHG CPVI G KWSATKW+HV  ++
Sbjct: 217 PDSLHGSCPVIEGQKWSATKWIHVRSFE 244


>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Glycine max]
          Length = 297

 Score =  251 bits (640), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 116/207 (56%), Positives = 154/207 (74%), Gaps = 2/207 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W+PRAFVY  FL++ EC++LI +AK  + +S V D+ +G+SK S VRTSSG F+ + +
Sbjct: 43  VSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIPKNK 102

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ G+E +I+ +T +P E+GE IQVL YE GQKYD HYDYF D+ N   GG R+AT+L
Sbjct: 103 DPIVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVATVL 162

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYL+DV +GGETVFP  NA   S     +LSEC ++G++VKP+RGDALLF+S+ P+A  D
Sbjct: 163 MYLTDVTKGGETVFP--NAELKSSETKEDLSECAQKGIAVKPRRGDALLFFSLYPNAIPD 220

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEY 286
             SLH GCPVI G KWSATKW+HV  +
Sbjct: 221 TMSLHAGCPVIEGEKWSATKWIHVDSF 247


>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 311

 Score =  251 bits (640), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 116/208 (55%), Positives = 156/208 (75%), Gaps = 1/208 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF+Y  FLS  EC++LIDLA+  + KS V D+++G+S +S VRTSSG F+ + Q
Sbjct: 52  LSWHPRAFLYKGFLSYEECDHLIDLARDKLEKSMVADNESGKSIESEVRTSSGMFIAKAQ 111

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+  IE RIA +TF+P E+GE +Q+LHYE GQKY+ H+DYF D+ N + GG R+AT+L
Sbjct: 112 DEIVADIEARIAAWTFLPEENGESMQILHYEHGQKYEPHFDYFHDKANQELGGHRVATVL 171

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLS+VE+GGETVFP A    +  +  +  S+C K G +VKP++GDALLF+S+ PDAT D
Sbjct: 172 MYLSNVEKGGETVFPNAEGKLSQPK-EDSWSDCAKGGYAVKPEKGDALLFFSLHPDATTD 230

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEYK 287
             SLHG CPVI G KWSATKW+HV  ++
Sbjct: 231 SDSLHGSCPVIEGEKWSATKWIHVRSFE 258


>gi|449495423|ref|XP_004159836.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 304

 Score =  250 bits (638), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 117/210 (55%), Positives = 156/210 (74%), Gaps = 3/210 (1%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAFVY  FL+  EC++LI LAK  + +S+V D+ +G+SK S VRTSSG F+ + +
Sbjct: 44  ISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSEVRTSSGAFIHKAK 103

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +IA +TF+P ++GE IQVL YE GQKYDAH+DYF D+ N   GG RMAT+L
Sbjct: 104 DPIVSGIEDKIAAWTFLPKDNGEDIQVLRYEYGQKYDAHFDYFADKVNIARGGHRMATVL 163

Query: 200 MYLSDVEEGGETVF---PAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           MYLSDVE+GGETVF    + +    +     +LS+C K+G++VKP++GDALLF+S+ P+A
Sbjct: 164 MYLSDVEKGGETVFLLRRSESQRRQASETNEDLSDCAKKGIAVKPRKGDALLFFSLHPNA 223

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
             D SSLHGGCPVI G KWSATKW+ V  +
Sbjct: 224 IPDTSSLHGGCPVIEGEKWSATKWIRVDSF 253


>gi|357447555|ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 303

 Score =  250 bits (638), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 118/211 (55%), Positives = 155/211 (73%), Gaps = 4/211 (1%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W+PRAFVY  FL+  EC++LI +AK  + +S V D+ +G+SK S VRTSSG F+ + +
Sbjct: 43  VSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFISKNK 102

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +I+ +TF+P E+GE IQVL YE GQKYD HYDYF D+ N   GG R+AT+L
Sbjct: 103 DAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVATVL 162

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNE----LSECGKQGLSVKPKRGDALLFWSMRPD 255
           MYL++V +GGETVFP A    +     +E    LSECGK+G++VKP+RGDALLF+S+ P+
Sbjct: 163 MYLTNVTKGGETVFPNAELQESPRHKLSETDEDLSECGKKGVAVKPRRGDALLFFSLHPN 222

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           A  D  SLH GCPVI G KWSATKW+HV  +
Sbjct: 223 AIPDTLSLHAGCPVIEGEKWSATKWIHVDSF 253


>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 318

 Score =  249 bits (637), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 118/211 (55%), Positives = 155/211 (73%), Gaps = 7/211 (3%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF+Y  FLS  EC++LI LAK  + KS V D+++G+S  S VRTSSG FL + Q
Sbjct: 59  LSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMSEVRTSSGMFLNKAQ 118

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE RIA +TF+P+E+GE +Q+LHYE GQKY+ H+DYF D+ N   GG R+AT+L
Sbjct: 119 DEIVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFDYFHDKANQVMGGHRIATVL 178

Query: 200 MYLSDVEEGGETVFPAANANFTSVR---WWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           MYLSDVE+GGET+FP A A     +   W    SEC  +G +VKP++GDALLF+S+  DA
Sbjct: 179 MYLSDVEKGGETIFPNAKAKLLQPKDESW----SECAHKGYAVKPRKGDALLFFSLHLDA 234

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           + D  SLHG CPVI G KWSATKW+HV +++
Sbjct: 235 STDNKSLHGSCPVIEGEKWSATKWIHVSDFQ 265


>gi|159795555|pdb|2V4A|A Chain A, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795556|pdb|2V4A|B Chain B, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795557|pdb|2V4A|C Chain C, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii.
 gi|159795558|pdb|2V4A|D Chain D, Crystal Structure Of The Semet-Labeled Prolyl-4
           Hydroxylase (P4h) Type I From Green Algae Chlamydomonas
           Reinhardtii
          Length = 233

 Score =  249 bits (636), Expect = 9e-64,   Method: Compositional matrix adjust.
 Identities = 118/215 (54%), Positives = 159/215 (73%), Gaps = 9/215 (4%)

Query: 74  EQW-TEIV--AWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS 130
           E+W  E+V  +W PRAF+  NFLS  EC+Y+++ A+P  VKS+VVD+++G+S DS +RTS
Sbjct: 16  EEWRGEVVHLSWSPRAFLLKNFLSDEECDYIVEKARPKXVKSSVVDNESGKSVDSEIRTS 75

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT-- 188
           +GT+  +G+D +I  IEKR+A  T IP+E+ EG+QVLHY  GQKY+ HYDYF D  N   
Sbjct: 76  TGTWFAKGEDSVISKIEKRVAQVTXIPLENHEGLQVLHYHDGQKYEPHYDYFHDPVNAGP 135

Query: 189 KNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALL 248
           ++GGQR+ T L YL+ VEEGGETV P A    T   W    SEC K+GL+VKP +GDAL 
Sbjct: 136 EHGGQRVVTXLXYLTTVEEGGETVLPNAEQKVTGDGW----SECAKRGLAVKPIKGDALX 191

Query: 249 FWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHV 283
           F+S++PD + DP+SLHG CP ++G+KWSATKW+HV
Sbjct: 192 FYSLKPDGSNDPASLHGSCPTLKGDKWSATKWIHV 226


>gi|242039723|ref|XP_002467256.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
 gi|241921110|gb|EER94254.1| hypothetical protein SORBIDRAFT_01g022150 [Sorghum bicolor]
          Length = 303

 Score =  249 bits (636), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 116/207 (56%), Positives = 155/207 (74%), Gaps = 1/207 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF++  FLS AEC++LI LAK  + KS V D+++G+S  S VRTSSG FL++ Q
Sbjct: 43  LSWRPRAFLHKGFLSDAECDHLIVLAKDKLEKSMVADNESGKSVQSEVRTSSGMFLEKKQ 102

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++RGIE+RIA +TF+P E+GE IQ+LHY+ G+KY+ HYDYF D+ N   GG R+AT+L
Sbjct: 103 DEVVRGIEERIAAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVL 162

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLS+VE+GGET+FP A       +  +  S+C + G +VKP +GDALLF+S+ PDAT D
Sbjct: 163 MYLSNVEKGGETIFPNAEGKLLQPK-DDTWSDCARNGYAVKPVKGDALLFFSLHPDATTD 221

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEY 286
             SLHG CPVI G KWSATKW+HV  +
Sbjct: 222 SESLHGSCPVIEGQKWSATKWIHVRSF 248


>gi|114796723|gb|ABI79328.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
          Length = 297

 Score =  249 bits (636), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 113/210 (53%), Positives = 160/210 (76%), Gaps = 3/210 (1%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PRAFVY  FL+  EC++LI +AK  + +S V D+++G+S+ S VRTSSG F+ + +
Sbjct: 40  ISWKPRAFVYEGFLTDEECDHLISIAKTELKRSAVADNESGKSQVSEVRTSSGAFISKAK 99

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I++ IE+++A +TF+P+E+GE IQVL YE GQKY+ H+D+F D+ N   GG R AT+L
Sbjct: 100 DAIVQRIEEKLATWTFLPIENGEDIQVLRYEEGQKYENHFDFFSDKVNIARGGHRYATVL 159

Query: 200 MYLSDVEEGGETVFPAANAN---FTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           MYLS+VE+GG+TVFP A  +     ++   ++LSEC K+G+SVKP++GDALLF+S+ P A
Sbjct: 160 MYLSNVEKGGDTVFPNAELSERQKAAIAANDDLSECAKRGISVKPRKGDALLFFSLTPTA 219

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           T D  SLHGGCPVI G KWSATKW+HV  +
Sbjct: 220 TPDQLSLHGGCPVIEGEKWSATKWIHVDSF 249


>gi|449522594|ref|XP_004168311.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Cucumis
           sativus]
          Length = 313

 Score =  249 bits (635), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 115/209 (55%), Positives = 158/209 (75%), Gaps = 3/209 (1%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PRAF+Y  FLS AEC++LIDLAK  + KS V D+ +G+S  S VRTSSG FL++ Q
Sbjct: 56  LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQ 115

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++ G+E RIA +T +P E+GE IQ+LHYE GQKY+ H+D+F D+ N + GG R+AT+L
Sbjct: 116 DEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVL 175

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNEL-SECGKQGLSVKPKRGDALLFWSMRPDATL 258
           MYLS+VE+GGET+FP  N+ F   +  +E  S+C ++G +VK ++GDALLF+S+  DAT 
Sbjct: 176 MYLSNVEKGGETIFP--NSEFKESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDATT 233

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D  SLHG CPVI G KWSATKW+HV  ++
Sbjct: 234 DERSLHGSCPVIAGEKWSATKWIHVRSFE 262


>gi|357447553|ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 301

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 119/211 (56%), Positives = 156/211 (73%), Gaps = 6/211 (2%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W+PRAFVY  FL+  EC++LI +AK  + +S V D+ +G+SK S VRTSSG F+ + +
Sbjct: 43  VSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFISKNK 102

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +I+ +TF+P E+GE IQVL YE GQKYD HYDYF D+ N   GG R+AT+L
Sbjct: 103 DAIVSGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVATVL 162

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNE----LSECGKQGLSVKPKRGDALLFWSMRPD 255
           MYL++V +GGETVFP  NA  +     +E    LSECGK+G++VKP+RGDALLF+S+ P+
Sbjct: 163 MYLTNVTKGGETVFP--NAEESPRHKLSETDEDLSECGKKGVAVKPRRGDALLFFSLHPN 220

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           A  D  SLH GCPVI G KWSATKW+HV  +
Sbjct: 221 AIPDTLSLHAGCPVIEGEKWSATKWIHVDSF 251


>gi|255085592|ref|XP_002505227.1| predicted protein [Micromonas sp. RCC299]
 gi|226520496|gb|ACO66485.1| predicted protein [Micromonas sp. RCC299]
          Length = 267

 Score =  248 bits (633), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 121/208 (58%), Positives = 157/208 (75%), Gaps = 8/208 (3%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P+A++Y  FL +AEC+Y+ + AKP + KSTVVD+KTGQS  S +RTS G F  R +D I
Sbjct: 11  KPKAYLYRGFLRQAECDYIKERAKPKLEKSTVVDNKTGQSVPSNIRTSDGMFFDRHEDDI 70

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTK--NGGQRMATLLM 200
           I  IE+RIA++T +P E+GEGIQVL YEVGQKY+ H D F D+FNT+   GGQRMAT+LM
Sbjct: 71  IEDIERRIAEWTNVPWENGEGIQVLRYEVGQKYEPHLDAFSDKFNTEESKGGQRMATVLM 130

Query: 201 YLSDVEEGGETVFPAA--NANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           YLSDVEEGGETVFP +    +    +W    SEC ++G++VK ++GDALLFWS+  D+ +
Sbjct: 131 YLSDVEEGGETVFPRSVDKPHKGDPKW----SECAQRGVAVKARKGDALLFWSLDIDSNV 186

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           D  SLHGGCPVI+G KWSATKWMH+  +
Sbjct: 187 DELSLHGGCPVIKGTKWSATKWMHLKSF 214


>gi|356546462|ref|XP_003541645.1| PREDICTED: uncharacterized protein LOC100818794 [Glycine max]
          Length = 839

 Score =  248 bits (632), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 116/209 (55%), Positives = 153/209 (73%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W+PRAFVY  FL++ EC++LI +AK  + +S V D+ +G+SK S VRTSSG F+ + +
Sbjct: 581 VSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIPKNK 640

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +I+ +TF+P E+GE IQVL YE GQKYD HYDYF D+ N   GG R+AT+L
Sbjct: 641 DLIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVATVL 700

Query: 200 MYLSDVEEGGETVFPAANAN--FTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           MYL+DV +GGETVFP A  +           LSEC ++G++VKP+RGDALLF+S+ P+A 
Sbjct: 701 MYLTDVTKGGETVFPDAEESPRHKGSETNENLSECAQKGIAVKPRRGDALLFFSLYPNAI 760

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            D  SLH GCPVI G KWSATKW+HV  +
Sbjct: 761 PDTLSLHAGCPVIEGEKWSATKWIHVDSF 789


>gi|359806348|ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
 gi|255645457|gb|ACU23224.1| unknown [Glycine max]
          Length = 298

 Score =  248 bits (632), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 117/209 (55%), Positives = 154/209 (73%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PRAFVY  FL+  EC++LI LAK  + +S V D+ +G+S+ S VRTSSG F+ + +
Sbjct: 40  ISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSDVRTSSGMFISKNK 99

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D II GIE +I+ +TF+P E+GE IQVL YE GQKYD HYDYF D+ N   GG R+AT+L
Sbjct: 100 DPIISGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFTDKVNIARGGHRIATVL 159

Query: 200 MYLSDVEEGGETVFPAAN--ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           MYL++V +GGETVFP+A            ++LSEC K+G++VKP RGDALLF+S+  +AT
Sbjct: 160 MYLTNVTKGGETVFPSAEEPPRRRGTETSSDLSECAKKGIAVKPHRGDALLFFSLHTNAT 219

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            D SSLH GCPVI G KWSATKW+HV  +
Sbjct: 220 PDTSSLHAGCPVIEGEKWSATKWIHVDSF 248


>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
          Length = 319

 Score =  247 bits (631), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 117/211 (55%), Positives = 155/211 (73%), Gaps = 7/211 (3%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF+Y  FLS+ EC++LI LAK  + KS V D+ +G+S  S +RTSSG FL + Q
Sbjct: 60  LSWSPRAFLYKGFLSEEECDHLIVLAKDKLEKSMVADNDSGKSIMSDIRTSSGMFLNKAQ 119

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE RIA +TF+P+E+GE +Q+LHYE GQKY+ H+DYF D+ N   GG R+AT+L
Sbjct: 120 DEIVAGIEARIAAWTFLPVENGESMQILHYENGQKYEPHFDYFHDKANQVMGGHRIATVL 179

Query: 200 MYLSDVEEGGETVFPAANANFTSVR---WWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           MYLSDVE+GGET+FP A A     +   W    SEC  +G +VKP++GDALLF+S+  DA
Sbjct: 180 MYLSDVEKGGETIFPNAEAKLLQPKDESW----SECAHKGYAVKPQKGDALLFFSLHLDA 235

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           + D  SLHG CPVI G KWSATKW+HV +++
Sbjct: 236 STDTKSLHGSCPVIEGEKWSATKWIHVSDFE 266


>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
 gi|255641119|gb|ACU20838.1| unknown [Glycine max]
          Length = 297

 Score =  247 bits (631), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 116/209 (55%), Positives = 154/209 (73%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PRAFVY  FL+  EC++LI LAK  + +S V D+ +G+S+ S VRTSSG F+ + +
Sbjct: 39  ISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSDVRTSSGMFISKNK 98

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +I+ +TF+P E+GE IQV  YE GQKYD HYDYF D+ N   GG R+AT+L
Sbjct: 99  DPIVAGIEDKISSWTFLPKENGEDIQVSRYEHGQKYDPHYDYFTDKVNIARGGHRIATVL 158

Query: 200 MYLSDVEEGGETVFPAAN--ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           MYL+DV +GGETVFP+A            ++LSEC K+G++VKP+RGDALLF+S+  +AT
Sbjct: 159 MYLTDVAKGGETVFPSAEEPPRRRGAETSSDLSECAKKGIAVKPRRGDALLFFSLHTNAT 218

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            D SSLH GCPVI G KWSATKW+HV  +
Sbjct: 219 PDTSSLHAGCPVIEGEKWSATKWIHVDSF 247


>gi|449454448|ref|XP_004144967.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449474082|ref|XP_004154068.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449515181|ref|XP_004164628.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 300

 Score =  247 bits (630), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 113/209 (54%), Positives = 159/209 (76%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PRAFVY  FL+  EC++L+ +A+  + +S V D+ +G+SK S VRTSSG F+ + +
Sbjct: 42  ISWKPRAFVYEGFLTDLECDHLVSIARSELKRSEVADNDSGKSKLSTVRTSSGMFISKNK 101

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +I+ +TF+P E+GE IQVL YE GQKY++HYDYF+D+ N   GG R+AT+L
Sbjct: 102 DPIVSGIEDKISAWTFLPKENGEDIQVLRYEHGQKYESHYDYFVDKVNIAWGGHRLATVL 161

Query: 200 MYLSDVEEGGETVFPAAN--ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           MYLS+V +GGETVFP A   ++  +     +LSEC K+G++VKPK+GDALLF+S+ P+A 
Sbjct: 162 MYLSNVTQGGETVFPLAEKPSHRRAYETDEDLSECAKKGVAVKPKKGDALLFFSLEPNAI 221

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            D +SLHGGCPV+ G KWSATKW+HV  +
Sbjct: 222 PDTNSLHGGCPVLEGEKWSATKWIHVDSF 250


>gi|294461211|gb|ADE76168.1| unknown [Picea sitchensis]
          Length = 280

 Score =  247 bits (630), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 117/204 (57%), Positives = 152/204 (74%), Gaps = 1/204 (0%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P  F+Y NFL+ AEC++LI LA+  + KS V D+++G+S  S +RTSSG FL + QD I+
Sbjct: 28  PGLFLYKNFLTDAECDHLIFLARDKLQKSMVADNESGKSVMSEIRTSSGMFLNKAQDEIV 87

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLS 203
             +E RIA +TF+P+E+GE +QVLHYE+GQKY+ H+DYF D+ N   GG R+AT+LMYLS
Sbjct: 88  ASVEDRIAAWTFLPIENGEAMQVLHYELGQKYEPHFDYFHDKINQAMGGHRIATVLMYLS 147

Query: 204 DVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSL 263
           DV +GGETVFP A    +  +  +  SEC K G SVKP +GDALLF+S+RPDAT D SSL
Sbjct: 148 DVVKGGETVFPNAETKDSQPK-DDSWSECAKGGYSVKPNKGDALLFFSLRPDATTDQSSL 206

Query: 264 HGGCPVIRGNKWSATKWMHVHEYK 287
           HG CPVI G KWSATKW+HV  ++
Sbjct: 207 HGSCPVIEGEKWSATKWIHVRSFE 230


>gi|255637501|gb|ACU19077.1| unknown [Glycine max]
          Length = 318

 Score =  246 bits (628), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 117/211 (55%), Positives = 154/211 (72%), Gaps = 7/211 (3%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF+Y  FLS  EC++LI LAK  + KS V D+++G+S  S VRTSSG FL + Q
Sbjct: 59  LSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMSEVRTSSGMFLNKAQ 118

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE RIA +TF+P+E+GE +Q+LHYE GQKY+ H+DYF D+ N   GG R+AT+L
Sbjct: 119 DEIVAGIEARIAAWTFLPIENGESMQILHYENGQKYEPHFDYFHDKANQVMGGHRIATVL 178

Query: 200 MYLSDVEEGGETVFPAANANFTSVR---WWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           MYLSDVE+GGET+F  A A     +   W    SEC  +G +VKP++GDALLF+S+  DA
Sbjct: 179 MYLSDVEKGGETIFSNAKAKLLQPKDESW----SECAHKGYAVKPRKGDALLFFSLHLDA 234

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           + D  SLHG CPVI G KWSATKW+HV +++
Sbjct: 235 STDNKSLHGSCPVIEGEKWSATKWIHVSDFQ 265


>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
 gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
 gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
          Length = 297

 Score =  246 bits (628), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 117/204 (57%), Positives = 154/204 (75%), Gaps = 3/204 (1%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PRAF+Y  FLS  EC++++ LAK  M KS V D+ +G+S  S+ RTSSGTFL + +D I+
Sbjct: 42  PRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVASQARTSSGTFLAKREDEIV 101

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLS 203
             IEKR+A +TF+P E+ E +QVL YE GQKYDAH+DYF D  N K GGQR+AT+LMYL+
Sbjct: 102 SAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRVATVLMYLT 161

Query: 204 DVEEGGETVFPAANANFTSVRWWNEL-SECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           DV++GGETVFP  NA  + +++ +E  SEC + GL+VKPK+GDALLF+++  +AT D  S
Sbjct: 162 DVKKGGETVFP--NAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNATADTGS 219

Query: 263 LHGGCPVIRGNKWSATKWMHVHEY 286
           LHG CPVI G KWSATKW+HV  +
Sbjct: 220 LHGSCPVIEGEKWSATKWIHVRSF 243


>gi|293337056|ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays]
 gi|224031897|gb|ACN35024.1| unknown [Zea mays]
 gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays]
 gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays]
          Length = 299

 Score =  246 bits (627), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 115/207 (55%), Positives = 153/207 (73%), Gaps = 1/207 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF++  FLS AEC++LI LAK  + KS V D+++G+S  S VRTSSG FL+R Q
Sbjct: 39  LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSEVRTSSGMFLERKQ 98

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IE+RI+ +TF+P E+GE IQ+LHY+ G+KY+ HYDYF D+ N   GG R+AT+L
Sbjct: 99  DEVVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRIATVL 158

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLS+VE+GGET+FP A       +  N  S+C + G +VKP +GDALLF+S+ PDAT D
Sbjct: 159 MYLSNVEKGGETIFPNAEGKLLQPK-DNTWSDCARNGYAVKPVKGDALLFFSLHPDATTD 217

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEY 286
             SLHG CPVI G KWSATKW+HV  +
Sbjct: 218 SDSLHGSCPVIEGQKWSATKWIHVRSF 244


>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
          Length = 297

 Score =  246 bits (627), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 118/204 (57%), Positives = 153/204 (75%), Gaps = 3/204 (1%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PRAF+Y  FLS  EC++L+ LAK  M KS V D+ +G+S  S+ RTSSGTFL + +D I+
Sbjct: 42  PRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQARTSSGTFLAKREDEIV 101

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLS 203
             IEKR+A +TF+P E+ E +QVL YE GQKYDAH+DYF D  N K GGQR+AT+LMYL+
Sbjct: 102 SAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRVATVLMYLT 161

Query: 204 DVEEGGETVFPAANANFTSVRWWNEL-SECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           DV +GGETVFP  NA  + +++ +E  SEC + GL+VKPK+GDALLF+++  +AT D  S
Sbjct: 162 DVNKGGETVFP--NAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNATADTGS 219

Query: 263 LHGGCPVIRGNKWSATKWMHVHEY 286
           LHG CPVI G KWSATKW+HV  +
Sbjct: 220 LHGSCPVIEGEKWSATKWIHVRSF 243


>gi|357496283|ref|XP_003618430.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
 gi|217073992|gb|ACJ85356.1| unknown [Medicago truncatula]
 gi|355493445|gb|AES74648.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
 gi|388494436|gb|AFK35284.1| unknown [Medicago truncatula]
          Length = 313

 Score =  246 bits (627), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 114/211 (54%), Positives = 157/211 (74%), Gaps = 7/211 (3%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF+Y NFL+  EC++LI+L+K  + KS V D+++G+S  S VRTSSG FL + Q
Sbjct: 54  LSWSPRAFLYKNFLTDEECDHLIELSKDKLEKSMVADNESGKSIQSEVRTSSGMFLNKQQ 113

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE RIA +TF+P+E+GE +QVLHY  G+KY+ H+D+F D+ N + GG R+AT+L
Sbjct: 114 DEIVSGIEARIAAWTFLPVENGESMQVLHYMNGEKYEPHFDFFHDKANQRLGGHRVATVL 173

Query: 200 MYLSDVEEGGETVFPAANANFTSVR---WWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           MYLS+VE+GGET+FP A    +  +   W    SEC  +G +VKP++GDALLF+S+  DA
Sbjct: 174 MYLSNVEKGGETIFPHAEGKLSQPKDESW----SECAHKGYAVKPRKGDALLFFSLHLDA 229

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           T D  SLHG CPVI G KWSATKW+HV +++
Sbjct: 230 TTDSKSLHGSCPVIEGEKWSATKWIHVADFE 260


>gi|18397528|ref|NP_566279.1| P4H isoform 2 [Arabidopsis thaliana]
 gi|332640849|gb|AEE74370.1| P4H isoform 2 [Arabidopsis thaliana]
          Length = 299

 Score =  245 bits (626), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 114/209 (54%), Positives = 158/209 (75%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+ +PRAFVY  FL+  EC++LI LAK  + +S V D+  G+S+ S VRTSSGTF+ +G+
Sbjct: 41  VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFISKGK 100

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +++ +TF+P E+GE +QVL YE GQKYDAH+DYF D+ N   GG R+AT+L
Sbjct: 101 DPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRIATVL 160

Query: 200 MYLSDVEEGGETVFPAAN--ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           +YLS+V +GGETVFP A   +  +     ++LS+C K+G++VKPK+G+ALLF++++ DA 
Sbjct: 161 LYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQQDAI 220

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            DP SLHGGCPVI G KWSATKW+HV  +
Sbjct: 221 PDPFSLHGGCPVIEGEKWSATKWIHVDSF 249


>gi|21618073|gb|AAM67123.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 297

 Score =  245 bits (626), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 114/209 (54%), Positives = 158/209 (75%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+ +PRAFVY  FL+  EC++LI LAK  + +S V D+  G+S+ S VRTSSGTF+ +G+
Sbjct: 39  VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFISKGK 98

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +++ +TF+P E+GE +QVL YE GQKYDAH+DYF D+ N   GG R+AT+L
Sbjct: 99  DPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRIATVL 158

Query: 200 MYLSDVEEGGETVFPAAN--ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           +YLS+V +GGETVFP A   +  +     ++LS+C K+G++VKPK+G+ALLF++++ DA 
Sbjct: 159 LYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQQDAI 218

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            DP SLHGGCPVI G KWSATKW+HV  +
Sbjct: 219 PDPFSLHGGCPVIEGEKWSATKWIHVDSF 247


>gi|297829156|ref|XP_002882460.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297328300|gb|EFH58719.1| hypothetical protein ARALYDRAFT_896741 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 299

 Score =  245 bits (626), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 114/209 (54%), Positives = 158/209 (75%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+ +PRAFVY  FL+  EC++LI LAK  + +S V D+  G+S+ S VRTSSGTF+ +G+
Sbjct: 41  VSAKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFISKGK 100

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +++ +TF+P E+GE +QVL YE GQKYDAH+DYF D+ N   GG R+AT+L
Sbjct: 101 DPIVSGIEDKLSTWTFLPKENGEDLQVLRYEPGQKYDAHFDYFHDKVNIARGGHRIATVL 160

Query: 200 MYLSDVEEGGETVFPAAN--ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           +YLS+V +GGETVFP A   +  +     ++LS+C K+G++VKPK+G+ALLF++++ DA 
Sbjct: 161 LYLSNVTKGGETVFPDAQEYSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQQDAI 220

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            DP SLHGGCPVI G KWSATKW+HV  +
Sbjct: 221 PDPFSLHGGCPVIEGEKWSATKWIHVDSF 249


>gi|449461905|ref|XP_004148682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 295

 Score =  245 bits (625), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 113/210 (53%), Positives = 156/210 (74%), Gaps = 2/210 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PRAF+Y  FLS AEC++LIDLAK  + KS V D+ +G+S  S VRTSSG FL++ Q
Sbjct: 35  LSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGMFLRKAQ 94

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++ G+E RIA +T +P E+GE IQ+LHYE GQKY+ H+D+F D+ N + GG R+AT+L
Sbjct: 95  DEVVAGVEARIAAWTLLPAENGESIQILHYENGQKYEPHFDFFHDKVNQELGGHRIATVL 154

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNE--LSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           MYLS+VE+GGET+FP +   + S     +   S+C ++G +VK ++GDALLF+S+  DAT
Sbjct: 155 MYLSNVEKGGETIFPNSEVWYGSESQAKDESWSDCSRKGYAVKAQKGDALLFFSLNLDAT 214

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
            D  SLHG CPVI G KWSATKW+HV  ++
Sbjct: 215 TDERSLHGSCPVIAGEKWSATKWIHVRSFE 244


>gi|388496942|gb|AFK36537.1| unknown [Lotus japonicus]
          Length = 302

 Score =  245 bits (625), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 118/211 (55%), Positives = 154/211 (72%), Gaps = 4/211 (1%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W+PRAFVY  FL++ EC++LI LAK  + +S V D+ +G SK S VRTSSG F+ + +
Sbjct: 44  VSWKPRAFVYKGFLTELECDHLISLAKSELKRSAVADNLSGDSKLSDVRTSSGMFISKNK 103

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +I+ +TF+P E+GE IQVL YE GQKYD HYD+F D+ N   GG R+AT+L
Sbjct: 104 DPIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDFFADKVNIARGGHRVATVL 163

Query: 200 MYLSDVEEGGETVFPAANA-NFTSVRW---WNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           MYL++V  GGETVFP A    F   R     ++LSEC K+G++VKP+RGDALLF+S+ P+
Sbjct: 164 MYLTNVTRGGETVFPNAEVEEFPRHRGSETIDDLSECAKKGIAVKPRRGDALLFFSLYPN 223

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           A  D  SLH GCPVI G KWSATKW+HV  +
Sbjct: 224 AVPDTMSLHAGCPVIEGEKWSATKWIHVDSF 254


>gi|110738390|dbj|BAF01121.1| hypothetical protein [Arabidopsis thaliana]
          Length = 299

 Score =  244 bits (624), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 114/209 (54%), Positives = 158/209 (75%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+ +PRAFVY  FL+  EC++LI LAK  + +S V D+  G+S+ S VRTSSGTF+ +G+
Sbjct: 41  VSSKPRAFVYGGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFISKGK 100

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +++ +TF+P E+GE +QVL YE GQKYDAH+DYF D+ N   GG R+AT+L
Sbjct: 101 DPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRIATVL 160

Query: 200 MYLSDVEEGGETVFPAAN--ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           +YLS+V +GGETVFP A   +  +     ++LS+C K+G++VKPK+G+ALLF++++ DA 
Sbjct: 161 LYLSNVTKGGETVFPDAQEFSRRSLSENKDDLSDCAKKGIAVKPKKGNALLFFNLQQDAI 220

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            DP SLHGGCPVI G KWSATKW+HV  +
Sbjct: 221 PDPFSLHGGCPVIEGEKWSATKWIHVDSF 249


>gi|356555585|ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Glycine max]
          Length = 301

 Score =  244 bits (624), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 114/209 (54%), Positives = 153/209 (73%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W+PRAFVY  FL++ EC++LI +AK  + +S V D+ +G+SK S VRTSSG F+ + +
Sbjct: 43  VSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIPKNK 102

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ G+E +I+ +T +P E+GE IQVL YE GQKYD HYDYF D+ N   GG R+AT+L
Sbjct: 103 DPIVAGVEDKISSWTLLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVATVL 162

Query: 200 MYLSDVEEGGETVFPAA--NANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           MYL+DV +GGETVFP A  +          +LSEC ++G++VKP+RGDALLF+S+ P+A 
Sbjct: 163 MYLTDVTKGGETVFPNAEESPRHRGSETKEDLSECAQKGIAVKPRRGDALLFFSLYPNAI 222

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            D  SLH GCPVI G KWSATKW+HV  +
Sbjct: 223 PDTMSLHAGCPVIEGEKWSATKWIHVDSF 251


>gi|255641919|gb|ACU21228.1| unknown [Glycine max]
          Length = 301

 Score =  244 bits (624), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 115/209 (55%), Positives = 153/209 (73%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W+PRAFVY  FL++ EC++LI +AK  + +S V D+ +G+SK S VRTSSG F+ + +
Sbjct: 43  VSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSSGMFIPKNK 102

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +I+ +TF+P E+GE IQVL YE GQKYD HYDYF D+ N   GG R+AT+L
Sbjct: 103 DLIVAGIEDKISSWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIARGGHRVATVL 162

Query: 200 MYLSDVEEGGETVFPAANAN--FTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           MYL+DV +GGETVFP A  +           LSEC ++G++VKP+RGDALLF+S+ P+A 
Sbjct: 163 MYLTDVTKGGETVFPDAEESPRHKGSETNENLSECAQKGIAVKPRRGDALLFFSLYPNAI 222

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            D  SLH GCPVI G KWSAT+W+HV  +
Sbjct: 223 PDTLSLHAGCPVIEGEKWSATEWIHVDSF 251


>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 316

 Score =  244 bits (622), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 111/207 (53%), Positives = 154/207 (74%), Gaps = 1/207 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF+Y  FLS  EC++ I LAK  + KS V D+ +G+S +S VRTSSG FL + Q
Sbjct: 59  LSWTPRAFLYKGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQ 118

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+  +E ++A +TFIP E+GE +Q+LHYE GQKY+ H+DYF D+ N + GG R+AT+L
Sbjct: 119 DDIVANVEAKLAAWTFIPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVL 178

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLS+VE+GGETVFP      T ++  +  +EC KQG +VKP++GDALLF+++ P+AT D
Sbjct: 179 MYLSNVEKGGETVFPMWKGKTTQLK-DDSWTECAKQGYAVKPRKGDALLFFNLHPNATTD 237

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEY 286
            +SLHG CPV+ G KWSAT+W+HV  +
Sbjct: 238 SNSLHGSCPVVEGEKWSATRWIHVRSF 264


>gi|449443245|ref|XP_004139390.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 295

 Score =  244 bits (622), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 124/216 (57%), Positives = 157/216 (72%), Gaps = 11/216 (5%)

Query: 72  KGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTG-QSKDSRVRTS 130
           +G++W       PRAF+YHNFLS+ EC  LI+LAKP M +S V    T  +   S  RTS
Sbjct: 78  RGDEW-------PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTS 130

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDY-FLDEFNTK 189
           SG FL +GQ++++R IEKRIA+FTFIP+E+GEG+ +LHYEVGQK++ H+DY   D F+ K
Sbjct: 131 SGRFLAKGQNQLVRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFK 190

Query: 190 NGGQRMATLLMYLSDVEEGGETVFPAANANFTSV-RWWNELSECGK-QGLSVKPKRGDAL 247
           + GQR ATL+MYLS V+EGG TVFP A    +S  RWW +L E GK  GLSVKPK GDAL
Sbjct: 191 SLGQRNATLVMYLSGVKEGGATVFPEAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDAL 250

Query: 248 LFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHV 283
           LFWS++PD TLDP+SLH   PV++G+KW   K MHV
Sbjct: 251 LFWSVKPDGTLDPTSLHASSPVVKGDKWVGVKLMHV 286



 Score =  107 bits (267), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 48/72 (66%), Positives = 57/72 (79%)

Query: 204 DVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSL 263
           ++EEGGETVFPAAN   +SV WW +L   GK GLS+KPK GDAL FWSM+PD TLD +SL
Sbjct: 11  NIEEGGETVFPAANQCVSSVPWWKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLDYTSL 70

Query: 264 HGGCPVIRGNKW 275
           HG  PVIRG++W
Sbjct: 71  HGSYPVIRGDEW 82


>gi|224141325|ref|XP_002324024.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
           trichocarpa]
 gi|222867026|gb|EEF04157.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Populus
           trichocarpa]
          Length = 308

 Score =  243 bits (620), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 112/208 (53%), Positives = 154/208 (74%), Gaps = 1/208 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF+Y  FLS  EC++L++LA+  + KS V D+++G+S +S VRTSSG F+ + Q
Sbjct: 49  LSWNPRAFLYKGFLSDEECDHLMNLARDKLEKSMVADNESGKSIESEVRTSSGMFIGKSQ 108

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+  IE RIA +TF+P E+GE IQ+LHYE GQKY+ H+DYF D+ N + GG R+ T+L
Sbjct: 109 DEIVDDIEARIAAWTFLPQENGESIQILHYEHGQKYEPHFDYFHDKANQELGGHRVVTVL 168

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLS+V +GGETVFP +       +  +  S+C K G +VKP++GDALLF+S+ PDAT D
Sbjct: 169 MYLSNVGKGGETVFPNSEGKTIQPK-DDSWSDCAKNGYAVKPQKGDALLFFSLHPDATTD 227

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEYK 287
            +SLHG CPVI G KWSATKW+HV  ++
Sbjct: 228 TNSLHGSCPVIEGEKWSATKWIHVRSFE 255


>gi|218184507|gb|EEC66934.1| hypothetical protein OsI_33548 [Oryza sativa Indica Group]
          Length = 308

 Score =  243 bits (619), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 114/207 (55%), Positives = 152/207 (73%), Gaps = 1/207 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF++  FL+ AECE+LI LAK  + KS V D+++G+S  S VRTSSG FL++ Q
Sbjct: 48  LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGMFLEKKQ 107

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IE+RIA +TF+P ++GE IQ+LHY+ G+KY+ HYDYF D+ N   GG R+AT+L
Sbjct: 108 DEVVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVL 167

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLSDV +GGET+FP A       +  +  S+C K G +VKP +GDALLF+S+ PDAT D
Sbjct: 168 MYLSDVGKGGETIFPEAEGKLLQPK-DDTWSDCAKNGYAVKPVKGDALLFFSLHPDATTD 226

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEY 286
             SLHG CPVI G KWSATKW+HV  +
Sbjct: 227 SDSLHGSCPVIEGQKWSATKWIHVRSF 253


>gi|449520144|ref|XP_004167094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 323

 Score =  243 bits (619), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 122/204 (59%), Positives = 152/204 (74%), Gaps = 4/204 (1%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTG-QSKDSRVRTSSGTFLKRGQDRI 142
           PRAF+YHNFLS+ EC  LI+LAKP M +S V    T  +   S  RTSSG FL +GQ+++
Sbjct: 74  PRAFIYHNFLSEKECSQLINLAKPRMERSLVSAQNTNWEGVVSSRRTSSGRFLAKGQNQL 133

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDY-FLDEFNTKNGGQRMATLLMY 201
           +R IEKRIA+FTFIP+E+GEG+ +LHYEVGQK++ H+DY   D F+ K+ GQR ATL+MY
Sbjct: 134 VRRIEKRIAEFTFIPVENGEGLSILHYEVGQKFEPHHDYTHPDSFSFKSLGQRNATLVMY 193

Query: 202 LSDVEEGGETVFPAANANFTSV-RWWNELSECGK-QGLSVKPKRGDALLFWSMRPDATLD 259
           LS V+EGG TVFP A    +S  RWW +L E GK  GLSVKPK GDALLFWS++PD TLD
Sbjct: 194 LSGVKEGGATVFPEAKKCASSARRWWKKLPEYGKDNGLSVKPKMGDALLFWSVKPDGTLD 253

Query: 260 PSSLHGGCPVIRGNKWSATKWMHV 283
           P+SLH   PV++G+KW   K MHV
Sbjct: 254 PTSLHASSPVVKGDKWVGVKLMHV 277



 Score = 91.7 bits (226), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 41/62 (66%), Positives = 48/62 (77%)

Query: 204 DVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSL 263
           ++EEGGETVFPAAN   +SV WW +L   GK GLS+KPK GDAL FWSM+PD TLD +SL
Sbjct: 11  NIEEGGETVFPAANKCVSSVPWWKKLPTHGKDGLSIKPKMGDALFFWSMKPDGTLDYTSL 70

Query: 264 HG 265
           H 
Sbjct: 71  HA 72


>gi|115481998|ref|NP_001064592.1| Os10g0413500 [Oryza sativa Japonica Group]
 gi|110289075|gb|ABG66075.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113639201|dbj|BAF26506.1| Os10g0413500 [Oryza sativa Japonica Group]
 gi|215692577|dbj|BAG87997.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222612821|gb|EEE50953.1| hypothetical protein OsJ_31503 [Oryza sativa Japonica Group]
          Length = 308

 Score =  243 bits (619), Expect = 8e-62,   Method: Compositional matrix adjust.
 Identities = 114/207 (55%), Positives = 152/207 (73%), Gaps = 1/207 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF++  FL+ AECE+LI LAK  + KS V D+++G+S  S VRTSSG FL++ Q
Sbjct: 48  LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGMFLEKKQ 107

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IE+RIA +TF+P ++GE IQ+LHY+ G+KY+ HYDYF D+ N   GG R+AT+L
Sbjct: 108 DEVVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVL 167

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLSDV +GGET+FP A       +  +  S+C K G +VKP +GDALLF+S+ PDAT D
Sbjct: 168 MYLSDVGKGGETIFPEAEGKLLQPK-DDTWSDCAKNGYAVKPVKGDALLFFSLHPDATTD 226

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEY 286
             SLHG CPVI G KWSATKW+HV  +
Sbjct: 227 SDSLHGSCPVIEGQKWSATKWIHVRSF 253


>gi|242088305|ref|XP_002439985.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
 gi|241945270|gb|EES18415.1| hypothetical protein SORBIDRAFT_09g023860 [Sorghum bicolor]
          Length = 308

 Score =  243 bits (619), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 116/214 (54%), Positives = 154/214 (71%), Gaps = 16/214 (7%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PR F+Y +FLS  E  +LI LA+  + +S V D+ +G+S  S VRTSSGTFL++GQ
Sbjct: 54  ISWKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSDVRTSSGTFLRKGQ 113

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +IA +TF+P E+GE IQVL Y+ G+KY+ HYDYF D  NT  GG R AT+L
Sbjct: 114 DPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTIRGGHRYATVL 173

Query: 200 MYLSDVEEGGETVFP-------AANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
           +YL+DV EGGETVFP       A +A F         SEC ++G++VKP++GDALLF+++
Sbjct: 174 LYLTDVAEGGETVFPLAEEVDDAKDATF---------SECAQKGIAVKPRKGDALLFFNL 224

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           +PD T DP SLHGGC VIRG KWSATKW+ V  +
Sbjct: 225 KPDGTTDPVSLHGGCAVIRGEKWSATKWIRVASF 258


>gi|413923982|gb|AFW63914.1| hypothetical protein ZEAMMB73_179176 [Zea mays]
          Length = 222

 Score =  242 bits (618), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 109/132 (82%), Positives = 125/132 (94%)

Query: 71  EKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS 130
           E+G QWTE+++WEPRAFVYHNFLSK ECEYLI LAKP+MVKSTVVDS TG+SKDSRVRTS
Sbjct: 91  ERGAQWTEVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTS 150

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN 190
           SG FL+RG+D++IR IEKRIAD+TFIP++HGEG+QVLHYEVGQKY+ H+DYFLDEFNTKN
Sbjct: 151 SGMFLQRGRDKVIRVIEKRIADYTFIPVDHGEGLQVLHYEVGQKYEPHFDYFLDEFNTKN 210

Query: 191 GGQRMATLLMYL 202
           GGQRMATLLMYL
Sbjct: 211 GGQRMATLLMYL 222


>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 263

 Score =  242 bits (618), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 117/208 (56%), Positives = 154/208 (74%), Gaps = 2/208 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PRAF+Y NFLS AEC+++I LAK  + KS V D+++G+S  S +RTSSG FL +GQ
Sbjct: 9   LSWKPRAFLYSNFLSDAECDHMISLAKDKLEKSMVADNESGKSVKSEIRTSSGMFLMKGQ 68

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D II  IE RIA +TF+P E+GE IQVL Y+ G+KY+ H+DYF D+ N   GG R+AT+L
Sbjct: 69  DDIISRIEDRIAAWTFLPKENGEAIQVLRYQDGEKYEPHFDYFHDKNNQALGGHRIATVL 128

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLSDV +GGETVFP++          +  S CGK G++VKP++GDALLF+S+ P A  D
Sbjct: 129 MYLSDVVKGGETVFPSSEDRGGPKD--DSWSACGKTGVAVKPRKGDALLFFSLHPSAVPD 186

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEYK 287
            SSLH GCPVI G KWSATKW+HV  ++
Sbjct: 187 ESSLHTGCPVIEGEKWSATKWIHVAAFE 214


>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
 gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
          Length = 316

 Score =  242 bits (618), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 123/279 (44%), Positives = 178/279 (63%), Gaps = 22/279 (7%)

Query: 15  STLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDL----TSFRRRAFEKRSSIAE 70
           S + L  S+ F+ T+ L+                S PN      ++ R  +  K  + A 
Sbjct: 3   SRIFLAFSLCFLFTLPLI---------------SSAPNRFLTRSSNTRDGSVIKMKTSAS 47

Query: 71  EKGEQWTEI--VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVR 128
             G   T +  ++W PR F+Y  FLS  EC++ I LAK  + KS V D+ +G+S +S VR
Sbjct: 48  SFGFDPTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVR 107

Query: 129 TSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT 188
           TSSG FL + QD I+  +E ++A +TF+P E+GE +Q+LHYE GQKY+ H+DYF D+ N 
Sbjct: 108 TSSGMFLSKRQDDIVNNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANL 167

Query: 189 KNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALL 248
           + GG R+AT+LMYLS+VE+GGETVFP      T ++  +  +EC KQG +VKP++GDALL
Sbjct: 168 ELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLK-DDSWTECAKQGYAVKPRKGDALL 226

Query: 249 FWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           F+++ P+AT D +SLHG CPV+ G KWSAT+W+HV  ++
Sbjct: 227 FFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFE 265


>gi|357478545|ref|XP_003609558.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355510613|gb|AES91755.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 299

 Score =  242 bits (617), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 114/209 (54%), Positives = 152/209 (72%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAFVY  FL+  EC++LI LAK  + +S V D+ +G S+ S VRTSSG F+ + +
Sbjct: 40  ISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSSGMFISKNK 99

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE RI+ +TF+P E+GE IQVL YE GQKYD HYDYF D+ N   GG R+AT+L
Sbjct: 100 DPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIVQGGHRLATVL 159

Query: 200 MYLSDVEEGGETVFPAAN--ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           MYL++V +GGETVFP A         +  ++LSEC K+G++VKP+RGDALLF+S+  +A 
Sbjct: 160 MYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPRRGDALLFFSLDTNAI 219

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            D +SLH GCPV+ G KWSATKW+HV  +
Sbjct: 220 PDTNSLHAGCPVLEGEKWSATKWIHVDSF 248


>gi|110289076|gb|ABB47602.2| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 309

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 114/207 (55%), Positives = 151/207 (72%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF++  FL+ AECE+LI LAK  + KS V D+++G+S  S VRTSSG FL++ Q
Sbjct: 48  LSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGMFLEKKQ 107

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IE+RIA +TF+P ++GE IQ+LHY+ G+KY+ HYDYF D+ N   GG R+AT+L
Sbjct: 108 DEVVARIEERIAAWTFLPPDNGESIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVL 167

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLSDV +GGET+FP A          +  S+C K G +VKP +GDALLF+S+ PDAT D
Sbjct: 168 MYLSDVGKGGETIFPEAEVGKLLQPKDDTWSDCAKNGYAVKPVKGDALLFFSLHPDATTD 227

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEY 286
             SLHG CPVI G KWSATKW+HV  +
Sbjct: 228 SDSLHGSCPVIEGQKWSATKWIHVRSF 254


>gi|125552794|gb|EAY98503.1| hypothetical protein OsI_20415 [Oryza sativa Indica Group]
          Length = 319

 Score =  241 bits (616), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 115/210 (54%), Positives = 156/210 (74%), Gaps = 4/210 (1%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PR F+Y +FLS  E  +L+ LA+  + +S V D+ +G+S+ S  RTSSGTF+++ Q
Sbjct: 61  ISWKPRVFLYQHFLSDDEANHLVSLARAELKRSAVADNLSGKSELSDARTSSGTFIRKSQ 120

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE++IA +TF+P E+GE IQVL Y+ G+KY+ HYDYF D  NT  GG R+AT+L
Sbjct: 121 DPIVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLRGGHRIATVL 180

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNE---LSECGKQGLSVKPKRGDALLFWSMRPDA 256
           MYL+DV EGGETVFP A   FT     NE   LSEC K+G++VKP++GDALLF+++ PDA
Sbjct: 181 MYLTDVAEGGETVFPLAE-EFTESGTNNEDSTLSECAKKGVAVKPRKGDALLFFNLSPDA 239

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           + D  SLH GCPVI+G KWSATKW+ V  +
Sbjct: 240 SKDSLSLHAGCPVIKGEKWSATKWIRVASF 269


>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
 gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
 gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 316

 Score =  241 bits (616), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 123/279 (44%), Positives = 178/279 (63%), Gaps = 22/279 (7%)

Query: 15  STLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDL----TSFRRRAFEKRSSIAE 70
           S + L  S+ F+ T+ L+                S PN      ++ R  +  K  + A 
Sbjct: 3   SRIFLAFSLCFLFTLPLI---------------SSAPNRFLTRSSNTRDGSVIKMKTSAS 47

Query: 71  EKGEQWTEI--VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVR 128
             G   T +  ++W PR F+Y  FLS  EC++ I LAK  + KS V D+ +G+S +S VR
Sbjct: 48  SFGFDPTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVR 107

Query: 129 TSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT 188
           TSSG FL + QD I+  +E ++A +TF+P E+GE +Q+LHYE GQKY+ H+DYF D+ N 
Sbjct: 108 TSSGMFLSKRQDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANL 167

Query: 189 KNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALL 248
           + GG R+AT+LMYLS+VE+GGETVFP      T ++  +  +EC KQG +VKP++GDALL
Sbjct: 168 ELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLK-DDSWTECAKQGYAVKPRKGDALL 226

Query: 249 FWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           F+++ P+AT D +SLHG CPV+ G KWSAT+W+HV  ++
Sbjct: 227 FFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFE 265


>gi|326526235|dbj|BAJ97134.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 308

 Score =  241 bits (616), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 112/207 (54%), Positives = 154/207 (74%), Gaps = 2/207 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF+Y +FLS  E  +L+ LA+  + +S V D  +G+S+ S VRTSSGTF+ +G+
Sbjct: 54  ISWHPRAFLYPHFLSDDEANHLVSLARAELKRSAVADETSGKSQLSEVRTSSGTFISKGK 113

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +IA +TF+P E+GE +QVL Y+ G+KY+ HYD+F D  NT  GG R+AT+L
Sbjct: 114 DPIVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKYEPHYDFFTDSVNTILGGHRVATVL 173

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           +YL+DV EGGETVFP A     S      LSEC ++G++VKP++GDALLF+++RPDA  D
Sbjct: 174 LYLTDVAEGGETVFPLAKGRKGSHH--KGLSECAQKGIAVKPRKGDALLFFNLRPDAATD 231

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEY 286
           P+SLHGGC VI+G KWSATKW+ V  +
Sbjct: 232 PTSLHGGCEVIKGEKWSATKWIRVASF 258


>gi|115464581|ref|NP_001055890.1| Os05g0489100 [Oryza sativa Japonica Group]
 gi|50511363|gb|AAT77286.1| putative prolyl 4-hydroxylase alpha subunit [Oryza sativa Japonica
           Group]
 gi|113579441|dbj|BAF17804.1| Os05g0489100 [Oryza sativa Japonica Group]
 gi|125587281|gb|EAZ27945.1| hypothetical protein OsJ_11906 [Oryza sativa Japonica Group]
 gi|215737307|dbj|BAG96236.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 319

 Score =  241 bits (616), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 115/210 (54%), Positives = 156/210 (74%), Gaps = 4/210 (1%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PR F+Y +FLS  E  +L+ LA+  + +S V D+ +G+S+ S  RTSSGTF+++ Q
Sbjct: 61  ISWKPRVFLYQHFLSDDEANHLVSLARTELKRSAVADNLSGKSELSDARTSSGTFIRKSQ 120

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE++IA +TF+P E+GE IQVL Y+ G+KY+ HYDYF D  NT  GG R+AT+L
Sbjct: 121 DPIVAGIEEKIAAWTFLPKENGEDIQVLRYKHGEKYERHYDYFSDNVNTLRGGHRIATVL 180

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNE---LSECGKQGLSVKPKRGDALLFWSMRPDA 256
           MYL+DV EGGETVFP A   FT     NE   LSEC K+G++VKP++GDALLF+++ PDA
Sbjct: 181 MYLTDVAEGGETVFPLAE-EFTESGTNNEDSTLSECAKKGVAVKPRKGDALLFFNLSPDA 239

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           + D  SLH GCPVI+G KWSATKW+ V  +
Sbjct: 240 SKDSLSLHAGCPVIKGEKWSATKWIRVASF 269


>gi|388492638|gb|AFK34385.1| unknown [Medicago truncatula]
          Length = 299

 Score =  241 bits (615), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 114/206 (55%), Positives = 151/206 (73%), Gaps = 2/206 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAFVY  FL+  EC++LI LAK  + +S V D+ +G S+ S VRTSSG F+ + +
Sbjct: 40  ISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSSGMFISKNK 99

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE RI+ +TF+P E+GE IQVL YE GQKYD HYDYF D+ N   GG R+AT+L
Sbjct: 100 DPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIVQGGHRLATVL 159

Query: 200 MYLSDVEEGGETVFPAAN--ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           MYL++V +GGETVFP A         +  ++LSEC K+G++VKP+RGDALLF+S+  +A 
Sbjct: 160 MYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPRRGDALLFFSLDTNAI 219

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHV 283
            D +SLH GCPV+ G KWSATKW+HV
Sbjct: 220 PDTNSLHAGCPVLEGEKWSATKWIHV 245


>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 332

 Score =  241 bits (615), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 109/208 (52%), Positives = 154/208 (74%), Gaps = 1/208 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PR F+Y  FLS  EC++ I LAK  + KS V D+ +G+S +S VRTSSG FL + Q
Sbjct: 75  LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQ 134

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+  +E ++A +TF+P E+GE +Q+LHYE GQKY+ H+DYF D+ N + GG R+AT+L
Sbjct: 135 DDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVL 194

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLS+VE+GGETVFP      T ++  +  +EC KQG +VKP++GDALLF+++ P+AT D
Sbjct: 195 MYLSNVEKGGETVFPMWKGKATQLK-DDSWTECAKQGYAVKPRKGDALLFFNLHPNATTD 253

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEYK 287
            +SLHG CPV+ G KWSAT+W+HV  ++
Sbjct: 254 SNSLHGSCPVVEGEKWSATRWIHVKSFE 281


>gi|297818458|ref|XP_002877112.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322950|gb|EFH53371.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 289

 Score =  241 bits (615), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 113/208 (54%), Positives = 154/208 (74%), Gaps = 2/208 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVV-DSKTGQSKDSRVRTSSGTFLKRG 138
           ++W PRAF+Y+ FLS  EC++LI+LAK  + KS VV D  +G+S DS  RTSSG FL + 
Sbjct: 35  LSWTPRAFLYNGFLSDEECDHLINLAKGKLEKSMVVADDNSGESIDSEERTSSGVFLTKR 94

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATL 198
           QD I+  +E ++A +TF+P E+GE +Q+LHYE GQKYD H+DY+ D+   K GG R+AT+
Sbjct: 95  QDDIVANVEAKLATWTFLPEENGEALQILHYENGQKYDPHFDYYYDKETLKLGGHRIATV 154

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           LMYLS+V +GGETVFP        ++  +  SEC KQG +VKP++GDALLF+++ P+AT 
Sbjct: 155 LMYLSNVTKGGETVFPMWKGKTPQLK-DDTWSECAKQGYAVKPRKGDALLFFNLHPNATT 213

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DP+SLHG CPVI G KWSAT+W+HV  +
Sbjct: 214 DPTSLHGSCPVIEGEKWSATRWIHVRSF 241


>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
          Length = 298

 Score =  241 bits (614), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 112/207 (54%), Positives = 152/207 (73%), Gaps = 1/207 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF++  FL  AEC++LI LAK  + KS V D+K+G+S  S VRTSSG FL++ Q
Sbjct: 38  LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSEVRTSSGMFLEKKQ 97

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IE+RI+ +TF+P E+GE IQ+LHY+ G+KY+ HYDYF D+ N   GG R+AT+L
Sbjct: 98  DEVVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVL 157

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLS+VE+GGET+FP A       +  +  S+C + G +VKP +GDALLF+S+ PD+T D
Sbjct: 158 MYLSNVEKGGETIFPNAEGKLLQPK-DDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTD 216

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEY 286
             SLHG CPVI G KWSATKW+HV  +
Sbjct: 217 SDSLHGSCPVIEGQKWSATKWIHVRSF 243


>gi|388500582|gb|AFK38357.1| unknown [Medicago truncatula]
          Length = 299

 Score =  239 bits (611), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 113/209 (54%), Positives = 151/209 (72%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAFVY  FL+  EC++LI LAK  + +S V D+ +G S+ S VRTSSG  + + +
Sbjct: 40  ISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSSGMLISKNK 99

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE RI+ +TF+P E+GE IQVL YE GQKYD HYDYF D+ N   GG R+AT+L
Sbjct: 100 DPIVSGIEDRISAWTFLPKENGEDIQVLRYEHGQKYDPHYDYFADKVNIVQGGHRLATVL 159

Query: 200 MYLSDVEEGGETVFPAAN--ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           MYL++V +GGETVFP A         +  ++LSEC K+G++VKP+RGDALLF+S+  +A 
Sbjct: 160 MYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPRRGDALLFFSLDTNAI 219

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            D +SLH GCPV+ G KWSATKW+HV  +
Sbjct: 220 PDTNSLHAGCPVLEGEKWSATKWIHVDSF 248


>gi|302834449|ref|XP_002948787.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
           nagariensis]
 gi|300265978|gb|EFJ50167.1| hypothetical protein VOLCADRAFT_80309 [Volvox carteri f.
           nagariensis]
          Length = 329

 Score =  239 bits (611), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 112/208 (53%), Positives = 147/208 (70%), Gaps = 2/208 (0%)

Query: 79  IVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRG 138
           +++W+PR F+Y   L++ EC+YLI +A+  + +S V D+ TG+   S +RTSSG F  RG
Sbjct: 52  VLSWQPRVFLYKGILTQEECDYLIKIAQGRLERSGVSDATTGEGGVSDIRTSSGMFYTRG 111

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATL 198
           ++ +++ IE R+A +T +P+E+GEGIQVL YE  QKYD H+DYF  E    NGG RMAT+
Sbjct: 112 ENDVVKRIETRLAMWTMLPVENGEGIQVLRYEKTQKYDPHHDYFSFEGRDANGGNRMATV 171

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           LMYL+  EEGGETVFP         R     SECG +GL+VKP +GDA+LFWS+RPD   
Sbjct: 172 LMYLATPEEGGETVFPKIPVPAGQTR--ANFSECGMKGLAVKPVKGDAVLFWSIRPDGRF 229

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           +P SLHG CPVIRG KWSATKW+HV  Y
Sbjct: 230 EPGSLHGSCPVIRGVKWSATKWIHVGPY 257


>gi|212720650|ref|NP_001132477.1| uncharacterized protein LOC100193935 precursor [Zea mays]
 gi|194694488|gb|ACF81328.1| unknown [Zea mays]
 gi|347978828|gb|AEP37756.1| prolyl 4-hydroxylase 7 [Zea mays]
 gi|413934218|gb|AFW68769.1| prolyl 4-hydroxylase [Zea mays]
          Length = 298

 Score =  239 bits (610), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 111/207 (53%), Positives = 151/207 (72%), Gaps = 1/207 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF++  FL  AEC++LI LAK  + KS V D+K+G+S  S VRTSSG FL++ Q
Sbjct: 38  LSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSEVRTSSGMFLEKKQ 97

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IE+RI+ +TF+P E+GE IQ+LHY+ G+KY+ HYDYF D+ N   GG R+AT+L
Sbjct: 98  DEVVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATVL 157

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLS+VE+GGET+FP A       +  +  S+C + G +VKP +GDALLF+S+ PD+T D
Sbjct: 158 MYLSNVEKGGETIFPNAEGKLLQPK-DDTWSDCARNGYAVKPVKGDALLFFSLHPDSTTD 216

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEY 286
             SLHG CP I G KWSATKW+HV  +
Sbjct: 217 SDSLHGSCPAIEGQKWSATKWIHVRSF 243


>gi|363543295|ref|NP_001241863.1| prolyl 4-hydroxylase 4 precursor [Zea mays]
 gi|347978806|gb|AEP37745.1| prolyl 4-hydroxylase 4 [Zea mays]
 gi|414591890|tpg|DAA42461.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
          Length = 274

 Score =  239 bits (609), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 116/208 (55%), Positives = 146/208 (70%), Gaps = 1/208 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W PR FVY  FLS AEC++L+ LAK  + +S V D+++G+S  S VRTSSG FL + Q
Sbjct: 48  VSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSEVRTSSGMFLDKRQ 107

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IE+RIA +TF+P E+ E +QVL YE GQKY+ H+DYF D  N   GG R AT+L
Sbjct: 108 DPVVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGHRYATVL 167

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLS V EGGETVFP A   + S       SEC  +GL+VKP +GDA+LF+S+  D T D
Sbjct: 168 MYLSTVREGGETVFPNAKG-WESQPKDATFSECAHKGLAVKPVKGDAVLFFSLHADGTPD 226

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           P SLHG CPVIRG KWSA KW+HV  Y+
Sbjct: 227 PLSLHGSCPVIRGEKWSAPKWIHVRSYE 254


>gi|388495016|gb|AFK35574.1| unknown [Lotus japonicus]
          Length = 297

 Score =  239 bits (609), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 115/209 (55%), Positives = 149/209 (71%), Gaps = 2/209 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W+PRAFVY  FL+  EC++LI LAK  + +S V D+  G SK S VRTSSG F+ + +
Sbjct: 39  VSWKPRAFVYEGFLTGLECDHLISLAKSELKRSAVADNLPGDSKLSEVRTSSGMFISKKK 98

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +I+ +TF+P E+GE +QVL YE GQKYD HYDYF D+ N   GG RMAT+L
Sbjct: 99  DPIVAGIEDKISAWTFLPKENGEDMQVLRYEHGQKYDPHYDYFTDKVNIVRGGHRMATVL 158

Query: 200 MYLSDVEEGGETVFPAAN--ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           +YL++V  GGETVFP A        +   ++LSEC K+G++VKP+RGDALLF+S+   A 
Sbjct: 159 LYLTNVTRGGETVFPVAEEPPRRRGLETNSDLSECAKKGIAVKPRRGDALLFFSLHTTAI 218

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            D  SLH GCPVI G KWSATKW+HV  +
Sbjct: 219 PDTDSLHAGCPVIEGEKWSATKWIHVDSF 247


>gi|148537204|dbj|BAF63493.1| prolyl 4-hydroxylase [Potamogeton distinctus]
          Length = 246

 Score =  238 bits (606), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 114/195 (58%), Positives = 145/195 (74%), Gaps = 1/195 (0%)

Query: 92  FLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIA 151
           FLS  EC++LI L K  + KS V D+++G+S  S +RTSSG FL+R QD  I  IEKRIA
Sbjct: 3   FLSHEECDHLIALGKDKLEKSMVADNESGKSVMSEIRTSSGMFLERRQDETITRIEKRIA 62

Query: 152 DFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGET 211
            +TF+P E+GE IQ+LHYE GQKYDAHYDYF D+ N + GG RMAT+LMYLSDV++GGET
Sbjct: 63  AWTFLPEENGEPIQILHYEKGQKYDAHYDYFHDKNNQRVGGHRMATVLMYLSDVKKGGET 122

Query: 212 VFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIR 271
           VFP A      V+  +  S+C + G +VKP++GDALLF+S  P+AT DP+SLH  CPVI 
Sbjct: 123 VFPDAEGKLLQVK-DDTWSDCARSGYAVKPRKGDALLFFSCHPNATTDPNSLHASCPVIE 181

Query: 272 GNKWSATKWMHVHEY 286
           G KWSAT+W+HV  +
Sbjct: 182 GEKWSATRWIHVRSF 196


>gi|168001068|ref|XP_001753237.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162695523|gb|EDQ81866.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 284

 Score =  238 bits (606), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 115/210 (54%), Positives = 152/210 (72%), Gaps = 16/210 (7%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+++W+PR  + HNFLS  EC++LI+LA+P +VKSTVVD+ TG+  +S+VRTS+G FL  
Sbjct: 79  EVISWQPRIILLHNFLSADECDHLINLARPRLVKSTVVDATTGKGIESKVRTSTGMFLN- 137

Query: 138 GQDR---IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
           G DR    I+ IE RIA ++ +P+++GE +QVL YE  Q Y AH+DYF DEFN K GGQR
Sbjct: 138 GNDRRHHTIQAIETRIAAYSMVPVQNGELLQVLRYESDQYYKAHHDYFSDEFNLKRGGQR 197

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECG---KQGLSVKPKRGDALLFWS 251
           +AT+LMYL++  EGGET+FP A     S         CG   K G+ VKPKRGDA+LFWS
Sbjct: 198 VATMLMYLTEGVEGGETIFPQAGDKECS---------CGGEMKIGVCVKPKRGDAVLFWS 248

Query: 252 MRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           ++ D  +DP+SLHGGC V+ G KWS+TKWM
Sbjct: 249 IKLDGQVDPTSLHGGCKVLSGEKWSSTKWM 278


>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
 gi|194697650|gb|ACF82909.1| unknown [Zea mays]
 gi|194708468|gb|ACF88318.1| unknown [Zea mays]
 gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
 gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
 gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
          Length = 308

 Score =  237 bits (605), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 112/207 (54%), Positives = 151/207 (72%), Gaps = 2/207 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++ +PR F+Y +FLS  E  +LI LA+  + +S V D+ +G+S  S VRTSSGTFL++GQ
Sbjct: 54  ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSEVRTSSGTFLRKGQ 113

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +IA +TF+P E+GE IQVL Y+ G+KY+ HYDYF D  NT  GG R AT+L
Sbjct: 114 DPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRYATVL 173

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           +YL+DV EGGETVFP A     +      LSEC ++G++V+P++GDALLF+++ PD T D
Sbjct: 174 LYLTDVPEGGETVFPLAEEPDDAKD--ATLSECAQKGIAVRPRKGDALLFFNLNPDGTTD 231

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEY 286
             SLHGGCPVI+G KWSATKW+ V  +
Sbjct: 232 SVSLHGGCPVIKGEKWSATKWIRVASF 258


>gi|242047772|ref|XP_002461632.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
 gi|241925009|gb|EER98153.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
          Length = 307

 Score =  235 bits (599), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 113/208 (54%), Positives = 149/208 (71%), Gaps = 1/208 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W+PR FVY  FLS AEC++L+ LAK  + +S V D+++G+S  S VRTSSG FL + Q
Sbjct: 47  VSWQPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNQSGKSVMSEVRTSSGMFLNKRQ 106

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IE+RIA +TF+P E+ E +Q+L YE GQKY+ H+DYF D+ N   GG R AT+L
Sbjct: 107 DPVVSRIEERIAAWTFLPQENAENMQILRYEHGQKYEPHFDYFHDKINQVRGGHRYATVL 166

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLS V++GGETVFP A   + S    +  SEC  QGL+VKP +GDA+LF+S+  D   D
Sbjct: 167 MYLSTVDKGGETVFPNAKG-WESQPKDDTFSECAHQGLAVKPVKGDAVLFFSLHVDGVPD 225

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           P SLHG CPVI+G KWSA KW+HV  Y+
Sbjct: 226 PLSLHGSCPVIQGEKWSAPKWIHVRSYE 253


>gi|29150368|gb|AAO72377.1| putative oxidoreductase [Oryza sativa Japonica Group]
 gi|108711617|gb|ABF99412.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|125546090|gb|EAY92229.1| hypothetical protein OsI_13949 [Oryza sativa Indica Group]
 gi|125588294|gb|EAZ28958.1| hypothetical protein OsJ_13002 [Oryza sativa Japonica Group]
          Length = 310

 Score =  234 bits (598), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 110/208 (52%), Positives = 147/208 (70%), Gaps = 1/208 (0%)

Query: 79  IVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRG 138
           I++W+PR F Y  FLS  EC++L+ L K  + +S V D+++G+S  S VRTSSG FL + 
Sbjct: 50  IISWKPRIFFYKGFLSDDECDHLVKLGKEKLKRSMVADNESGKSVMSEVRTSSGMFLDKQ 109

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATL 198
           QD ++ GIE+RIA +T +P E+ E IQ+L YE GQKYD H+DYF D+ N   GG R AT+
Sbjct: 110 QDPVVSGIEERIAAWTLLPQENAENIQILRYENGQKYDPHFDYFQDKVNQLQGGHRYATV 169

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L YLS VE+GGETVFP A   + S    +  S+C K+GL+VK  +GD++LF++++PD T 
Sbjct: 170 LTYLSTVEKGGETVFPNAEG-WESQPKDDSFSDCAKKGLAVKAVKGDSVLFFNLQPDGTP 228

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DP SLHG CPVI G KWSA KW+HV  Y
Sbjct: 229 DPLSLHGSCPVIEGEKWSAPKWIHVRSY 256


>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
          Length = 487

 Score =  234 bits (596), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 114/213 (53%), Positives = 145/213 (68%), Gaps = 11/213 (5%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W PR FVY  FLS  EC++L+ L K  M +S V D+K+G+S  S VRTSSG FL + Q
Sbjct: 60  VSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLDKRQ 119

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IEKRIA +TF+P E+ E IQ+L YE GQKY+ H+DYF D+ N   GG R AT+L
Sbjct: 120 DPVVSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHDKVNQALGGHRYATVL 179

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNE-----LSECGKQGLSVKPKRGDALLFWSMRP 254
           MYLS VE+GGETVFP A        W N+      SEC ++GL+VKP +GDA+LF+S+  
Sbjct: 180 MYLSTVEKGGETVFPNAEG------WENQPKDDTFSECAQKGLAVKPVKGDAVLFFSLHI 233

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D   DP SLHG CPVI G KWSA KW+ +  Y+
Sbjct: 234 DGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYE 266


>gi|30689216|ref|NP_189490.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
 gi|332643931|gb|AEE77452.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
          Length = 288

 Score =  234 bits (596), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 111/208 (53%), Positives = 152/208 (73%), Gaps = 2/208 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVV-DSKTGQSKDSRVRTSSGTFLKRG 138
           ++W PRAF+Y  FLS  EC++LI LAK  + KS VV D  +G+S+DS VRTSSG FL + 
Sbjct: 35  LSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKR 94

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATL 198
           QD I+  +E ++A +TF+P E+GE +Q+LHYE GQKYD H+DYF D+   + GG R+AT+
Sbjct: 95  QDDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRIATV 154

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           LMYLS+V +GGETVFP        ++  +  S+C KQG +VKP++GDALLF+++  + T 
Sbjct: 155 LMYLSNVTKGGETVFPNWKGKTPQLK-DDSWSKCAKQGYAVKPRKGDALLFFNLHLNGTT 213

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DP+SLHG CPVI G KWSAT+W+HV  +
Sbjct: 214 DPNSLHGSCPVIEGEKWSATRWIHVRSF 241


>gi|28393447|gb|AAO42145.1| putative prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 253

 Score =  233 bits (594), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 111/207 (53%), Positives = 151/207 (72%), Gaps = 2/207 (0%)

Query: 81  AWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVV-DSKTGQSKDSRVRTSSGTFLKRGQ 139
           +W PRAF+Y  FLS  EC++LI LAK  + KS VV D  +G+S+DS VRTSSG FL + Q
Sbjct: 1   SWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTSSGMFLTKRQ 60

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+  +E ++A +TF+P E+GE +Q+LHYE GQKYD H+DYF D+   + GG R+AT+L
Sbjct: 61  DDIVANVEAKLAAWTFLPEENGEALQILHYENGQKYDPHFDYFYDKKALELGGHRIATVL 120

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLS+V +GGETVFP        ++  +  S+C KQG +VKP++GDALLF+++  + T D
Sbjct: 121 MYLSNVTKGGETVFPNWKGKTPQLK-DDSWSKCAKQGYAVKPRKGDALLFFNLHLNGTTD 179

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEY 286
           P+SLHG CPVI G KWSAT+W+HV  +
Sbjct: 180 PNSLHGSCPVIEGEKWSATRWIHVRSF 206


>gi|145343778|ref|XP_001416487.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576712|gb|ABO94780.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 255

 Score =  233 bits (593), Expect = 8e-59,   Method: Compositional matrix adjust.
 Identities = 112/201 (55%), Positives = 147/201 (73%), Gaps = 4/201 (1%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PRAFVY  FL+  EC++++ L+K ++ KS VVD+KTG S  S +RTS+GTF+ R  D  I
Sbjct: 1   PRAFVYEGFLTDEECDHILALSKGHLHKSGVVDAKTGGSTTSDIRTSTGTFISRAHDPTI 60

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLS 203
             IE+RI  ++ IP++HGE +QVL YE GQ+Y AH+DYF  +   +N   R+AT+L+YLS
Sbjct: 61  TAIEERIELWSQIPVDHGEALQVLRYENGQEYKAHFDYFFHKGGKRN--NRIATVLLYLS 118

Query: 204 DVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSL 263
           DVEEGGETVFP  N +  + R  ++ SECG  G SVK ++GDALLFWSM+P   LDP S 
Sbjct: 119 DVEEGGETVFP--NTDVPTDRDRSQYSECGNGGKSVKARKGDALLFWSMKPGGELDPGSS 176

Query: 264 HGGCPVIRGNKWSATKWMHVH 284
           H GCPVI+G KW+ATKWMHV+
Sbjct: 177 HAGCPVIKGVKWTATKWMHVN 197


>gi|222636605|gb|EEE66737.1| hypothetical protein OsJ_23428 [Oryza sativa Japonica Group]
          Length = 487

 Score =  232 bits (592), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 113/213 (53%), Positives = 144/213 (67%), Gaps = 11/213 (5%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W PR FVY  FLS  EC++L+ L K  M +S V D+K+G+S  S VRTSSG FL + Q
Sbjct: 60  VSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLDKRQ 119

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IEKRIA +TF+P E+ E IQ+L YE GQKY+ H+DYF D+ N   GG R AT+L
Sbjct: 120 DPVVSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHDKVNQALGGHRYATVL 179

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNE-----LSECGKQGLSVKPKRGDALLFWSMRP 254
           MYLS VE+GGETVFP A        W N+      SEC ++GL+VKP +GD +LF+S+  
Sbjct: 180 MYLSTVEKGGETVFPNAEG------WENQPKDDTFSECAQKGLAVKPVKGDTVLFFSLHI 233

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D   DP SLHG CPVI G KWSA KW+ +  Y+
Sbjct: 234 DGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYE 266


>gi|414870897|tpg|DAA49454.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 222

 Score =  231 bits (589), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 107/142 (75%), Positives = 124/142 (87%)

Query: 61  AFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTG 120
           AF+       EKGE WTE+++WEPRAFVYHNFLSK EC++LI LAKP+M KSTVVDS TG
Sbjct: 80  AFDSGLETRGEKGEPWTEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATG 139

Query: 121 QSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYD 180
            SKDSRVRTSSG FL+RGQD+IIR IEKRIAD+TFIP+E GEG+QVLHYEVGQKY+ H+D
Sbjct: 140 GSKDSRVRTSSGMFLRRGQDKIIRTIEKRIADYTFIPVEQGEGLQVLHYEVGQKYEPHFD 199

Query: 181 YFLDEFNTKNGGQRMATLLMYL 202
           YF D++NTKNGGQR+ATLLMYL
Sbjct: 200 YFHDDYNTKNGGQRIATLLMYL 221


>gi|145345764|ref|XP_001417370.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577597|gb|ABO95663.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 328

 Score =  231 bits (589), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 113/208 (54%), Positives = 145/208 (69%), Gaps = 3/208 (1%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E V+W P A VY  FL++ EC++L  LA P + +STVVD+  G S  S +RTSSG FL R
Sbjct: 57  ERVSWRPHAEVYRGFLTREECDHLKALATPSLGRSTVVDASNGGSVPSDIRTSSGMFLLR 116

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTK--NGGQRM 195
           G+D ++  IE+RIA +T +P  HGEG QVL YE GQ+Y  H+DYF DEFN K   GGQR+
Sbjct: 117 GEDDVVASIERRIASWTHVPESHGEGFQVLRYEFGQEYRPHFDYFQDEFNQKREKGGQRV 176

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           AT+LMYL+DVEEGGET+FP A A   +    ++ S C    L+VKP++GDAL F S+  +
Sbjct: 177 ATVLMYLTDVEEGGETIFPDAEAG-ANPGGGDDASSCAAGKLAVKPRKGDALFFRSLHHN 235

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHV 283
            T D  S H GCPV++G K+SATKWMHV
Sbjct: 236 GTSDAMSSHAGCPVVKGVKFSATKWMHV 263


>gi|218192156|gb|EEC74583.1| hypothetical protein OsI_10158 [Oryza sativa Indica Group]
          Length = 299

 Score =  231 bits (588), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 113/211 (53%), Positives = 148/211 (70%), Gaps = 8/211 (3%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKP-YMVKSTVVDSKTGQSKDSRVRTSSGTFLKRG 138
           V+W PR F+Y  FLS AECE+LI LAK   M +STVV+ K+G+S  S+ RTSSG FL R 
Sbjct: 40  VSWSPRVFLYEGFLSDAECEHLIALAKQGRMERSTVVNGKSGESVMSKTRTSSGMFLIRK 99

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATL 198
           QD ++  IE+RIA +T  P E+GE +Q+L Y  G+KY+ H+DY      +  GG R+AT+
Sbjct: 100 QDEVVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHRIATV 159

Query: 199 LMYLSDVEEGGETVFPAANANFTSVR---WWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           LMYLS+V+ GGETVFP A A  +  +   W    S+C +QG +VKP +G A+LF+S+ P+
Sbjct: 160 LMYLSNVKMGGETVFPDAEARLSQPKDETW----SDCAEQGFAVKPTKGSAVLFFSLYPN 215

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           AT DP SLHG CPVI+G KWSATKW+HV  Y
Sbjct: 216 ATFDPGSLHGSCPVIQGEKWSATKWIHVRSY 246


>gi|115471029|ref|NP_001059113.1| Os07g0194500 [Oryza sativa Japonica Group]
 gi|113610649|dbj|BAF21027.1| Os07g0194500 [Oryza sativa Japonica Group]
 gi|215768445|dbj|BAH00674.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 319

 Score =  229 bits (585), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 113/213 (53%), Positives = 144/213 (67%), Gaps = 11/213 (5%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W PR FVY  FLS  EC++L+ L K  M +S V D+K+G+S  S VRTSSG FL + Q
Sbjct: 60  VSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLDKRQ 119

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IEKRIA +TF+P E+ E IQ+L YE GQKY+ H+DYF D+ N   GG R AT+L
Sbjct: 120 DPVVSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHDKVNQALGGHRYATVL 179

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNE-----LSECGKQGLSVKPKRGDALLFWSMRP 254
           MYLS VE+GGETVFP A        W N+      SEC ++GL+VKP +GD +LF+S+  
Sbjct: 180 MYLSTVEKGGETVFPNAEG------WENQPKDDTFSECAQKGLAVKPVKGDTVLFFSLHI 233

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D   DP SLHG CPVI G KWSA KW+ +  Y+
Sbjct: 234 DGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYE 266


>gi|357128903|ref|XP_003566109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 313

 Score =  229 bits (585), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 107/210 (50%), Positives = 154/210 (73%), Gaps = 4/210 (1%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PR F+Y +FLS  E  +L+ LA+  + +S V D+ +G+S  S VRTS GTF+ +G+
Sbjct: 55  ISWKPRVFLYQHFLSDDEANHLLSLARAELKRSAVADNTSGKSTLSEVRTSYGTFISKGK 114

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +IA +TF+P E+GE +QVL Y+ G+K +  +D+F D  NT  GG R+AT+L
Sbjct: 115 DPIVAGIEDKIAAWTFLPKENGEDMQVLRYKRGEKDEPQFDFFTDTVNTVRGGHRVATVL 174

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNE---LSECGKQGLSVKPKRGDALLFWSMRPDA 256
           +YL+DV EGGETVFP A  +FT     ++   LSEC ++G++VKP++GDALLF+++RPDA
Sbjct: 175 LYLTDVAEGGETVFPLAK-DFTDTGLHDKDTTLSECAQKGIAVKPRKGDALLFFNLRPDA 233

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
             DP SLHGGC VI+G KW+ATKW+ V  +
Sbjct: 234 ATDPLSLHGGCTVIKGEKWTATKWIRVASF 263


>gi|34393269|dbj|BAC83179.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
           sativa Japonica Group]
 gi|50509101|dbj|BAD30161.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein [Oryza
           sativa Japonica Group]
          Length = 313

 Score =  229 bits (584), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 113/213 (53%), Positives = 144/213 (67%), Gaps = 11/213 (5%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W PR FVY  FLS  EC++L+ L K  M +S V D+K+G+S  S VRTSSG FL + Q
Sbjct: 54  VSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFLDKRQ 113

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IEKRIA +TF+P E+ E IQ+L YE GQKY+ H+DYF D+ N   GG R AT+L
Sbjct: 114 DPVVSRIEKRIAAWTFLPEENAENIQILRYEHGQKYEPHFDYFHDKVNQALGGHRYATVL 173

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNE-----LSECGKQGLSVKPKRGDALLFWSMRP 254
           MYLS VE+GGETVFP A        W N+      SEC ++GL+VKP +GD +LF+S+  
Sbjct: 174 MYLSTVEKGGETVFPNAEG------WENQPKDDTFSECAQKGLAVKPVKGDTVLFFSLHI 227

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D   DP SLHG CPVI G KWSA KW+ +  Y+
Sbjct: 228 DGVPDPLSLHGSCPVIEGEKWSAPKWIRIRSYE 260


>gi|108706361|gb|ABF94156.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|222624253|gb|EEE58385.1| hypothetical protein OsJ_09545 [Oryza sativa Japonica Group]
          Length = 299

 Score =  229 bits (583), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 112/211 (53%), Positives = 147/211 (69%), Gaps = 8/211 (3%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKP-YMVKSTVVDSKTGQSKDSRVRTSSGTFLKRG 138
           V+W PR F+Y  FLS  ECE+LI LAK   M +STVV+ K+G+S  S+ RTSSG FL R 
Sbjct: 40  VSWSPRVFLYEGFLSDVECEHLIALAKQGRMERSTVVNGKSGESVMSKTRTSSGMFLIRK 99

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATL 198
           QD ++  IE+RIA +T  P E+GE +Q+L Y  G+KY+ H+DY      +  GG R+AT+
Sbjct: 100 QDEVVARIEERIAAWTMFPAENGESMQMLRYGQGEKYEPHFDYIRGRQASARGGHRIATV 159

Query: 199 LMYLSDVEEGGETVFPAANANFTSVR---WWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           LMYLS+V+ GGETVFP A A  +  +   W    S+C +QG +VKP +G A+LF+S+ P+
Sbjct: 160 LMYLSNVKMGGETVFPDAEARLSQPKDETW----SDCAEQGFAVKPTKGSAVLFFSLYPN 215

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           AT DP SLHG CPVI+G KWSATKW+HV  Y
Sbjct: 216 ATFDPGSLHGSCPVIQGEKWSATKWIHVRSY 246


>gi|308801080|ref|XP_003075321.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116061875|emb|CAL52593.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 541

 Score =  229 bits (583), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 111/201 (55%), Positives = 144/201 (71%), Gaps = 4/201 (1%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PRAF+Y NFLS+ ECE+L+ L+K  + KS VVD++TG S  S VRTS+GTF+ R  D II
Sbjct: 265 PRAFLYENFLSEKECEHLLALSKGKLHKSGVVDAQTGGSSLSEVRTSTGTFISRKYDDII 324

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLS 203
            G+E+RI  ++ IP  H E  Q+L YE GQ+Y AH+DYF  +   +N   R+AT+L+YLS
Sbjct: 325 AGVEERIELWSQIPQSHHEAFQILRYEPGQEYKAHFDYFFHKSGMRN--NRIATVLLYLS 382

Query: 204 DVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSL 263
           DVEEGGETVFP  N +  + R  +  SECG  G ++K ++GDALLFWSM+P   LD  S 
Sbjct: 383 DVEEGGETVFP--NTDVPTSRNRSMYSECGNGGKALKARKGDALLFWSMKPGGELDAGSS 440

Query: 264 HGGCPVIRGNKWSATKWMHVH 284
           H GCPVI+G KW+ATKWMHV+
Sbjct: 441 HAGCPVIKGEKWTATKWMHVN 461


>gi|308799217|ref|XP_003074389.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116000560|emb|CAL50240.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 294

 Score =  228 bits (582), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 116/208 (55%), Positives = 141/208 (67%), Gaps = 5/208 (2%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++W P A VY  FL++AECE++  LA   +  STVVD+ TG    S +RTSSG FL R
Sbjct: 27  ERLSWAPHAEVYRGFLTEAECEHIERLATAELKPSTVVDASTGGDASSEIRTSSGMFLGR 86

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTK--NGGQRM 195
            +D +I  IE RIA +T +P  HGEG QVL YE  Q+Y AHYDYF D+FN K   GGQRM
Sbjct: 87  AEDDVIEAIEARIAAWTHVPESHGEGFQVLRYEKHQEYRAHYDYFHDKFNVKREKGGQRM 146

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
            T+LMYLSDVEEGGETVFP       +    +E SEC +  L+V+P++GDAL F S+R D
Sbjct: 147 GTVLMYLSDVEEGGETVFPKFEDGTPA---GSEASECARNKLAVRPRKGDALFFRSLRHD 203

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHV 283
              D  S H GCPVIRG K+SATKWMHV
Sbjct: 204 GVPDTFSEHAGCPVIRGVKFSATKWMHV 231


>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
          Length = 344

 Score =  228 bits (582), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 110/211 (52%), Positives = 146/211 (69%), Gaps = 6/211 (2%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           +++  + R F+YHNFL+  EC+++I LA+P M +S VV++ +G+SK   VRTS GTFL R
Sbjct: 64  QVLHEDARIFLYHNFLTDEECDHIIKLAEPTMARSGVVETDSGKSKIDNVRTSKGTFLNR 123

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
           G D +I  IE RIA +T +P  +GEG+QVL YE GQ+Y+ HYDYF  +  T NGG R  T
Sbjct: 124 GHDSVIADIEARIAKWTLMPAGNGEGLQVLKYEHGQEYEGHYDYFFHKAGTANGGNRYLT 183

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWN--ELSECGKQGLSVKPKRGDALLFWSMRPD 255
           +LMYL+DVEEGGET FP    N  S    N  E SEC ++ L+ KPK+G+A+LF S++P 
Sbjct: 184 VLMYLNDVEEGGETCFP----NIPSPNGDNGPEFSECARKVLAAKPKKGNAVLFHSIKPT 239

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
             L+  SLH  CPVI+G KWSA KW+HV  Y
Sbjct: 240 GELERRSLHTACPVIKGVKWSAPKWVHVGHY 270


>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
 gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
          Length = 269

 Score =  228 bits (581), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 112/217 (51%), Positives = 150/217 (69%), Gaps = 17/217 (7%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSK---DSRVRTSSGTF 134
           E++ W PR  + H FLS  EC+YLI +A P + KSTVVD+ TG+++   +S+VRTS+G F
Sbjct: 61  EVLNWSPRIILLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHGIESKVRTSTGMF 120

Query: 135 LKRGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGG 192
           L     R  +I+ IE+RIA ++ IP+E+GE +QVL YE  Q Y  H+DYF D+FN K GG
Sbjct: 121 LSNYDRRYPMIQAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQFNLKRGG 180

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECG---KQGLSVKPKRGDALLF 249
           QR+AT+LMYLSDVEEGGET+FP+           +   ECG   ++GL VKP++GDA+LF
Sbjct: 181 QRVATVLMYLSDVEEGGETIFPSVG---------DGECECGGELRKGLCVKPRKGDAILF 231

Query: 250 WSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           WS   D  +D +SLHGGC V+RG KWSATKW+    +
Sbjct: 232 WSAALDGNVDSNSLHGGCSVLRGEKWSATKWLRQSRF 268


>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
          Length = 278

 Score =  228 bits (580), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 108/207 (52%), Positives = 146/207 (70%), Gaps = 19/207 (9%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+ +PRAFVY  FL+  EC++LI LAK  + +S V D+  G+S+ S VRTSSGTF+ +G+
Sbjct: 41  VSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFISKGK 100

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +++ +TF+P E+GE +QVL YE GQKYDAH+DYF D+ N   GG R+AT+L
Sbjct: 101 DPIVSGIEDKLSTWTFLPKENGEDLQVLRYEHGQKYDAHFDYFHDKVNIARGGHRIATVL 160

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           +YLS+V +GGETVFP A                    + +KPK+G+ALLF++++ DA  D
Sbjct: 161 LYLSNVTKGGETVFPDAQ-------------------VCLKPKKGNALLFFNLQQDAIPD 201

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEY 286
           P SLHGGCPVI G KWSATKW+HV  +
Sbjct: 202 PFSLHGGCPVIEGEKWSATKWIHVDSF 228


>gi|302764100|ref|XP_002965471.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
 gi|300166285|gb|EFJ32891.1| hypothetical protein SELMODRAFT_67344 [Selaginella moellendorffii]
          Length = 264

 Score =  226 bits (576), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 112/212 (52%), Positives = 148/212 (69%), Gaps = 17/212 (8%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSK---DSRVRTSSGTF 134
           E++ W PR  + H FLS  EC+YLI +A P + KSTVVD+ TG+++   +S+VRTS+G F
Sbjct: 60  EVLNWSPRITLLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHGIESKVRTSTGMF 119

Query: 135 LKRGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGG 192
           L     R  +I  IE+RIA ++ IP+E+GE +QVL YE  Q Y  H+DYF D+FN K GG
Sbjct: 120 LSNYDRRYPMIEAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYFSDQFNLKRGG 179

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECG---KQGLSVKPKRGDALLF 249
           QR+AT+LMYLSDVEEGGET+FP+           +   ECG   ++GL VKP++GDA+LF
Sbjct: 180 QRVATVLMYLSDVEEGGETIFPSVG---------DGECECGGELRKGLCVKPRKGDAILF 230

Query: 250 WSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           WS   D  +D +SLHGGC V+RG KWSATKW+
Sbjct: 231 WSAALDGNVDSNSLHGGCSVLRGEKWSATKWL 262


>gi|224069056|ref|XP_002302889.1| predicted protein [Populus trichocarpa]
 gi|222844615|gb|EEE82162.1| predicted protein [Populus trichocarpa]
          Length = 287

 Score =  225 bits (574), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 115/209 (55%), Positives = 144/209 (68%), Gaps = 14/209 (6%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           EI++W PR  V H+FLS  EC+YL  LAKP +  STVVD KTG+  +S+VRTSSG FL  
Sbjct: 82  EIISWSPRIIVLHDFLSSEECDYLRALAKPRLRISTVVDVKTGKGIESKVRTSSGMFLSS 141

Query: 138 GQD--RIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            +   ++++ IEKRI+ ++ +P+E+GE IQVL YE  Q Y  H+DYF D FN K GGQR+
Sbjct: 142 EEKTYQVVQAIEKRISVYSQVPIENGELIQVLRYEKNQYYKPHHDYFSDTFNLKRGGQRV 201

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQ---GLSVKPKRGDALLFWSM 252
           AT+LMYLSD  EGGET FP A +   S         CG +   GLSVKP +G+A+LFWSM
Sbjct: 202 ATMLMYLSDNVEGGETYFPMAGSGKCS---------CGGKVVDGLSVKPIKGNAVLFWSM 252

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWM 281
             D   DPSS+HGGC V+ G KWSATKWM
Sbjct: 253 GLDGQSDPSSIHGGCEVLSGVKWSATKWM 281


>gi|412992163|emb|CCO19876.1| predicted protein [Bathycoccus prasinos]
          Length = 350

 Score =  225 bits (573), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 124/291 (42%), Positives = 175/291 (60%), Gaps = 23/291 (7%)

Query: 17  LTLVLSMLFML-TIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRRRAFEK----------- 64
           L++  S  F++  +VLL         I  G DDS       F   A +            
Sbjct: 7   LSVFSSRKFLVFVVVLLCKRCFCDAKIGFGLDDSNGEKKIDFEHEALDSNRFPGWLGEKH 66

Query: 65  RSSIAEEKG-EQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSK 123
           R SI ++K    WTE ++W+PRAFV H+ LS+ ECE ++ +AKP M +STVVDS TG+ K
Sbjct: 67  RPSINKDKELGTWTEPISWQPRAFVLHSILSEEECEEILRIAKPMMKRSTVVDSITGEIK 126

Query: 124 DSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL 183
              +RTS  TFL RG+  ++  +E+R++ FT +P  +GE +Q+L Y VG+KY AH+D  +
Sbjct: 127 TDPIRTSKQTFLARGKYPVVTRVEERLSRFTMLPWYNGEDMQILSYGVGEKYSAHHD--V 184

Query: 184 DEFNTK-------NGGQRMATLLMYLSDVEEGGETVFPAAN-ANFTSVRWWNELSECGKQ 235
            E NTK       +GGQR+AT+L+YL D EEGGET FP +      S     + SEC K 
Sbjct: 185 GEKNTKSGQQLSADGGQRVATVLLYLQDTEEGGETAFPDSEWIEPESEYAQQKFSECAKN 244

Query: 236 GLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           G++ KPKRGD LLF+S+ P+  +D  S+H GCPV++G KW+ATKW+H   +
Sbjct: 245 GVAFKPKRGDGLLFFSITPEGDIDQKSMHAGCPVVKGTKWTATKWIHARPF 295


>gi|159476104|ref|XP_001696154.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
           [Chlamydomonas reinhardtii]
 gi|158275325|gb|EDP01103.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
           [Chlamydomonas reinhardtii]
          Length = 343

 Score =  225 bits (573), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 117/247 (47%), Positives = 156/247 (63%), Gaps = 14/247 (5%)

Query: 44  IGDDDSPPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLID 103
           IGDD S   DL  +    F      A++       +++W PR F+Y   L+  EC+ L+D
Sbjct: 38  IGDDGSG-RDLIGWLGETFNAGEHRAQDS---RMVVLSWHPRVFLYKGILTHEECDQLMD 93

Query: 104 LAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEG 163
            ++  + +S V D+ TG    S +RTSSG F +RG+  +++ IE R+A +T +P+E+GEG
Sbjct: 94  NSRSRLERSGVSDATTGAGAVSDIRTSSGMFYERGETELVKRIENRLAMWTMLPVENGEG 153

Query: 164 IQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSV 223
           IQVL YE  QKYD H+DYF  +    NGG RMAT+LMYL+  EEGGETVFP        V
Sbjct: 154 IQVLRYEKTQKYDPHHDYFSFDGADDNGGNRMATVLMYLATPEEGGETVFPKV------V 207

Query: 224 RWWNELSECG----KQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATK 279
            W  +L+       +QGL+VKP +GDA+LFWS+RPD   DP SLHG CPVI+G KWSATK
Sbjct: 208 GWVVQLTTTASAPCRQGLAVKPAKGDAVLFWSIRPDGRFDPGSLHGSCPVIKGVKWSATK 267

Query: 280 WMHVHEY 286
           W+HV  Y
Sbjct: 268 WIHVGHY 274


>gi|449459442|ref|XP_004147455.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449515722|ref|XP_004164897.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 319

 Score =  225 bits (573), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 111/201 (55%), Positives = 145/201 (72%), Gaps = 2/201 (0%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PRAF+Y  FLS  EC++LI+ AK  + +S +V + TGQS  S+ RTS+G FL + QD I
Sbjct: 66  KPRAFLYKGFLSAEECQHLINSAKGKLHQS-LVAAGTGQSVTSKERTSTGMFLHKAQDEI 124

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  IE RIA +TF+P+++GE IQ+L YE GQKY+ H+D+F D  N   GG R+AT+LMYL
Sbjct: 125 VARIESRIAAWTFLPLDNGEPIQILRYENGQKYEPHFDFFQDPGNIAIGGHRIATILMYL 184

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           S+VE+GGETVFP +     S     +LSECGK G  V+PK GDALLF+SM P+ T D +S
Sbjct: 185 SNVEKGGETVFPNSPVKL-SEEEKADLSECGKVGYGVRPKLGDALLFFSMNPNVTPDTTS 243

Query: 263 LHGGCPVIRGNKWSATKWMHV 283
            HG CPVI G KWSATKW+H+
Sbjct: 244 YHGSCPVIEGEKWSATKWIHM 264


>gi|302765413|ref|XP_002966127.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
 gi|300165547|gb|EFJ32154.1| hypothetical protein SELMODRAFT_86017 [Selaginella moellendorffii]
          Length = 201

 Score =  224 bits (572), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 104/201 (51%), Positives = 138/201 (68%), Gaps = 1/201 (0%)

Query: 87  FVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGI 146
            ++    S  EC++LI LA P + +S+V+D KTG  KDSR RTS G FL+R  D I+ GI
Sbjct: 1   LIFFYLYSDDECDHLIGLALPRLRRSSVIDEKTGLGKDSRNRTSWGAFLRRDHDNIVSGI 60

Query: 147 EKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVE 206
           E RI+  TFIP E+GE +QV+ Y+ GQK++ H DY+    N  NGG R+ TLL+YL++VE
Sbjct: 61  EDRISSITFIPKEYGESLQVVRYKTGQKFEPHQDYYKLTENNNNGGHRIGTLLLYLTNVE 120

Query: 207 EGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGG 266
            GGETVFP A AN  +  +    SEC K+G+ ++P+RGD LLFW  RP   +DP S HGG
Sbjct: 121 NGGETVFPRALANVIN-DYSTNTSECTKKGIVIRPRRGDGLLFWITRPSGEIDPFSFHGG 179

Query: 267 CPVIRGNKWSATKWMHVHEYK 287
           CPV++G KW ATK++H HE K
Sbjct: 180 CPVVKGEKWLATKFLHEHELK 200


>gi|326501992|dbj|BAK06488.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 306

 Score =  224 bits (571), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 112/210 (53%), Positives = 151/210 (71%), Gaps = 4/210 (1%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPY-MVKSTVVDSKTGQSKDSRVRTSSGTFLKRG 138
           V+W PRAF+Y  FL++AEC++L+ LA+   + KS VVD +TG+S  S VRTSSGTFL + 
Sbjct: 41  VSWRPRAFLYKGFLTEAECDHLVALAEEGGLQKSMVVDRQTGKSVMSEVRTSSGTFLAKK 100

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF--LDEFNTKNGGQRMA 196
           QD+++  IE RIA +T +P E+GE IQVL YE GQKY+ H D+     + +   GG R+A
Sbjct: 101 QDQVVATIEARIAAWTLLPQENGESIQVLRYENGQKYEPHVDFIRHAAKGHHSRGGHRVA 160

Query: 197 TLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           T+LMYLSDV+ GGETVFP ++A     +  +  SEC ++G +VKP +GDA+LF+S+ P+ 
Sbjct: 161 TVLMYLSDVKMGGETVFPNSDAKTLQPK-DDTQSECARRGYAVKPVKGDAVLFFSLHPNG 219

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           T D  SLHGGCPVI G KWSATKW+HV  +
Sbjct: 220 TTDRDSLHGGCPVIEGEKWSATKWIHVRPF 249


>gi|255072321|ref|XP_002499835.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
 gi|226515097|gb|ACO61093.1| prolyl 4-hydroxylase [Micromonas sp. RCC299]
          Length = 454

 Score =  223 bits (569), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 109/214 (50%), Positives = 144/214 (67%), Gaps = 14/214 (6%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP+A+++ NFL+  ECE+L+ LAK  +  STVV  K   S  S++RTS+G FL RGQD  
Sbjct: 176 EPKAYMFRNFLTPHECEHLMQLAKKQLAPSTVVGDKGSGSMVSKIRTSAGMFLGRGQDPT 235

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLM 200
           +R IE+RIA  + +P  +GEG+Q+L YE GQKYD H+DYF D+ N+  + GGQRMAT+L+
Sbjct: 236 VRAIEERIAAASGLPEPNGEGLQILRYENGQKYDPHFDYFHDQVNSSPRRGGQRMATMLI 295

Query: 201 YLSDVEEGGETVFPAANANFTSVRWW--------NELSECGKQGLSVKPKRGDALLFWSM 252
           YL D  EGGET+FP    N      W        N  S+C K+G+ VK  RGDA+LFWS+
Sbjct: 296 YLEDTTEGGETIFP----NGVRPEDWDADEPGNHNSWSDCAKKGIPVKSHRGDAVLFWSL 351

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           + D TLD  SLHG CPVI G KW+A KW+ V ++
Sbjct: 352 KEDYTLDNGSLHGACPVIAGEKWTAVKWIRVAKF 385


>gi|115457822|ref|NP_001052511.1| Os04g0346000 [Oryza sativa Japonica Group]
 gi|38346023|emb|CAE03962.2| OSJNBb0085H11.11 [Oryza sativa Japonica Group]
 gi|113564082|dbj|BAF14425.1| Os04g0346000 [Oryza sativa Japonica Group]
 gi|125547818|gb|EAY93640.1| hypothetical protein OsI_15426 [Oryza sativa Indica Group]
 gi|125589953|gb|EAZ30303.1| hypothetical protein OsJ_14349 [Oryza sativa Japonica Group]
 gi|215693934|dbj|BAG89133.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 267

 Score =  223 bits (567), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 122/271 (45%), Positives = 165/271 (60%), Gaps = 19/271 (7%)

Query: 20  VLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRRRAFEKRSSIAEEKGEQWTEI 79
           +L +L  +T+ +++   + + +    DD S    L + +  AF +   +         E+
Sbjct: 11  LLPLLTFVTLGMILGSLLQLAFFRRIDDHSNVTHLENDQEAAFLRLGLVK-------PEV 63

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PR  V+HNFLS  EC+YL  +A+P +  STVVD  TG+   S VRTSSG F+   +
Sbjct: 64  ISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSNVRTSSGMFVSSEE 123

Query: 140 DR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
            +  +I+ IEKRI+ ++ IP E+GE IQVL YE  Q Y  H+DYF D FN K GGQR+AT
Sbjct: 124 RKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQRVAT 183

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGK--QGLSVKPKRGDALLFWSMRPD 255
           +LMYL+D  EGGET FP A           E S  GK  +GL VKP +GDA+LFWSM  D
Sbjct: 184 MLMYLTDGVEGGETHFPQAGD--------GECSCGGKMVKGLCVKPNKGDAVLFWSMGLD 235

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
              D +S+HGGCPV+ G KWSATKWM   E+
Sbjct: 236 GETDSNSIHGGCPVLEGEKWSATKWMRQKEF 266


>gi|116309432|emb|CAH66506.1| OSIGBa0111I14.1 [Oryza sativa Indica Group]
          Length = 267

 Score =  221 bits (564), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 113/213 (53%), Positives = 142/213 (66%), Gaps = 12/213 (5%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+++W PR  V+HNFLS  EC+YL  +A+P +  STVVD  TG+   S VRTSSG F+  
Sbjct: 62  EVISWSPRIIVFHNFLSSEECDYLRSIARPRLQISTVVDVATGKGVKSNVRTSSGMFVSS 121

Query: 138 GQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            + +  +I+ IEKRI+ ++ IP E+GE IQVL YE  Q Y  H+DYF D FN K GGQR+
Sbjct: 122 EERKLPVIQSIEKRISVYSQIPEENGELIQVLRYEPSQYYRPHHDYFSDTFNIKRGGQRV 181

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGK--QGLSVKPKRGDALLFWSMR 253
           AT+LMYL+D  EGGET FP A           E S  GK  +GL VKP +GDA+LFWSM 
Sbjct: 182 ATMLMYLTDGVEGGETHFPQAGD--------GECSCGGKMVKGLCVKPNKGDAVLFWSMG 233

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            D   D +S+HGGCPV+ G KWSATKWM   E+
Sbjct: 234 LDGETDSNSIHGGCPVLEGEKWSATKWMRQKEF 266


>gi|303285562|ref|XP_003062071.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226456482|gb|EEH53783.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 522

 Score =  221 bits (564), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 107/212 (50%), Positives = 141/212 (66%), Gaps = 4/212 (1%)

Query: 79  IVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRG 138
           + A  P+A+++ NFL++ EC +LI LAK  +  STVV     +S  S +RTS+G FL +G
Sbjct: 231 LSATRPKAYLFRNFLTEEECRHLIALAKAQLAPSTVVADGGKKSTKSGIRTSAGMFLTKG 290

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN--TKNGGQRMA 196
           Q   +R +E+R+A    +P E+GEG+Q+L YE GQKYD HYDYF D+ N     GGQRMA
Sbjct: 291 QTPTVRMVEERVAAAVGLPEENGEGMQILRYEHGQKYDPHYDYFHDKINPSPNRGGQRMA 350

Query: 197 TLLMYLSDVEEGGETVFPAANA--NFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           T+L+YL D EEGGET+FP A     F         S+C K+GL VK KRGDA+LFWS+  
Sbjct: 351 TMLIYLKDTEEGGETIFPNAKKPEGFHDGEKDGAFSDCAKRGLPVKSKRGDAVLFWSLTS 410

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           D  LD  SLHG CPV+RG KW+A KW+ V ++
Sbjct: 411 DYKLDEGSLHGACPVLRGEKWTAVKWIRVAKF 442


>gi|297802348|ref|XP_002869058.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297314894|gb|EFH45317.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 245

 Score =  221 bits (563), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 109/200 (54%), Positives = 142/200 (71%), Gaps = 30/200 (15%)

Query: 74  EQWTEIVAWEPRAFVYHNFL--------SKAECEYLIDLAKPYMVKSTVVDSKTGQSKDS 125
           E+W E++A EPRAFVYHNFL        +  ECE+LI LAKP M +S V ++ TG  ++S
Sbjct: 52  ERWLEVIAKEPRAFVYHNFLALFFKFCKTNEECEHLISLAKPSMARSKVRNAITGLGEES 111

Query: 126 RVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE 185
             RTSSGTFL++G D+I++ IEKRI++FTFIP E+GE +QV+HYEVGQK++ H+D F   
Sbjct: 112 SSRTSSGTFLRKGHDKIVKEIEKRISEFTFIPEENGEALQVIHYEVGQKFEPHFDGF--- 168

Query: 186 FNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGD 245
                  QR+AT+LMYLSDV++GGETVFP A    +            K+G+SV+PK+GD
Sbjct: 169 -------QRIATVLMYLSDVDKGGETVFPEAKGIKS------------KKGVSVRPKKGD 209

Query: 246 ALLFWSMRPDATLDPSSLHG 265
           ALLFWSMRPD + DPSS HG
Sbjct: 210 ALLFWSMRPDGSQDPSSKHG 229


>gi|414587756|tpg|DAA38327.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
          Length = 263

 Score =  221 bits (562), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 110/209 (52%), Positives = 140/209 (66%), Gaps = 14/209 (6%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+++W PR  V+HNFLS  EC+YL+ +A+P +  STVVD  TG+   S VRTSSG F+  
Sbjct: 58  EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSDVRTSSGMFVNS 117

Query: 138 GQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            + +  +++ IEKRI+ F+ IP E+GE IQVL YE  Q Y  H+DYF D FN K GGQR+
Sbjct: 118 EERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQRV 177

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGK---QGLSVKPKRGDALLFWSM 252
           AT+LMYL+D   GGET FP A     S         CG    +GL VKP +GDA+LFWSM
Sbjct: 178 ATMLMYLTDGVVGGETHFPQAGDGECS---------CGGNVVKGLCVKPNKGDAVLFWSM 228

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWM 281
             D   DP+S+H GCPV++G KWSATKWM
Sbjct: 229 GLDGNTDPNSIHSGCPVLKGEKWSATKWM 257


>gi|297824279|ref|XP_002880022.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
 gi|297325861|gb|EFH56281.1| AT-P4H-1 [Arabidopsis lyrata subsp. lyrata]
          Length = 283

 Score =  220 bits (561), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 115/210 (54%), Positives = 139/210 (66%), Gaps = 14/210 (6%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V+W PR  V H+FLS  ECEYL  +A+P +  STVVD KTG+   S VRTSSG FL  
Sbjct: 78  EVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSDVRTSSGMFLTH 137

Query: 138 GQ--DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            +  + II+ IEKRIA F+ +P E+GE IQVL YE  Q Y  H+DYF D FN K GGQR+
Sbjct: 138 VERSNPIIQAIEKRIAVFSQVPAENGELIQVLRYEPKQFYKPHHDYFADTFNLKRGGQRV 197

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGK---QGLSVKPKRGDALLFWSM 252
           AT+LMYL+D  EGGET FP A     +         CG    +G+SVKP +GDA+LFWSM
Sbjct: 198 ATMLMYLTDDVEGGETYFPLAGDGDCT---------CGGKIMKGISVKPTKGDAVLFWSM 248

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
             D   DP S+HGGC V+ G KWSATKWM 
Sbjct: 249 GLDGQSDPRSIHGGCEVLSGEKWSATKWMR 278


>gi|242075290|ref|XP_002447581.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
 gi|241938764|gb|EES11909.1| hypothetical protein SORBIDRAFT_06g004550 [Sorghum bicolor]
          Length = 263

 Score =  220 bits (561), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 110/209 (52%), Positives = 140/209 (66%), Gaps = 14/209 (6%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+++W PR  ++HNFLS  EC+YL+ +A+P +  STVVD  TG+   S VRTSSG F+  
Sbjct: 58  EVISWTPRIIIFHNFLSSEECDYLMAIARPRLQMSTVVDVATGKGVKSDVRTSSGMFVNS 117

Query: 138 GQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            + +  +I+ IEKRI+ F+ IP E+GE IQVL YE  Q Y  H+DYF D FN K GGQR+
Sbjct: 118 EERKSPVIQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQRV 177

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGK---QGLSVKPKRGDALLFWSM 252
           AT+LMYL+D  EGGET F  A     S         CG    +GL VKP +GDA+LFWSM
Sbjct: 178 ATMLMYLTDGVEGGETHFLQAGDGECS---------CGGNVVKGLCVKPNKGDAVLFWSM 228

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWM 281
             D   DP+S+H GCPV++G KWSATKWM
Sbjct: 229 GLDGNTDPNSIHSGCPVLKGEKWSATKWM 257


>gi|15224220|ref|NP_181836.1| P4H isoform 1 [Arabidopsis thaliana]
 gi|3763917|gb|AAC64297.1| hypothetical protein [Arabidopsis thaliana]
 gi|20197628|gb|AAM15158.1| hypothetical protein [Arabidopsis thaliana]
 gi|26450452|dbj|BAC42340.1| unknown protein [Arabidopsis thaliana]
 gi|29824245|gb|AAP04083.1| unknown protein [Arabidopsis thaliana]
 gi|330255112|gb|AEC10206.1| P4H isoform 1 [Arabidopsis thaliana]
          Length = 283

 Score =  219 bits (559), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 115/210 (54%), Positives = 138/210 (65%), Gaps = 14/210 (6%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V+W PR  V H+FLS  ECEYL  +A+P +  STVVD KTG+   S VRTSSG FL  
Sbjct: 78  EVVSWSPRIIVLHDFLSPEECEYLKAIARPRLQVSTVVDVKTGKGVKSDVRTSSGMFLTH 137

Query: 138 GQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            +    II+ IEKRIA F+ +P E+GE IQVL YE  Q Y  H+DYF D FN K GGQR+
Sbjct: 138 VERSYPIIQAIEKRIAVFSQVPAENGELIQVLRYEPQQFYKPHHDYFADTFNLKRGGQRV 197

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGK---QGLSVKPKRGDALLFWSM 252
           AT+LMYL+D  EGGET FP A     +         CG    +G+SVKP +GDA+LFWSM
Sbjct: 198 ATMLMYLTDDVEGGETYFPLAGDGDCT---------CGGKIMKGISVKPTKGDAVLFWSM 248

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
             D   DP S+HGGC V+ G KWSATKWM 
Sbjct: 249 GLDGQSDPRSIHGGCEVLSGEKWSATKWMR 278


>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
          Length = 321

 Score =  218 bits (556), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 111/228 (48%), Positives = 148/228 (64%), Gaps = 25/228 (10%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKP-YMVKSTVVDSKTGQSKDSRVRTSSGTFLKRG 138
           V+W PRAF+Y  FLS AEC++LI LAK   M KSTVVD ++G+S  S+VRTSSG FL + 
Sbjct: 45  VSWRPRAFLYEGFLSDAECDHLISLAKQGKMEKSTVVDGESGESVTSKVRTSSGMFLDKK 104

Query: 139 QDRIIRGIEKRIADFTFIPME-----------------HGEGIQVLHYEVGQKYDAHYDY 181
           QD ++  IE+RIA +T +P E                 +GE +Q+L Y  G+KY+ H+DY
Sbjct: 105 QDEVVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPHFDY 164

Query: 182 FLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVR---WWNELSECGKQGLS 238
                 +   G R+AT+LMYLS+V+ GGET+FP   A  +  +   W    S+C +QG +
Sbjct: 165 ISGRQGSTREGDRVATVLMYLSNVKMGGETIFPDCEARLSQPKDETW----SDCAEQGFA 220

Query: 239 VKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           VKP +G A+LF+S+ P+ATLD  SLHG CPVI G KWSATKW+HV  Y
Sbjct: 221 VKPAKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEKWSATKWIHVRSY 268


>gi|307102962|gb|EFN51227.1| hypothetical protein CHLNCDRAFT_28161 [Chlorella variabilis]
          Length = 300

 Score =  218 bits (555), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 99/210 (47%), Positives = 144/210 (68%), Gaps = 2/210 (0%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVD-SKTGQSKDSRVRTSSGTFLK 136
           ++++W+PR F+Y   L++ EC++++  A P + +S VVD    G    S +RTS G F  
Sbjct: 17  KVLSWDPRIFLYQRLLTEEECDHMMTKAGPRLTRSGVVDVDNPGGESVSDIRTSYGMFFD 76

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMA 196
           RG+D ++R +E+R+++++ IP  HGEGIQVL YE G++Y  H+DYF D  + +NGG R+A
Sbjct: 77  RGEDEVVREVERRLSEWSLIPPGHGEGIQVLRYENGEEYKPHFDYFFDNLSVQNGGNRLA 136

Query: 197 TLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           T+LMYL++ E GGETVFP   A           SEC  QGL+VKP++GDA+LF+S+R + 
Sbjct: 137 TILMYLAEPEFGGETVFPNVKAPPEQTLEAG-YSECATQGLAVKPRKGDAVLFFSLRTEG 195

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           TLD  SLHG CP ++G K++ATKW HV  Y
Sbjct: 196 TLDKGSLHGSCPTLKGFKFAATKWYHVAHY 225


>gi|303282201|ref|XP_003060392.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226457863|gb|EEH55161.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 369

 Score =  218 bits (554), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 111/211 (52%), Positives = 146/211 (69%), Gaps = 12/211 (5%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
           T + A  PRA+VY  FL+ AEC++ I  A P + KS VVD+ TG+   S +RTS G F  
Sbjct: 76  TTLSASSPRAYVYRGFLTDAECDHFIARASPKLAKSNVVDTDTGEGVPSAIRTSDGMFFD 135

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN--GGQR 194
           RG+D ++  +E+RI+ +T +P E+GEG+QVL Y  GQKYDAH D F+D+FN  +  GGQR
Sbjct: 136 RGEDDVVDAVERRISAWTRLPTENGEGMQVLRYAGGQKYDAHLDAFVDKFNADDAHGGQR 195

Query: 195 MATLLMYLSDVEEGGETVFP--AANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
           +AT+LMYL+DV++GGETVFP   A  +    R+    S C ++G++VKP+RGDALLFWSM
Sbjct: 196 VATVLMYLNDVDDGGETVFPETTAKPHVGDERY----SACARRGVAVKPRRGDALLFWSM 251

Query: 253 RPDATLDPSSLHGGCPVIRGN-KWSATKWMH 282
             D T    SLHGGCPV  G  KWS TKW+H
Sbjct: 252 --DETFT-RSLHGGCPVGAGGVKWSMTKWIH 279


>gi|255083627|ref|XP_002508388.1| predicted protein [Micromonas sp. RCC299]
 gi|226523665|gb|ACO69646.1| predicted protein [Micromonas sp. RCC299]
          Length = 253

 Score =  217 bits (552), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 105/218 (48%), Positives = 148/218 (67%), Gaps = 12/218 (5%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W PRAF  HNF+S  EC+ ++++A+P + +STV+DS TGQSK   +RTS  TFL RG 
Sbjct: 1   VSWYPRAFHLHNFMSHEECDRILEIARPRVRRSTVIDSVTGQSKVDPIRTSEQTFLNRGT 60

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN-------GG 192
             I+  +E+R+A  T +P  HGE +Q+L Y +GQKYDAH+D  + E  + +       GG
Sbjct: 61  WDIVTKVEERLAVVTQLPAYHGEDMQILKYGLGQKYDAHHD--VGELTSASGKQLAAEGG 118

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNE---LSECGKQGLSVKPKRGDALLF 249
            R+AT+L+YLSDVEEGGET FP +      +R W E    S+C +  ++VKP++GD LLF
Sbjct: 119 HRVATVLLYLSDVEEGGETAFPDSEWMTPELRKWAEGQKWSDCAEGNVAVKPRKGDGLLF 178

Query: 250 WSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           WS+  +  +DP S+H GCPVIRG KW+ATKW+H   ++
Sbjct: 179 WSVNNENAIDPHSMHAGCPVIRGEKWTATKWIHARPFR 216


>gi|326503458|dbj|BAJ86235.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326516134|dbj|BAJ88090.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 266

 Score =  217 bits (552), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 118/241 (48%), Positives = 151/241 (62%), Gaps = 22/241 (9%)

Query: 54  LTSFRR-------RAFEKRSSIAEEK-GEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLA 105
           L  FRR       R F+     A+ + G    E+++W PR  V+HNFLS  EC+YL ++A
Sbjct: 29  LAFFRRLDDHSHTRHFDNDQGAADLRLGYVKPEVISWTPRIIVFHNFLSSEECDYLREIA 88

Query: 106 KPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDR--IIRGIEKRIADFTFIPMEHGEG 163
           +P +  STVVD  TG+   S VRTSSG F+   + +  +I+ IEKRI+ F+ IP+E+GE 
Sbjct: 89  RPRLEISTVVDVATGKGVKSDVRTSSGMFVNSEERKLPVIKAIEKRISVFSQIPVENGEL 148

Query: 164 IQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSV 223
           IQVL YE  Q Y  H+DYF D FN K GGQR+AT+LMYL+D  EGGET FP A       
Sbjct: 149 IQVLRYEPNQYYRPHHDYFSDTFNLKRGGQRVATMLMYLTDGVEGGETHFPQAG------ 202

Query: 224 RWWNELSECGK---QGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKW 280
              +    CG    +GL VKP +GDA+LFWSM  D   D +SLH GC V++G KWSATKW
Sbjct: 203 ---DGECICGGRLVRGLCVKPNKGDAVLFWSMGLDGNTDSNSLHSGCAVVKGEKWSATKW 259

Query: 281 M 281
           M
Sbjct: 260 M 260


>gi|449468746|ref|XP_004152082.1| PREDICTED: putative prolyl 4-hydroxylase-like [Cucumis sativus]
          Length = 290

 Score =  216 bits (551), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 122/275 (44%), Positives = 161/275 (58%), Gaps = 18/275 (6%)

Query: 16  TLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRRRAFEKRSSIAEEK--- 72
           T+ +++  L  L  +  +  ++G  ++P G       D      R F    +  E +   
Sbjct: 17  TVGMIIGALLQLAFLRRLEDSIGTEFLPAGRLHKAQYDSQHQLPRGFPNWINDKEAEILR 76

Query: 73  -GEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSS 131
            G    E+V+W PR  V HNFLS  EC+YL  +A   +  STVVD+KTG+   S  RTSS
Sbjct: 77  LGYVKPEVVSWSPRIIVLHNFLSTKECDYLKGIALARLEISTVVDTKTGKGVKSDFRTSS 136

Query: 132 GTFLKRGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTK 189
           G FL   +    +++ IEKRI+ ++ +P+E+GE IQVL YE  Q Y  H+DYF D FN K
Sbjct: 137 GMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYFSDTFNLK 196

Query: 190 NGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQ---GLSVKPKRGDA 246
            GGQR+AT+LMYLS+  EGGET FP A +   S         CG +   GLSVKP +GDA
Sbjct: 197 RGGQRIATMLMYLSENIEGGETYFPKAGSGECS---------CGGKTVPGLSVKPAKGDA 247

Query: 247 LLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           +LFWSM  D   DP S+HGGC V+ G KWSATKWM
Sbjct: 248 VLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWM 282


>gi|357162904|ref|XP_003579560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 266

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 109/209 (52%), Positives = 140/209 (66%), Gaps = 14/209 (6%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+++W PR  V+HNFLS  EC++L ++A+P +  STVVD  TG+   S VRTSSG F+  
Sbjct: 61  EVISWTPRIIVFHNFLSSEECDFLKEIARPRLEISTVVDVATGKGVKSDVRTSSGMFVNS 120

Query: 138 GQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            + +  +I+ IEKRI+ F+ IP+E+GE IQVL YE  Q Y  H+DYF D FN K GGQR+
Sbjct: 121 EERKFPVIQAIEKRISVFSQIPVENGELIQVLRYEPSQYYRPHHDYFSDTFNLKRGGQRV 180

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGK---QGLSVKPKRGDALLFWSM 252
           AT+LMYL+D  EGGET FP A     S         CG    +GL VKP +GDA+LFWSM
Sbjct: 181 ATMLMYLTDGVEGGETHFPQAGDGECS---------CGGRIVRGLCVKPNKGDAVLFWSM 231

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWM 281
             D   D +S+H GC V++G KWSATKWM
Sbjct: 232 GLDGNTDSNSIHSGCAVLKGEKWSATKWM 260


>gi|145345836|ref|XP_001417405.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577632|gb|ABO95698.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 330

 Score =  216 bits (549), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 106/218 (48%), Positives = 150/218 (68%), Gaps = 11/218 (5%)

Query: 79  IVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRG 138
           + A +P+A++  NFLS  EC++L+ LAK  +  STVV  + G S  S +RTS+G FL++G
Sbjct: 44  LSATQPKAYLLRNFLSAEECDHLMKLAKRELAPSTVV-GEAGDSVPSDIRTSAGMFLRKG 102

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN--TKNGGQRMA 196
           QD+I++ IE+RIA  +  P+++GEG+Q+L Y+VGQKYD H+DYF D+ N   K GGQR+A
Sbjct: 103 QDKIVKAIEERIARLSGTPVDNGEGMQILRYDVGQKYDPHFDYFHDKVNPAPKRGGQRLA 162

Query: 197 TLLMYLSDVEEGGETVFPAAN--ANFTSVRWWN------ELSECGKQGLSVKPKRGDALL 248
           T+L+YL D ++GGET FP A    +F +    N      E ++C K+G+ VK  RGDA+L
Sbjct: 163 TMLIYLVDTDKGGETTFPNAKLPQSFEADEPENPFASHIEHTDCAKKGIPVKSVRGDAIL 222

Query: 249 FWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           F+SM  D  LD  SLHG CPVI G KW+A KW+ V ++
Sbjct: 223 FFSMTQDGVLDRGSLHGACPVIEGQKWTAVKWIRVGKF 260


>gi|357445147|ref|XP_003592851.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355481899|gb|AES63102.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 281

 Score =  215 bits (547), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 110/209 (52%), Positives = 140/209 (66%), Gaps = 14/209 (6%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+++W PR  + HNFLS  EC+YL  +A P +  STVVD+ TG+   S VRTSSG FL  
Sbjct: 76  EVLSWSPRIILLHNFLSYEECDYLRGVALPRLKISTVVDANTGKGIKSDVRTSSGMFLSH 135

Query: 138 GQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            + +  +I  IEKRI+ ++ IP+E+GE +QVL YE  Q Y  H+DYF D FN K GGQR+
Sbjct: 136 EERKYPMIHAIEKRISVYSQIPIENGELMQVLRYEKNQYYRPHHDYFSDTFNLKRGGQRI 195

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECG---KQGLSVKPKRGDALLFWSM 252
           AT+LMYL D  EGGET FP+A ++  S         CG    +GL VKP +G+A+LFWSM
Sbjct: 196 ATMLMYLGDNVEGGETHFPSAGSDECS---------CGGKLTKGLCVKPVKGNAVLFWSM 246

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWM 281
             D   DP S+HGGCPV+ G KWSATKWM
Sbjct: 247 GLDGQSDPDSVHGGCPVLAGEKWSATKWM 275


>gi|302831512|ref|XP_002947321.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
           nagariensis]
 gi|300267185|gb|EFJ51369.1| hypothetical protein VOLCADRAFT_120451 [Volvox carteri f.
           nagariensis]
          Length = 797

 Score =  214 bits (546), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 106/218 (48%), Positives = 146/218 (66%), Gaps = 5/218 (2%)

Query: 74  EQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGT 133
           E W E ++W PRAFVYHNFL+ AEC++L+ +    + +S VVDS+TGQSK   +RTS G 
Sbjct: 490 EPWIETISWSPRAFVYHNFLTSAECDHLVQIGTQRVSRSLVVDSQTGQSKLDDIRTSYGA 549

Query: 134 FLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN--- 190
              RG+D +I  IE+RIA++T +P EHGE +Q+L Y  GQKYDAH+D+F D  + ++   
Sbjct: 550 AFGRGEDPVIAEIEERIAEWTHLPPEHGEPMQILRYVDGQKYDAHWDWFDDPVHHRSYLV 609

Query: 191 GGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSEC-GKQGLSVKPKRGDALLF 249
            G R AT+L+YLS+VE GGET  P A+    SV+     S C  K GLS++P++GDALLF
Sbjct: 610 DGNRYATVLLYLSEVEAGGETNLPLADPIDMSVQAIENPSPCAAKMGLSIRPRKGDALLF 669

Query: 250 WSMRPDATL-DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           + M  +    D  +LH  CP ++G KW+ATKW+H   Y
Sbjct: 670 YDMDIEGQKGDRKALHASCPTLKGMKWTATKWIHSKPY 707


>gi|356576923|ref|XP_003556579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 287

 Score =  214 bits (545), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 112/208 (53%), Positives = 141/208 (67%), Gaps = 12/208 (5%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++ W PR  + HNFLS  EC+YL  +A P +  S VVD+KTG+   S VRTSSG FL  
Sbjct: 82  EVLNWSPRIILLHNFLSMEECDYLRAIALPRLHISNVVDTKTGKGIKSDVRTSSGMFLNP 141

Query: 138 GQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            + +  +++ IEKRI+ ++ IP+E+GE +QVL YE  Q Y  H+DYF D FN K GGQR+
Sbjct: 142 QERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQRI 201

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGK--QGLSVKPKRGDALLFWSMR 253
           AT+LMYLSD  EGGET FP A +         E S  GK  +GLSVKP +G+A+LFWSM 
Sbjct: 202 ATMLMYLSDNIEGGETYFPLAGS--------GECSCGGKLVKGLSVKPIKGNAVLFWSMG 253

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWM 281
            D   DP+S+HGGC VI G KWSATKWM
Sbjct: 254 LDGQSDPNSVHGGCEVISGEKWSATKWM 281


>gi|15233345|ref|NP_195307.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|3805848|emb|CAA21468.1| putative protein [Arabidopsis thaliana]
 gi|7270534|emb|CAB81491.1| putative protein [Arabidopsis thaliana]
 gi|332661175|gb|AEE86575.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 272

 Score =  214 bits (545), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 105/201 (52%), Positives = 140/201 (69%), Gaps = 30/201 (14%)

Query: 73  GEQWTEIVAWEPRAFVYHNFL--------SKAECEYLIDLAKPYMVKSTVVDSKTGQSKD 124
            E+W E++  EPRAFVYHNFL        +  EC++LI LAKP M +S V ++ TG  ++
Sbjct: 84  NERWLEVITKEPRAFVYHNFLALFFKICKTNEECDHLISLAKPSMARSKVRNALTGLGEE 143

Query: 125 SRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD 184
           S  RTSSGTF++ G D+I++ IEKRI++FTFIP E+GE +QV++YEVGQK++ H+D F  
Sbjct: 144 SSSRTSSGTFIRSGHDKIVKEIEKRISEFTFIPQENGETLQVINYEVGQKFEPHFDGF-- 201

Query: 185 EFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRG 244
                   QR+AT+LMYLSDV++GGETVFP A    +            K+G+SV+PK+G
Sbjct: 202 --------QRIATVLMYLSDVDKGGETVFPEAKGIKS------------KKGVSVRPKKG 241

Query: 245 DALLFWSMRPDATLDPSSLHG 265
           DALLFWSMRPD + DPSS HG
Sbjct: 242 DALLFWSMRPDGSRDPSSKHG 262


>gi|225433714|ref|XP_002268409.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296089634|emb|CBI39453.3| unnamed protein product [Vitis vinifera]
          Length = 287

 Score =  214 bits (544), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 109/209 (52%), Positives = 139/209 (66%), Gaps = 14/209 (6%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           EI+ W PR  + H+FLS  EC+YL  +A+P +  STVVD++TG+   S VRTSSG FL  
Sbjct: 82  EILNWSPRIILLHSFLSSEECDYLRAMAEPLLQISTVVDAQTGKGIQSDVRTSSGMFLSP 141

Query: 138 GQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
                 I+R IEKRI+ ++ +P+E+GE IQVL Y+  Q Y  H+DYF D FN K GGQR+
Sbjct: 142 DDSTYPIVRAIEKRISVYSQVPVENGELIQVLRYKKSQFYKPHHDYFSDSFNLKRGGQRV 201

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQ---GLSVKPKRGDALLFWSM 252
           AT+L+YLSD  EGGET FP A + F           CG +   GLSV P +G+A+LFWSM
Sbjct: 202 ATMLIYLSDNVEGGETYFPMAGSGFC---------RCGGKSVRGLSVAPVKGNAVLFWSM 252

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWM 281
             D   DP+S+HGGC V+ G KWSATKWM
Sbjct: 253 GLDGQSDPNSIHGGCEVLAGEKWSATKWM 281


>gi|334185677|ref|NP_001189994.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
 gi|332643930|gb|AEE77451.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 324

 Score =  213 bits (542), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 117/287 (40%), Positives = 173/287 (60%), Gaps = 30/287 (10%)

Query: 15  STLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDL----TSFRRRAFEKRSSIAE 70
           S + L  S+ F+ T+ L+                S PN      ++ R  +  K  + A 
Sbjct: 3   SRIFLAFSLCFLFTLPLI---------------SSAPNRFLTRSSNTRDGSVIKMKTSAS 47

Query: 71  EKGEQWTEI--VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-- 126
             G   T +  ++W PR F+Y  FLS  EC++ I LAK  + KS V D+ +G+S +S   
Sbjct: 48  SFGFDPTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEDS 107

Query: 127 --VRTSSGTFLKRGQ----DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYD 180
             V   S +F+        D I+  +E ++A +TF+P E+GE +Q+LHYE GQKY+ H+D
Sbjct: 108 VSVVRQSSSFIANMDSLEIDDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFD 167

Query: 181 YFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVK 240
           YF D+ N + GG R+AT+LMYLS+VE+GGETVFP      T ++  +  +EC KQG +VK
Sbjct: 168 YFHDQANLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLK-DDSWTECAKQGYAVK 226

Query: 241 PKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           P++GDALLF+++ P+AT D +SLHG CPV+ G KWSAT+W+HV  ++
Sbjct: 227 PRKGDALLFFNLHPNATTDSNSLHGSCPVVEGEKWSATRWIHVKSFE 273


>gi|412985583|emb|CCO19029.1| predicted protein [Bathycoccus prasinos]
          Length = 458

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 98/203 (48%), Positives = 136/203 (66%), Gaps = 4/203 (1%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PRAF+Y  F++  EC++LID +K  M KS VVD++TG +  S +RTS+G+F+  G + ++
Sbjct: 185 PRAFLYKRFMTDEECDFLIDHSKSRMSKSGVVDAETGGTAKSDIRTSTGSFVGIGANDLM 244

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLS 203
           + +EKR+A F+ +P++H E  QVL YEV Q+Y AHYDYF  +    N   R+ T+LMYL 
Sbjct: 245 KKLEKRVATFSMLPVKHQEATQVLRYEVKQEYRAHYDYFFHKGGMAN--NRIVTILMYLH 302

Query: 204 DVEEGGETVFPAANANFTSVR--WWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           + E GGETVFP            W    SECG +G +   ++GDAL+FWSM+P   LDP 
Sbjct: 303 EPEFGGETVFPNTEVPLERAEKGWGKNFSECGNRGRAAVVRKGDALIFWSMKPGGELDPG 362

Query: 262 SLHGGCPVIRGNKWSATKWMHVH 284
           S H GCPV+RG KW+ATKW+HV+
Sbjct: 363 SSHAGCPVVRGEKWTATKWIHVN 385


>gi|255637879|gb|ACU19258.1| unknown [Glycine max]
          Length = 287

 Score =  211 bits (537), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 111/208 (53%), Positives = 140/208 (67%), Gaps = 12/208 (5%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++ W PR  + HNFLS  EC+YL  LA P +  STVVD+KTG+   S VRTSSG FL  
Sbjct: 82  EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSDVRTSSGMFLNS 141

Query: 138 GQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            + +  +++ IEKRI+ ++ IP+E+GE +QVL YE  Q Y   +DYF D FN K GGQ +
Sbjct: 142 KERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPRHDYFFDTFNLKRGGQGI 201

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGK--QGLSVKPKRGDALLFWSMR 253
           AT+LMYLSD  EGGET FP A +         E S  GK  +GLSVKP +G+A+LFWSM 
Sbjct: 202 ATMLMYLSDNIEGGETYFPLAGS--------GECSCGGKLVKGLSVKPIKGNAVLFWSMG 253

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWM 281
            D   DP+S+HGGC VI G KWSATKW+
Sbjct: 254 LDGQSDPNSVHGGCEVISGEKWSATKWL 281


>gi|412993142|emb|CCO16675.1| predicted protein [Bathycoccus prasinos]
          Length = 564

 Score =  211 bits (536), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 105/214 (49%), Positives = 143/214 (66%), Gaps = 7/214 (3%)

Query: 79  IVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRG 138
           + A +P+A+++ NFLS  EC++L+ LAK  +  STVV +  G S  S +RTS+G FL++ 
Sbjct: 281 LSAVKPKAYLFRNFLSAEECDHLMKLAKAELAPSTVVGA-GGTSVPSTIRTSAGMFLRKA 339

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMA 196
            D+ +  IE RIA  +  P  +GEG+Q+L Y+VGQKYD H+DYF D  N   K GGQRMA
Sbjct: 340 ADKTLENIEYRIAAASGTPEPNGEGMQILRYDVGQKYDPHFDYFHDAVNPSPKRGGQRMA 399

Query: 197 TLLMYLSDVEEGGETVFP----AANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
           T+L+YL + +EGGET+FP    A   + T     +E SEC K GL VK  +GDALLFWS+
Sbjct: 400 TMLIYLENTKEGGETIFPRGTRAETFDLTEEGNPHEWSECTKHGLPVKSVKGDALLFWSL 459

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
             D  LD  SLHG CPV++G KW+A KW+ V ++
Sbjct: 460 TDDYKLDMGSLHGACPVVKGQKWTAVKWIRVAKF 493


>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
 gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
          Length = 433

 Score =  210 bits (535), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 103/204 (50%), Positives = 141/204 (69%), Gaps = 8/204 (3%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR----GQ 139
           PRAF++  FLS+ EC+ L++ A+P M KS VVD+  G S  S +RTS+G+F+      G 
Sbjct: 166 PRAFMHIGFLSERECDLLVEYARPNMYKSGVVDASNGGSSFSNIRTSTGSFVPTVFPLGM 225

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           + ++R IE+RIA +T IP  HGE IQVL Y++GQ+Y +H+DYF  E   KN   R+AT+L
Sbjct: 226 NDVVRRIERRIAAWTQIPAAHGEPIQVLRYQIGQEYQSHFDYFFHEGGMKN--NRIATVL 283

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLSDV++GGETVFP+A +    V+       C K G++V PK+GDA+LFW+M+    LD
Sbjct: 284 MYLSDVKDGGETVFPSAES--LQVKPEPIHHACAKNGITVIPKKGDAILFWNMKVGGDLD 341

Query: 260 PSSLHGGCPVIRGNKWSATKWMHV 283
             S H GCPV+ G KW+ATKW+HV
Sbjct: 342 GGSTHAGCPVVLGEKWTATKWLHV 365


>gi|255085784|ref|XP_002505323.1| predicted protein [Micromonas sp. RCC299]
 gi|226520592|gb|ACO66581.1| predicted protein [Micromonas sp. RCC299]
          Length = 215

 Score =  210 bits (535), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 110/217 (50%), Positives = 145/217 (66%), Gaps = 16/217 (7%)

Query: 76  WTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPY---MVKSTVVDSKTGQSKDSRVRTSSG 132
           W E ++WEPRAFVYHNFL+  EC +L++LAK     + ++TV D++TG +        SG
Sbjct: 1   WIEQISWEPRAFVYHNFLTPEECAHLVNLAKATDGGLKRATVADARTGGTF-----PGSG 55

Query: 133 TFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD-EFNTKNG 191
            FL R  D I+  IE+RI+ F  IP +HGEG+++L Y  G+KYD H+DYF D + N +  
Sbjct: 56  AFLLRNHDPIVTRIEERISAFAMIPADHGEGMRILRYGRGEKYDPHHDYFDDGDKNLRFY 115

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANA----NFTSVRWWN---ELSECGKQGLSVKPKRG 244
           GQR+AT+LMYLSDVE GGETVFP   A    +   VR  +   + S+C K  L VKP+RG
Sbjct: 116 GQRVATVLMYLSDVESGGETVFPKHGAWIEPDEMDVRGRSSSKDSSKCAKGALHVKPRRG 175

Query: 245 DALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           DALLF +   +   DP+SLH GCPV+RG KW+ATKWM
Sbjct: 176 DALLFHNCHLNGREDPTSLHAGCPVLRGEKWTATKWM 212


>gi|307110744|gb|EFN58979.1| hypothetical protein CHLNCDRAFT_137600 [Chlorella variabilis]
          Length = 327

 Score =  209 bits (532), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 106/219 (48%), Positives = 144/219 (65%), Gaps = 9/219 (4%)

Query: 76  WTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFL 135
           W E+VAW+PRA + H FLS AEC+++I +A P + +STVV  + G   D  +RTSSG F+
Sbjct: 41  WVEVVAWKPRALLLHGFLSHAECDHIIRVADPSLERSTVVSPEGGSMLD-EIRTSSGMFI 99

Query: 136 KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-----LDEFNTKN 190
            +G D +I G+E+R+A  T +P+ H E +QVL YE+GQKY AH+D         +   K 
Sbjct: 100 LKGHDAVISGLEERVAALTHLPVSHQEDLQVLRYELGQKYSAHWDINDSPERAQQMRAKG 159

Query: 191 --GGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALL 248
             GG R ATLLMYLSDVEEGGET FP        V+     +EC  +G+ VKP++GDA+L
Sbjct: 160 VLGGLRTATLLMYLSDVEEGGETAFPHGRWLDEGVQAAPPYTECASKGVVVKPRKGDAIL 219

Query: 249 FWSMRPDAT-LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           F+S++ +    D  SLH GCPV+RG K+SATKW+HV  +
Sbjct: 220 FFSLKLNGQKKDVYSLHAGCPVVRGVKYSATKWVHVEPF 258


>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
          Length = 313

 Score =  207 bits (526), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 103/219 (47%), Positives = 147/219 (67%), Gaps = 8/219 (3%)

Query: 69  AEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVR 128
           A++    W +++  E R F+  NFL++ EC++++ LAKP++ +S VVD+ TG S+ S +R
Sbjct: 26  AQDPKRPWMQVLDAEARIFI--NFLTEEECDHIVALAKPHLERSGVVDTATGGSEISDIR 83

Query: 129 TSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT 188
           TS G FL+RG D  +  IE+RIA +T +P+ +GEG+QVL+Y  G+KYD   DYF D+ N 
Sbjct: 84  TSKGMFLERGHDDTVAAIEERIARWTLLPVGNGEGLQVLNYHPGEKYD---DYFFDKVNG 140

Query: 189 K-NGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDAL 247
           + NGG R AT+LMYL+ VEEGGETVFP  N            +EC ++ L+ KP +G A+
Sbjct: 141 ESNGGNRYATVLMYLNTVEEGGETVFP--NIPAPGGDNGPTFTECARRHLAAKPTKGSAV 198

Query: 248 LFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           LF S++P   L+  SLH  CPV++G KWSA KW+HV  Y
Sbjct: 199 LFHSIKPSGDLERRSLHTACPVVKGEKWSAPKWIHVGHY 237


>gi|224034451|gb|ACN36301.1| unknown [Zea mays]
 gi|413945801|gb|AFW78450.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
          Length = 295

 Score =  206 bits (523), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 101/207 (48%), Positives = 139/207 (67%), Gaps = 15/207 (7%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++ +PR F+Y +FLS  E  +LI LA+  + +S V D+ +G+S  S             +
Sbjct: 54  ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLS-------------E 100

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +IA +TF+P E+GE IQVL Y+ G+KY+ HYDYF D  NT  GG R AT+L
Sbjct: 101 DPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRYATVL 160

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           +YL+DV EGGETVFP A     +      LSEC ++G++V+P++GDALLF+++ PD T D
Sbjct: 161 LYLTDVPEGGETVFPLAEEPDDAKD--ATLSECAQKGIAVRPRKGDALLFFNLNPDGTTD 218

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEY 286
             SLHGGCPVI+G KWSATKW+ V  +
Sbjct: 219 SVSLHGGCPVIKGEKWSATKWIRVASF 245


>gi|308802438|ref|XP_003078532.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
           tauri]
 gi|116056985|emb|CAL51412.1| prolyl 4-hydroxylase alpha-1 subunit precursor (IC) [Ostreococcus
           tauri]
          Length = 369

 Score =  204 bits (520), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 104/214 (48%), Positives = 140/214 (65%), Gaps = 11/214 (5%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P+A++  NFLS  EC++L+ LAK  +  STVV    G S  S +RTS+G FL++ QD  
Sbjct: 89  KPKAYLMRNFLSPQECDHLMMLAKRELAPSTVV-GDGGSSVASEIRTSAGMFLRKSQDDT 147

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN--TKNGGQRMATLLM 200
           +R IE+RIA  + +P+++GEG+Q+L Y+ GQKYD H+DYF D+ N   K GGQR+AT+L+
Sbjct: 148 VREIEERIARLSGVPVDNGEGMQILRYDKGQKYDPHFDYFHDKVNPAPKRGGQRVATVLI 207

Query: 201 YLSDVEEGGETVFPAAN--ANFTSVRWWNEL------SECGKQGLSVKPKRGDALLFWSM 252
           YL D EEGGET FP      NF      N        ++C K G+ VK  RGDA+LF+SM
Sbjct: 208 YLVDTEEGGETTFPNGRLPENFEEDEPDNPFAAHIKHTDCAKNGIPVKSVRGDAILFFSM 267

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
             D  LD  SLHG CPVI G KW+A KW+ V ++
Sbjct: 268 TKDGELDHGSLHGACPVIAGQKWTAVKWLRVAKF 301


>gi|297727581|ref|NP_001176154.1| Os10g0415128 [Oryza sativa Japonica Group]
 gi|255679404|dbj|BAH94882.1| Os10g0415128 [Oryza sativa Japonica Group]
          Length = 241

 Score =  204 bits (519), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 96/173 (55%), Positives = 127/173 (73%), Gaps = 1/173 (0%)

Query: 114 VVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQ 173
           V D+++G+S  S VRTSSG FL++ QD ++  IE+RIA +TF+P ++GE IQ+LHY+ G+
Sbjct: 2   VADNESGKSVMSEVRTSSGMFLEKKQDEVVARIEERIAAWTFLPPDNGESIQILHYQNGE 61

Query: 174 KYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECG 233
           KY+ HYDYF D+ N   GG R+AT+LMYLSDV +GGET+FP A       +  +  S+C 
Sbjct: 62  KYEPHYDYFHDKNNQALGGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPK-DDTWSDCA 120

Query: 234 KQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           K G +VKP +GDALLF+S+ PDAT D  SLHG CPVI G KWSATKW+HV  +
Sbjct: 121 KNGYAVKPVKGDALLFFSLHPDATTDSDSLHGSCPVIEGQKWSATKWIHVRSF 173


>gi|242085722|ref|XP_002443286.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
 gi|241943979|gb|EES17124.1| hypothetical protein SORBIDRAFT_08g016950 [Sorghum bicolor]
          Length = 147

 Score =  202 bits (515), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 96/153 (62%), Positives = 116/153 (75%), Gaps = 8/153 (5%)

Query: 134 FLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
           FLKRGQD I+R IE+RIAD+T +P+E+GE +QVLHY VGQK++ H+DY      TK GG 
Sbjct: 2   FLKRGQDTIVRTIEQRIADYTSVPIENGEPLQVLHYAVGQKFEPHFDYTDGTSVTKIGGP 61

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R AT LMYLSDVEEGGETVFP A A  +        +   K G+SVKPK GDALLFWSM+
Sbjct: 62  RKATFLMYLSDVEEGGETVFPNATAKGS--------APSAKSGISVKPKMGDALLFWSMK 113

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           PD +LDP SLHG  PVI+G+KWSATKW+HV++Y
Sbjct: 114 PDGSLDPKSLHGASPVIKGDKWSATKWIHVNKY 146


>gi|242047774|ref|XP_002461633.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
 gi|241925010|gb|EER98154.1| hypothetical protein SORBIDRAFT_02g005760 [Sorghum bicolor]
          Length = 275

 Score =  201 bits (510), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 101/208 (48%), Positives = 140/208 (67%), Gaps = 6/208 (2%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PR FVY  FLS  EC++L+ LAK    K T+V +    S   + RTSSG FL++ Q
Sbjct: 51  LSWQPRIFVYKGFLSDDECDHLVTLAK----KGTMV-AHNRSSYYRQTRTSSGMFLRKRQ 105

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IE+RIA +T +P E+ E +Q+  Y+ GQKYD H+DYF D+ +   GG R AT+L
Sbjct: 106 DPVVSRIEERIAAWTLLPRENVEKMQIQRYQHGQKYDPHFDYFDDKIHHTRGGPRYATVL 165

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLS V++GGETVFP A   + S    +  SEC  +GL+VKP +GDA+LF+S+  D   D
Sbjct: 166 MYLSTVDKGGETVFPKAKG-WESQPKDDTFSECAHKGLAVKPVKGDAVLFFSLHVDGGPD 224

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           P +LHG CPVI+G KWSA  W+HV  ++
Sbjct: 225 PLTLHGSCPVIQGEKWSAPNWIHVRSFE 252


>gi|159469311|ref|XP_001692811.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158278064|gb|EDP03830.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 273

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 103/212 (48%), Positives = 139/212 (65%), Gaps = 7/212 (3%)

Query: 78  EIVAWEPRAFVY--HNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFL 135
           +IV  +P A +Y    FL+  EC+Y+   A+  + +S VVD+ +G S  S +RTS G F 
Sbjct: 35  KIVVLDPDARIYLWKGFLTPEECDYIRMKAEKRLERSGVVDTGSGGSVVSDIRTSDGMFF 94

Query: 136 KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
           +RG+D II  +E+R+AD+T  P+  GE +QVL Y   QKYD+H+DYF  +  + NGG R 
Sbjct: 95  ERGEDAIIEAVEQRLADWTMTPIWGGESLQVLRYRKDQKYDSHWDYFFHKDGSSNGGNRW 154

Query: 196 ATLLMYLSDVEEGGETVFPAANA-NFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           AT+L+YL++ EEGGETVFP   A N  +V      SEC K  L+VKP +GDALLF SM+P
Sbjct: 155 ATVLLYLTETEEGGETVFPKIPAPNGINV----GFSECAKYNLAVKPHKGDALLFHSMKP 210

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
              L+  S+HG CPVIRG K+S TKW+H   Y
Sbjct: 211 TGELEERSMHGACPVIRGEKFSMTKWIHAGHY 242


>gi|9294584|dbj|BAB02865.1| unnamed protein product [Arabidopsis thaliana]
          Length = 328

 Score =  199 bits (505), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 92/173 (53%), Positives = 127/173 (73%), Gaps = 1/173 (0%)

Query: 114 VVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQ 173
           V D  +G+S+DS VRTSSG FL + QD I+  +E ++A +TF+P E+GE +Q+LHYE GQ
Sbjct: 3   VADVDSGESEDSEVRTSSGMFLTKRQDDIVANVEAKLAAWTFLPEENGEALQILHYENGQ 62

Query: 174 KYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECG 233
           KYD H+DYF D+   + GG R+AT+LMYLS+V +GGETVFP        ++  +  S+C 
Sbjct: 63  KYDPHFDYFYDKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLK-DDSWSKCA 121

Query: 234 KQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           KQG +VKP++GDALLF+++  + T DP+SLHG CPVI G KWSAT+W+HV  +
Sbjct: 122 KQGYAVKPRKGDALLFFNLHLNGTTDPNSLHGSCPVIEGEKWSATRWIHVRSF 174


>gi|308812133|ref|XP_003083374.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
           [Ostreococcus tauri]
 gi|116055254|emb|CAL57650.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
           [Ostreococcus tauri]
          Length = 311

 Score =  198 bits (503), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 99/217 (45%), Positives = 133/217 (61%), Gaps = 9/217 (4%)

Query: 76  WTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFL 135
           W E ++  PRA+V+  FL+ AEC+ +I+ A P M  S V D  +G+++    R+S G ++
Sbjct: 67  WIEKISDSPRAYVFREFLTDAECDRVIERAYPTMEASEVTDDDSGEARPDDARSSIGGWV 126

Query: 136 KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
               D +IR IE R + +  +PM  GE +QVL YE GQKYDAH D+F DE N KNGGQR+
Sbjct: 127 SGDDDEVIRNIELRASTWAMLPMNRGETMQVLRYEKGQKYDAHDDFFHDEHNVKNGGQRV 186

Query: 196 ATLLMYLSDVEEGGETVFPAAN---------ANFTSVRWWNELSECGKQGLSVKPKRGDA 246
           AT+LMYLSDVEEGGETVFP            +  T        S+   + L+VKP+RGDA
Sbjct: 187 ATILMYLSDVEEGGETVFPLGTPLGGRDPEKSGVTGDNACELASQNDPRVLAVKPRRGDA 246

Query: 247 LLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHV 283
           LLF++      +D  + H GCPV RG KW+ T+W  V
Sbjct: 247 LLFFNAHLSGEMDEKANHAGCPVNRGTKWTMTRWHRV 283


>gi|307102975|gb|EFN51240.1| hypothetical protein CHLNCDRAFT_28187 [Chlorella variabilis]
          Length = 322

 Score =  197 bits (501), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 108/229 (47%), Positives = 143/229 (62%), Gaps = 28/229 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRT-----SSG 132
           E+V+W+PRA + H FL+ +EC+++I LA+  +  S VV S+ G  K   VRT     SSG
Sbjct: 16  ELVSWKPRALLLHGFLAHSECDHMISLAEARLEPSKVV-SRDGSGKLDSVRTRQGLSSSG 74

Query: 133 TFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-------LDE 185
           TFL + QD ++ G+E RI   T +P  H E +QVL YE+GQKY AHYD         L  
Sbjct: 75  TFLTKRQDSVVAGVEDRIELATHLPFSHSEQLQVLKYELGQKYSAHYDVHGSNEQAQLAI 134

Query: 186 FNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNE-------LSECGKQGLS 238
              + GG R AT+LMYLSDVEEGGET FP         RW +E        SECG +G++
Sbjct: 135 RRGEQGGSRYATMLMYLSDVEEGGETSFPHG-------RWIDEGAQAQPPYSECGSRGVA 187

Query: 239 VKPKRGDALLFWSMRPDA-TLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           VKP++GDA+LF+S++ D  + D  SLH GCPV +G K+SAT W+HV  Y
Sbjct: 188 VKPRKGDAILFYSLKSDGQSKDFFSLHAGCPVAKGVKYSATAWIHVEPY 236


>gi|168006299|ref|XP_001755847.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693166|gb|EDQ79520.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 299

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 117/301 (38%), Positives = 172/301 (57%), Gaps = 31/301 (10%)

Query: 4   LRHSRLQAKKWSTLTLVLSM---LFMLTIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRRR 60
           ++ S+ + +KW+  T  LS+   L   TI LL   A   F+    ++++       +R+ 
Sbjct: 1   MKSSKNKGRKWNPFTYKLSLPQVLLWCTICLLAGYAASNFFPQKIEEEA---IYQPYRKS 57

Query: 61  AFEKRSSIAEEKGEQWT----------------EIVAWEPRAFVYHNFLSKAECEYLIDL 104
           A ++      E  E+                  ++++W+PRA +Y  F SK +CE ++ L
Sbjct: 58  AQQEGEFPFGEFSEKVVLDHGSTGDNFIADIPFQVLSWKPRALLYPRFASKEQCEAIMKL 117

Query: 105 AKPYMVKSTVVDSKTGQSKDSR--VRTSSGTFLKRGQD--RIIRGIEKRIADFTFIPMEH 160
           A+  +  S +   K G+S+DS   +RTSSGTFL+  +D  R +  +E+++A  T IP E+
Sbjct: 118 ARTRLAPSALALRK-GESEDSTKDIRTSSGTFLRADEDTTRSLEQVEEKMAKATMIPREN 176

Query: 161 GEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANF 220
           GE   VL Y VGQKYD HYD F          QRMA+ L+YLSDVEEGGET+FP    NF
Sbjct: 177 GEAFNVLKYNVGQKYDCHYDVFDPAEYGPQPSQRMASFLLYLSDVEEGGETMFPF--ENF 234

Query: 221 TSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKW 280
            ++    +  +C   G+ VKP++GDALLF+SM P+ T D S+LHG CPVI+G KW ATKW
Sbjct: 235 QNMNIGFDYKKC--IGMKVKPRQGDALLFYSMHPNGTFDKSALHGSCPVIKGEKWVATKW 292

Query: 281 M 281
           +
Sbjct: 293 I 293


>gi|302844247|ref|XP_002953664.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
           nagariensis]
 gi|300261073|gb|EFJ45288.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
           nagariensis]
          Length = 364

 Score =  196 bits (497), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 106/217 (48%), Positives = 140/217 (64%), Gaps = 8/217 (3%)

Query: 70  EEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRT 129
           +E+   W E V   PRA+++HNFL+KAE  +++ LA P + +STVV SK G+     +RT
Sbjct: 41  DEEATPWVEQVGLHPRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGSK-GEGVVDNIRT 99

Query: 130 SSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL--DEFN 187
           S G F++R  D II  IEKRI+ +T +P+EH E IQVL Y  GQ Y AHYD     D   
Sbjct: 100 SFGMFIRRLSDPIIARIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDSGASSDHVG 159

Query: 188 TKNGGQRMATLLMYLSDVEEGGETVFPAANANF--TSVRWWNELSECGKQGLSVKPKRGD 245
            K    R+AT LMYLSDVEEGGET FP  +  +  T       +SEC K  ++ KPK GD
Sbjct: 160 PK---WRLATFLMYLSDVEEGGETAFPQNSVWYDPTIPERIGPVSECAKGHVAAKPKAGD 216

Query: 246 ALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           A+LF+S  P+ T+DP+++H GCPVI+G KW+A  WMH
Sbjct: 217 AVLFYSFLPNNTMDPAAMHTGCPVIKGIKWAAPVWMH 253


>gi|159487763|ref|XP_001701892.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158281111|gb|EDP06867.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 259

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 101/214 (47%), Positives = 138/214 (64%), Gaps = 10/214 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E VAW+PR F+YHNF+++ E ++LI+LA P M +STVV +  G+S +   RTS GTFLKR
Sbjct: 2   EHVAWKPRVFIYHNFITEVEAKHLIELAAPQMKRSTVVGAG-GKSVEDNYRTSYGTFLKR 60

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
            QD I+  IE R+A +T IP+ H E  Q+L Y +GQ+Y  H D   DE      G R+AT
Sbjct: 61  YQDEIVERIENRVAAWTQIPVAHQEDTQILRYGLGQQYKVHADTLRDE----EAGVRVAT 116

Query: 198 LLMYLSDVEEGGETVFPAA---NANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +L+YL++ + GGET FP++   N         N  S+C K  ++  PKRGDALLFWS+ P
Sbjct: 117 VLIYLNEPDGGGETAFPSSEWVNPQLAKTLGAN-FSDCAKNHVAFAPKRGDALLFWSINP 175

Query: 255 DA-TLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D  T D  + H GCPV+ G KW+ATKW+H   ++
Sbjct: 176 DGNTEDTHASHTGCPVLSGVKWTATKWIHARPFR 209


>gi|145354086|ref|XP_001421326.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144581563|gb|ABO99619.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 309

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 104/218 (47%), Positives = 136/218 (62%), Gaps = 9/218 (4%)

Query: 75  QWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTF 134
           QW E ++  PRA+VY NFL++ E E  I  A+  M +S VV+   G SK S  RTSSG +
Sbjct: 76  QWIERISESPRAYVYRNFLTREEAEATIAAARRTMRRSEVVNEADGTSKTSDERTSSGGW 135

Query: 135 LKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
           +      ++  IE+R+A +T +P   GE  QV+ YE GQ+Y AH DYF DE N KNGGQR
Sbjct: 136 VSGEDSEVMANIERRVAAWTMLPRNRGETTQVMRYEAGQEYAAHDDYFHDEVNVKNGGQR 195

Query: 195 MATLLMYLSDVEEGGETVFP------AANANFTSVRWWNELSECGKQG---LSVKPKRGD 245
            AT+LMYLSDVEEGGETVFP       A    + V   N      +     L+VKP+RGD
Sbjct: 196 AATVLMYLSDVEEGGETVFPRGTPLGGAAPEKSGVTQGNACERALRGDPNVLAVKPRRGD 255

Query: 246 ALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHV 283
           ALLF+++  +  +D  + H GCPV+RG KW+AT+W HV
Sbjct: 256 ALLFFNVHLNGEVDERARHAGCPVVRGTKWTATRWQHV 293


>gi|302841711|ref|XP_002952400.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
           nagariensis]
 gi|300262336|gb|EFJ46543.1| hypothetical protein VOLCADRAFT_81799 [Volvox carteri f.
           nagariensis]
          Length = 269

 Score =  193 bits (491), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 95/210 (45%), Positives = 133/210 (63%), Gaps = 15/210 (7%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           + R +++  FL+  EC+Y+   A+  + +S VVD+ +G S  S +RTS G F +RG+D I
Sbjct: 42  DARIYLWRGFLTPEECDYIRMKAEKRLERSGVVDTASGSSVVSDIRTSDGMFFERGEDAI 101

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  +E+R+AD+T  P+  GE +QVL Y   QKYD+H +YF  +  + NGG R AT+L YL
Sbjct: 102 LEAVEQRLADWTMTPIWAGEALQVLRYRKDQKYDSHVNYFFHKEGSANGGNRWATVLTYL 161

Query: 203 SDVEEGGETVFP------AANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           +D EEGGETVFP        N  F         SEC K  L+VKP++GDA+LF SM+ + 
Sbjct: 162 TDTEEGGETVFPKIPAPGGVNVGF---------SECAKYNLAVKPRKGDAILFHSMKTNG 212

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            L+  SLHG CPVI+G K+S TKW+H   Y
Sbjct: 213 QLEERSLHGACPVIKGEKFSMTKWIHAGHY 242


>gi|384250156|gb|EIE23636.1| hypothetical protein COCSUDRAFT_53414 [Coccomyxa subellipsoidea
           C-169]
          Length = 285

 Score =  193 bits (491), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 101/271 (37%), Positives = 157/271 (57%), Gaps = 27/271 (9%)

Query: 25  FMLTIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRRRAFEKRSSI----------AEEKGE 74
           ++    L +L+A+G+  I               R+ AFE+   +             K  
Sbjct: 6   WVCMTALALLMAVGVLGI---------------RQGAFEREDKVIGWSESYQHKTPAKSS 50

Query: 75  QWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTF 134
              E ++W PRAF+Y   LS+ EC+Y+I+ A+P MVK+TV+D+KT +   +++R +   +
Sbjct: 51  LMVERISWNPRAFLYRGLLSQDECDYIINAARPNMVKATVLDAKTKKQVPNKLRNNKEAY 110

Query: 135 LKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
           +    D +I  IE+RIA +TF+P  HGE   ++ Y  GQ Y  H D+  D ++ + G +R
Sbjct: 111 IDGSADDVIDQIERRIARYTFLPAAHGEPFHIMQYLPGQGYAPHTDWLDDWWHPRLGNER 170

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +AT+++YLSDV EGGETVFP  N+           S+C +QG++VKP +GDALL +++  
Sbjct: 171 IATMIIYLSDVVEGGETVFP--NSTMQPHVGDAAYSKCAQQGIAVKPVKGDALLLYNLLE 228

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
           +   D  SLH GCPVIRG KW+ATK + V++
Sbjct: 229 NGRNDGESLHQGCPVIRGVKWTATKRILVNQ 259


>gi|159486447|ref|XP_001701251.1| hypothetical protein CHLREDRAFT_122372 [Chlamydomonas reinhardtii]
 gi|158271833|gb|EDO97644.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 251

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 96/212 (45%), Positives = 133/212 (62%), Gaps = 9/212 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E V+W PR F+YHNFLS AEC ++   A P M +S+VV +  G S    +RTS GTF++R
Sbjct: 3   ETVSWNPRVFIYHNFLSDAECRHIKRTAAPMMKRSSVVGTN-GSSVLDTIRTSYGTFIRR 61

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
             D ++  + +R+A +T  P E+ E +QVL Y  GQKY AH D  +D+        RMAT
Sbjct: 62  RHDPVVERVLRRVAAWTKAPPENQEDLQVLRYGPGQKYGAHMDSLIDD------SPRMAT 115

Query: 198 LLMYLSDVEEGGETVFPAANA--NFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           +L+YL D E GGET FP +    + +  +     SEC +  ++ +PK+GDAL+FWS++PD
Sbjct: 116 VLLYLHDTEYGGETAFPDSGHWLDPSLAQSMGPFSECAQGHVAFRPKKGDALMFWSIKPD 175

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
            T DP SLH GCPV+ G KW+AT W+H   Y 
Sbjct: 176 GTHDPLSLHTGCPVVTGVKWTATSWVHSMPYN 207


>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
 gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 244

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 94/200 (47%), Positives = 137/200 (68%), Gaps = 6/200 (3%)

Query: 85  RAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIR 144
           R F+  +FL+  E ++++ +++  + +S VV +  G S++S++RTS G FL+RG+D +++
Sbjct: 1   RIFLIEHFLTDEEADHIVQVSERRLERSGVV-ATNGGSEESQIRTSFGVFLERGEDPVVK 59

Query: 145 GIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSD 204
           G+E+RI+  T +P+ +GEG+QVL Y+  QKYDAH+DYF  +    NGG R AT+LMYL D
Sbjct: 60  GVEERISALTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFHKDGIANGGNRYATVLMYLVD 119

Query: 205 VEEGGETVFPAANANFTSVRWWN-ELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSL 263
            EEGGETVFP    N  +    N   SEC +  L+ KPK+G A+LF S++P   L+  SL
Sbjct: 120 TEEGGETVFP----NIAAPGGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERKSL 175

Query: 264 HGGCPVIRGNKWSATKWMHV 283
           H  CPVI+G KWSA KW+HV
Sbjct: 176 HTACPVIKGIKWSAAKWIHV 195


>gi|302838815|ref|XP_002950965.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
           nagariensis]
 gi|300263660|gb|EFJ47859.1| hypothetical protein VOLCADRAFT_60971 [Volvox carteri f.
           nagariensis]
          Length = 298

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 97/223 (43%), Positives = 135/223 (60%), Gaps = 21/223 (9%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E V+W PR F+YHNFL+  EC ++   A P M +S+VV  + G S    +RTS GTF++R
Sbjct: 3   EAVSWNPRVFIYHNFLTDGECRHIKRTAAPMMKRSSVV-GQNGSSVTDNIRTSYGTFIRR 61

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQ------------VLHYEVGQKYDAHYDYFLDE 185
             D +I  I +R+A +T  P E+ E +Q            VL Y +GQKY AH D  +D+
Sbjct: 62  RHDPVIERILRRVAAWTKAPPENQEDLQAGRGEGGREKERVLRYGIGQKYGAHMDSLIDD 121

Query: 186 FNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTS--VRWWNELSECGKQGLSVKPKR 243
                   RMAT+L+YL D EEGGET FP +++  T          SEC +  ++ +PK+
Sbjct: 122 ------SPRMATVLLYLHDTEEGGETAFPDSSSWLTPDLATRMGPFSECAQGHVAFRPKK 175

Query: 244 GDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           GDAL+FWS++PD T DP S+H GCPV++G KW+AT W+H   Y
Sbjct: 176 GDALMFWSIKPDGTHDPLSMHTGCPVVKGVKWTATSWVHSMPY 218


>gi|159487419|ref|XP_001701720.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280939|gb|EDP06695.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 274

 Score =  191 bits (485), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 100/207 (48%), Positives = 136/207 (65%), Gaps = 7/207 (3%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V   PRA+ +HNFL+KAE  +L+ LA P + +STVV    G+     +RTS G F++R Q
Sbjct: 4   VGLHPRAYYFHNFLTKAERGHLVKLAAPKLKRSTVV-GNDGEGVVDNIRTSYGMFIRRLQ 62

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IEKRI+ +T +P+EH E IQVL Y  GQ Y AHYD   D+ N      R+AT L
Sbjct: 63  DPVVARIEKRISLWTHLPVEHQEDIQVLRYAHGQTYGAHYDSG-DKSNEPGPKWRLATFL 121

Query: 200 MYLSDVEEGGETVFPA----ANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           MYLSDVEEGGET FP     A+ +    +  ++ S+C K  ++ KPK GDA+LF+S  P+
Sbjct: 122 MYLSDVEEGGETAFPHNSVWADPSIPE-KVGDKFSDCAKGNVAAKPKAGDAVLFYSFYPN 180

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMH 282
            T+DP+++H GCPVI+G KW+A  WMH
Sbjct: 181 MTMDPAAMHTGCPVIKGVKWAAPVWMH 207


>gi|168043388|ref|XP_001774167.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162674574|gb|EDQ61081.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 284

 Score =  190 bits (483), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 95/207 (45%), Positives = 133/207 (64%), Gaps = 7/207 (3%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSK-TGQSKDSRVRTSSGTFLK 136
           ++++W+PRA +Y NF SK +CE +I LA+  +  S +   K   ++    +RTSSGTFL+
Sbjct: 76  QVLSWKPRALLYPNFASKEQCEAIIKLARTRLAPSGLALRKGESEATTKEIRTSSGTFLR 135

Query: 137 RGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
             +D+   +  +E+++A  T IP ++GE   VL Y  GQKYD HYD F          QR
Sbjct: 136 ASEDKTQSLAEVEEKMARATMIPRQNGEAFNVLRYNPGQKYDCHYDVFDPAEYGPQPSQR 195

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           MA+ L+YLSDVEEGGET+FP  N  F ++       +C   GL VKP++GDALLF+SM P
Sbjct: 196 MASFLLYLSDVEEGGETMFPFEN--FQNMNTGYNYKDC--IGLKVKPRQGDALLFYSMHP 251

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWM 281
           + T D ++LHG CPVI+G KW ATKW+
Sbjct: 252 NGTFDKTALHGSCPVIKGEKWVATKWI 278


>gi|255584898|ref|XP_002533164.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223527036|gb|EEF29223.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 290

 Score =  190 bits (483), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 92/211 (43%), Positives = 138/211 (65%), Gaps = 7/211 (3%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           ++++W+PRA  + NF +  +C+ +I++AKP +  ST+   K    ++++ +RTSSG FL 
Sbjct: 81  QVLSWKPRALYFPNFATAEQCQSVINMAKPNLTPSTLALRKGETEENTKGIRTSSGMFLS 140

Query: 137 RGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
             +D+  ++  IE++IA  T +P  +GE   +L YE+GQKY++HYD F          QR
Sbjct: 141 ASEDKTGVLDAIEEKIARATMLPRANGEAFNILRYEIGQKYNSHYDAFNPAEYGPQKSQR 200

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +A+ L+YLSDVEEGGET+FP  N     V    +  +C   GL V+P+RGD LLF+S+ P
Sbjct: 201 VASFLLYLSDVEEGGETMFPFEND--LDVDESYDFEKC--IGLQVRPRRGDGLLFYSLFP 256

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
           + T+DP+SLHG CPVI+G KW ATKW+   E
Sbjct: 257 NNTIDPTSLHGSCPVIKGEKWVATKWIRDQE 287


>gi|302802700|ref|XP_002983104.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
 gi|300149257|gb|EFJ15913.1| hypothetical protein SELMODRAFT_234144 [Selaginella moellendorffii]
          Length = 292

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 96/207 (46%), Positives = 133/207 (64%), Gaps = 7/207 (3%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           ++++W PRA ++  F S A+CE +I LAK  +  S++   K   + +++ VRTS G FL 
Sbjct: 83  QVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALRKGETATETQDVRTSHGCFLS 142

Query: 137 RGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
             QD+   +  +E+++A  T IP  HGE   VL YE+GQKY++HYD F          QR
Sbjct: 143 SRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYDVFNPAEYGPQKSQR 202

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           MA+ L+YLSDVEEGGET+FP  N    +  +  +  EC   GL VKPK+GDALLF+SM P
Sbjct: 203 MASFLLYLSDVEEGGETMFPFENYEHMNENY--DYKEC--IGLKVKPKQGDALLFYSMFP 258

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWM 281
           + T D ++LHG CPVI+G KW ATKW+
Sbjct: 259 NGTFDKTALHGSCPVIKGEKWVATKWI 285


>gi|303287328|ref|XP_003062953.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455589|gb|EEH52892.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 259

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 100/222 (45%), Positives = 142/222 (63%), Gaps = 13/222 (5%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++W PRAF  HN ++ AEC+ +++LA+  + +STVVDS TG+SK   +RTS   FL R
Sbjct: 2   EPISWHPRAFHLHNIMTDAECDEVLELARTRVRRSTVVDSTTGESKVDPIRTSEQCFLNR 61

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQ-----VLHYEVGQKYDAHYDYF-LDEFNTKN- 190
           G   I+  IEKR+  +T +P  +GE +Q     VL Y  GQKYDAH+D   LD  + K  
Sbjct: 62  GHFPIVSVIEKRLERYTMLPWYNGEDLQARPSRVLKYSNGQKYDAHHDVGELDTASGKQL 121

Query: 191 ---GGQRMATLLMYLSDVEE--GGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGD 245
              GG R+AT+L+YLSDV++  GGET FP +     +    +  SEC +  ++VKPK+GD
Sbjct: 122 AAEGGHRVATVLLYLSDVDDDGGGETAFPDSEWIDPTADRGSGWSECAEDHVAVKPKKGD 181

Query: 246 ALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
            LLFWS+ P+  +D  S+H GCPV+ G  W+ATKW+H   ++
Sbjct: 182 GLLFWSITPEGVIDQQSMHAGCPVL-GKSWTATKWIHARPFR 222


>gi|40809925|dbj|BAD07294.1| prolyl 4-hydroxylase [Nicotiana tabacum]
          Length = 286

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 97/208 (46%), Positives = 136/208 (65%), Gaps = 9/208 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR--VRTSSGTFL 135
           ++++W PRA  + NF S  +C+ +I +AK  M  S++   +TG+++++   +RTSSGTF+
Sbjct: 77  QVLSWFPRALYFPNFASIEQCQSIIKMAKANMEPSSLA-LRTGETEETTKGIRTSSGTFI 135

Query: 136 KRGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
              +D+  I+  IE++IA  T IP  HGE   VL YE+GQ+Y +HYD F          Q
Sbjct: 136 SASEDKTGILDLIEEKIAKATMIPKTHGEAFNVLRYEIGQRYQSHYDAFDPAQYGPQKSQ 195

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R A+ L+YLSDVEEGGETVFP  N       +  + S+C   GL VKP+RGD LLF+S+ 
Sbjct: 196 RAASFLLYLSDVEEGGETVFPYENGQNMDASY--DFSKC--IGLKVKPRRGDGLLFYSLF 251

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWM 281
           P+ T+D +SLHG CPVIRG KW ATKW+
Sbjct: 252 PNGTIDLTSLHGSCPVIRGEKWVATKWI 279


>gi|307110383|gb|EFN58619.1| hypothetical protein CHLNCDRAFT_19485 [Chlorella variabilis]
          Length = 328

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 96/212 (45%), Positives = 139/212 (65%), Gaps = 27/212 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E V+W+PRAFV+HNF+++ E ++++ LAKP+M +STVV +  G S + ++RTS GTFLKR
Sbjct: 33  EPVSWKPRAFVFHNFMTEEEADHIVALAKPFMKRSTVVGAG-GASVEDQIRTSYGTFLKR 91

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
            QD I+  +E+R+A +T + + H E +Q+L Y +GQKY AHYD      +  N   R+ T
Sbjct: 92  LQDPIVTAVEQRLATWTKLNVSHQEDMQILRYGIGQKYGAHYD------SLDNDSPRVCT 145

Query: 198 LLMYLSDV--EEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           +L+YLSDV  + GGET FP                  G +  ++ PK+GDALLF+S++PD
Sbjct: 146 VLLYLSDVPADGGGETAFP------------------GVRRQALYPKKGDALLFYSLKPD 187

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
            T D  SLH GCP+I G KW+ATKW+H   ++
Sbjct: 188 GTSDAYSLHTGCPIISGVKWTATKWIHTLPFR 219


>gi|302764866|ref|XP_002965854.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
 gi|300166668|gb|EFJ33274.1| hypothetical protein SELMODRAFT_84512 [Selaginella moellendorffii]
          Length = 231

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 97/211 (45%), Positives = 134/211 (63%), Gaps = 7/211 (3%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           ++++W PRA ++  F S A+CE +I LAK  +  S++   K   + +++ VRTS G FL 
Sbjct: 22  QVLSWTPRALLFPKFASPAQCEAIISLAKTKLTPSSLALRKGETATETQDVRTSHGCFLS 81

Query: 137 RGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
             QD+   +  +E+++A  T IP  HGE   VL YE+GQKY++HYD F          QR
Sbjct: 82  SRQDKTGTLAWVEEKMAKATMIPKSHGEAFNVLRYEIGQKYNSHYDVFNPAEYGPQKSQR 141

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           MA+ L+YLSDVEEGGET+FP  N    +  +  +  EC   GL VKPK+GDALLF+SM P
Sbjct: 142 MASFLLYLSDVEEGGETMFPFENYEHMNENY--DYKEC--IGLKVKPKQGDALLFYSMFP 197

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
           + T D ++LHG CPVI+G KW ATKW+   E
Sbjct: 198 NGTFDKTALHGSCPVIKGEKWVATKWIRDKE 228


>gi|412988743|emb|CCO15334.1| predicted protein [Bathycoccus prasinos]
          Length = 352

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 95/214 (44%), Positives = 134/214 (62%), Gaps = 13/214 (6%)

Query: 76  WTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFL 135
           W E ++W+PRAF+YHNFLSK E ++L+DL +P + +STVV  +TG+  D  +RTS GTF+
Sbjct: 67  WIEALSWDPRAFLYHNFLSKEEAKHLVDLGEPRVTRSTVVGGQTGRVSD--IRTSFGTFI 124

Query: 136 KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            +  D ++  IE R A F+ IP+ H E +Q+L Y  GQKY  H D  + E    NGG+R+
Sbjct: 125 PKKYDEVLEKIEDRCAVFSGIPVAHQEQMQLLRYRDGQKYSDHTDGLISE----NGGKRI 180

Query: 196 ATLLMYLSDVEEGGETVF----PAANANFTSVRWWNELSECG---KQGLSVKPKRGDALL 248
           AT+LM+L +  EGGET F    P         R  ++ S+CG    +G +VKPK GDA+L
Sbjct: 181 ATILMFLHEPTEGGETSFVLGNPLGKVKERIERTKDQFSDCGYRSGKGFAVKPKVGDAIL 240

Query: 249 FWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           F+S       D +S+H  CP + G KW+AT W+H
Sbjct: 241 FFSFSEAGITDNNSMHASCPTLGGTKWTATMWIH 274


>gi|108706360|gb|ABF94155.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative [Oryza
           sativa Japonica Group]
 gi|125585047|gb|EAZ25711.1| hypothetical protein OsJ_09544 [Oryza sativa Japonica Group]
          Length = 277

 Score =  186 bits (473), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 98/213 (46%), Positives = 135/213 (63%), Gaps = 22/213 (10%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKP-YMVKSTVVDSKTGQSKDSRVRTSSGTFLKRG 138
           V+W PRAF+Y  FLS AEC++LI LAK   M KSTVVD ++G+S  S+VRTSSG FL + 
Sbjct: 45  VSWRPRAFLYEGFLSDAECDHLISLAKQGKMEKSTVVDGESGESVTSKVRTSSGMFLDKK 104

Query: 139 QDRIIRGIEKRIADFTFIPME-----------------HGEGIQVLHYEVGQKYDAHYDY 181
           QD ++  IE+RIA +T +P E                 +GE +Q+L Y  G+KY+ H+DY
Sbjct: 105 QDEVVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPHFDY 164

Query: 182 FLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKP 241
                 +   G R+AT+LMYLS+V+ G +++ P A  +      W   S+C +QG +VKP
Sbjct: 165 ISGRQGSTREGDRVATVLMYLSNVKMG-DSLLPQARLSQPKDETW---SDCAEQGFAVKP 220

Query: 242 KRGDALLFWSMRPDATLDPSSLHGGCPVIRGNK 274
            +G A+LF+S+ P+ATLD  SLHG CPVI G K
Sbjct: 221 AKGSAVLFFSLHPNATLDTDSLHGSCPVIEGEK 253


>gi|224071291|ref|XP_002303388.1| predicted protein [Populus trichocarpa]
 gi|222840820|gb|EEE78367.1| predicted protein [Populus trichocarpa]
          Length = 297

 Score =  185 bits (470), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 93/208 (44%), Positives = 135/208 (64%), Gaps = 10/208 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           ++++W PRA  Y  F++  +C+++I++AKP +  ST+   K   ++ ++ +RTSSG F+ 
Sbjct: 90  QVLSWRPRALYYPGFITAEQCQHIINMAKPSLQPSTLALRKGETAETTKGIRTSSGMFVF 149

Query: 137 RGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
             +D+  +++ IE++IA  T IP  HGE   VL YE+GQKYDAHYD F          QR
Sbjct: 150 SSEDQAGVLQVIEEKIARATMIPSTHGEAFNVLRYEIGQKYDAHYDAFNPAEYGPQTSQR 209

Query: 195 MATLLMYLSDVEEGGETVFPAAN-ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           +AT L+YLS+ EEGGET FP  N  NF       +  +C   GL VKP +GDA+LF+S+ 
Sbjct: 210 VATFLLYLSNFEEGGETTFPIENDENFEGY----DAQKC--NGLRVKPHQGDAILFYSIF 263

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWM 281
           P+ T+DP+SLH  C VI+G KW ATKW+
Sbjct: 264 PNNTIDPASLHASCHVIKGEKWVATKWI 291


>gi|255647903|gb|ACU24410.1| unknown [Glycine max]
          Length = 293

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 94/212 (44%), Positives = 134/212 (63%), Gaps = 9/212 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR--VRTSSGTFL 135
           ++++W PRA  + NF +  +CE +ID+AK  +  ST+   + G+++D+   +RTSSG F+
Sbjct: 84  QVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLA-LRQGETEDNTKGIRTSSGVFV 142

Query: 136 KRGQD--RIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
              +D  R +  IE++IA  T IP  HGE   +L YEV Q+Y++HYD F          Q
Sbjct: 143 SASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYNSHYDAFNPAEYGPQKSQ 202

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           RMA+ L+YL+DVEEGGET+FP  N       +  E    G  GL VKP++GD LLF+S+ 
Sbjct: 203 RMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYE----GCIGLKVKPRQGDGLLFYSLL 258

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
            + T+DP+SLHG CPVI+G KW ATKW+   E
Sbjct: 259 TNGTIDPTSLHGSCPVIKGEKWVATKWIRDQE 290


>gi|226494249|ref|NP_001141909.1| uncharacterized protein LOC100274058 [Zea mays]
 gi|194706408|gb|ACF87288.1| unknown [Zea mays]
 gi|413932757|gb|AFW67308.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
 gi|413932758|gb|AFW67309.1| hypothetical protein ZEAMMB73_919439 [Zea mays]
          Length = 217

 Score =  184 bits (468), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 88/149 (59%), Positives = 114/149 (76%), Gaps = 3/149 (2%)

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATL 198
           +D I+  IEKR+A +TF+P E+ E +QVL YE GQKYDAH+DYF D  N K GGQR+AT+
Sbjct: 17  KDEIVSAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRVATV 76

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNEL-SECGKQGLSVKPKRGDALLFWSMRPDAT 257
           LMYL+DV +GGETVFP  NA  + +++ +E  SEC + GL+VKPK+GDALLF+++  +AT
Sbjct: 77  LMYLTDVNKGGETVFP--NAEGSHLQYKDETWSECSRSGLAVKPKKGDALLFFNLHVNAT 134

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            D  SLHG CPVI G KWSATKW+HV  +
Sbjct: 135 ADTGSLHGSCPVIEGEKWSATKWIHVRSF 163


>gi|356563543|ref|XP_003550021.1| PREDICTED: putative prolyl 4-hydroxylase-like [Glycine max]
          Length = 293

 Score =  184 bits (466), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 94/212 (44%), Positives = 135/212 (63%), Gaps = 9/212 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR--VRTSSGTFL 135
           ++++W PRA  + NF +  +CE +ID+AK  +  ST+   + G+++D+   +RTSSG F+
Sbjct: 84  QVLSWRPRAVYFPNFATAEQCESIIDVAKDGLKPSTLA-LRQGETEDNTKGIRTSSGVFV 142

Query: 136 KRGQD--RIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
              +D  R +  IE++IA  T IP  HGE   +L YEV Q+Y++HYD F          Q
Sbjct: 143 SASEDKTRTLDVIEEKIARATMIPRSHGEAFNILRYEVNQRYNSHYDAFNPAEYGPQKSQ 202

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           RMA+ L+YL+DVEEGGET+FP  N       +  E  +C   GL VKP++GD LLF+S+ 
Sbjct: 203 RMASFLLYLTDVEEGGETMFPFENGLNMDGNYGYE--DC--IGLKVKPRQGDGLLFYSLL 258

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
            + T+DP+SLHG CPVI+G KW ATKW+   E
Sbjct: 259 TNGTIDPTSLHGSCPVIKGEKWVATKWIRDQE 290


>gi|159487421|ref|XP_001701721.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280940|gb|EDP06696.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 336

 Score =  183 bits (465), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 102/219 (46%), Positives = 138/219 (63%), Gaps = 11/219 (5%)

Query: 70  EEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRT 129
           EE    W + V   PRA+ +HNFL+KAE  +L+ +A P + +STVV  K     D  +RT
Sbjct: 12  EEDATPWVQQVGLHPRAYYFHNFLTKAERAHLVRVAAPKLKRSTVVGGKGEGVVDD-IRT 70

Query: 130 SSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL--DEFN 187
           S G F++R  D ++  IEKRI+ +T +P+EH E IQ+L Y  GQ Y AHYD     D   
Sbjct: 71  SYGMFIRRLSDPVVTRIEKRISLWTHLPVEHQEDIQILRYAHGQTYGAHYDSGASSDHVG 130

Query: 188 TKNGGQRMATLLMYLSDVEEGGETVFPA----ANANFTSVRWWNELSECGKQGLSVKPKR 243
            K    R+AT LMYLSDVEEGGET FP     A+ +    +  ++ S+C K  ++ KPK 
Sbjct: 131 PK---WRLATFLMYLSDVEEGGETAFPHNSVWADPSIPE-QVGDKFSDCAKGHVAAKPKA 186

Query: 244 GDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           GDA+LF+S  P+ T+DP+S+H GCPVI+G KW+A  WMH
Sbjct: 187 GDAVLFYSFYPNNTMDPASMHTGCPVIKGVKWAAPVWMH 225


>gi|449448264|ref|XP_004141886.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 294

 Score =  183 bits (464), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 100/252 (39%), Positives = 144/252 (57%), Gaps = 26/252 (10%)

Query: 46  DDDSPPNDLTSFRRRAFEKRSSIAE-------EKGEQWT-----EIVAWEPRAFVYHNFL 93
           DDD P       R R  +  S + E       E G+        ++++W PRA  +  F 
Sbjct: 45  DDDRP-------RARLLQSASDVTEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFA 97

Query: 94  SKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLKRGQDR--IIRGIEKRI 150
           +  +C+ +++LAKP +  ST+   K   ++ ++ VRTSSG F    +D    +  IE++I
Sbjct: 98  TAEQCQSIVNLAKPKLRPSTLALRKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKI 157

Query: 151 ADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGE 210
           A  T IP  HGE   +L YE+GQKY++HYD F          QR+A+ L+YL+DVEEGGE
Sbjct: 158 ARATMIPRTHGEAYNILRYEIGQKYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGE 217

Query: 211 TVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVI 270
           T+FP  N    ++        C   GL VKP++GD LLF+S+ P+ T+DP+SLHG CPVI
Sbjct: 218 TMFPFENG--LNMDGTYNFQTC--IGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVI 273

Query: 271 RGNKWSATKWMH 282
           +G KW ATKW+ 
Sbjct: 274 KGQKWVATKWIR 285


>gi|302845026|ref|XP_002954052.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
           nagariensis]
 gi|300260551|gb|EFJ44769.1| hypothetical protein VOLCADRAFT_64430 [Volvox carteri f.
           nagariensis]
          Length = 311

 Score =  183 bits (464), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 97/212 (45%), Positives = 135/212 (63%), Gaps = 10/212 (4%)

Query: 79  IVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRG 138
           +++W+PRAFV  NFL++ EC ++ DLA+ +M +STVV +  G S     RTS GTF+ R 
Sbjct: 3   VISWQPRAFVIRNFLTEHECTHIADLAQVHMRRSTVV-ADNGSSVLDDYRTSYGTFINRY 61

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATL 198
           Q  +I  +E R+A  T  P+ + E +QVL Y +GQ Y  H D      + +N   RMAT+
Sbjct: 62  QTPVIAAVEDRVALLTRTPVVYQEDMQVLRYGLGQYYHRHTD------SLENDSPRMATV 115

Query: 199 LMYLSDVEEGGETVFPAAN--ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           L+YLS+ E GGET FP A   A+    + +   S+C K  ++ KP+RGDALLFWS++PD 
Sbjct: 116 LLYLSEPELGGETAFPQAASWAHPAMAQLFGPFSDCVKGNVAFKPRRGDALLFWSVKPDG 175

Query: 257 -TLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
            T DP S H GCPVIRG KW+AT W+H   ++
Sbjct: 176 RTEDPYSEHEGCPVIRGVKWTATVWVHTQPFR 207


>gi|224103711|ref|XP_002313164.1| predicted protein [Populus trichocarpa]
 gi|222849572|gb|EEE87119.1| predicted protein [Populus trichocarpa]
          Length = 294

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 90/207 (43%), Positives = 134/207 (64%), Gaps = 7/207 (3%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           ++++W+PRA  +  F +  +CE +I + +  +  ST+   K   ++ ++  RTSSG+F+ 
Sbjct: 83  QVLSWKPRALYFPKFATPEQCESIIKMVESKLKPSTLALRKGETAESTKDTRTSSGSFVS 142

Query: 137 RGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
             +D    +  IEK+IA  T IP  HGE   +L YE+GQKYD+HYD F  +   +   QR
Sbjct: 143 GSEDETGTLDFIEKKIAKATMIPQSHGEAFNILRYEIGQKYDSHYDAFNPDEYGQQSSQR 202

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
            A+ L+YLS+VEEGGET+FP  N   ++V    +  +C   GL VKP++GD LLF+S+ P
Sbjct: 203 TASFLLYLSNVEEGGETMFPFENG--SAVIPGFDYKQC--VGLKVKPRQGDGLLFYSLFP 258

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWM 281
           + T+DP+SLHG CPVI+G KW ATKW+
Sbjct: 259 NGTIDPTSLHGSCPVIKGVKWVATKWI 285


>gi|55741082|gb|AAV64222.1| unknown [Zea mays]
          Length = 369

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 83/148 (56%), Positives = 109/148 (73%), Gaps = 1/148 (0%)

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATL 198
           QD ++  IE+RI+ +TF+P E+GE IQ+LHY+ G+KY+ HYDYF D+ N   GG R+AT+
Sbjct: 193 QDEVVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRIATV 252

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           LMYLS+VE+GGET+FP A       +  N  S+C + G +VKP +GDALLF+S+ PDAT 
Sbjct: 253 LMYLSNVEKGGETIFPNAEGKLLQPK-DNTWSDCARNGYAVKPVKGDALLFFSLHPDATT 311

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           D  SLHG CPVI G KWSATKW+HV  +
Sbjct: 312 DSDSLHGSCPVIEGQKWSATKWIHVRSF 339


>gi|55741040|gb|AAV64184.1| unknown [Zea mays]
          Length = 394

 Score =  182 bits (463), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 83/148 (56%), Positives = 109/148 (73%), Gaps = 1/148 (0%)

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATL 198
           QD ++  IE+RI+ +TF+P E+GE IQ+LHY+ G+KY+ HYDYF D+ N   GG R+AT+
Sbjct: 193 QDEVVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRIATV 252

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           LMYLS+VE+GGET+FP A       +  N  S+C + G +VKP +GDALLF+S+ PDAT 
Sbjct: 253 LMYLSNVEKGGETIFPNAEGKLLQPK-DNTWSDCARNGYAVKPVKGDALLFFSLHPDATT 311

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           D  SLHG CPVI G KWSATKW+HV  +
Sbjct: 312 DSDSLHGSCPVIEGQKWSATKWIHVRSF 339


>gi|115455509|ref|NP_001051355.1| Os03g0761900 [Oryza sativa Japonica Group]
 gi|14488368|gb|AAK63935.1|AC084282_16 putative dioxygenase [Oryza sativa Japonica Group]
 gi|17027263|gb|AAL34117.1|AC090713_4 putative hydroxylase subunit [Oryza sativa Japonica Group]
 gi|108711218|gb|ABF99013.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113549826|dbj|BAF13269.1| Os03g0761900 [Oryza sativa Japonica Group]
 gi|125545807|gb|EAY91946.1| hypothetical protein OsI_13633 [Oryza sativa Indica Group]
          Length = 310

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 91/209 (43%), Positives = 134/209 (64%), Gaps = 9/209 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR--VRTSSGTFL 135
           +I++W+PRA  +  F +  +CE ++  AK  ++ ST+   K G++++S   +RTSSGTFL
Sbjct: 101 QILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALRK-GETEESTKGIRTSSGTFL 159

Query: 136 KRGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
              +D    +  +EK+IA  T IP  HGE   +L YE+GQ+Y +HYD F          Q
Sbjct: 160 SSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDPAQYGPQKSQ 219

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+A+ L+YL+DVEEGGET+FP  N     + +  +  +C   GL VKP++GD LLF+S+ 
Sbjct: 220 RVASFLLYLTDVEEGGETMFPYENGENMDIGY--DYEKC--IGLKVKPRKGDGLLFYSLM 275

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            + T+DP+SLHG CPVI+G KW ATKW+ 
Sbjct: 276 VNGTIDPTSLHGSCPVIKGEKWVATKWIR 304


>gi|357476355|ref|XP_003608463.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355509518|gb|AES90660.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 297

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 91/208 (43%), Positives = 138/208 (66%), Gaps = 9/208 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           ++++W+PRA  + NF +  +CE ++ +AK  +  S++   K   +++++ +RTSSG FL 
Sbjct: 86  QVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGETTENTKGIRTSSGVFLS 145

Query: 137 RGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNTKNGGQ 193
             +D+   +  IE++IA  T IP  HGE   +L YEVGQ+Y++HYD F  DE+  +   Q
Sbjct: 146 ASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYNSHYDAFNPDEYGPQKS-Q 204

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+A+ L+YL+DVEEGGET+FP  N       +  E  +C   GL VKP++GD LLF+S+ 
Sbjct: 205 RVASFLLYLTDVEEGGETMFPFENGLNMDGTYGYE--DC--VGLRVKPRQGDGLLFYSLL 260

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWM 281
           P+ T+D +SLHG CPVI+G KW ATKW+
Sbjct: 261 PNGTIDQTSLHGSCPVIKGEKWVATKWI 288


>gi|125588006|gb|EAZ28670.1| hypothetical protein OsJ_12681 [Oryza sativa Japonica Group]
          Length = 280

 Score =  182 bits (461), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 91/209 (43%), Positives = 134/209 (64%), Gaps = 9/209 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR--VRTSSGTFL 135
           +I++W+PRA  +  F +  +CE ++  AK  ++ ST+   K G++++S   +RTSSGTFL
Sbjct: 71  QILSWQPRALYFPQFATSQQCENIVKTAKQRLMPSTLALRK-GETEESTKGIRTSSGTFL 129

Query: 136 KRGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
              +D    +  +EK+IA  T IP  HGE   +L YE+GQ+Y +HYD F          Q
Sbjct: 130 SSDEDPTGTLAEVEKKIAKATMIPRHHGEPFNILRYEIGQRYASHYDAFDPAQYGPQKSQ 189

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+A+ L+YL+DVEEGGET+FP  N     + +  +  +C   GL VKP++GD LLF+S+ 
Sbjct: 190 RVASFLLYLTDVEEGGETMFPYENGENMDIGY--DYEKC--IGLKVKPRKGDGLLFYSLM 245

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            + T+DP+SLHG CPVI+G KW ATKW+ 
Sbjct: 246 VNGTIDPTSLHGSCPVIKGEKWVATKWIR 274


>gi|356536125|ref|XP_003536590.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 286

 Score =  181 bits (460), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 111/287 (38%), Positives = 165/287 (57%), Gaps = 24/287 (8%)

Query: 13  KWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPI---GDDDSPPNDLTSFRRRAFEKRSSIA 69
           K S L L +  LF+L  +   +   G F  P+     DD  P   +   + + +K     
Sbjct: 7   KSSKLKLGVPTLFILCALFFFV---GFFVSPLLFQDLDDVGPR--SRILQESVKKEYEPL 61

Query: 70  E--EKGEQW-----TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQS 122
           E  E GE +     ++I++W PRA  + NF S   C+ +I++AKP +  S +   K   +
Sbjct: 62  EHGESGEPFVDSIPSQILSWRPRAVFFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGETA 121

Query: 123 KDSR-VRTSSGTFLKRGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHY 179
           + ++  RTSSGTF+   +D+  I+  +E++IA  T IP  HGE   +L YEVGQKYD+HY
Sbjct: 122 ESTKDTRTSSGTFISASEDKSGILDLVERKIAKVTMIPRTHGEIFNILKYEVGQKYDSHY 181

Query: 180 DYF-LDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLS 238
           D F  DE+ +    QR+A+ L+YLS+VE GGET+FP       ++    +  +C   GL 
Sbjct: 182 DAFNPDEYGSVE-SQRIASFLLYLSNVEAGGETMFPYEGG--LNIDRGYDYQKC--IGLK 236

Query: 239 VKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
           VKP++GD LLF+S+ P+  +D +SLHG CPVI+G KW ATKW+   E
Sbjct: 237 VKPRQGDGLLFYSLLPNGKIDKTSLHGSCPVIKGEKWVATKWIDDRE 283


>gi|308799555|ref|XP_003074558.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
 gi|116000729|emb|CAL50409.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
          Length = 274

 Score =  181 bits (459), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 92/218 (42%), Positives = 135/218 (61%), Gaps = 12/218 (5%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++W PRAF   N L + E   ++ LA+  + +STV+DS++G+S  + +RTS  TFL R
Sbjct: 10  EPLSWYPRAFALRNALDETEMRAILALARTRVARSTVIDSESGKSVVNPIRTSKQTFLSR 69

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT-----KNGG 192
             D ++R + +R++  T +P  H E +QVL Y  G+KYDAH D   +   +     KNGG
Sbjct: 70  -NDPVVRKVLERMSSVTHLPWYHCEDLQVLEYSAGEKYDAHEDVGEEGTKSGDQLSKNGG 128

Query: 193 QRMATLLMYLSDVEEGGETVFPAA---NANFTSVRWWNELSECGKQGLSVKPKRGDALLF 249
           +R+AT+L+YL + EEGGET FP +   +        W   S+C  + +++KP RGD L+F
Sbjct: 129 KRVATILLYLEEPEEGGETAFPDSEWIDPERAKTETW---SKCAHRRVAMKPTRGDGLMF 185

Query: 250 WSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           WS+RPD T+D  +LH GCP  RG KW+AT W+H   Y 
Sbjct: 186 WSVRPDGTIDHRALHVGCPPTRGTKWTATIWVHADPYN 223


>gi|326492085|dbj|BAJ98267.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 347

 Score =  181 bits (459), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 92/208 (44%), Positives = 134/208 (64%), Gaps = 9/208 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR--VRTSSGTFL 135
           +I++W+PRA  +  F +  +CE ++  AK  +  ST+   K G+S+++   +RTSSGTFL
Sbjct: 140 QILSWQPRALYFPQFATAEQCENVVKTAKARLRPSTLALRK-GESEETTKGIRTSSGTFL 198

Query: 136 KRGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
              +D    +  IE +IA  T +P  HGE   VL YE+GQKY +HYD F          Q
Sbjct: 199 SAEEDPTGALAEIETKIAKATMMPRSHGEPFNVLRYEIGQKYASHYDAFDPAQYGPQKSQ 258

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+A+ L+YL+DVEEGGET+FP  N +  ++ +  +  +C   GL VKP++GD LLF+S+ 
Sbjct: 259 RVASFLLYLTDVEEGGETMFPYENGDNMNIGY--DYEQC--IGLKVKPRKGDGLLFYSLM 314

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWM 281
            + T+DP+SLHG CPV+RG KW ATKW+
Sbjct: 315 VNGTIDPTSLHGSCPVVRGEKWVATKWI 342


>gi|116784858|gb|ABK23496.1| unknown [Picea sitchensis]
          Length = 208

 Score =  181 bits (458), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 85/157 (54%), Positives = 113/157 (71%), Gaps = 4/157 (2%)

Query: 134 FLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
           F+ +G+D II  IE +IA +TF+P E+GE +QVL YE G+KYD H+D+F D+ N   GG 
Sbjct: 2   FIPKGKDAIISRIEDKIAAWTFLPKENGEDMQVLRYEPGEKYDPHFDFFQDKVNIVRGGH 61

Query: 194 RMATLLMYLSDVEEGGETVFPAANAN----FTSVRWWNELSECGKQGLSVKPKRGDALLF 249
           R+AT+LMYL+DV +GGETVFP+A  +     +S+   + LS+C K+G +VKPKRGDALLF
Sbjct: 62  RVATVLMYLTDVSKGGETVFPSAEEDTHRRISSIIKDDTLSDCAKRGTAVKPKRGDALLF 121

Query: 250 WSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           +S+   A  D  SLH GCPVI G KWS TKW+HV  +
Sbjct: 122 FSLTTQAKPDTRSLHAGCPVIEGEKWSVTKWIHVESF 158


>gi|449511009|ref|XP_004163837.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-1-like [Cucumis sativus]
          Length = 294

 Score =  181 bits (458), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 99/252 (39%), Positives = 143/252 (56%), Gaps = 26/252 (10%)

Query: 46  DDDSPPNDLTSFRRRAFEKRSSIAE-------EKGEQWT-----EIVAWEPRAFVYHNFL 93
           DDD P       R R  +  S + E       E G+        ++++W PRA  +  F 
Sbjct: 45  DDDRP-------RARLLQSASDVTEFDLMSSGENGDDSISSIPFQVLSWRPRALYFPKFA 97

Query: 94  SKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLKRGQDR--IIRGIEKRI 150
           +  +C+ +++LAKP +  ST+   K   ++ ++ VRTSSG F    +D    +  IE++ 
Sbjct: 98  TAEQCQSIVNLAKPKLRPSTLALRKGETAESTKGVRTSSGVFFSASEDESGTLGVIEEKX 157

Query: 151 ADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGE 210
           A  T IP  HGE   +L YE+GQKY++HYD F          QR+A+ L+YL+DVEEGGE
Sbjct: 158 ARATMIPRTHGEAYNILRYEIGQKYNSHYDAFKPSEYGPQKSQRVASFLLYLTDVEEGGE 217

Query: 211 TVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVI 270
           T+FP  N    ++        C   GL VKP++GD LLF+S+ P+ T+DP+SLHG CPVI
Sbjct: 218 TMFPFENG--LNMDGTYNFQTC--IGLKVKPRQGDGLLFYSVFPNGTIDPTSLHGSCPVI 273

Query: 271 RGNKWSATKWMH 282
           +G KW ATKW+ 
Sbjct: 274 KGQKWVATKWIR 285


>gi|388505024|gb|AFK40578.1| unknown [Medicago truncatula]
          Length = 297

 Score =  180 bits (457), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 90/210 (42%), Positives = 137/210 (65%), Gaps = 11/210 (5%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           ++++W+PRA  + NF +  +CE ++ +AK  +  S++   K   +++++ +RTSSG FL 
Sbjct: 86  QVLSWKPRALYFPNFATAEQCENIVSVAKAGLKPSSLALRKGETTENTKGIRTSSGVFLS 145

Query: 137 RGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNTKNGGQ 193
             +D+   +  IE++IA  T IP  HGE   +L YEVGQ+Y +HYD F  DE+  +   Q
Sbjct: 146 ASRDKTKTLEAIEEKIARATMIPRSHGEAFNILRYEVGQRYYSHYDAFNPDEYGPQKS-Q 204

Query: 194 RMATLLMYLSDVEEGGETVFPAANA-NFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
           R+A+ L+YL+DVEEGGET+FP  N  N      + +     + GL VKP++GD LLF+S+
Sbjct: 205 RVASFLLYLTDVEEGGETMFPFENGLNMDGTYGYED-----RVGLRVKPRQGDGLLFYSL 259

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            P+ T+D +SLHG CPVI+G KW ATKW+ 
Sbjct: 260 LPNGTIDQTSLHGSCPVIKGEKWVATKWIR 289


>gi|225438938|ref|XP_002279411.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296087348|emb|CBI33722.3| unnamed protein product [Vitis vinifera]
          Length = 285

 Score =  180 bits (457), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 92/212 (43%), Positives = 131/212 (61%), Gaps = 9/212 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR--VRTSSGTFL 135
           ++++W PRA  + NF +  +C+ +I++AK  +  STV   + G+ + +   +RTSSG F+
Sbjct: 76  QVLSWRPRALYFPNFATSEQCQSIINMAKSNLTPSTVA-LRVGEIRGNTEGIRTSSGVFI 134

Query: 136 KRGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
              +D+   +  IE++IA    IP  HGE   VL YE+GQ+Y++HYD F           
Sbjct: 135 SASEDKTGTLDLIEQKIARVIMIPRTHGEAFNVLRYEIGQRYNSHYDAFDPAEYGPQKSH 194

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+AT L+YLSDVEEGGET+FP  N       +  +   C   GL VKP +GD LLF+SM 
Sbjct: 195 RIATFLVYLSDVEEGGETMFPFENGLNMDKDY--DFQRC--IGLKVKPHQGDGLLFYSMF 250

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
           P+ T+DP+SLHG CPVI+G KW ATKW+   E
Sbjct: 251 PNGTIDPTSLHGSCPVIKGEKWVATKWIRDQE 282


>gi|388523073|gb|AFK49598.1| unknown [Lotus japonicus]
          Length = 318

 Score =  180 bits (457), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 93/208 (44%), Positives = 133/208 (63%), Gaps = 9/208 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR--VRTSSGTFL 135
           ++++W P A  + NF +  +CE +I+ AK  +  ST+V  + G++ +S   +RTSSG F+
Sbjct: 93  QVLSWNPHALYFPNFATAEQCESIIETAKEGLKPSTLV-LRVGETDESTTGIRTSSGVFI 151

Query: 136 KRGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
              +D+  ++  IE++IA  T IP  HGE   VL Y+VGQKY +HYD    +       Q
Sbjct: 152 SAFEDKTGVLDVIEEKIARATKIPRTHGEAFNVLRYKVGQKYSSHYDALHPDIYGPQKSQ 211

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           RMA+ L+YLSDV EGGET+FP  N       ++ E  +C   GL VKP++GD LLF+S+ 
Sbjct: 212 RMASFLLYLSDVPEGGETMFPFENGLNMDGSYYYE--KC--IGLKVKPRKGDGLLFYSLF 267

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWM 281
           P+ T+DP SLHG CPVI+G KW ATKW+
Sbjct: 268 PNGTIDPMSLHGSCPVIKGEKWVATKWI 295


>gi|412994121|emb|CCO14632.1| predicted protein [Bathycoccus prasinos]
          Length = 341

 Score =  180 bits (457), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 98/234 (41%), Positives = 144/234 (61%), Gaps = 7/234 (2%)

Query: 54  LTSFRRRAFEKRSSIAEEK-GEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKS 112
           L S    A +K  ++ +E   E   ++++  PR+ +Y NF S A+C+ +++ A+  + KS
Sbjct: 104 LNSEDSNALDKVGALGDEYLTELKFQLLSTAPRSVMYRNFASDADCDAIVEAARSRLHKS 163

Query: 113 TVVDSKTGQSKDS--RVRTSSGTFL--KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLH 168
            +   K G++ ++   +RTSSGTFL  K  Q   ++ +E+++A  T IP  HGE   +L 
Sbjct: 164 GLA-LKRGETLETTKNIRTSSGTFLTSKMEQSGALKRVEEKMARATHIPATHGEAYNILR 222

Query: 169 YEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNE 228
           YE+GQKYD+HYD F          QR+A+ L+YL+  +EGGETVFP    N        +
Sbjct: 223 YEIGQKYDSHYDMFDPSQYGPQRSQRVASFLLYLTTPDEGGETVFPLEGQNGLYRLRGID 282

Query: 229 LSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            + C + GL VKP++GDALLFWS+ P+ T D SSLHGGCPVI G K+ ATKW+H
Sbjct: 283 YTSC-EAGLKVKPRKGDALLFWSVHPNNTFDRSSLHGGCPVISGTKFVATKWIH 335


>gi|357114580|ref|XP_003559078.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 295

 Score =  180 bits (457), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 92/209 (44%), Positives = 133/209 (63%), Gaps = 9/209 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR--VRTSSGTFL 135
           +I++W+PRA  +  F +  +CE ++  AK  +  ST+   K G+++++   +RTSSGTFL
Sbjct: 88  QILSWQPRALYFPQFATSEQCENVVKTAKARLRPSTLALRK-GETEETTKGIRTSSGTFL 146

Query: 136 KRGQD--RIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
              +D  R +  +EK+IA  T IP  HGE   VL YE+GQKY +HYD F          Q
Sbjct: 147 SADEDPTRTLAEVEKKIAKATMIPRSHGEPFNVLRYEIGQKYASHYDAFDPAQYGPQKSQ 206

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+A+ L+YL+DVEEGGET+FP  N     + +  +  +C   GL VKP++GD LLF+S+ 
Sbjct: 207 RVASFLLYLTDVEEGGETMFPYENGENMDIGY--DYEQC--IGLKVKPRKGDGLLFYSLM 262

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            + T+D +SLHG CPVI+G KW ATKW+ 
Sbjct: 263 VNGTIDLTSLHGSCPVIKGEKWVATKWIR 291


>gi|125546091|gb|EAY92230.1| hypothetical protein OsI_13950 [Oryza sativa Indica Group]
          Length = 178

 Score =  180 bits (456), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 83/137 (60%), Positives = 110/137 (80%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF+Y  FLS  EC++L++LAK  M KS V D+ +G+S  S+VRTSSGTFL + +
Sbjct: 40  LSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSGTFLSKHE 99

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIEKR+A +TF+P E+ E IQ+LHYE+GQKYDAH+DYF D+ N K GG R+AT+L
Sbjct: 100 DDIVSGIEKRVAAWTFLPEENAESIQILHYELGQKYDAHFDYFHDKNNLKRGGHRVATVL 159

Query: 200 MYLSDVEEGGETVFPAA 216
           MYL+DV++GGETVFP A
Sbjct: 160 MYLTDVKKGGETVFPNA 176


>gi|356574299|ref|XP_003555286.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 290

 Score =  180 bits (456), Expect = 8e-43,   Method: Compositional matrix adjust.
 Identities = 93/212 (43%), Positives = 135/212 (63%), Gaps = 8/212 (3%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           +I++W PRA  + NF S   C+ +I++AKP +  S +   K   ++ ++  RTSSGTF+ 
Sbjct: 76  QILSWRPRAVYFPNFTSVEVCQQIIEMAKPKLEPSKLALRKGETAESTKDTRTSSGTFIS 135

Query: 137 RGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNTKNGGQ 193
             +D+  I+  +E++IA  T IP  HGE   +L YEV QKYD+HYD F  DE+ T    Q
Sbjct: 136 ASEDKSGILDFVERKIAKVTMIPRTHGEKFNILKYEVAQKYDSHYDAFNPDEYGTVE-SQ 194

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+A+ L+YLS+VE GGET+FP         + + +  +C   GL VKP++GD LLF+S+ 
Sbjct: 195 RIASFLLYLSNVEAGGETMFP-YEGGLNIDKGYYDYKKC--IGLKVKPRQGDGLLFYSLL 251

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
           P+  +D +SLHG CPVI+G KW ATKW+   E
Sbjct: 252 PNGKIDKTSLHGSCPVIKGEKWVATKWIDDRE 283


>gi|302844249|ref|XP_002953665.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
 gi|300261074|gb|EFJ45289.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
          Length = 245

 Score =  179 bits (455), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 94/201 (46%), Positives = 129/201 (64%), Gaps = 4/201 (1%)

Query: 70  EEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRT 129
           +E+   W E V   PRA+++HNFL+KAE  +++ LA P + +STVV +  G+     +RT
Sbjct: 47  DEEATPWVEQVGLHPRAYLFHNFLTKAERAHMVRLAAPKLKRSTVVGND-GEGVVDEIRT 105

Query: 130 SSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTK 189
           S G F++R  D +I  IEKRI+ +T +P+EH E IQVL Y  GQ Y AHYD   D+ N  
Sbjct: 106 SYGMFIRRLADPVITRIEKRISLWTHLPIEHQEDIQVLRYAHGQTYGAHYDSG-DKSNEP 164

Query: 190 NGGQRMATLLMYLSDVEEGGETVFPAANA--NFTSVRWWNELSECGKQGLSVKPKRGDAL 247
               R+AT LMYLSDVEEGGET FP  +   + T       +SEC K  ++ KPK GDA+
Sbjct: 165 GPKWRLATFLMYLSDVEEGGETAFPQNSVWYDPTIPERIGPVSECAKGHVAAKPKAGDAV 224

Query: 248 LFWSMRPDATLDPSSLHGGCP 268
           LF+S  P+ T+DP+++H GCP
Sbjct: 225 LFYSFYPNLTMDPAAMHTGCP 245


>gi|297798522|ref|XP_002867145.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297312981|gb|EFH43404.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 288

 Score =  179 bits (454), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 88/208 (42%), Positives = 131/208 (62%), Gaps = 7/208 (3%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           ++++W PRA  + NF +  +C+ +I+ AK  +  S +   K   +++++  RTSSGTF+ 
Sbjct: 79  QVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRTSSGTFIS 138

Query: 137 RGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
             +D    +  +E++IA  T IP  HGE   +L YE+GQKYD+HYD F          QR
Sbjct: 139 ASEDSTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEYGPQSSQR 198

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +A+ L+YLSDVEEGGET+FP  N +     +  +  +C   GL VKP++GD LLF+S+ P
Sbjct: 199 IASFLLYLSDVEEGGETMFPFENGSNMGTGY--DYKQC--IGLKVKPRKGDGLLFYSVFP 254

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMH 282
           + T+D +SLHG CPV +G KW ATKW+ 
Sbjct: 255 NGTIDQTSLHGSCPVTKGEKWVATKWIR 282


>gi|159489450|ref|XP_001702710.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280732|gb|EDP06489.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 252

 Score =  179 bits (454), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 94/210 (44%), Positives = 132/210 (62%), Gaps = 8/210 (3%)

Query: 79  IVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRG 138
           +++WEPRAFV  NFL+  E  ++ D+A+ +M +STVV +  G S     RTS GTF+ R 
Sbjct: 3   VISWEPRAFVIRNFLTDQEATHIADVAQVHMRRSTVV-ADNGSSVLDDYRTSYGTFINRY 61

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATL 198
              ++  +E R+A  T +P+ + E +QVL Y  GQ Y  H D      + +N   R+AT+
Sbjct: 62  ATPVVARVEDRVAVLTRVPVHYQEDMQVLRYGNGQYYHRHTD------SLENDSPRLATV 115

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA-T 257
           L+YLSD E GGET FP A A+    + +   SEC K  ++ KP++GDALLFWS++PD  T
Sbjct: 116 LLYLSDPELGGETAFPLAWAHPDMPKVFGPFSECVKNNVAFKPRKGDALLFWSVKPDGKT 175

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
            DP S H GCPVIRG KW+AT W+H   ++
Sbjct: 176 EDPLSEHEGCPVIRGVKWTATVWVHTKPFR 205


>gi|326518408|dbj|BAJ88233.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 276

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 93/180 (51%), Positives = 120/180 (66%), Gaps = 14/180 (7%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+++W PR  V+HNFLS  EC+YL ++A+P +  STVVD  TG+   S VRTSSG F+  
Sbjct: 61  EVISWTPRIIVFHNFLSSEECDYLREIARPRLEISTVVDVATGKGVKSDVRTSSGMFVNS 120

Query: 138 GQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            + +  +I+ IEKRI+ F+ IP+E+GE IQVL YE  Q Y  H+DYF D FN K GGQR+
Sbjct: 121 EERKLPVIKAIEKRISVFSQIPVENGELIQVLRYEPNQYYRPHHDYFSDTFNLKRGGQRV 180

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGK---QGLSVKPKRGDALLFWSM 252
           AT+LMYL+D  EGGET FP A          +    CG    +GL VKP +GDA+LFWSM
Sbjct: 181 ATMLMYLTDGVEGGETHFPQAG---------DGECICGGRLVRGLCVKPNKGDAVLFWSM 231


>gi|363807682|ref|NP_001242420.1| uncharacterized protein LOC100775302 [Glycine max]
 gi|255641811|gb|ACU21174.1| unknown [Glycine max]
          Length = 293

 Score =  179 bits (453), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 91/211 (43%), Positives = 132/211 (62%), Gaps = 7/211 (3%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           ++++W PRA  + NF +  +CE +ID+AK  +  ST+   +    ++++ +RTSSG F+ 
Sbjct: 84  QVLSWRPRALYFPNFATAEQCENIIDVAKDGLKPSTLALRQGETEENTKGIRTSSGVFVS 143

Query: 137 RGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
              D+   +  IE++IA  T IP  HGE   +L YEV Q+Y++HYD F          QR
Sbjct: 144 ASGDKTGTLAVIEEKIARATMIPRSHGEAFNILRYEVDQRYNSHYDAFNPAEYGPQKSQR 203

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           MA+ L+YL+DVEEGGET+FP  N       +  E  +C   GL VKP++GD LLF+S+  
Sbjct: 204 MASFLLYLTDVEEGGETMFPFENGLNMDGNYGYE--DC--IGLKVKPRQGDGLLFYSLLT 259

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
           + T+DP+SLHG CPVI+G KW ATKW+   E
Sbjct: 260 NGTIDPTSLHGSCPVIKGEKWVATKWIRDQE 290


>gi|18418321|ref|NP_567941.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|17381226|gb|AAL36425.1| unknown protein [Arabidopsis thaliana]
 gi|20465827|gb|AAM20018.1| unknown protein [Arabidopsis thaliana]
 gi|21592377|gb|AAM64328.1| putative dioxygenase [Arabidopsis thaliana]
 gi|332660892|gb|AEE86292.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 288

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 132/208 (63%), Gaps = 7/208 (3%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           ++++W PRA  + NF +  +C+ +I+ AK  +  S +   K   +++++  RTSSGTF+ 
Sbjct: 79  QVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRTSSGTFIS 138

Query: 137 RGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
             ++    +  +E++IA  T IP  HGE   +L YE+GQKYD+HYD F          QR
Sbjct: 139 ASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEYGPQSSQR 198

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +A+ L+YLSDVEEGGET+FP  N +   + +  +  +C   GL VKP++GD LLF+S+ P
Sbjct: 199 IASFLLYLSDVEEGGETMFPFENGSNMGIGY--DYKQC--IGLKVKPRKGDGLLFYSVFP 254

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMH 282
           + T+D +SLHG CPV +G KW ATKW+ 
Sbjct: 255 NGTIDQTSLHGSCPVTKGEKWVATKWIR 282


>gi|356541677|ref|XP_003539300.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 297

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 90/208 (43%), Positives = 135/208 (64%), Gaps = 11/208 (5%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR--VRTSSGTFL 135
           ++++W PRA  + NF S  +CE +I++A+  + KS+ +  + G++++S   +RTSSG F+
Sbjct: 90  QVLSWYPRALYFPNFASAEQCESIIEMARGGL-KSSTLALRKGETEESTKGIRTSSGVFM 148

Query: 136 KRGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
              +D   I+  IE++IA  T IP  HGE   +L YEVGQKY++HYD F +        Q
Sbjct: 149 SASEDETGILDAIEEKIAKATKIPRTHGEAFNILRYEVGQKYNSHYDAFDEAEYGPLQSQ 208

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+A+ L+YL+DV EGGET+FP  N           + +C   GL V+P++GDALLF+S+ 
Sbjct: 209 RVASFLLYLTDVPEGGETMFPYENG----FNRDGNVEDC--IGLRVRPRKGDALLFYSLL 262

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWM 281
           P+ T+D +S HG CPVI+G KW ATKW+
Sbjct: 263 PNGTIDQTSAHGSCPVIKGEKWVATKWI 290


>gi|145341735|ref|XP_001415959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576182|gb|ABO94251.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 254

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 88/215 (40%), Positives = 136/215 (63%), Gaps = 6/215 (2%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++W PRAF   + L++A+CE ++   +  + +STVVDS TG+SK   +RTS  TFL R
Sbjct: 4   EPLSWYPRAFALRDALTEAQCEAVLRATRARVRRSTVVDSVTGESKVDPIRTSKQTFLNR 63

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT-----KNGG 192
            ++ ++R I   ++  T +P  H E +QVL Y VG+KYDAH D   ++  +     K+GG
Sbjct: 64  DEE-VVREIYDALSAVTMLPWTHNEDMQVLEYRVGEKYDAHEDVGAEDSLSGRELSKDGG 122

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
           +R+AT+L+YL + E GGET FP +      +      S+C +  +++KP+RGD L+FWS+
Sbjct: 123 KRVATVLLYLEEPEAGGETAFPDSEWIDPKMAEGTSWSKCAEHRVAMKPRRGDGLIFWSV 182

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
            P+  +D  +LH GCPV+ G KW+AT W+H   Y+
Sbjct: 183 DPNGKIDHRALHVGCPVVAGVKWTATVWVHAEPYR 217


>gi|385137888|gb|AFI41205.1| oxygenase protein, partial [Arabidopsis thaliana]
          Length = 288

 Score =  178 bits (452), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 132/208 (63%), Gaps = 7/208 (3%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           ++++W PRA  + NF +  +C+ +I+ AK  +  S +   K   +++++  RTSSGTF+ 
Sbjct: 79  QVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRTSSGTFIS 138

Query: 137 RGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
             ++    +  +E++IA  T IP  HGE   +L YE+GQKYD+HYD F          QR
Sbjct: 139 ASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEYGPQSSQR 198

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +A+ L+YLSDVEEGGET+FP  N +   + +  +  +C   GL VKP++GD LLF+S+ P
Sbjct: 199 IASFLLYLSDVEEGGETMFPFENGSNMGIGY--DYKQC--IGLKVKPRKGDGLLFYSVFP 254

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMH 282
           + T+D +SLHG CPV +G KW ATKW+ 
Sbjct: 255 NGTIDQTSLHGSCPVTKGEKWVATKWIR 282


>gi|413934216|gb|AFW68767.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
          Length = 210

 Score =  177 bits (450), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 80/148 (54%), Positives = 108/148 (72%), Gaps = 1/148 (0%)

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATL 198
           QD ++  IE+RI+ +TF+P E+GE IQ+LHY+ G+KY+ HYDYF D+ N   GG R+AT+
Sbjct: 9   QDEVVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATV 68

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           LMYLS+VE+GGET+FP A       +  +  S+C + G +VKP +GDALLF+S+ PD+T 
Sbjct: 69  LMYLSNVEKGGETIFPNAEGKLLQPK-DDTWSDCARNGYAVKPVKGDALLFFSLHPDSTT 127

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           D  SLHG CP I G KWSATKW+HV  +
Sbjct: 128 DSDSLHGSCPAIEGQKWSATKWIHVRSF 155


>gi|225428938|ref|XP_002262952.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296083079|emb|CBI22483.3| unnamed protein product [Vitis vinifera]
          Length = 284

 Score =  177 bits (449), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 89/208 (42%), Positives = 136/208 (65%), Gaps = 9/208 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR--VRTSSGTFL 135
           ++++W+PRA  +  F +  +C+ +I++AK ++  ST+   + G++ +S    RTSSGTF+
Sbjct: 75  QVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLA-LRQGETDESTKGTRTSSGTFI 133

Query: 136 KRGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
              +D+  I+  +E++IA  T IP  HGE   +L YE+GQ+Y++HYD F          Q
Sbjct: 134 SASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSHYDAFNPAEYGPQTSQ 193

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+A+ L+YLSDVEEGGET+FP  +    ++    +  +C   GL VKP+RGD LLF+S+ 
Sbjct: 194 RVASFLLYLSDVEEGGETMFPFEHD--LNIGTGYDYKKC--IGLKVKPQRGDGLLFYSVF 249

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWM 281
           P+ T+D +SLHG CPVI G KW ATKW+
Sbjct: 250 PNGTIDRTSLHGSCPVIAGEKWVATKWI 277


>gi|147823227|emb|CAN70872.1| hypothetical protein VITISV_009065 [Vitis vinifera]
          Length = 276

 Score =  177 bits (449), Expect = 5e-42,   Method: Compositional matrix adjust.
 Identities = 89/208 (42%), Positives = 136/208 (65%), Gaps = 9/208 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR--VRTSSGTFL 135
           ++++W+PRA  +  F +  +C+ +I++AK ++  ST+   + G++ +S    RTSSGTF+
Sbjct: 67  QVLSWKPRALYFPRFATAEQCQSIIEMAKSHLRPSTLA-LRQGETDESTKGTRTSSGTFI 125

Query: 136 KRGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
              +D+  I+  +E++IA  T IP  HGE   +L YE+GQ+Y++HYD F          Q
Sbjct: 126 SASEDKTGILDFVERKIAKATMIPRSHGEAFNILRYEIGQRYNSHYDAFNPAEYGPQTSQ 185

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+A+ L+YLSDVEEGGET+FP  +    ++    +  +C   GL VKP+RGD LLF+S+ 
Sbjct: 186 RVASFLLYLSDVEEGGETMFPFEHD--LNIGTGYDYKKC--IGLKVKPQRGDGLLFYSVF 241

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWM 281
           P+ T+D +SLHG CPVI G KW ATKW+
Sbjct: 242 PNGTIDRTSLHGSCPVIAGEKWVATKWI 269


>gi|225428943|ref|XP_002263094.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296083076|emb|CBI22480.3| unnamed protein product [Vitis vinifera]
          Length = 282

 Score =  176 bits (447), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 111/278 (39%), Positives = 162/278 (58%), Gaps = 28/278 (10%)

Query: 19  LVLSMLFML--TIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRRRAFEKRSSIAE-EKGEQ 75
           L L +LF+      L  L    +F   +    SPP  L S      E+ SS+   E GE 
Sbjct: 11  LELPLLFLSWSLFFLAGLFGSMLFSQDVNGVRSPPRLLESVE----EEYSSMPHGETGES 66

Query: 76  WTEIV-----AWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR--VR 128
             +++     +W+PRA  + +F +  +C+ +I++AK  +  ST+V  K G++++S   +R
Sbjct: 67  SVDLIPFQVLSWKPRARYFPHFATAEQCQSIIEMAKSGLSPSTLVLRK-GETEESTKGIR 125

Query: 129 TSSGTFLKRGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEF 186
           TSSGTF+   +D+  I+  IE++IA  T IP  HGE   +L YE+GQ+Y++HYD      
Sbjct: 126 TSSGTFISASEDKTGILDFIERKIAKATMIPRNHGEVFNILRYEIGQRYNSHYDAISPAE 185

Query: 187 NTKNGGQRMATLLMYLSDVEEGGETVFPAA---NANFTSVRWWNELSECGKQGLSVKPKR 243
                 QR+A+ L+YLSDVEEGGET+FP     N N  + R      +C   GL VKP+R
Sbjct: 186 YGLQTSQRIASFLLYLSDVEEGGETMFPFEHDLNINTFNSR------KC--IGLKVKPRR 237

Query: 244 GDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           GD LLF+S+ P+ T+D +S+HG CPVI G KW ATKW+
Sbjct: 238 GDGLLFYSVFPNGTIDWTSMHGSCPVIEGEKWVATKWI 275


>gi|242038031|ref|XP_002466410.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
 gi|241920264|gb|EER93408.1| hypothetical protein SORBIDRAFT_01g007280 [Sorghum bicolor]
          Length = 294

 Score =  176 bits (446), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 89/208 (42%), Positives = 130/208 (62%), Gaps = 7/208 (3%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           +I++W+PRA  +  F +  +CE ++  AK  +  ST+   K   ++ ++ +RTSSGTFL 
Sbjct: 87  QILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGIRTSSGTFLS 146

Query: 137 RGQD--RIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
             +D  R +  IEK+IA  T IP  HGE   VL Y +GQ+Y +HYD F          QR
Sbjct: 147 ANEDPTRTLAEIEKKIARATMIPRNHGEPFNVLRYNIGQRYASHYDAFDPVQYGPQKSQR 206

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +A+ L+YL++VEEGGET+FP  N     + +  +  +C   GL VKP++GD LLF+S+  
Sbjct: 207 VASFLLYLTNVEEGGETMFPYENGENMDIGY--DYEKC--IGLKVKPRKGDGLLFYSLMV 262

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMH 282
           + T+D +SLHG CPVI+G KW ATKW+ 
Sbjct: 263 NGTIDRTSLHGSCPVIKGEKWVATKWIR 290


>gi|255573113|ref|XP_002527486.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223533126|gb|EEF34884.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 286

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 93/218 (42%), Positives = 136/218 (62%), Gaps = 14/218 (6%)

Query: 73  GEQWTE-----IVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR- 126
           GE + E     +++W+PRA  + +F +  +C+ +I++AK   +K + +  + G++ +S  
Sbjct: 67  GESYIESIPFQVLSWKPRAVYFPDFATPEQCKNIIEMAK-LRLKPSGLALRKGETAESTK 125

Query: 127 -VRTSSGTFLKRGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL 183
             RTSSGTFL   +D    +  IE +IA  T IP  HGE   +L YE+GQKYD+HYD F 
Sbjct: 126 GTRTSSGTFLSASEDGTGTLDFIEHKIARATMIPRSHGEAFNILRYEIGQKYDSHYDSFN 185

Query: 184 DEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKR 243
                    QR+A+ L+YLSDVE+GGET+FP  N    S  +  +  +C   GL VKP++
Sbjct: 186 PAEYGPQMSQRVASFLLYLSDVEKGGETMFPFENGVKISSVY--DYKKCA--GLKVKPRQ 241

Query: 244 GDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           GD +LF+S+ P+ T+D +SLHG CPVI G KW ATKW+
Sbjct: 242 GDGILFYSLLPNGTIDQTSLHGSCPVIEGEKWVATKWI 279


>gi|226499492|ref|NP_001150030.1| LOC100283657 [Zea mays]
 gi|195636206|gb|ACG37571.1| prolyl 4-hydroxylase [Zea mays]
 gi|347978804|gb|AEP37744.1| prolyl 4-hydroxylase 3 [Zea mays]
          Length = 294

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 88/208 (42%), Positives = 130/208 (62%), Gaps = 7/208 (3%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           +I++W+PRA  +  F +  +CE ++  AK  +  ST+   K   ++ ++ +RTSSGTFL 
Sbjct: 87  QILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGIRTSSGTFLS 146

Query: 137 RGQD--RIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
             +D    +  IEK+IA  T +P  HGE   VL Y +GQ+Y +HYD F          QR
Sbjct: 147 ANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDPAQYGPQKNQR 206

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +A+ L+YL+DVEEGGET+FP  N+    + +  +  +C   GL VKP++GD LLF+S+  
Sbjct: 207 VASFLLYLTDVEEGGETMFPYENSENMDIGY--DYEKC--IGLKVKPRKGDGLLFYSLMV 262

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMH 282
           + T+D +SLHG CPVI+G KW ATKW+ 
Sbjct: 263 NGTIDRTSLHGSCPVIKGEKWVATKWIR 290


>gi|223945827|gb|ACN26997.1| unknown [Zea mays]
 gi|414872966|tpg|DAA51523.1| TPA: prolyl 4-hydroxylase [Zea mays]
          Length = 294

 Score =  176 bits (445), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 88/208 (42%), Positives = 130/208 (62%), Gaps = 7/208 (3%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           +I++W+PRA  +  F +  +CE ++  AK  +  ST+   K   ++ ++ +RTSSGTFL 
Sbjct: 87  QILSWQPRALYFPQFATSEQCENIVKTAKERLKPSTLALRKGETAESTKGIRTSSGTFLS 146

Query: 137 RGQD--RIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
             +D    +  IEK+IA  T +P  HGE   VL Y +GQ+Y +HYD F          QR
Sbjct: 147 ANEDPTETLAEIEKKIARATMLPRNHGEPFNVLRYNIGQRYASHYDAFDPAQYGPQKNQR 206

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +A+ L+YL+DVEEGGET+FP  N+    + +  +  +C   GL VKP++GD LLF+S+  
Sbjct: 207 VASFLLYLTDVEEGGETMFPYENSENMDIGY--DYEKC--IGLKVKPRKGDGLLFYSLMV 262

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMH 282
           + T+D +SLHG CPVI+G KW ATKW+ 
Sbjct: 263 NGTIDRTSLHGSCPVIKGEKWVATKWIR 290


>gi|413934217|gb|AFW68768.1| hypothetical protein ZEAMMB73_452923 [Zea mays]
          Length = 204

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 79/148 (53%), Positives = 107/148 (72%), Gaps = 1/148 (0%)

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATL 198
            D ++  IE+RI+ +TF+P E+GE IQ+LHY+ G+KY+ HYDYF D+ N   GG R+AT+
Sbjct: 3   NDEVVTRIEERISAWTFLPPENGEAIQILHYQNGEKYEPHYDYFHDKNNQALGGHRIATV 62

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           LMYLS+VE+GGET+FP A       +  +  S+C + G +VKP +GDALLF+S+ PD+T 
Sbjct: 63  LMYLSNVEKGGETIFPNAEGKLLQPK-DDTWSDCARNGYAVKPVKGDALLFFSLHPDSTT 121

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           D  SLHG CP I G KWSATKW+HV  +
Sbjct: 122 DSDSLHGSCPAIEGQKWSATKWIHVRSF 149


>gi|255083957|ref|XP_002508553.1| predicted protein [Micromonas sp. RCC299]
 gi|226523830|gb|ACO69811.1| predicted protein [Micromonas sp. RCC299]
          Length = 262

 Score =  175 bits (444), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 93/214 (43%), Positives = 134/214 (62%), Gaps = 15/214 (7%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++ EP+AF+YH FLS  EC++LI +  P++ +STVV  K        VRTS GTFL +
Sbjct: 2   EKLSDEPKAFLYHGFLSAEECDHLIKIGTPHLKRSTVVGGKDDTGVLDDVRTSFGTFLPK 61

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
             D ++ GIE+R+ DF+ I  E+ E +Q+L Y  GQ+Y  H     D   + NGG+R+AT
Sbjct: 62  KYDDVLYGIERRVEDFSQISYENQEQLQLLKYHDGQEYKDHQ----DGLTSPNGGRRIAT 117

Query: 198 LLMYLSDVEEGGETVF------PAANANFTSVRWWNELSECG---KQGLSVKPKRGDALL 248
           +LM+L + E+GGET F      PA       +R  +ELS+C     +GL+VKP+RGDA+L
Sbjct: 118 VLMFLHEPEKGGETSFPQGKPLPAVAQRLRGMR--DELSDCAWRDGRGLAVKPRRGDAVL 175

Query: 249 FWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           F+S + +   D +S H  CP + G KW+ATKW+H
Sbjct: 176 FFSFKKNGGSDIASTHASCPTVGGVKWTATKWIH 209


>gi|302844281|ref|XP_002953681.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
           nagariensis]
 gi|300261090|gb|EFJ45305.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
           nagariensis]
          Length = 304

 Score =  175 bits (443), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 93/214 (43%), Positives = 131/214 (61%), Gaps = 10/214 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E VAW+PR F+YHNF++  E +++I+LA P M +STVV +  GQS +   RT     ++R
Sbjct: 2   EHVAWKPRVFIYHNFITDMEAKHMIELAAPQMKRSTVVGAG-GQSVEDSYRTLYTAGVRR 60

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
            QD ++  IE R+A +T I + H E +Q+L Y +GQ+Y  H D   D+      G R+AT
Sbjct: 61  YQDDVVERIENRVAAWTQISVLHQEDMQILRYGIGQQYKVHADTLRDD----EAGVRVAT 116

Query: 198 LLMYLSDVEEGGETVFPAA---NANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +L+YL++ E GGET FP +   N         N  S C K  ++  PKRGDALLFWS+ P
Sbjct: 117 VLIYLNEPEAGGETAFPDSQWVNPKLAETIGAN-FSACAKNHVAFAPKRGDALLFWSIGP 175

Query: 255 DATL-DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D T  D  + H GCPV+ G KW+ATKW+H   ++
Sbjct: 176 DGTTEDYHASHTGCPVLSGVKWTATKWIHAKPFR 209


>gi|356496957|ref|XP_003517331.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 299

 Score =  174 bits (442), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 90/208 (43%), Positives = 133/208 (63%), Gaps = 9/208 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR--VRTSSGTFL 135
           ++++W PRA  + NF+S  +CE +I++A+  +  ST+V  K G++++S   +RTS G F+
Sbjct: 90  QVLSWYPRALYFPNFVSAEQCETIIEMARGGLKPSTLVLRK-GETEESTKGIRTSYGVFM 148

Query: 136 KRGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
              +D   I+  IE++IA  T IP  HGE   +L YEVGQKY  HYD F +        Q
Sbjct: 149 SASEDETGILDSIEEKIAKATKIPRTHGEAFNILRYEVGQKYSPHYDAFDEAEFGPLQSQ 208

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R A+ L+YL+DV EGGET+FP  N       +  +  +C   GL V+P++GD LLF+S+ 
Sbjct: 209 RAASFLLYLTDVPEGGETLFPYENGFNRDGSY--DFEDC--IGLRVRPRKGDGLLFYSLL 264

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWM 281
           P+ T+D +S+HG CPVI+G KW ATKW+
Sbjct: 265 PNGTIDQTSVHGSCPVIKGEKWVATKWI 292


>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
           [Collimonas fungivorans Ter331]
 gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
           [Collimonas fungivorans Ter331]
          Length = 289

 Score =  173 bits (438), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 90/210 (42%), Positives = 124/210 (59%), Gaps = 24/210 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PRA ++ N LS  EC+ LI L+K  +++S VVD +TG +K    RTSSGTF  RG    
Sbjct: 99  KPRAILFGNVLSHDECDQLIALSKTKLLRSGVVDHQTGNTKLHEHRTSSGTFFHRGTTPF 158

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEF-----NTKNGGQRMAT 197
           I  I+KR+A    +P  HGEG+Q+L+Y++G +Y  HYDYF  +      +   GGQR AT
Sbjct: 159 IAMIDKRLAALMQVPESHGEGLQILNYQMGGEYRPHYDYFRPDAPGSAKHLARGGQRTAT 218

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           L++YL+DV+ GGET+FP                   + GLS+ P +G A+ F     +  
Sbjct: 219 LIIYLNDVDGGGETIFP-------------------RNGLSIVPAKGSAIYFSYTNAENQ 259

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           LD  S HGG PVI G KW ATKW+  +EY+
Sbjct: 260 LDSLSFHGGSPVIEGEKWIATKWVRQNEYR 289


>gi|302842389|ref|XP_002952738.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
 gi|300262082|gb|EFJ46291.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
          Length = 281

 Score =  173 bits (438), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 85/159 (53%), Positives = 109/159 (68%), Gaps = 5/159 (3%)

Query: 125 SRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD 184
           S +RTS G FL RG+D I++ +E+RIA +T +P+ +GEG+QVL Y+  QKYDAH+DYF  
Sbjct: 36  SNIRTSYGVFLDRGEDEIVKRVEERIAAWTLMPVGNGEGLQVLRYQKEQKYDAHWDYFFH 95

Query: 185 EFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWN-ELSECGKQGLSVKPKR 243
           +    NGG R AT+LMYL D EEGGETVFP    N  +    N   SEC +  L+ KPK+
Sbjct: 96  KDGITNGGNRYATVLMYLVDTEEGGETVFP----NVAAPGGENVGFSECARYHLAAKPKK 151

Query: 244 GDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           G A+LF S++P   L+  SLH  CPVIRG KWSA KW+H
Sbjct: 152 GTAILFHSIKPTGELERKSLHTACPVIRGIKWSAAKWIH 190


>gi|363543293|ref|NP_001241862.1| prolyl 4-hydroxylase 2-1 precursor [Zea mays]
 gi|347978802|gb|AEP37743.1| prolyl 4-hydroxylase 2-1 [Zea mays]
          Length = 204

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 82/159 (51%), Positives = 115/159 (72%), Gaps = 1/159 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PRAF++  FLS AEC++LI LAK  + KS V D+++G+S  S VRTSSG FL+R Q
Sbjct: 39  LSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSEVRTSSGMFLERKQ 98

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IE+RI+ +TF+P E+GE IQ+LHY+ G+KY+ HYDYF D+ N   GG R+AT+L
Sbjct: 99  DEVVTRIEERISAWTFLPPENGESIQILHYQNGEKYEPHYDYFHDKKNQALGGHRIATVL 158

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLS 238
           MYLS+VE+GGET+FP A       +  N  S+C + G +
Sbjct: 159 MYLSNVEKGGETIFPNAEGKLLQPK-DNTWSDCARNGYA 196


>gi|159485424|ref|XP_001700744.1| hypothetical protein CHLREDRAFT_187378 [Chlamydomonas reinhardtii]
 gi|158281243|gb|EDP06998.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 253

 Score =  173 bits (438), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 103/241 (42%), Positives = 139/241 (57%), Gaps = 12/241 (4%)

Query: 56  SFRRRAFEKRSSIAEEKGE---QWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKS 112
           ++   A  K +++  + G+    W E ++W PRAF+YH FLS AEC++LI LA P + +S
Sbjct: 14  TYNAGAHTKSTTVPADSGDVGAPWIETISWVPRAFIYHGFLSHAECDHLIGLALPKLERS 73

Query: 113 TVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVG 172
            VV +K+ +     +RTS    +   +  ++  IE RIA +T +P  H E ++VL Y  G
Sbjct: 74  LVVGNKSDEVDP--IRTSYSASIGYNETDVVADIEGRIARWTHLPRSHQEPMEVLRYING 131

Query: 173 QKYDAHYDYF-LDEFNTKNGGQRMATLLMYLSDVEE--GGETVFPAANANFTSVRW--WN 227
           QKYDAH+D+F   E     GG RMAT LMYLSD+E   GGET  P A      V+     
Sbjct: 132 QKYDAHWDWFDETETGGTGGGNRMATALMYLSDMEPAAGGETALPLAQPLDWEVQGVEGR 191

Query: 228 ELSECG-KQGLSVKPKRGDALLFWSMRPDA-TLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
             SEC  K G+SV+PK+GD LLFW M P     D  +LH  CP   G KW+ATKW+H   
Sbjct: 192 GYSECASKMGISVRPKKGDVLLFWDMEPGGREPDRHALHASCPTFSGTKWTATKWIHNTP 251

Query: 286 Y 286
           Y
Sbjct: 252 Y 252


>gi|363543309|ref|NP_001241870.1| prolyl 4-hydroxylase 6-3 precursor [Zea mays]
 gi|347978824|gb|AEP37754.1| prolyl 4-hydroxylase 6-3 [Zea mays]
          Length = 208

 Score =  172 bits (437), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 84/156 (53%), Positives = 115/156 (73%), Gaps = 3/156 (1%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PRAF+Y  FLS  EC++L+ LAK  M KS V D+ +G+S  S+ RTSSGTFL + +D I+
Sbjct: 42  PRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQARTSSGTFLAKREDEIV 101

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLS 203
             IEKR+A +TF+P E+ E +QVL YE GQKYDAH+DYF D  N K GGQR+AT+LMYL+
Sbjct: 102 SAIEKRVAAWTFLPEENAESLQVLRYETGQKYDAHFDYFHDRNNLKLGGQRVATVLMYLT 161

Query: 204 DVEEGGETVFPAANANFTSVRWWNEL-SECGKQGLS 238
           DV++GGE VFP  +A  + +++ +E  S+C + GL+
Sbjct: 162 DVKKGGEAVFP--DAEGSHLQYKDETWSDCSRSGLA 195


>gi|307109700|gb|EFN57937.1| hypothetical protein CHLNCDRAFT_142031 [Chlorella variabilis]
          Length = 325

 Score =  172 bits (436), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 95/229 (41%), Positives = 134/229 (58%), Gaps = 9/229 (3%)

Query: 61  AFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTG 120
           A     +I       W E V+W PRAFV HNF SK E +++I LA+P + +STVV S+ G
Sbjct: 16  AASSAGAIDTAAAHPWFEPVSWYPRAFVAHNFASKEETDHMIKLAQPQLRRSTVVGSR-G 74

Query: 121 QSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYD 180
           +S     RTS G F++R  D ++  +EKR+A +T   + H E IQVL Y   Q+Y AH+D
Sbjct: 75  ESVVDNYRTSYGMFIRRHHDEVVSTLEKRVATWTKYNVTHQEDIQVLRYGTTQEYKAHFD 134

Query: 181 YFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAAN-ANFTSVRWWNELSECGKQGLSV 239
              D+        R AT+L+YLSDVE GGET FP +   +    +     SEC +  +++
Sbjct: 135 SLDDD------SPRTATVLIYLSDVESGGETTFPNSEWIDPALPKALGPFSECAQGHVAM 188

Query: 240 KPKRGDALLFWSMRPDA-TLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           KPKRGDA++F S+ PD  + D  +LH  CPVI G K+ A  W+H   ++
Sbjct: 189 KPKRGDAIVFHSLNPDGRSHDQHALHTACPVIVGVKYVAIFWIHTKPFR 237


>gi|255577610|ref|XP_002529682.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223530830|gb|EEF32693.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 165

 Score =  172 bits (436), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 91/165 (55%), Positives = 109/165 (66%), Gaps = 10/165 (6%)

Query: 119 TGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAH 178
           T Q   S VRTSSG FL   + +    IEKRI+ ++ +P+E+GE +QVL YE  Q Y  H
Sbjct: 3   TNQGMKSNVRTSSGMFLSSEERKSPMAIEKRISVYSQVPIENGELVQVLRYEKSQFYRPH 62

Query: 179 YDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGK--QG 236
           +DYF D FN K GGQR+AT+LMYLSD  EGGET FP A +         E S  GK  +G
Sbjct: 63  HDYFSDTFNLKRGGQRVATMLMYLSDNVEGGETYFPMAGS--------GECSCGGKIVKG 114

Query: 237 LSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           LSVKP +GDA+LFWSM  D   DP S+HGGC V+ G KWSATKWM
Sbjct: 115 LSVKPIKGDAVLFWSMGLDGQSDPKSIHGGCEVLAGEKWSATKWM 159


>gi|384250599|gb|EIE24078.1| hypothetical protein COCSUDRAFT_47131 [Coccomyxa subellipsoidea
           C-169]
          Length = 327

 Score =  171 bits (433), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 89/213 (41%), Positives = 124/213 (58%), Gaps = 8/213 (3%)

Query: 75  QWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKT-GQSKDSRVRTSSGT 133
           Q  ++++W PR  +Y  F+    C++ + +AK  +  S +    T G  +   VRTS GT
Sbjct: 107 QPQQLISWYPRIILYPGFIDPERCKHFVKVAKARLAPSGLALRTTEGPQETENVRTSQGT 166

Query: 134 FLKRGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG 191
           F+ R  D   +I  +E++ A  T +P+ HGE   VL Y+ GQ YD+HYD F  E      
Sbjct: 167 FMSRKDDPAGVIAWVEEKAAQVTGLPVSHGEPFNVLRYQDGQHYDSHYDIFEPESYGPQP 226

Query: 192 GQRMATLLMYLSDVEEGGETVFPAA---NANFTSVRWWNELSECGKQGLSVKPKRGDALL 248
            QRMAT+L YL+DVEEGGET+FP       +   +  +N  S C   G   KP+ GDAL+
Sbjct: 227 SQRMATILFYLTDVEEGGETIFPLEGRYGPDLLKMTGFNYKS-C-TTGFKYKPRMGDALM 284

Query: 249 FWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           F+SM P+ T D  +LHGGCPV+ G KW ATKW+
Sbjct: 285 FYSMHPNGTFDKHALHGGCPVMAGEKWVATKWI 317


>gi|302835042|ref|XP_002949083.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
           nagariensis]
 gi|300265828|gb|EFJ50018.1| hypothetical protein VOLCADRAFT_89416 [Volvox carteri f.
           nagariensis]
          Length = 263

 Score =  171 bits (432), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 98/220 (44%), Positives = 131/220 (59%), Gaps = 22/220 (10%)

Query: 76  WTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFL 135
           W E V+W PRAFVYH FL+ AEC++LI+LA P + +S VV + +    D  +RTS    +
Sbjct: 58  WVETVSWMPRAFVYHQFLTPAECDHLIELATPKLERSMVVGTDSDLIDD--IRTSFSASI 115

Query: 136 KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGG-QR 194
             G+  I+  IE+RIA +T           VL Y  GQKYDAH+D+F D    K GG  R
Sbjct: 116 MYGETSIVSSIEERIARWT-----------VLRYVNGQKYDAHWDWFDDNEVAKAGGSNR 164

Query: 195 MATLLMYLSDVEE--GGETVFPAA---NANFTSVRWWNELSEC-GKQGLSVKPKRGDALL 248
           MAT+LMYLSDV+   GGET  P A   + +  SV      S+C  + G+S++P++GD LL
Sbjct: 165 MATVLMYLSDVDPAAGGETALPLAEPLDPHKQSVD-GQGYSQCAARMGISIRPRKGDVLL 223

Query: 249 FWSMRPDATL-DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           FW M P   + D  +LH  CP   G KW+ATKW+H   Y+
Sbjct: 224 FWDMDPAGLIPDRHALHASCPTFSGTKWTATKWIHNKPYR 263


>gi|357453665|ref|XP_003597113.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|357482683|ref|XP_003611628.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355486161|gb|AES67364.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355512963|gb|AES94586.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 294

 Score =  170 bits (431), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 84/208 (40%), Positives = 132/208 (63%), Gaps = 9/208 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR--VRTSSGTFL 135
           ++++W PRA  + NF S  +C+ +I++AK  +  S ++  + G++++    +RTSSG F+
Sbjct: 84  QVLSWNPRALYFPNFASAEQCDRIIEMAKAELSPSRLM-LREGETEEGTKGIRTSSGMFI 142

Query: 136 KRGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
              +D+  ++  I+++IA    IP  HG    +L Y+VGQKY++HYD F          Q
Sbjct: 143 SASEDKTGLLEVIDEKIARAAKIPKTHGGAYNILRYKVGQKYNSHYDAFNPAEYGPQESQ 202

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+A+ L+YL+DV EGGET+FP  N +     +     +C   GL +KP +GD LLF+S+ 
Sbjct: 203 RVASFLLYLTDVPEGGETMFPFENGSNMDSSY--NFEDC--IGLKIKPLKGDGLLFYSLF 258

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWM 281
           P+ T+DP+SLHG CPVI+G KW ATKW+
Sbjct: 259 PNGTIDPTSLHGSCPVIKGEKWVATKWI 286


>gi|302845120|ref|XP_002954099.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
           nagariensis]
 gi|300260598|gb|EFJ44816.1| hypothetical protein VOLCADRAFT_64439 [Volvox carteri f.
           nagariensis]
          Length = 231

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 89/208 (42%), Positives = 124/208 (59%), Gaps = 6/208 (2%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD--SRVRTSSGTFL 135
           +I++W PR  V+  F+ KA  EY+I LA  +M  S +   + G++ D   + RTS+GTFL
Sbjct: 18  QILSWYPRVVVFPGFIDKARAEYVIKLASKFMYPSGLA-YRPGETVDPSQQTRTSTGTFL 76

Query: 136 KRGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
               D   ++  +E+RIA  T +P E+GE   VLHYE  Q YD+HYD F  +       Q
Sbjct: 77  AAAMDPEGVLGWVEQRIAAATLLPAENGEAFNVLHYEKEQHYDSHYDTFDPKEFGPQPSQ 136

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+AT+L+YLS+V EGGETVF     +  + R   +   C        P+ GDA+LFW  +
Sbjct: 137 RIATVLLYLSEVLEGGETVFKREGVDGEN-RVIGDWRNCDDGSFKYMPRMGDAVLFWGTK 195

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWM 281
           P+  +DP +LHGGCPV RG KW ATKW+
Sbjct: 196 PNGDIDPHALHGGCPVKRGEKWVATKWI 223


>gi|413945803|gb|AFW78452.1| hypothetical protein ZEAMMB73_588774 [Zea mays]
          Length = 239

 Score =  169 bits (427), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 87/174 (50%), Positives = 117/174 (67%), Gaps = 9/174 (5%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++ +PR F+Y +FLS  E  +LI LA+  + +S V D+ +G+S  S VRTSSGTFL++GQ
Sbjct: 54  ISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSEVRTSSGTFLRKGQ 113

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+ GIE +IA +TF+P E+GE IQVL Y+ G+KY+ HYDYF D  NT  GG R AT+L
Sbjct: 114 DPIVEGIEDKIAAWTFLPKENGEDIQVLRYKHGEKYEPHYDYFTDNVNTVRGGHRYATVL 173

Query: 200 MYLSDVEEGGETVFPAANANF--TSVRW-WNELSECGKQGLSVKPKRGDALLFW 250
           +YL+DV EGGETVFP A  NF   SV + + E+ E G +   +        LFW
Sbjct: 174 LYLTDVPEGGETVFPLAEVNFFIFSVTFVFKEMVESGSEVFLI------FFLFW 221


>gi|145347188|ref|XP_001418057.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144578285|gb|ABO96350.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 317

 Score =  168 bits (426), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 91/219 (41%), Positives = 135/219 (61%), Gaps = 16/219 (7%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++W PR F+  NFLS  ECE+LI+L +  + +STVV+S    +  S  RTS GTF+ R
Sbjct: 37  ETLSWSPRVFLLKNFLSDEECEHLIELGEKKLERSTVVNSDESGAV-STARTSFGTFVTR 95

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
                ++ +E R+A ++ IP EH E +Q+L Y  GQ+Y AH+D  + E    NGG+R+AT
Sbjct: 96  RLTETLQRVEDRVAKYSGIPWEHQEQLQLLRYRDGQEYVAHHDGIISE----NGGKRIAT 151

Query: 198 LLMYLSDVEEGGETVFPAAN------ANFTSVRWWNELSECG---KQGLSVKPKRGDALL 248
           +LM+L +   GGET FP         A F + +  ++LSECG     G SV PK+G+A+L
Sbjct: 152 VLMFLREPTSGGETSFPQGTPLPETKAAFLANK--DKLSECGWNDGNGFSVIPKKGEAVL 209

Query: 249 FWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           F+S   + T DP + H  CP + G K++ATKW+H + ++
Sbjct: 210 FFSFHINGTNDPFANHASCPTLGGTKYTATKWIHENPFE 248


>gi|414591891|tpg|DAA42462.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
          Length = 207

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 84/159 (52%), Positives = 108/159 (67%), Gaps = 1/159 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W PR FVY  FLS AEC++L+ LAK  + +S V D+++G+S  S VRTSSG FL + Q
Sbjct: 48  VSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSEVRTSSGMFLDKRQ 107

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IE+RIA +TF+P E+ E +QVL YE GQKY+ H+DYF D  N   GG R AT+L
Sbjct: 108 DPVVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGHRYATVL 167

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLS 238
           MYLS V EGGETVFP A   + S       SEC  +GL+
Sbjct: 168 MYLSTVREGGETVFPNAKG-WESQPKDATFSECAHKGLA 205


>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
 gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
          Length = 286

 Score =  168 bits (425), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 88/208 (42%), Positives = 124/208 (59%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P+  V+ + LS AEC  LI+ ++  + +ST V+  TG+    R RTS G + +RG+D++I
Sbjct: 97  PQLVVFADVLSAAECAELIERSRHRLKRSTTVNPLTGREDVIRNRTSEGVWYRRGEDQLI 156

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE-----FNTKNGGQRMATL 198
             +E+RIA  T  P+E+GEG+QVLHY    +Y  H+D+F  +      +T  GGQR+ATL
Sbjct: 157 ARVERRIASLTNWPLENGEGLQVLHYGTSGEYSPHFDFFAPDQPGSAVHTTQGGQRVATL 216

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+DV +GGETVFP A                   GLSV  + G A+ F  M  +  L
Sbjct: 217 IIYLNDVADGGETVFPTA-------------------GLSVAAQAGGAVYFRYMNAERQL 257

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DPS+LHGG PV+ G+KW  TKWM    Y
Sbjct: 258 DPSTLHGGAPVLAGDKWIMTKWMRERAY 285


>gi|363543297|ref|NP_001241864.1| prolyl 4-hydroxylase 4-2 precursor [Zea mays]
 gi|194704960|gb|ACF86564.1| unknown [Zea mays]
 gi|347978810|gb|AEP37747.1| prolyl 4-hydroxylase 4-2 [Zea mays]
          Length = 207

 Score =  167 bits (423), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 84/159 (52%), Positives = 107/159 (67%), Gaps = 1/159 (0%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W PR FVY  FLS AEC++L+ LAK    +S V D+++G+S  S VRTSSG FL + Q
Sbjct: 48  VSWHPRIFVYKGFLSDAECDHLVTLAKKKTQRSMVADNESGKSVKSEVRTSSGMFLDKRQ 107

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IE+RIA +TF+P E+ E +QVL YE GQKY+ H+DYF D  N   GG R AT+L
Sbjct: 108 DPVVSRIEERIAAWTFLPQENAENMQVLRYEPGQKYEPHFDYFHDRVNQARGGHRYATVL 167

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLS 238
           MYLS V EGGETVFP A   + S       SEC  +GL+
Sbjct: 168 MYLSTVREGGETVFPNAKG-WESQPKDATFSECAHKGLA 205


>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
 gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
          Length = 286

 Score =  167 bits (422), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 84/208 (40%), Positives = 120/208 (57%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P   V   F+S  ECE LI+ ++  +  S +VD +TG+ +    R+S GT+ +RG+  +I
Sbjct: 96  PDIVVVDEFMSGEECEQLIEQSRRKLTPSAIVDPQTGKFQVIADRSSEGTYFQRGESPLI 155

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE-----FNTKNGGQRMATL 198
             +++RI++    P +HGEGIQ+LHY VG +Y  H+DYFL+            GQR+ATL
Sbjct: 156 SRLDRRISELMNWPEDHGEGIQILHYGVGAQYKPHFDYFLENESGGALQMTQSGQRVATL 215

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL++V EGGETVFP                     G+S+ PKRG A  F        +
Sbjct: 216 VMYLNEVTEGGETVFPDV-------------------GISITPKRGSAAYFAYCNSLGQV 256

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DP++LHGG PV+ G KW ATKWM  ++Y
Sbjct: 257 DPATLHGGAPVLTGEKWIATKWMRQYKY 284


>gi|449520827|ref|XP_004167434.1| PREDICTED: putative prolyl 4-hydroxylase-like, partial [Cucumis
           sativus]
          Length = 164

 Score =  166 bits (420), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 87/162 (53%), Positives = 107/162 (66%), Gaps = 14/162 (8%)

Query: 125 SRVRTSSGTFLKRGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF 182
           S  RTSSG FL   +    +++ IEKRI+ ++ +P+E+GE IQVL YE  Q Y  H+DYF
Sbjct: 4   SDFRTSSGMFLSHHEKNFPMVQAIEKRISVYSQVPVENGELIQVLRYEKNQFYKPHHDYF 63

Query: 183 LDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQ---GLSV 239
            D FN K GGQR+AT+LMYLS+  EGGET FP A +   S         CG +   GLSV
Sbjct: 64  SDTFNLKRGGQRIATMLMYLSENIEGGETYFPKAGSGECS---------CGGKTVPGLSV 114

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           KP +GDA+LFWSM  D   DP S+HGGC V+ G KWSATKWM
Sbjct: 115 KPAKGDAVLFWSMGLDGQSDPKSIHGGCEVLSGEKWSATKWM 156


>gi|159489502|ref|XP_001702736.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280758|gb|EDP06515.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 231

 Score =  165 bits (417), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 87/207 (42%), Positives = 121/207 (58%), Gaps = 4/207 (1%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           +I++W PR  V+  F+ KA  E+++ LA  +M  S +      Q + S+  RTS+GTFL 
Sbjct: 18  QILSWYPRIVVFPGFIDKARAEHIVKLAGKFMYPSGLAYRPGEQVESSQQTRTSTGTFLS 77

Query: 137 RGQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
            G D   ++  +E+RIA  T +P ++GE   VLHYE  Q YD+H D F  +       QR
Sbjct: 78  SGMDTEGVLGWVEQRIAAATLLPADNGEAFNVLHYEHMQHYDSHMDSFDPKDFGPQPSQR 137

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +AT+L+YLS+V EGGETVF     +    R   +   C        P+ GDA+LFW  RP
Sbjct: 138 IATVLLYLSEVLEGGETVFKKEGVDGAD-RPIQDWRNCDDGSFKYAPRMGDAVLFWGTRP 196

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWM 281
           +  +DP SLHGGCPV +G KW ATKW+
Sbjct: 197 NGEIDPHSLHGGCPVKKGEKWVATKWI 223


>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
          Length = 289

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 87/209 (41%), Positives = 119/209 (56%), Gaps = 24/209 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  ++ +FLS  EC+ LI L +  + +S VV+ +TG+      RTS G   + G+  +I
Sbjct: 97  PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATL 198
             IE RIA  T +P+EHGEG QVLHY+ G +Y  H+DYF      +    + GGQR+ATL
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+ V+ GG T FP                   K GL V P +G+A+ F   RPD TL
Sbjct: 217 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 257

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D ++LH G PV RG KW ATKW+    Y+
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRERPYR 286


>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
           bacterium R229]
          Length = 289

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 87/209 (41%), Positives = 119/209 (56%), Gaps = 24/209 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  ++ +FLS  EC+ LI L +  + +S VV+ +TG+      RTS G   + G+  +I
Sbjct: 97  PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATL 198
             IE RIA  T +P+EHGEG QVLHY+ G +Y  H+DYF      +    + GGQR+ATL
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+ V+ GG T FP                   K GL V P +G+A+ F   RPD TL
Sbjct: 217 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 257

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D ++LH G PV RG KW ATKW+    Y+
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRERPYR 286


>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
 gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           PSI07]
          Length = 289

 Score =  165 bits (417), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 87/209 (41%), Positives = 119/209 (56%), Gaps = 24/209 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  ++ +FLS  EC+ LI L +  + +S VV+ +TG+      RTS G   + G+  +I
Sbjct: 97  PRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATL 198
             IE RIA  T +P+EHGEG QVLHY+ G +Y  H+DYF      +    + GGQR+ATL
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+ V+ GG T FP                   K GL V P +G+A+ F   RPD TL
Sbjct: 217 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 257

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D ++LH G PV RG KW ATKW+    Y+
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRERPYR 286


>gi|299532490|ref|ZP_07045880.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
 gi|298719437|gb|EFI60404.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni S44]
          Length = 299

 Score =  164 bits (416), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 88/209 (42%), Positives = 120/209 (57%), Gaps = 26/209 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V+ N LS  EC+ +I  A+P M +S  VD+++G    +  RTS+G F +RG++ +I
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAARPRMRRSLTVDNQSGGEAVNDDRTSNGMFFQRGENELI 171

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-DEFNT----KNGGQRMATL 198
             +E+RIA     P+E+GEG+QVLHY  G +Y  HYDYF  +E  T    K GGQR+ TL
Sbjct: 172 SLVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTL 231

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL++   GG T FP                     GL V P+RG+A+ F   RPD   
Sbjct: 232 VMYLNEPARGGATTFPDV-------------------GLQVVPRRGNAVFFSYNRPDPAT 272

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
              +LHGG PV+ G KW ATKW+   E+K
Sbjct: 273 --KTLHGGAPVLEGEKWIATKWLREREFK 299


>gi|30681957|ref|NP_850038.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|330252315|gb|AEC07409.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 274

 Score =  164 bits (414), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 92/272 (33%), Positives = 148/272 (54%), Gaps = 12/272 (4%)

Query: 19  LVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRRRAFEKRSSIAEEKGEQWTE 78
           LV   +F+     L +   G  +  +       +++ + RR   ++  S+  + G   + 
Sbjct: 10  LVFPYVFIACCFFLAIF--GFCFFNLFSQGISFSEIPTTRRSVNDETDSL--DHGSSVSN 65

Query: 79  I----VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTF 134
           I    ++W PR F   NF +K +CE +ID+AKP +  ST+   K   ++ ++   S    
Sbjct: 66  IPFHGLSWNPRVFYLPNFATKQQCEAVIDMAKPKLKPSTLALRKGETAETTQNYRSLHQH 125

Query: 135 LKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
               +  ++  IE++IA  T  P ++ E   +L Y++GQKYD+HYD F          QR
Sbjct: 126 TDEDESGVLAAIEEKIALATRFPKDYYESFNILRYQLGQKYDSHYDAFHSAEYGPLISQR 185

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           + T L++LS VEEGGET+FP  N    + R+  +  +C   GL VKP++GDA+ F+++ P
Sbjct: 186 VVTFLLFLSSVEEGGETMFPFENGRNMNGRY--DYEKC--VGLKVKPRQGDAIFFYNLFP 241

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           + T+D +SLHG CPVI+G KW ATKW+    Y
Sbjct: 242 NGTIDQTSLHGSCPVIKGEKWVATKWIRDQTY 273


>gi|418530659|ref|ZP_13096582.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
 gi|371452378|gb|EHN65407.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni ATCC 11996]
          Length = 299

 Score =  163 bits (413), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 87/209 (41%), Positives = 120/209 (57%), Gaps = 26/209 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V+ N LS  EC+ +I  A+P M +S  VD+++G    +  RTS+G F +RG++ +I
Sbjct: 112 PRVVVFGNLLSNEECDAIIAAARPRMQRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDLI 171

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-DEFNT----KNGGQRMATL 198
             +E+RIA     P+E+GEG+QVLHY  G +Y  HYDYF  +E  T    K GGQR+ TL
Sbjct: 172 SRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTL 231

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL++   GG T FP                     GL V P+RG+A+ F   RP+   
Sbjct: 232 VMYLNEPARGGATTFPDV-------------------GLQVVPRRGNAVFFSYNRPEPAT 272

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
              +LHGG PV+ G KW ATKW+   E+K
Sbjct: 273 --KTLHGGAPVLEGEKWIATKWLREREFK 299


>gi|264677094|ref|YP_003277000.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
 gi|262207606|gb|ACY31704.1| 2OG-Fe(II) oxygenase [Comamonas testosteroni CNB-2]
          Length = 306

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 87/209 (41%), Positives = 120/209 (57%), Gaps = 26/209 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V+ N LS  EC+ +I  A+P M +S  VD+++G    +  RTS+G F +RG++ +I
Sbjct: 119 PRVVVFGNLLSDEECDAIIAAARPRMRRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDLI 178

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-DEFNT----KNGGQRMATL 198
             +E+RIA     P+E+GEG+QVLHY  G +Y  HYDYF  +E  T    K GGQR+ TL
Sbjct: 179 SLVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTL 238

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL++   GG T FP                     GL + P+RG+A+ F   RPD   
Sbjct: 239 VMYLNEPARGGATTFPDV-------------------GLQIVPRRGNAVFFSYNRPDPAT 279

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
              +LHGG PV+ G KW ATKW+   E+K
Sbjct: 280 --KTLHGGAPVLEGEKWIATKWLREREFK 306


>gi|297825201|ref|XP_002880483.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297326322|gb|EFH56742.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 272

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 86/211 (40%), Positives = 131/211 (62%), Gaps = 12/211 (5%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDS--RVRTSSGTFLKR 137
           ++W PR F   NF +K +CE +ID+AKP + K +++  + G++ ++   VRT      K 
Sbjct: 69  LSWNPRVFYLPNFATKQQCEAVIDMAKPKL-KPSLLALRKGETAETTQNVRTR---LKKT 124

Query: 138 GQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            +D   I+  IE++IA  T IP+++ E   +L Y++GQKYD+HYD F          QR+
Sbjct: 125 DEDESGILAAIEEKIALATRIPIDYYESFNILRYQLGQKYDSHYDAFHPAEYGPQISQRV 184

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
            T +++LS VEEGGET+FP  N    + R+  +   C   GL VKP++GDA+ F+++ P+
Sbjct: 185 VTFILFLSSVEEGGETMFPFENGRNMNGRY--DYETC--IGLRVKPRQGDAIFFYNLLPN 240

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            T+D +SLHG CPVI+G KW ATKW+    Y
Sbjct: 241 RTIDQTSLHGSCPVIKGEKWVATKWIRDQTY 271


>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
 gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
           oxidoreductase protein [Ralstonia solanacearum GMI1000]
          Length = 289

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 86/209 (41%), Positives = 118/209 (56%), Gaps = 24/209 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  ++ +FLS  EC+ LI L +  + +S VV+ +TG+      RTS G   + G+  ++
Sbjct: 97  PRIVLFQHFLSDEECDQLIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLV 156

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATL 198
             IE RIA  T +P+EHGEG QVLHY+ G +Y  H+DYF      +    + GGQR+ATL
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+ V  GG T FP                   K GL V P +G+A+ F   RPD TL
Sbjct: 217 VIYLNSVPAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 257

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D ++LH G PV RG KW ATKW+    Y+
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRERPYR 286


>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
 gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
          Length = 296

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 90/221 (40%), Positives = 126/221 (57%), Gaps = 26/221 (11%)

Query: 73  GEQWTEIVA--WEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS 130
           GE+ T +++    P A +  +FLS  ECE LI LA+P + +STVVD  TG++  +  R+S
Sbjct: 89  GERKTRVISRMQRPAAILLDDFLSANECEQLISLARPRLSRSTVVDPVTGRNVVAGHRSS 148

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DE 185
            G F + G+  +I  +E RIA+ T +P+E+GEG+Q+LHYEVG +   H DY +     ++
Sbjct: 149 DGMFFRLGETPLIARLEARIAELTGLPVENGEGLQLLHYEVGAESTPHVDYLIAGNPANQ 208

Query: 186 FNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGD 245
            +    GQR+ TLLMYL+DVE GGET+FP                   + G SV P+RG 
Sbjct: 209 ESIARSGQRVGTLLMYLNDVEGGGETMFP-------------------QTGWSVVPRRGQ 249

Query: 246 ALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           AL F         DPSSLH   P+  G KW ATKW+    +
Sbjct: 250 ALYFEYGNRFGLADPSSLHTSTPLRVGEKWVATKWIRTRRF 290


>gi|302850293|ref|XP_002956674.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
           nagariensis]
 gi|300258035|gb|EFJ42276.1| hypothetical protein VOLCADRAFT_67269 [Volvox carteri f.
           nagariensis]
          Length = 325

 Score =  163 bits (413), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 96/236 (40%), Positives = 135/236 (57%), Gaps = 22/236 (9%)

Query: 48  DSPPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKP 107
           ++PP    S+R+  F+  S +   +     + ++W+PRA VYHNFLS  E  ++IDLA  
Sbjct: 20  ETPP----SYRQSHFQSLSQLPTCR----IQTISWKPRAVVYHNFLSDQEARHIIDLAHE 71

Query: 108 YMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVL 167
            M +STVV +K     D  +RTS GTFL+R QD +I  IE+R+A ++ +P  H E +QVL
Sbjct: 72  QMKRSTVVGNKNEGVVDD-IRTSYGTFLRRAQDPVIMAIEERLALWSHMPPSHQEDMQVL 130

Query: 168 HYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWN 227
            Y    KY  H D          G +R+AT+LMYL   E  G  + P +          N
Sbjct: 131 RYGRTNKYGPHID----------GLERVATVLMYLVG-ESPGPDLAPVSACECMYAEQSN 179

Query: 228 ELSECGKQGLSVKPKRGDALLFWSMRPD-ATLDPSSLHGGCPVIRGNKWSATKWMH 282
             S C K  ++ KPKRGDAL+F+ ++PD  T D  S+H GCPV+ G KW+A KW+H
Sbjct: 180 P-SACAKGHVAYKPKRGDALMFFDVKPDYTTTDGHSMHTGCPVVAGVKWNAVKWIH 234


>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
 gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
          Length = 292

 Score =  163 bits (413), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 86/209 (41%), Positives = 118/209 (56%), Gaps = 24/209 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  ++ +FLS  EC+ LI L +  + +S VV+ +TG+      RTS G   + G+  ++
Sbjct: 100 PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 159

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATL 198
             IE RIA  T +P+EHGEG QVLHY  G +Y  H+DYF      +    + GGQR+ATL
Sbjct: 160 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 219

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+ V+ GG T FP                   K GL V P +G+A+ F   RPD TL
Sbjct: 220 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 260

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D ++LH G PV RG KW ATKW+    Y+
Sbjct: 261 DDNTLHAGLPVERGEKWIATKWLRERPYR 289


>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           CMR15]
          Length = 289

 Score =  163 bits (413), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 87/209 (41%), Positives = 117/209 (55%), Gaps = 24/209 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  ++ +FLS  EC+ LI L +  + +S VV+ +TG+      RTS G   + G+  +I
Sbjct: 97  PRIVLFQHFLSDEECDQLITLGRHRLKRSPVVNPETGEENLISARTSQGAMFQVGEHPLI 156

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATL 198
             IE RIA  T +P+EHGEG QVLHY+ G +Y  H+DYF      +    + GGQR+ATL
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYQPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+ V  GG T FP                   K GL V P +G+A+ F   RPD TL
Sbjct: 217 VIYLNSVPAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 257

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D  +LH G PV RG KW ATKW+    Y+
Sbjct: 258 DDKTLHAGLPVERGEKWIATKWLRERPYR 286


>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
 gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum IPO1609]
          Length = 280

 Score =  163 bits (412), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 86/209 (41%), Positives = 118/209 (56%), Gaps = 24/209 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  ++ +FLS  EC+ LI L +  + +S VV+ +TG+      RTS G   + G+  ++
Sbjct: 88  PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 147

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATL 198
             IE RIA  T +P+EHGEG QVLHY  G +Y  H+DYF      +    + GGQR+ATL
Sbjct: 148 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 207

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+ V+ GG T FP                   K GL V P +G+A+ F   RPD TL
Sbjct: 208 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 248

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D ++LH G PV RG KW ATKW+    Y+
Sbjct: 249 DDNTLHAGLPVERGEKWIATKWLRERPYR 277


>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
 gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
          Length = 289

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 86/209 (41%), Positives = 118/209 (56%), Gaps = 24/209 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  ++ +FLS  EC+ LI L +  + +S VV+ +TG+      RTS G   + G+  ++
Sbjct: 97  PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 156

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATL 198
             IE RIA  T +P+EHGEG QVLHY  G +Y  H+DYF      +    + GGQR+ATL
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+ V+ GG T FP                   K GL V P +G+A+ F   RPD TL
Sbjct: 217 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 257

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D ++LH G PV RG KW ATKW+    Y+
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRERPYR 286


>gi|388567209|ref|ZP_10153646.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
 gi|388265592|gb|EIK91145.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
          Length = 296

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 92/224 (41%), Positives = 122/224 (54%), Gaps = 28/224 (12%)

Query: 71  EKGEQWTEIVAW--EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVR 128
           + G++W +I+     PR  V  N LS  EC+ +I+ AKP + +S  V + TG  + +  R
Sbjct: 94  DAGDRWVDIITHMNHPRVVVLGNLLSAEECDAIIESAKPKLARSLTVQTATGGEELNADR 153

Query: 129 TSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFN 187
           TSSG F  RGQ   +  +E+RIA     P+E+GEG+QVLHY  G +Y  HYDYF   E  
Sbjct: 154 TSSGMFFTRGQTPEVTAVERRIARLVGWPVENGEGLQVLHYRPGAEYKPHYDYFDPKEAG 213

Query: 188 T----KNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKR 243
           T    K GGQR+ATL+MYL++   GG T FP                     GL V P +
Sbjct: 214 TPTILKRGGQRVATLVMYLNEPARGGGTTFPDV-------------------GLEVAPVK 254

Query: 244 GDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           G A+ F   RP  T    SLHGG PV+ G KW ATKW+   E++
Sbjct: 255 GSAVFFSYDRPHPTT--RSLHGGAPVLEGEKWVATKWLREREFQ 296


>gi|221068712|ref|ZP_03544817.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
 gi|220713735|gb|EED69103.1| Procollagen-proline dioxygenase [Comamonas testosteroni KF-1]
          Length = 299

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 88/209 (42%), Positives = 119/209 (56%), Gaps = 26/209 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V+ N LS  EC+ +I  A P M +S  VD+++G    +  RTS+G F +RG++ +I
Sbjct: 112 PRVVVFGNLLSDEECDAIIAAAGPRMQRSLTVDNQSGGEAVNDDRTSNGMFFQRGENDLI 171

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-DEFNT----KNGGQRMATL 198
             +E+RIA     P+E+GEG+QVLHY  G +Y  HYDYF  +E  T    K GGQR+ TL
Sbjct: 172 CRVEQRIARLLNWPLENGEGMQVLHYRPGAEYKPHYDYFAPNEPGTPTILKRGGQRVGTL 231

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL++   GG T FP                     GL V P+RG+A+ F   RPD   
Sbjct: 232 VMYLNEPARGGATTFPDV-------------------GLQVVPRRGNAVFFSYNRPDPAT 272

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
              +LHGG PV+ G KW ATKW+   E+K
Sbjct: 273 --KTLHGGAPVLEGEKWIATKWLREREFK 299


>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
           CFBP2957]
 gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           CFBP2957]
          Length = 289

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 86/209 (41%), Positives = 118/209 (56%), Gaps = 24/209 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  ++ +FLS  EC+ LI L +  + +S VV+ +TG+      RTS G   + G+  ++
Sbjct: 97  PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 156

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATL 198
             IE RIA  T +P+EHGEG QVLHY  G +Y  H+DYF      +    + GGQR+ATL
Sbjct: 157 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATL 216

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+ V+ GG T FP                   K GL V P +G+A+ F   RPD TL
Sbjct: 217 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 257

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D ++LH G PV RG KW ATKW+    Y+
Sbjct: 258 DDNTLHAGLPVERGEKWIATKWLRERPYR 286


>gi|299115886|emb|CBN75895.1| prolyl 4-hydroxylase alpha-1 subunit precursor-like protein
           [Ectocarpus siliculosus]
          Length = 404

 Score =  163 bits (412), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 90/210 (42%), Positives = 128/210 (60%), Gaps = 13/210 (6%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTV--VDSKTGQSKDSRVRTSSGTFL 135
           + ++ EP  F   NFL   EC+++ + A P+M  S V  +D   G+  D+  RTS+  F+
Sbjct: 194 KTLSMEPLVFEARNFLLDEECKHIREKADPHMKPSPVSLMDHDKGKP-DTNWRTSTTYFM 252

Query: 136 KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN--GGQ 193
              +D +++GI++R+ +FT +P  H E +QVL Y+ GQ+Y AH+D FLDE   +N  GG+
Sbjct: 253 PSTRDPLLQGIDRRVEEFTRVPKSHQEQVQVLKYDKGQRYTAHHD-FLDERTMRNMDGGR 311

Query: 194 --RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
             RM T+  YLSDVEEGGET+FP        V    + S+C   GL VKP  G   +F+S
Sbjct: 312 KNRMITVFWYLSDVEEGGETIFPRYGGRTGRV----DFSDC-TTGLKVKPVEGKVAMFYS 366

Query: 252 MRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           ++PD   D  SLHG CPVI G KW+A KW+
Sbjct: 367 LKPDGQFDDFSLHGACPVITGQKWAANKWV 396


>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
 gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
          Length = 300

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 121/208 (58%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P+  V+ N LS  EC+ +I+ ++  + +ST+VD  TGQ    R RTS G + +RG+D  I
Sbjct: 111 PQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEGVIRNRTSEGIWYQRGEDAFI 170

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE-----FNTKNGGQRMATL 198
             +++RIA     P+E+GEG+Q+LHY    +Y  H+DYF  +      +T  GGQR+ATL
Sbjct: 171 ERLDRRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQRVATL 230

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+DV +GGET+FPAA                   GLSV  K+G A+ F  M     L
Sbjct: 231 VVYLNDVADGGETIFPAA-------------------GLSVAAKQGGAVYFRYMNGQRQL 271

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DP +LHGG PV  G+KW  TKWM    Y
Sbjct: 272 DPLTLHGGAPVRAGDKWIMTKWMRERAY 299


>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
           [Alteromonas sp. S89]
          Length = 294

 Score =  162 bits (411), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 86/211 (40%), Positives = 118/211 (55%), Gaps = 24/211 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   ++ NFL++ EC+ L+++++P +  S VV+++ G  +    RTS GT   RG+  +
Sbjct: 102 QPNIVLFANFLAEWECDALVEMSRPNLSPSRVVNTQHGAFELKPSRTSGGTHFARGETPL 161

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN-----GGQRMAT 197
           I  IE RIA    +P  HGE +Q+LHY V  +Y  HYD+F  E          GGQR+ T
Sbjct: 162 IADIEARIASLLKVPEAHGEPLQILHYPVSGEYRPHYDFFDPEKPGNQEVLAAGGQRVGT 221

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           L+MYLSDVE GG TVFP                   + GL V+P++G AL F  +     
Sbjct: 222 LIMYLSDVESGGATVFP-------------------RVGLEVQPQKGAALFFSYVGEHGK 262

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           LD  SLHGG PV+ G KW ATKW+   EY A
Sbjct: 263 LDLQSLHGGSPVLAGEKWIATKWLRAAEYPA 293


>gi|407938132|ref|YP_006853773.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
 gi|407895926|gb|AFU45135.1| 2OG-Fe(II) oxygenase [Acidovorax sp. KKS102]
          Length = 303

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 90/210 (42%), Positives = 121/210 (57%), Gaps = 26/210 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  V+ N LS  EC+ LI  A+P M +S  V +KTG  + +  RTS G F +RGQ  +
Sbjct: 115 QPRIVVFGNLLSPEECDALIAAAEPRMARSLTVATKTGGEEINADRTSDGMFFQRGQSPL 174

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMAT 197
           I+ IE+RIA     P+E+GEG+QVLHY  G +Y  HYDYF   E  T    K GGQR+ T
Sbjct: 175 IQRIEERIARLLQWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIIKRGGQRVGT 234

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           L+MYL+  ++GG T FP  +                   L V P+RG+A+ F   RP  +
Sbjct: 235 LVMYLNTPDKGGGTTFPDVH-------------------LEVAPQRGNAVFFSYERPHPS 275

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
               +LHGG PVI G+KW ATKW+   E++
Sbjct: 276 T--RTLHGGAPVIAGDKWIATKWLREREFQ 303


>gi|308804269|ref|XP_003079447.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
 gi|116057902|emb|CAL54105.1| oxidoreductase, 2OG-Fe (ISS) [Ostreococcus tauri]
          Length = 363

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 93/235 (39%), Positives = 136/235 (57%), Gaps = 25/235 (10%)

Query: 61  AFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTG 120
           A +  S I  +    WT  ++W PRAF+Y NFL++ ECE+LI L +  + +STVV SK  
Sbjct: 76  AVKGASEIGAKARGTWT-TLSWSPRAFLYQNFLTEDECEHLIALGEKKLERSTVVGSKGK 134

Query: 121 QSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYD 180
           +      RTS GTF+ R     +  +E R+A+++ IP  H E +Q+L YE GQ+Y     
Sbjct: 135 EGDVHSARTSFGTFITRRLTPTLSAVEDRVAEYSGIPWRHQEQLQLLRYEKGQEYG---- 190

Query: 181 YFLDEFNTKNGGQRMATLLMYLSDVEEGGETVF------PAANANFTSVRWWNELSECG- 233
                    NG +R+AT+LM+L + E GGET F      PA  + F   R   +LS+CG 
Sbjct: 191 ---------NGEKRIATVLMFLREPEFGGETHFPDATPLPATRSEFLGSR--AKLSDCGW 239

Query: 234 --KQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
              +G SV P++GDA+LF+S   + T D ++ H  CP +RG K++ATKW+H  E+
Sbjct: 240 NEGRGFSVIPRKGDAILFFSHHINGTSDDAASHASCPTLRGIKYTATKWIHEKEF 294


>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
 gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
          Length = 300

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 87/208 (41%), Positives = 121/208 (58%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P+  V+ N LS  EC+ +I+ ++  + +ST+VD  TGQ    R RTS G + +RG+D  I
Sbjct: 111 PQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEGVIRNRTSEGIWYQRGEDAFI 170

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE-----FNTKNGGQRMATL 198
             +++RIA     P+E+GEG+Q+LHY    +Y  H+DYF  +      +T  GGQR+ATL
Sbjct: 171 ERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSAVHTARGGQRVATL 230

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+DV +GGET+FPAA                   GLSV  K+G A+ F  M     L
Sbjct: 231 VVYLNDVADGGETIFPAA-------------------GLSVAAKQGGAVYFRYMNGQRQL 271

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DP +LHGG PV  G+KW  TKWM    Y
Sbjct: 272 DPLTLHGGAPVHAGDKWIMTKWMRERAY 299


>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
 gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
          Length = 220

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 90/200 (45%), Positives = 117/200 (58%), Gaps = 28/200 (14%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  ECE LI+L+K  M +S     K G S++   +RTSSGTFL+  +  
Sbjct: 42  EPLIVVLENVLSDEECESLIELSKDSMKRS-----KIGASREVDNIRTSSGTFLEENETV 96

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
            I  IEKR++    IP+EHGEG+ +L Y  GQ+Y AHYDYF  E +      R++TL+MY
Sbjct: 97  AI--IEKRVSSIMNIPVEHGEGLHILKYTPGQEYKAHYDYFA-EHSRAAENNRISTLVMY 153

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LS+ PK+G A+ F     D +L+  
Sbjct: 154 LNDVEEGGETFFPKLN-------------------LSIAPKKGSAVYFEYFYNDKSLNEL 194

Query: 262 SLHGGCPVIRGNKWSATKWM 281
           +LHGG PVI+G KW AT+WM
Sbjct: 195 TLHGGAPVIKGEKWVATQWM 214


>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           K60-1]
 gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           K60-1]
          Length = 288

 Score =  162 bits (410), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 86/209 (41%), Positives = 117/209 (55%), Gaps = 24/209 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  ++ +FLS  EC+ LI L +  + +S VV+ +TG+      RTS G   + G+  ++
Sbjct: 96  PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 155

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATL 198
             IE RIA  T +P+EHGEG QVLHY  G +Y  H+DYF      +      GGQR+ATL
Sbjct: 156 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRSGEARQLDVGGQRVATL 215

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+ V+ GG T FP                   K GL V P +G+A+ F   RPD TL
Sbjct: 216 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 256

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D ++LH G PV RG KW ATKW+    Y+
Sbjct: 257 DDNTLHAGLPVERGEKWIATKWLRERPYR 285


>gi|241767624|ref|ZP_04765273.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
 gi|241361463|gb|EER57922.1| Procollagen-proline dioxygenase [Acidovorax delafieldii 2AN]
          Length = 318

 Score =  162 bits (409), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 89/209 (42%), Positives = 118/209 (56%), Gaps = 26/209 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V+ N LS  ECE LI  A P M +S  V ++TG  + +  RTS G F +RG+  ++
Sbjct: 131 PRVVVFGNLLSPEECEALIAAAAPRMARSLTVATQTGGEEVNDDRTSHGMFFQRGESPLV 190

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + IE+RIA     P+E+GEG+QVLHY  G +Y  HYDYF   E  T    + GGQR+ TL
Sbjct: 191 QRIEERIASLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTVIQRGGQRVGTL 250

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  E+GG T FP A                    + V P+RG+A  F   RP  T 
Sbjct: 251 VMYLNTPEQGGGTTFPDAQ-------------------IEVAPQRGNAAFFSYERP--TP 289

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
              +LHGG PV+ G+KW ATKW+   E+K
Sbjct: 290 STRTLHGGAPVLAGDKWIATKWLREREFK 318


>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
 gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
          Length = 286

 Score =  161 bits (407), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 89/205 (43%), Positives = 116/205 (56%), Gaps = 26/205 (12%)

Query: 82  WEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDR 141
           + PR  V+ + LS  ECE LI LAKP + +S  V +KTG  + +  RTSSG F +RG++ 
Sbjct: 97  YNPRVVVFGSLLSDQECEQLIGLAKPRLARSLTVATKTGGEEVNEDRTSSGMFFQRGENE 156

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMA 196
           ++  IE RIA     P+E+GEG+QVLHY  G +Y  HYDYF   E  T    K GGQR+ 
Sbjct: 157 LVARIEARIARLVNWPVENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILKRGGQRVG 216

Query: 197 TLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           TL+MYL + E+GG T FP  +                   L V PKRG  + F   RP  
Sbjct: 217 TLVMYLGEPEKGGGTTFPDVH-------------------LEVAPKRGHGVFFSYERPHP 257

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWM 281
           +    +LHGG PV+ G KW ATKW+
Sbjct: 258 ST--RTLHGGAPVLAGEKWIATKWL 280


>gi|219121927|ref|XP_002181308.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217407294|gb|EEC47231.1| proly 4-hydroxylase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 226

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 94/217 (43%), Positives = 125/217 (57%), Gaps = 21/217 (9%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKS--TVVDSKTGQSKDSRVRTSSGTFL 135
           E ++  P       FLS  EC Y+ + A+P+M  S  T++D   G+   S  RTS   F+
Sbjct: 8   ETLSLVPLVLSVEGFLSDDECTYIQETAEPHMEYSEVTLMDKDQGRPA-SDFRTSQSAFI 66

Query: 136 KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTK------ 189
           +   D I+  I+ R A    IP  H E +QVL Y+V +KYD+H DYF     TK      
Sbjct: 67  RAHDDAILTDIDYRTASLVRIPRRHQEDVQVLRYDVTEKYDSHADYFDPALYTKDKRTLA 126

Query: 190 ---NGGQ-RMATLLMYLSDVEEGGETVFPAAN-ANFTSVRWWNELSECGKQGLSVKPKRG 244
              NG + RMAT+  YLSDVE+GGETVFP  N A  TS++      +C K GL VKP++G
Sbjct: 127 LIRNGHRNRMATVFWYLSDVEKGGETVFPRFNGAQETSMK------DC-KTGLKVKPEKG 179

Query: 245 DALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
             ++F+SM PD  LD  SLHG CPV +G KW+A KW+
Sbjct: 180 KVIIFYSMTPDGALDEYSLHGACPVQKGTKWAANKWV 216


>gi|351731158|ref|ZP_08948849.1| 2OG-Fe(II) oxygenase [Acidovorax radicis N35]
          Length = 303

 Score =  161 bits (407), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 90/210 (42%), Positives = 119/210 (56%), Gaps = 26/210 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  V+ N LS  EC+ LI  A P M +S  V +KTG  + +  RTS G F +RGQ  +
Sbjct: 115 QPRVVVFGNLLSPEECDALIADAAPRMARSLTVATKTGGEEINDDRTSDGMFFQRGQSPL 174

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMAT 197
           I+ IE+RIA     P+E+GEG+QVLHY  G +Y  HYDYF   E  T    K GGQR+ T
Sbjct: 175 IQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTIVKRGGQRVGT 234

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           L+MYL+  E+GG T FP  +                   + V P+RG+A+ F   RP  +
Sbjct: 235 LVMYLNTPEKGGGTTFPDVH-------------------VEVAPQRGNAVFFSYERPHPS 275

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
               +LHGG PV+ G KW ATKW+   E+K
Sbjct: 276 T--RTLHGGAPVLAGEKWIATKWLREREFK 303


>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
 gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
          Length = 216

 Score =  160 bits (406), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 90/206 (43%), Positives = 121/206 (58%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  ECE LI+L+K  M +S     K G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLANVLSDEECEELIELSKNKMKRS-----KVGSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+  T +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum MolK2]
 gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum MolK2]
          Length = 283

 Score =  160 bits (406), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 85/209 (40%), Positives = 117/209 (55%), Gaps = 24/209 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  ++ +FLS  EC+ LI L +  + +S VV+ +TG+      RTS G   + G+  ++
Sbjct: 91  PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQVGEHPLV 150

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATL 198
             IE RIA  T +P+EHGEG QVLHY  G +Y  H+DYF      +    + GGQR+ATL
Sbjct: 151 ARIEARIAQATGVPVEHGEGFQVLHYHPGGEYQPHFDYFNPGRGGEARQLEVGGQRVATL 210

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+ V+ GG T FP                   K GL V P +G+A+ F   RPD  L
Sbjct: 211 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGML 251

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D ++LH G PV RG KW ATKW+    Y+
Sbjct: 252 DDNTLHAGLPVERGEKWIATKWLRERPYR 280


>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
 gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
          Length = 296

 Score =  160 bits (406), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 89/221 (40%), Positives = 124/221 (56%), Gaps = 26/221 (11%)

Query: 73  GEQWTEIVA--WEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS 130
           GE+ T +++    P A +  +FLS  ECE LI LA+P + +STVVD  TG++  +  R+S
Sbjct: 89  GERKTRVISRMQRPAAVLLDDFLSANECEQLIALARPRLSRSTVVDPVTGRNVVAGHRSS 148

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DE 185
            G F + G+  +I  +E RIA+ T +P+E+GEG+Q+LHYE G +   H DY +     + 
Sbjct: 149 DGMFFRLGETPLIARLEARIAELTGLPVENGEGLQLLHYEAGAESTPHVDYLIAGNPANR 208

Query: 186 FNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGD 245
            +    GQR+ TLLMYL+DVE GGET+FP                   + G SV P+RG 
Sbjct: 209 ESIARSGQRVGTLLMYLNDVEGGGETMFP-------------------QTGWSVVPRRGQ 249

Query: 246 ALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           AL F         DPSSLH   P+  G KW ATKW+    +
Sbjct: 250 ALYFEYGNRFGLADPSSLHTSTPLRAGEKWVATKWIRTRRF 290


>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
 gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
          Length = 307

 Score =  160 bits (405), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 86/208 (41%), Positives = 120/208 (57%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P+  V+ N LS  EC+ +I+ ++  + +ST+VD  TGQ    R RTS G + +RG+D  I
Sbjct: 118 PQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEDVIRNRTSEGIWYQRGEDAFI 177

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE-----FNTKNGGQRMATL 198
             +++RIA     P+E+GEG+Q+LHY    +Y  H+DYF  +      +T  GGQR+ATL
Sbjct: 178 ERLDQRIASLMNWPVENGEGLQILHYGPTGEYRPHFDYFPPDQPGSMVHTARGGQRVATL 237

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+DV +GGET+FP A                   GLSV  K+G A+ F  M     L
Sbjct: 238 VIYLNDVPDGGETIFPEA-------------------GLSVAAKQGGAVYFRYMNGQRQL 278

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DP +LHGG PV  G+KW  TKWM    Y
Sbjct: 279 DPLTLHGGAPVRAGDKWIMTKWMRERAY 306


>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
 gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
          Length = 216

 Score =  160 bits (404), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 89/206 (43%), Positives = 121/206 (58%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  M +S     K G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLANVLSDEECDELIELSKSKMERS-----KVGSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+  T +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSITNVPVSHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
 gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
 gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
 gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
 gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
 gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
 gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
 gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
          Length = 216

 Score =  160 bits (404), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 89/206 (43%), Positives = 121/206 (58%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  M +S     K G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLANVLSDEECDELIELSKSKMERS-----KVGSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+  T +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|365090417|ref|ZP_09328465.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
 gi|363416516|gb|EHL23626.1| 2OG-Fe(II) oxygenase [Acidovorax sp. NO-1]
          Length = 302

 Score =  159 bits (403), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 89/210 (42%), Positives = 120/210 (57%), Gaps = 26/210 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  V+ N LS  EC+ LI  A+P + +S  V +KTG  + +  RTS G F +RGQ  +
Sbjct: 114 QPRIVVFGNLLSPEECDALIADAQPRLARSLTVATKTGGEEINDDRTSDGMFFQRGQSPL 173

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNTKN----GGQRMAT 197
           I+ IE+RIA     P+E+GEG+QVLHY  G +Y  HYDYF   E  T +    GGQR+ T
Sbjct: 174 IQRIEERIARLLNWPIENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPSIVNRGGQRVGT 233

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           L+MYL+  E+GG T FP  +                   L V P+RG+A+ F   RP  +
Sbjct: 234 LVMYLNTPEKGGGTTFPDVH-------------------LEVAPQRGNAVFFSYERPHPS 274

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
               +LHGG PVI G KW ATKW+   E++
Sbjct: 275 T--RTLHGGAPVIAGEKWIATKWLREREFR 302


>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
 gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
          Length = 288

 Score =  159 bits (403), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 88/238 (36%), Positives = 127/238 (53%), Gaps = 24/238 (10%)

Query: 55  TSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTV 114
           TS   +A E  +++    G+         PR  ++ +FLS AEC+ LI + +  + +S V
Sbjct: 67  TSAADQAVENSNAVHTADGDIPILFAIETPRIVLFQHFLSDAECDELIAIGRNRLKRSPV 126

Query: 115 VDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQK 174
           V+  TG+      RTS G   + G+  +I  IE RIA    +P+EHGEG QVL+Y+ G +
Sbjct: 127 VNPDTGEENLISARTSQGGMFQVGEHPLIAKIEVRIAQAVGVPVEHGEGFQVLNYQPGGE 186

Query: 175 YDAHYDYFL-----DEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNEL 229
           Y  H+D+F      +    + GGQR+AT+++YL+ V+ GG T FP               
Sbjct: 187 YQPHFDFFNPGRSGEARQLEVGGQRVATMVIYLNSVQAGGATGFP--------------- 231

Query: 230 SECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
               K GL V P +G+A+ F   RPD TLD  +LH G PV RG KW ATKW+    Y+
Sbjct: 232 ----KLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVERGEKWIATKWLRERPYR 285


>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
 gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
 gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
 gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
          Length = 216

 Score =  159 bits (403), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 89/206 (43%), Positives = 121/206 (58%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  M +S     K G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLANVLSDEECDELIELSKSKMKRS-----KVGSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+  T +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
 gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
          Length = 216

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 89/206 (43%), Positives = 121/206 (58%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  M +S     K G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLANVLSDEECDKLIELSKNNMKRS-----KVGSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+  T +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGAYK 216


>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
 gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
 gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
 gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
 gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
 gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
          Length = 216

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 89/206 (43%), Positives = 121/206 (58%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  M +S     K G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLANVLSDEECDELIELSKNKMKRS-----KVGSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+  T +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
 gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
 gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
 gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
          Length = 219

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 87/199 (43%), Positives = 116/199 (58%), Gaps = 26/199 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP   V  N LS  ECE LI+++K  M +S +  S+    K + +RTSSG FL+  +  I
Sbjct: 41  EPLIVVLANVLSDEECETLIEMSKNKMKRSKIGISR----KTNDIRTSSGAFLE--ESEI 94

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
              IE+RIA    +P  HGEG+Q+L Y VGQ+Y AHYD+F+ E +      RM+TL+MYL
Sbjct: 95  TTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFFV-ENSAAASNNRMSTLVMYL 153

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           + VEEGGET FP  N                   LSV PK+G A+ F     D +++  +
Sbjct: 154 NHVEEGGETFFPKLN-------------------LSVSPKKGMAVYFEYFYQDESINKLT 194

Query: 263 LHGGCPVIRGNKWSATKWM 281
           LHGG PVI+G KW AT+WM
Sbjct: 195 LHGGAPVIKGEKWVATQWM 213


>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
           12442]
 gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
           12442]
          Length = 219

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 87/199 (43%), Positives = 116/199 (58%), Gaps = 26/199 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP   V  N LS  ECE LI+++K  M +S +  S+    K + +RTSSG FL+  +  I
Sbjct: 41  EPLIVVLANVLSDEECETLIEMSKNKMKRSKIGVSR----KTNDIRTSSGAFLE--ESEI 94

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
              IE+RIA    +P  HGEG+Q+L Y VGQ+Y AHYD+F+ E +      RM+TL+MYL
Sbjct: 95  TTRIERRIASIMNVPAPHGEGLQILKYTVGQEYQAHYDFFV-ENSAAASNNRMSTLVMYL 153

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           + VEEGGET FP  N                   LSV PK+G A+ F     D +++  +
Sbjct: 154 NHVEEGGETFFPKLN-------------------LSVSPKKGMAVYFEYFYQDESINKLT 194

Query: 263 LHGGCPVIRGNKWSATKWM 281
           LHGG PVI+G KW AT+WM
Sbjct: 195 LHGGAPVIKGEKWVATQWM 213


>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
 gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
          Length = 285

 Score =  159 bits (402), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 86/208 (41%), Positives = 120/208 (57%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P+  V+ N L + EC+ +I  +   + +ST V+++TG  +  R RTS GT+ + G+D +I
Sbjct: 96  PQIVVFGNVLDQDECDEMIQRSMHKLEQSTTVNAETGTQEVIRHRTSHGTWFQNGEDALI 155

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEF-----NTKNGGQRMATL 198
           R IE R+A     P+E+GEG+QVL Y  G +Y +HYDYF         + + GGQR+ATL
Sbjct: 156 RRIETRLAALMNCPVENGEGLQVLRYTPGGEYRSHYDYFQPTAAGSLTHVRTGGQRVATL 215

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+DV  GGETVFP A                   G+SV P+RGDA+ F  M     L
Sbjct: 216 IVYLNDVPSGGETVFPEA-------------------GISVVPRRGDAVYFRYMNRLRQL 256

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DP++LH G PV  G KW  TKW+    Y
Sbjct: 257 DPATLHAGAPVRDGEKWIMTKWVRERPY 284


>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
 gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
 gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
 gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
          Length = 216

 Score =  159 bits (401), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 89/206 (43%), Positives = 120/206 (58%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  M +S     K G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLANVLSDEECDELIELSKSKMERS-----KVGSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+  T +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D  L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQLLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
 gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
          Length = 313

 Score =  158 bits (400), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 83/208 (39%), Positives = 121/208 (58%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P+  V+ N LS  EC  +I+ ++  + +ST+VD  TG+    R RTS G + +RG+D +I
Sbjct: 124 PQVIVFGNVLSPDECAEMIERSRHRLKRSTIVDPATGREDVIRNRTSEGIWYQRGEDALI 183

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE-----FNTKNGGQRMATL 198
             +++RIA     P+E+GEG+Q+LHY    +Y  H+DYF  +      +T  GGQR+ATL
Sbjct: 184 ERLDQRIASLMNWPLENGEGLQILHYGPSGEYRPHFDYFPPDQPGSAVHTARGGQRVATL 243

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+DV +GGET+FP A                   GLSV  ++G A+ F  M     L
Sbjct: 244 VVYLNDVPDGGETIFPEA-------------------GLSVAAQQGGAVYFRYMNGRRQL 284

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DP +LHGG PV+ G+KW  TKW+    Y
Sbjct: 285 DPLTLHGGAPVLSGDKWIMTKWVRERPY 312


>gi|295700439|ref|YP_003608332.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
 gi|295439652|gb|ADG18821.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
          Length = 296

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 90/208 (43%), Positives = 116/208 (55%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P A    NFLS  ECE LI LA+P + +S VVD  TG+   +  R+S G F + G+  +I
Sbjct: 102 PAAVHLANFLSADECEQLIALAQPRLDRSAVVDPVTGRDVIATHRSSHGMFFRLGETPLI 161

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL--DEFNTKN---GGQRMATL 198
             IE RIA+ T  P+E+GEG+Q+LHYE G +   H DY +  +E N ++    GQRM TL
Sbjct: 162 ARIEARIAELTATPVENGEGLQMLHYEEGAESTPHVDYLMTGNEANRESIARSGQRMGTL 221

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           LMYL DVE GGETVFP                   + G S+ P+RG AL F         
Sbjct: 222 LMYLKDVEGGGETVFP-------------------QVGWSIVPQRGHALYFEYGNRYGMC 262

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DPSSLH   P+  G+KW ATKW+    +
Sbjct: 263 DPSSLHASTPLRTGDKWVATKWIRTRRF 290


>gi|395003644|ref|ZP_10387769.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
 gi|394318439|gb|EJE54870.1| 2OG-Fe(II) oxygenase superfamily enzyme [Acidovorax sp. CF316]
          Length = 299

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 89/210 (42%), Positives = 119/210 (56%), Gaps = 26/210 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  V+ N LS  EC+ LI  A P M +S  V +KTG  + +  RTS G F +RG++ +
Sbjct: 111 KPRIVVFGNLLSAEECDALIAAAAPRMARSLTVATKTGGEEVNDDRTSDGMFFQRGENPV 170

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMAT 197
           ++ IE+RIA     P+E+GEG+QVLHY  G +Y  HYDYF   E  T    K GGQR+ T
Sbjct: 171 VQRIEERIARLLDWPIENGEGLQVLHYRPGAEYKPHYDYFDPGEPGTPTILKRGGQRVGT 230

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           L+MYL+  E+GG T FP  +                   + V P+RG+A+ F   R  A 
Sbjct: 231 LVMYLNTPEKGGGTTFPDVH-------------------VEVAPQRGNAVFFSYER--AH 269

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
               +LHGG PVI G KW ATKW+   E+K
Sbjct: 270 PATRTLHGGAPVIAGEKWIATKWLREREFK 299


>gi|229135058|ref|ZP_04263863.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
 gi|228648443|gb|EEL04473.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
          Length = 216

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 89/206 (43%), Positives = 119/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC  LI+L+K  M +S     K G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLANVLSDEECAELIELSKSNMKRS-----KVGSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+  T +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D  L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQLLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|423512354|ref|ZP_17488885.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
 gi|402449325|gb|EJV81162.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
          Length = 216

 Score =  158 bits (399), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 89/206 (43%), Positives = 119/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC  LI+L+K  M +S     K G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLANVLSDEECAELIELSKSNMKRS-----KVGSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+  T +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D  L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQLLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
 gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
          Length = 294

 Score =  157 bits (398), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 84/208 (40%), Positives = 119/208 (57%), Gaps = 26/208 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V+ N LS  EC+ +I  A+P M +S  V +++G  + +  RTS+G F +RG+  I+
Sbjct: 107 PRIVVFGNLLSHEECDAIIAAARPRMARSLTVATQSGGEEINDDRTSNGMFFQRGETGIV 166

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD-EFNT----KNGGQRMATL 198
             +E+RIA     P++HGEG+QVLHY  G +Y  H+DYF   E  T    K GGQR+ TL
Sbjct: 167 SQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYFAPGEPGTPTILKRGGQRVGTL 226

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL++ E GG T+FP                      L V P+RG+A+ F   RPD + 
Sbjct: 227 VIYLNEPERGGATIFPEVP-------------------LQVVPRRGNAVFFSYERPDPST 267

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
              +LHGG PV+ G KW ATKW+   E+
Sbjct: 268 --RTLHGGAPVLAGEKWIATKWLREREF 293


>gi|24417248|gb|AAN60234.1| unknown [Arabidopsis thaliana]
          Length = 190

 Score =  157 bits (398), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 71/132 (53%), Positives = 99/132 (75%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PR F+Y  FLS  EC++ I LAK  + KS V D+ +G+S +S VRTSSG FL + Q
Sbjct: 59  LSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGMFLSKRQ 118

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+  +E ++A +TF+P E+GE +Q+LHYE GQKY+ H+DYF D+ N + GG R+AT+L
Sbjct: 119 DDIVSNVEAKLAAWTFLPEENGESMQILHYENGQKYEPHFDYFHDQANLELGGHRIATVL 178

Query: 200 MYLSDVEEGGET 211
           MYLS+VE+GGET
Sbjct: 179 MYLSNVEKGGET 190


>gi|423518940|ref|ZP_17495421.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
 gi|401159995|gb|EJQ67374.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
          Length = 216

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 89/206 (43%), Positives = 119/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC  LI+L+K  M +S     K G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLANVLSDEECAELIELSKNNMKRS-----KVGSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+  T +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D  L+  
Sbjct: 150 LNDVEEGGETFFPQLN-------------------LSVHPRKGMAVYFEYFYQDQLLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           gladioli BSR3]
 gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           gladioli BSR3]
          Length = 302

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 84/208 (40%), Positives = 115/208 (55%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P A +   FLS  EC  LI+LA+P + +STVVD  TG++  +  R+S G F + G+  +I
Sbjct: 102 PAAVLLDGFLSAGECRQLIELARPRLNRSTVVDPVTGRNIVAGHRSSDGMFFRLGETPLI 161

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATL 198
             IE+RIA  T  P+E+GEG+Q+LHYE G +   H DY +     +  +    GQR+ TL
Sbjct: 162 SRIEQRIAALTGFPVENGEGLQMLHYEAGAESTPHVDYLVPGNPANAESIARSGQRVGTL 221

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           LMYL+DVE GGET+FP                   + G SV P+RG A  F         
Sbjct: 222 LMYLNDVESGGETLFP-------------------QVGCSVVPRRGQAFYFEYGNGSGRS 262

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DP+SLH   P+  G+KW ATKW+    +
Sbjct: 263 DPASLHASSPIGSGDKWVATKWIRTRRF 290


>gi|423368291|ref|ZP_17345723.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
 gi|401081042|gb|EJP89322.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
          Length = 216

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 89/206 (43%), Positives = 119/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC  LI+L+K  M +S     K G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLANVLSDEECAELIELSKNNMKRS-----KVGSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+  T +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D  L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQLLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|388520887|gb|AFK48505.1| unknown [Lotus japonicus]
          Length = 187

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 76/136 (55%), Positives = 100/136 (73%), Gaps = 8/136 (5%)

Query: 156 IP-MEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFP 214
           IP +E+GE IQ+LHYE G+KY+ HYDYF D  N   GG R+AT+LMYLSDV +GGET+FP
Sbjct: 3   IPSIENGESIQILHYENGRKYEPHYDYFHDRANQFMGGHRIATVLMYLSDVGKGGETIFP 62

Query: 215 AANANFTSVR---WWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIR 271
            A +  +  +   W    SEC  +G +VKP++GDALLF+S+  +AT D +SLHG CPVI 
Sbjct: 63  NAESKLSQPKDESW----SECAHKGYAVKPRKGDALLFFSLHLNATTDSNSLHGSCPVIE 118

Query: 272 GNKWSATKWMHVHEYK 287
           G KWSATKW+HV +++
Sbjct: 119 GEKWSATKWIHVSDFE 134


>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
 gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
          Length = 309

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 82/210 (39%), Positives = 122/210 (58%), Gaps = 26/210 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  ++ N LS  EC+ +ID A+P M +S  V ++TG  + +  RTS+G F +R ++ +
Sbjct: 121 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDDRTSNGMFFQREENPV 180

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMAT 197
           +  +E RIA     P+E+GEG+QVLHY  G +Y  HYDYF   E  T    + GGQR+AT
Sbjct: 181 VARLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPTILRRGGQRVAT 240

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           +++YL+D E+GG T FP  +                   L V P+RG+A+ F   RP  +
Sbjct: 241 IVIYLNDPEKGGGTTFPDVH-------------------LEVAPRRGNAVFFSYERPHPS 281

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
               +LHGG PV+ G+KW ATKW+    ++
Sbjct: 282 T--RTLHGGAPVVAGDKWIATKWLRERRFE 309


>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
 gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
          Length = 294

 Score =  157 bits (398), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 84/208 (40%), Positives = 119/208 (57%), Gaps = 26/208 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V+ N LS  EC+ +I  A+P M +S  V +++G  + +  RTS+G F +RG+  I+
Sbjct: 107 PRIVVFGNLLSHEECDAIIAAARPRMARSLTVATQSGGEEINDDRTSNGMFFQRGETGIV 166

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD-EFNT----KNGGQRMATL 198
             +E+RIA     P++HGEG+QVLHY  G +Y  H+DYF   E  T    K GGQR+ TL
Sbjct: 167 SQLEERIARLLRWPLDHGEGLQVLHYGPGAEYKPHHDYFAPGEPGTPTILKRGGQRVGTL 226

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL++ E GG T+FP                      L V P+RG+A+ F   RPD + 
Sbjct: 227 VIYLNEPERGGATIFPEVP-------------------LQVVPRRGNAVFFSYERPDPST 267

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
              +LHGG PV+ G KW ATKW+   E+
Sbjct: 268 --RTLHGGAPVLAGEKWIATKWLREREF 293


>gi|229061929|ref|ZP_04199257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
 gi|228717372|gb|EEL69042.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
          Length = 216

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 89/206 (43%), Positives = 119/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC  LI+L+K  M +S     K G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLANVLSDEECAELIELSKSNMKRS-----KVGSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+  T +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSITNVPVVHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D  L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQLLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
           JS666]
 gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
           JS666]
          Length = 277

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 91/230 (39%), Positives = 133/230 (57%), Gaps = 32/230 (13%)

Query: 66  SSIAEEKGEQWTEIVA--WEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSK 123
           +S++ + G++W +++     P   V+ N LS +ECE L+++A+P + +S  V+ KTG  +
Sbjct: 70  ASMSMDAGDRWVDVLQRLQLPDLVVFGNLLSDSECEALMEVAQPRLARSLTVNIKTGGEE 129

Query: 124 DSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF- 182
            +R RTS G F  RG++ +++ +E RIA     P++ GEG+QVL Y  G +Y  HYDYF 
Sbjct: 130 RNRDRTSQGMFFARGENPLVQRVEARIARLVGWPVDRGEGLQVLRYRQGAQYKPHYDYFD 189

Query: 183 LDEFNT----KNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLS 238
             E  T    + GGQR+ATL+MYL++ E+GG TVFP                     GL 
Sbjct: 190 PAEPGTPAILQRGGQRVATLIMYLNEPEQGGATVFP-------------------DIGLQ 230

Query: 239 VKPKRGDALLFWSMRPDATLDPSSL--HGGCPVIRGNKWSATKWMHVHEY 286
           V P+RG A+ F    P A  +P+SL  HGG PV  G KW ATKW+   E+
Sbjct: 231 VTPRRGTAVFF--SYPAA--NPASLTRHGGEPVKAGEKWIATKWLREREF 276


>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
           ATCC 19860]
 gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
           ATCC 19860]
          Length = 298

 Score =  157 bits (397), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 82/210 (39%), Positives = 122/210 (58%), Gaps = 26/210 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  ++ N LS  EC+ +ID A+P M +S  V ++TG  + +  RTS+G F +R ++ +
Sbjct: 110 QPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDDRTSNGMFFQREENPM 169

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMAT 197
           +  +E RIA     P+E+GEG+QVLHY  G +Y  HYDYF   E  T    + GGQR+AT
Sbjct: 170 VAKLEARIARLVNWPLENGEGLQVLHYRPGAEYKPHYDYFDPTEPGTPTILRRGGQRVAT 229

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           +++YL+D E+GG T FP  +                   L V P+RG+A+ F   RP  +
Sbjct: 230 IVIYLNDPEKGGGTTFPDVH-------------------LEVAPRRGNAVFFSYERPHPS 270

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
               +LHGG PV+ G+KW ATKW+    ++
Sbjct: 271 T--RTLHGGAPVVAGDKWIATKWLRERRFE 298


>gi|423470454|ref|ZP_17447198.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
 gi|402436583|gb|EJV68613.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
          Length = 216

 Score =  157 bits (396), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 88/206 (42%), Positives = 120/206 (58%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K     + +  SK G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLANVLSDEECDGLIELSK-----NKIERSKIGSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+WM    YK
Sbjct: 191 TLHGGAPVTKGEKWVATQWMRRGTYK 216


>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
 gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
          Length = 287

 Score =  157 bits (396), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 88/205 (42%), Positives = 112/205 (54%), Gaps = 28/205 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  V+  FLS  EC+ L+ LA+P + +S  VD+ TG S+ +  RTS G F  RG+  +
Sbjct: 99  DPRVVVFGGFLSHDECDALVALAQPRLARSETVDNDTGGSEVNEARTSQGMFFMRGEGEL 158

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT------KNGGQRMA 196
           I  IE RIA     P+E+GEG+QVLHY  G +Y  HYDYF D          K GGQR+ 
Sbjct: 159 ISRIEARIAALLDWPLENGEGVQVLHYRPGAEYKPHYDYF-DPAQPGTPTILKRGGQRVG 217

Query: 197 TLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           TL+MYL+  E GG T FP  N                   L V P +G+A+ F   R  A
Sbjct: 218 TLVMYLNTPERGGGTTFPDVN-------------------LEVAPIKGNAVFFSYER--A 256

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWM 281
                SLHGG PV+ G KW ATKW+
Sbjct: 257 HPSTRSLHGGAPVLAGEKWVATKWL 281


>gi|229071739|ref|ZP_04204954.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
 gi|228711334|gb|EEL63294.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
          Length = 232

 Score =  156 bits (395), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 88/206 (42%), Positives = 119/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 54  EPLIVVLANVLSDEECDELIEMSKNKMERS-----KIGSSRDVNDIRTSSGAFLE--DNE 106

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 107 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 165

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 166 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 206

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+WM    YK
Sbjct: 207 TLHGGAPVTKGEKWIATQWMRRGTYK 232


>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
 gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
 gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
 gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
 gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
 gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
          Length = 288

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 82/209 (39%), Positives = 116/209 (55%), Gaps = 24/209 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  ++ +FLS  EC+ LI + +  + +S VV+  TG+      RTS G   + G+  +I
Sbjct: 96  PRIVLFQHFLSDQECDELIAIGRNRLKRSPVVNPDTGEENLISARTSQGGMFQVGEHPLI 155

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATL 198
             IE RIA    +P+EHGEG QVL+Y+ G +Y  H+D+F      +    + GGQR+AT+
Sbjct: 156 AKIEARIAQAVGVPVEHGEGFQVLNYQPGGEYQPHFDFFNPGRSGEARQLEVGGQRVATM 215

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+ V+ GG T FP                   K GL V P +G+A+ F   RPD TL
Sbjct: 216 VIYLNSVQAGGATGFP-------------------KLGLEVAPVKGNAVFFVYKRPDGTL 256

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D  +LH G PV RG KW ATKW+    Y+
Sbjct: 257 DEDTLHAGLPVERGEKWIATKWLRERPYR 285


>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
          Length = 217

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 83/199 (41%), Positives = 118/199 (59%), Gaps = 25/199 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP   V  N LS  EC+ LI L+K  + +S + ++         +RTSS TF++  ++ I
Sbjct: 38  EPLIVVLGNVLSDEECDELIRLSKDRINRSKIANANV-----DNMRTSSSTFIEENENII 92

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  IEKRI+    IP E+GEG+Q+L+Y+VGQ+Y +H+D+F    N  N   R++TL+MYL
Sbjct: 93  VSRIEKRISQIMNIPTEYGEGLQILNYQVGQEYKSHFDFFSSPHNAIN-NPRISTLVMYL 151

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           SDVE+GGET FP                   K   SV P++G A+ F     D TL+  +
Sbjct: 152 SDVEQGGETYFP-------------------KLHFSVSPQKGMAVYFEYFYNDQTLNELT 192

Query: 263 LHGGCPVIRGNKWSATKWM 281
           LHGG PVI G+KW+AT+WM
Sbjct: 193 LHGGAPVIVGDKWAATQWM 211


>gi|229168980|ref|ZP_04296697.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
 gi|423591765|ref|ZP_17567796.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
 gi|228614572|gb|EEK71680.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
 gi|401231898|gb|EJR38400.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
          Length = 216

 Score =  156 bits (394), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 89/206 (43%), Positives = 119/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC  LI+L+K  M +S     K G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLANVLSDEECAELIELSKSNMKRS-----KVGSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+  T +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTWKIEKRISSITNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D  L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQLLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|42783360|ref|NP_980607.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10987]
 gi|42739288|gb|AAS43215.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           ATCC 10987]
          Length = 216

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTSSG FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECDELIELSKNKLARS-----KVGSSRDVNDIRTSSGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+WM    Y+
Sbjct: 191 TLHGGAPVTKGEKWIATQWMRRGTYR 216


>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           H3081.97]
 gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
 gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           H3081.97]
 gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
          Length = 216

 Score =  155 bits (393), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTSSG FL    D 
Sbjct: 38  EPLIVVLGNVLSDEECDKLIELSKNKLARS-----KVGSSRDVNDIRTSSGAFLD--DDE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    Y+
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYR 216


>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
 gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
          Length = 296

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 91/208 (43%), Positives = 117/208 (56%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P A    +FLS  ECE LI LA+P + +STVVD  TG++  +  R+S G F + G+  +I
Sbjct: 102 PAAVHLADFLSADECEQLIALAQPRLDRSTVVDPVTGRNVVAGHRSSHGMFFRLGETPLI 161

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL--DEFNTKN---GGQRMATL 198
             IE RIA  T  P+E+GEG+Q+LHYE G +   H DY +  +E N ++    GQRM TL
Sbjct: 162 VRIEARIAALTGTPVENGEGLQMLHYEEGAESTPHVDYLITGNEANRESIARSGQRMGTL 221

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           LMYL DVE GGETVFP                   + G SV P+RG AL F         
Sbjct: 222 LMYLKDVEGGGETVFP-------------------QIGWSVAPQRGHALYFEYGNRFGLC 262

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DPSSLH   P+  G+KW ATKW+    +
Sbjct: 263 DPSSLHASTPLRVGDKWVATKWIRTRRF 290


>gi|414587755|tpg|DAA38326.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
          Length = 244

 Score =  155 bits (393), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 75/139 (53%), Positives = 98/139 (70%), Gaps = 2/139 (1%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+++W PR  V+HNFLS  EC+YL+ +A+P +  STVVD  TG+   S VRTSSG F+  
Sbjct: 58  EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSDVRTSSGMFVNS 117

Query: 138 GQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            + +  +++ IEKRI+ F+ IP E+GE IQVL YE  Q Y  H+DYF D FN K GGQR+
Sbjct: 118 EERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSDTFNLKRGGQRV 177

Query: 196 ATLLMYLSDVEEGGETVFP 214
           AT+LMYL+D   GGET FP
Sbjct: 178 ATMLMYLTDGVVGGETHFP 196


>gi|423478381|ref|ZP_17455096.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
 gi|402428543|gb|EJV60640.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
          Length = 216

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 88/206 (42%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  M +S     K G S+D + +RTSSG FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECDELIELSKSKMKRS-----KVGSSRDVNDIRTSSGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGAYK 216


>gi|319792090|ref|YP_004153730.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
 gi|315594553|gb|ADU35619.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
          Length = 280

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 86/209 (41%), Positives = 116/209 (55%), Gaps = 26/209 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V+ N LS  ECE LI  A+  + +S  V+++TG    +  RTS G F +RG++ I+
Sbjct: 93  PRVIVFGNLLSTEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSDGMFFERGENEIV 152

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
             +E+R+A     P+E+GEG+Q+L Y  G +Y  HYDYF  +E  T    K GGQR+ATL
Sbjct: 153 ARLEQRLAMLLRWPLEYGEGLQILRYAPGAQYRPHYDYFDPNEPGTPTILKRGGQRVATL 212

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL + E+GG T FP                     GL V P RG  + F   RPD   
Sbjct: 213 VMYLQEPEQGGATTFPDV-------------------GLEVAPVRGTGVFFSYDRPDPVT 253

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
              +LHGG PV+ G KW ATKW+   E+K
Sbjct: 254 --RTLHGGAPVLAGEKWVATKWLREREFK 280


>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
 gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
          Length = 216

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 88/206 (42%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  M +S     K G S+D + +RTSSG FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECDELIELSKSKMKRS-----KVGSSRDVNDIRTSSGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGAYK 216


>gi|423558182|ref|ZP_17534484.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
 gi|401191450|gb|EJQ98472.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
          Length = 216

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 120/206 (58%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K     + +  SK G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLANVLSDEECDGLIELSK-----NKIKRSKIGSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWVATQWVRRGTYK 216


>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
 gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
          Length = 216

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 81/199 (40%), Positives = 116/199 (58%), Gaps = 24/199 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP   +  N LS  EC+ LI  +K  M +S V +S     +   +RTSS TF   G++ I
Sbjct: 37  EPLIVILGNVLSDEECDQLIQQSKDRMQRSKVANSL----EVDELRTSSSTFFHEGENEI 92

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  IEKRI+    IP+EHGEG+Q+L+Y++GQ+Y AH+D+F    +      R++TL+MYL
Sbjct: 93  VARIEKRISQIMNIPVEHGEGLQILNYKIGQEYKAHFDFF-SSTSRAASNPRISTLVMYL 151

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           +DVE+GGET FP  N                    SV P++G A+ F     D  L+  +
Sbjct: 152 NDVEQGGETYFPKLN-------------------FSVSPQKGMAVYFEYFYNDQNLNDLT 192

Query: 263 LHGGCPVIRGNKWSATKWM 281
           LHGG PV+ G+KW+AT+WM
Sbjct: 193 LHGGAPVVMGDKWAATQWM 211


>gi|423452458|ref|ZP_17429311.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
 gi|401140096|gb|EJQ47653.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
          Length = 216

 Score =  155 bits (392), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 120/206 (58%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K     + +  SK G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLANVLSDEECDGLIELSK-----NKIERSKIGSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWVATQWVRRGTYK 216


>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
 gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
          Length = 216

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 85/206 (41%), Positives = 121/206 (58%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N +S  EC+ LI+++K  + +ST+     G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLGNVISDEECDELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|30022316|ref|NP_833947.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
 gi|229129515|ref|ZP_04258486.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
 gi|29897873|gb|AAP11148.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
 gi|228654120|gb|EEL09987.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
          Length = 232

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 119/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+   ++
Sbjct: 54  EPLIVVLANVLSDEECDELIEMSKNKMERS-----KIGSSRDVNDIRTSSGAFLE--DNK 106

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 107 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 165

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 166 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 206

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 207 TLHGGAPVTKGEKWIATQWVRRGTYK 232


>gi|206971296|ref|ZP_03232247.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH1134]
 gi|229081494|ref|ZP_04213993.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
 gi|423411965|ref|ZP_17389085.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
 gi|423432249|ref|ZP_17409253.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
 gi|206734068|gb|EDZ51239.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH1134]
 gi|228701801|gb|EEL54288.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
 gi|401104033|gb|EJQ12010.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
 gi|401117005|gb|EJQ24843.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
          Length = 216

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 119/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 38  EPLIVVLANVLSDEECDELIEMSKNKMERS-----KIGSSRDVNDIRTSSGAFLE--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
 gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
          Length = 305

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 82/208 (39%), Positives = 119/208 (57%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P+  V+ + LS+ EC+ LI+ A+  + +ST V+ ++G+    ++RTS G + +R +D  I
Sbjct: 116 PQVIVFDDVLSRDECDELIERARHRLKRSTTVNPESGREDVIQLRTSEGFWFQRCEDAFI 175

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE-----FNTKNGGQRMATL 198
             +++RI+     P+EHGEG+Q+LHY  G +Y  H+DYF         +T  GGQR+ATL
Sbjct: 176 ERLDRRISALMNWPLEHGEGLQILHYTKGGEYRPHFDYFPPSQSGSVLHTSRGGQRVATL 235

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YLSDV  GGETVFP A                   GL+V  ++G A+ F  +     L
Sbjct: 236 IVYLSDVAGGGETVFPNA-------------------GLAVMARQGGAIYFRYLNGHRQL 276

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DP +LHGG PV  G KW  TKWM    Y
Sbjct: 277 DPLTLHGGAPVTNGEKWIMTKWMRERPY 304


>gi|365158975|ref|ZP_09355162.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
 gi|363625964|gb|EHL76973.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
          Length = 248

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 119/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 70  EPLIVVLANVLSDEECDELIEMSKNKMERS-----KIGSSRDVNDIRTSSGAFLE--DNE 122

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 123 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 182 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 222

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248


>gi|423426372|ref|ZP_17403403.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
 gi|401111119|gb|EJQ19018.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
          Length = 248

 Score =  155 bits (391), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 119/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 70  EPLIVVLANVLSDEECDELIEISKNKMERS-----KIGSSRDVNDIRTSSGAFLE--DNE 122

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 123 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 182 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 222

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248


>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
 gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
          Length = 216

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTSSG FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECDELIELSKNKLARS-----KVGSSRDVNDIRTSSGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFHQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    Y+
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYR 216


>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
           4222]
 gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           sotto str. T04001]
 gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus thuringiensis HD-771]
 gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-789]
 gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           sotto str. T04001]
 gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
           4222]
 gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-771]
 gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-789]
          Length = 216

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 38  EPLIVVLANVLSDEECDKLIEMSKNKMKRS-----KVGSSRDVNDIRTSSGAFLE--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
 gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
          Length = 286

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 82/211 (38%), Positives = 115/211 (54%), Gaps = 24/211 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   V    LS  EC+ LI  A   + +ST+VD  TG+ +    R+S GTF +   D  
Sbjct: 94  QPVLAVLDGVLSHEECDELIRRAAAKLQRSTIVDPTTGKHETIADRSSEGTFFEINADDF 153

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMAT 197
           I  +++RI+    +P++HGEG+Q+LHY  G +Y  H+D+F             GGQR++T
Sbjct: 154 IARLDRRISALMNLPVDHGEGLQILHYGPGGEYKPHFDFFPPGDPGSAVQMATGGQRVST 213

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           L+MYL++VE+GG T+FP                   + GLSV PK+G A+ F        
Sbjct: 214 LVMYLNEVEDGGATIFP-------------------ELGLSVLPKKGSAVYFEYTNSRGQ 254

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           LDP +LHGG PV+RG KW  TKWM    Y A
Sbjct: 255 LDPRTLHGGAPVLRGEKWIVTKWMRQRRYGA 285


>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
 gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
          Length = 254

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 88/206 (42%), Positives = 119/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  M +S     K G S++ + +RTSSG FL+  ++ 
Sbjct: 76  EPLIVVLANVLSDEECDELIELSKNKMERS-----KIGSSRNVNDIRTSSGAFLE--ENE 128

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
               IEKRI+  T +P+ HGEG+ +L+Y V Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 129 FTSKIEKRISSITNVPVAHGEGLHILNYAVDQEYKAHYDYFA-EHSRSAANNRISTLVMY 187

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 188 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 228

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+WM    YK
Sbjct: 229 TLHGGAPVTKGEKWIATQWMRRGTYK 254


>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
 gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
 gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
 gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
          Length = 248

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 70  EPLIVVLANVLSDEECDKLIEMSKNKMKRS-----KVGSSRDVNDIRTSSGAFLE--DNE 122

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 182 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 222

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248


>gi|222111817|ref|YP_002554081.1| procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
 gi|221731261|gb|ACM34081.1| Procollagen-proline dioxygenase [Acidovorax ebreus TPSY]
          Length = 289

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 82/209 (39%), Positives = 119/209 (56%), Gaps = 26/209 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  ++ N LS  EC+ +ID A+P M +S  V + TG  + +  RTS G F +RG+  ++
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARSLTVQTTTGGEEVNADRTSDGMFFQRGETPVV 161

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + +E+RIA     P+++GEG+QVLHY  G +Y  HYDYF  D+  T    + GGQR+ATL
Sbjct: 162 QRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRVATL 221

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL++  +GG T FP                      L V P++G+A+ F   RP  + 
Sbjct: 222 VIYLNNPRKGGGTTFPDVP-------------------LEVAPRQGNAVFFSYERPHPST 262

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
              +LHGG  VI G KW ATKW+   E+K
Sbjct: 263 --RTLHGGASVIEGEKWIATKWLREREFK 289


>gi|30264308|ref|NP_846685.1| prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. Ames]
 gi|47529753|ref|YP_021102.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. 'Ames
           Ancestor']
 gi|65321616|ref|ZP_00394575.1| hypothetical protein Bant_01005109 [Bacillus anthracis str. A2012]
 gi|165873278|ref|ZP_02217887.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0488]
 gi|167634610|ref|ZP_02392930.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0442]
 gi|167638693|ref|ZP_02396969.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0193]
 gi|170687507|ref|ZP_02878724.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0465]
 gi|170709341|ref|ZP_02899757.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0389]
 gi|177655890|ref|ZP_02937082.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0174]
 gi|190566156|ref|ZP_03019075.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Tsiankovskii-I]
 gi|196034803|ref|ZP_03102210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           W]
 gi|227817011|ref|YP_002817020.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           anthracis str. CDC 684]
 gi|228929280|ref|ZP_04092307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pondicheriensis BGSC 4BA1]
 gi|228935557|ref|ZP_04098373.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           andalousiensis BGSC 4AW1]
 gi|229123754|ref|ZP_04252949.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
 gi|229604260|ref|YP_002868528.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0248]
 gi|254683996|ref|ZP_05147856.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. CNEVA-9066]
 gi|254721830|ref|ZP_05183619.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A1055]
 gi|254736344|ref|ZP_05194050.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Western North America USA6153]
 gi|254741382|ref|ZP_05199069.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Kruger B]
 gi|254753983|ref|ZP_05206018.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Vollum]
 gi|254757854|ref|ZP_05209881.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Australia 94]
 gi|386738126|ref|YP_006211307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
 gi|421506493|ref|ZP_15953416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
 gi|421638315|ref|ZP_16078911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
 gi|30258953|gb|AAP28171.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Ames]
 gi|47504901|gb|AAT33577.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. 'Ames Ancestor']
 gi|164710995|gb|EDR16563.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0488]
 gi|167513541|gb|EDR88911.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0193]
 gi|167530062|gb|EDR92797.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0442]
 gi|170125767|gb|EDS94678.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0389]
 gi|170668702|gb|EDT19448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0465]
 gi|172079923|gb|EDT65028.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0174]
 gi|190563075|gb|EDV17041.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Tsiankovskii-I]
 gi|195992342|gb|EDX56303.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           W]
 gi|227005734|gb|ACP15477.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. CDC 684]
 gi|228659889|gb|EEL15534.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
 gi|228824095|gb|EEM69911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           andalousiensis BGSC 4AW1]
 gi|228830570|gb|EEM76180.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pondicheriensis BGSC 4BA1]
 gi|229268668|gb|ACQ50305.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0248]
 gi|384387978|gb|AFH85639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
 gi|401823486|gb|EJT22633.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
 gi|403394741|gb|EJY91981.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
          Length = 216

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 117/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTSSG FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECDELIELSKSKLARS-----KVGSSRDVNDIRTSSGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|229180513|ref|ZP_04307855.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
 gi|228602937|gb|EEK60416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
          Length = 232

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 119/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 54  EPLIVVLANVLSDEECDELIEMSKNKMERS-----KIGSSRDVNDIRTSSGAFLE--DNE 106

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 107 LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 165

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 166 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 206

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 207 TLHGGAPVTKGEKWIATQWVRRGTYK 232


>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pakistani str. T13001]
 gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
 gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
 gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
 gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
 gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pakistani str. T13001]
 gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
 gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
          Length = 248

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 70  EPLIVVLANVLSDEECDELIEMSKNKMKRS-----KVGSSRDVNDIRTSSGAFLE--DNE 122

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 182 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 222

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248


>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
 gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
          Length = 248

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 70  EPLIVVLANVLSDEECDKLIEMSKNKMKRS-----KVGSSRDVNDIRTSSGAFLE--DNE 122

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 182 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 222

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248


>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
 gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
          Length = 248

 Score =  154 bits (390), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 70  EPLIVVLANVLSDEECDKLIEMSKNKMKRS-----KVGSSRDVNDIRTSSGAFLE--DNE 122

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 182 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 222

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248


>gi|388519941|gb|AFK48032.1| unknown [Lotus japonicus]
          Length = 151

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 77/143 (53%), Positives = 96/143 (67%), Gaps = 12/143 (8%)

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           ++  IEKRI+ ++ +P+E+GE +QVL YE  Q Y  H+DYF D FN K GGQR+AT+LMY
Sbjct: 12  MVHAIEKRISVYSQVPIENGELMQVLRYEKNQYYKPHHDYFADTFNLKRGGQRIATMLMY 71

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGK---QGLSVKPKRGDALLFWSMRPDATL 258
           LSD  EGGET FP   +   S         CG    +GLSVKP +G+A+LFWSM  D   
Sbjct: 72  LSDNVEGGETYFPNIGSGQCS---------CGGKTVEGLSVKPTKGNAVLFWSMGLDGQS 122

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
           DP S+HGGC V+ G KWSATKWM
Sbjct: 123 DPLSVHGGCEVLAGEKWSATKWM 145


>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
           konkukian str. 97-27]
 gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
           konkukian str. 97-27]
          Length = 232

 Score =  154 bits (390), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 117/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTSSG FL    + 
Sbjct: 54  EPLIVVLGNVLSDEECDELIELSKNKLARS-----KVGSSRDVNDIRTSSGAFLD--DNE 106

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 107 LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 165

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 166 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 206

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 207 TLHGGAPVTKGEKWIATQWVRRGTYK 232


>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
 gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
          Length = 216

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTSSG FL+   + 
Sbjct: 38  EPLIVVLGNVLSDEECDELIELSKSKLARS-----KVGSSRDVNDIRTSSGAFLE--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
 gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
          Length = 248

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 70  EPLIVVLANVLSDEECDELIEMSKNKMERS-----KIGSSRDVNDIRTSSGAFLE--DNE 122

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 182 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 222

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248


>gi|301055727|ref|YP_003793938.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus biovar
           anthracis str. CI]
 gi|300377896|gb|ADK06800.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus biovar
           anthracis str. CI]
          Length = 216

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 117/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTSSG FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECDELIELSKNKLARS-----KVGSSRDVNDIRTSSGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|402555628|ref|YP_006596899.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus FRI-35]
 gi|401796838|gb|AFQ10697.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus FRI-35]
          Length = 216

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 117/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC  LI+L+K  + +S     K G S+D + +RTSSG FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECGELIELSKNKLARS-----KVGSSRDVNDIRTSSGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+WM    Y+
Sbjct: 191 TLHGGAPVTKGEKWIATQWMRRGTYR 216


>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
 gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
          Length = 212

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 80/205 (39%), Positives = 122/205 (59%), Gaps = 28/205 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP   V  N LS  EC+ LI L+K  + +S + +++     ++ +RTSS TF++ G+  +
Sbjct: 36  EPLIVVLGNVLSDEECDALIGLSKDKLKRSKIGNTRN----ENDMRTSSSTFMEEGESEV 91

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  +EKRI+    IP E+GEG+Q+L+Y++GQ+Y AH+D+F +  N      R++TL+MYL
Sbjct: 92  VTRVEKRISQIMNIPYENGEGLQILNYKIGQEYKAHFDFFKNASNP-----RISTLVMYL 146

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           +DVEEGGET FP  N                    SV P++G A+ F     +  L+  +
Sbjct: 147 NDVEEGGETYFPKLN-------------------FSVSPQKGMAVYFEYFYDNQELNDLT 187

Query: 263 LHGGCPVIRGNKWSATKWMHVHEYK 287
           LHGG PVI G+KW+AT+WM   + K
Sbjct: 188 LHGGAPVIIGDKWAATQWMRRKQVK 212


>gi|196041590|ref|ZP_03108882.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NVH0597-99]
 gi|218905373|ref|YP_002453207.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           AH820]
 gi|225866219|ref|YP_002751597.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB102]
 gi|423550018|ref|ZP_17526345.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
 gi|196027578|gb|EDX66193.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NVH0597-99]
 gi|218537435|gb|ACK89833.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH820]
 gi|225786013|gb|ACO26230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB102]
 gi|401189634|gb|EJQ96684.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
          Length = 216

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 117/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTSSG FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECDELIELSKNKLARS-----KVGSSRDVNDIRTSSGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|49187135|ref|YP_030387.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. Sterne]
 gi|228947951|ref|ZP_04110238.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           monterrey BGSC 4AJ1]
 gi|49181062|gb|AAT56438.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Sterne]
 gi|228811938|gb|EEM58272.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           monterrey BGSC 4AJ1]
          Length = 232

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 88/206 (42%), Positives = 117/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K     S +  SK G S+D + +RTSSG FL    + 
Sbjct: 54  EPLIVVLGNVLSDEECDELIELSK-----SKLARSKVGSSRDVNDIRTSSGAFLD--DNE 106

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 107 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 165

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 166 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 206

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 207 TLHGGAPVTKGEKWIATQWVRRGTYK 232


>gi|354334983|gb|AER23925.1| procollagen-proline dioxygenase [Variovorax sp. HH01]
          Length = 280

 Score =  154 bits (389), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 87/209 (41%), Positives = 113/209 (54%), Gaps = 26/209 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V+ N LS  ECE LI  A+  + +S  V+++TG    +  RTS G F +RG++ I+
Sbjct: 93  PRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSDGMFFERGENEIV 152

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
             +E+RIA     P+E GEG+Q+L Y  G +Y  HYDYF   E  T    K GGQR+ATL
Sbjct: 153 ARVEQRIAALLRWPLEFGEGLQILRYAPGAQYRPHYDYFDPSEPGTPTILKRGGQRVATL 212

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL + E GG T FP                     GL V P RG  + F   RPD   
Sbjct: 213 VMYLQEPEGGGATTFP-------------------DVGLEVAPARGCGVFFSYDRPDPVT 253

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
              +LHGG PV+ G KW ATKW+   E+K
Sbjct: 254 --RTLHGGAPVLAGEKWVATKWLREREFK 280


>gi|196046329|ref|ZP_03113555.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB108]
 gi|376268135|ref|YP_005120847.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
           F837/76]
 gi|196022799|gb|EDX61480.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB108]
 gi|364513935|gb|AEW57334.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
           F837/76]
          Length = 216

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 117/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTSSG FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECDELIELSKNKLARS-----KVGSSRDVNDIRTSSGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|217961727|ref|YP_002340297.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus cereus AH187]
 gi|222097680|ref|YP_002531737.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           Q1]
 gi|229198365|ref|ZP_04325071.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
 gi|375286242|ref|YP_005106681.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus cereus NC7401]
 gi|423354732|ref|ZP_17332357.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
 gi|423566803|ref|ZP_17543050.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
 gi|423574080|ref|ZP_17550199.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
 gi|217067199|gb|ACJ81449.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH187]
 gi|221241738|gb|ACM14448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           Q1]
 gi|228585065|gb|EEK43177.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
 gi|358354769|dbj|BAL19941.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NC7401]
 gi|401086280|gb|EJP94507.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
 gi|401212649|gb|EJR19392.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
 gi|401215318|gb|EJR22035.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
          Length = 216

 Score =  154 bits (389), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTSSG FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECDKLIELSKNKLARS-----KVGSSRDVNDIRTSSGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    Y+
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYR 216


>gi|229192445|ref|ZP_04319408.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
 gi|228591022|gb|EEK48878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
          Length = 216

 Score =  154 bits (388), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 119/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N +S  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 38  EPLIVVLANVISDEECDELIEMSKNKMERS-----KIGSSRDVNDIRTSSGAFLE--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
 gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
          Length = 216

 Score =  154 bits (388), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 84/206 (40%), Positives = 121/206 (58%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N +S  EC+ LI+++K  + +ST+     G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLGNVISDEECDELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPVAHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    Y+
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYR 216


>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
 gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
          Length = 216

 Score =  154 bits (388), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 84/206 (40%), Positives = 121/206 (58%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N +S  EC+ LI+++K  + +ST+     G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLGNVISDEECDELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    Y+
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYR 216


>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
 gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
          Length = 248

 Score =  154 bits (388), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+  +  
Sbjct: 70  EPLIVVLANVLSDEECDELIEMSKNKMKRS-----KVGSSRDVNDIRTSSGAFLEDSE-- 122

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 123 LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 182 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 222

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248


>gi|52141260|ref|YP_085568.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
 gi|51974729|gb|AAU16279.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
          Length = 232

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 117/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTSSG FL    + 
Sbjct: 54  EPLIVVLGNVLSDEECDELIELSKNKLARS-----KVGSSRDVNDIRTSSGAFLD--DNE 106

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 107 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 165

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 166 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 206

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 207 TLHGGAPVTKGEKWIATQWVRRGTYK 232


>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           huazhongensis BGSC 4BD1]
 gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           huazhongensis BGSC 4BD1]
          Length = 216

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+  +  
Sbjct: 38  EPLIVVLANVLSDEECDELIEMSKNKMKRS-----KVGSSRDVNDIRTSSGAFLEDSE-- 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
 gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
          Length = 232

 Score =  154 bits (388), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTSSG FL    + 
Sbjct: 54  EPLIVVLGNVLSDEECDKLIELSKNKLARS-----KVGSSRDVNDIRTSSGAFLD--DNE 106

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 107 LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 165

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 166 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 206

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    Y+
Sbjct: 207 TLHGGAPVTKGEKWIATQWVRRGTYR 232


>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
 gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
          Length = 248

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+  +  
Sbjct: 70  EPLIVVLANVLSDEECDELIEMSKNKMKRS-----KVGSSRDVNDIRTSSGAFLEDSE-- 122

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 123 LTLKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 182 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 222

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248


>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
 gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
          Length = 282

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 84/208 (40%), Positives = 115/208 (55%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P+   + + LS  EC  LI+ A+  + +ST V+ + G     ++RTS G + +R +D  I
Sbjct: 93  PQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGSEDVIQLRTSEGFWFQRCEDAFI 152

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-----TKNGGQRMATL 198
             ++ RI+     P+EHGEG+Q+LHY  G +Y  H+DYF    N     T  GGQR+ATL
Sbjct: 153 ERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDYFPPGQNGSVLHTARGGQRVATL 212

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YLSDVE GGETVFP A                   GL+V  ++G A+ F  M     L
Sbjct: 213 IVYLSDVEGGGETVFPDA-------------------GLAVMARQGGAIYFRYMNGRRQL 253

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DP +LHGG PV  G+KW  TKWM    Y
Sbjct: 254 DPLTLHGGAPVTSGDKWIMTKWMRERPY 281


>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
 gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
          Length = 285

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 84/208 (40%), Positives = 115/208 (55%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P+   + + LS  EC  LI+ A+  + +ST V+ + G     ++RTS G + +R +D  I
Sbjct: 96  PQVIAFDDVLSGEECAELIERARHRLKRSTTVNPENGSEDVIQLRTSEGFWFQRCEDAFI 155

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-----TKNGGQRMATL 198
             ++ RI+     P+EHGEG+Q+LHY  G +Y  H+DYF    N     T  GGQR+ATL
Sbjct: 156 ERLDHRISALMNWPLEHGEGLQILHYRQGGEYRPHFDYFPPGQNGSVLHTARGGQRVATL 215

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YLSDVE GGETVFP A                   GL+V  ++G A+ F  M     L
Sbjct: 216 IVYLSDVEGGGETVFPDA-------------------GLAVMARQGGAIYFRYMNGRRQL 256

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DP +LHGG PV  G+KW  TKWM    Y
Sbjct: 257 DPLTLHGGAPVTSGDKWIMTKWMRERPY 284


>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. T03a001]
 gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. HD73]
 gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. T03a001]
 gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. HD73]
          Length = 216

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G ++D + +RTSSG FL+   + 
Sbjct: 38  EPLIVVLANVLSDEECDELIEMSKNKMKRS-----KVGSARDVNDIRTSSGAFLE--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|384182063|ref|YP_005567825.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           finitimus YBT-020]
 gi|324328147|gb|ADY23407.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           finitimus YBT-020]
          Length = 216

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTSSG FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECDELIELSKNKLARS-----KVGSSRDVNDIRTSSGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTAKIEKRISSIMNVPVSHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDRSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    Y+
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYR 216


>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           G9842]
 gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           G9842]
          Length = 216

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 38  EPLIVVLANVLSDEECDELIEMSKNKMKRS-----KVGSSRDVNDIRTSSGAFLE--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAVNNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
 gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
 gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
 gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
 gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
 gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
 gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
 gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
          Length = 248

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 117/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 70  EPLIVVLANVLSDEECDELIEMSKNKMERS-----KIGSSRDVNDIRTSSGAFLE--DNE 122

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
               IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 123 FTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 182 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 222

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248


>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
 gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
 gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
 gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
          Length = 248

 Score =  153 bits (387), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G ++D + +RTSSG FL+   + 
Sbjct: 70  EPLIVVLANVLSDEECDELIEMSKNKMKRS-----KVGSARDVNDIRTSSGAFLE--DNE 122

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 182 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 222

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248


>gi|121595595|ref|YP_987491.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
 gi|120607675|gb|ABM43415.1| 2OG-Fe(II) oxygenase [Acidovorax sp. JS42]
          Length = 289

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 82/209 (39%), Positives = 119/209 (56%), Gaps = 26/209 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  ++ N LS  EC+ +ID A+P M +S  V + TG  + +  RTS G F +RG+  ++
Sbjct: 102 PRVVLFGNLLSPEECQAIIDAAQPRMARSLTVQTTTGGEEVNADRTSDGMFFQRGETPVV 161

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + +E+RIA     P+++GEG+QVLHY  G +Y  HYDYF  D+  T    + GGQR+ATL
Sbjct: 162 QRLEERIARLVRWPIQNGEGLQVLHYRPGAEYKPHYDYFDPDQPGTSTIVRRGGQRVATL 221

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL++  +GG T FP                      L V P++G+A+ F   RP  + 
Sbjct: 222 VIYLNNPLKGGGTTFPDVP-------------------LEVAPRQGNAVFFSYERPHPST 262

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
              +LHGG  VI G KW ATKW+   E+K
Sbjct: 263 --RTLHGGASVIEGEKWIATKWLREREFK 289


>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
 gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
          Length = 229

 Score =  153 bits (387), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 116/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K     S +  SK G S+D + +RTS G FL    + 
Sbjct: 51  EPLIVVLGNVLSDEECDELIELSK-----SKLARSKVGSSRDVNDIRTSKGAFLD--DNE 103

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 104 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 162

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 163 LNDVEEGGETFFPKLN-------------------LSVNPRKGMAVYFEYFYQDQSLNEL 203

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 204 TLHGGAPVTKGEKWIATQWVRRGTYK 229


>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
 gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
          Length = 248

 Score =  153 bits (386), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 118/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 70  EPLIVVLANVLSDEECDELIEMSKNKMKRS-----KVGSSRDVNDIRTSSGAFLE--DNE 122

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAVNNRISTLVMY 181

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 182 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 222

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248


>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
 gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
          Length = 216

 Score =  153 bits (386), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 116/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTS G FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECDELIELSKNKLARS-----KVGSSRDVNDIRTSKGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|281307110|pdb|3ITQ|A Chain A, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
           Anthracis
 gi|281307111|pdb|3ITQ|B Chain B, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
           Anthracis
          Length = 216

 Score =  153 bits (386), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 116/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTSSG FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECDELIELSKSKLARS-----KVGSSRDVNDIRTSSGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+ Y
Sbjct: 91  LTAKIEKRISSIXNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVXY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGXAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|228941395|ref|ZP_04103947.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           berliner ATCC 10792]
 gi|228974327|ref|ZP_04134896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           thuringiensis str. T01001]
 gi|228980919|ref|ZP_04141223.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|384188306|ref|YP_005574202.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           chinensis CT-43]
 gi|410676625|ref|YP_006928996.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|452200698|ref|YP_007480779.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
           thuringiensis serovar thuringiensis str. IS5056]
 gi|228778855|gb|EEM27118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|228785377|gb|EEM33387.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           thuringiensis str. T01001]
 gi|228818321|gb|EEM64394.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           berliner ATCC 10792]
 gi|326942015|gb|AEA17911.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           chinensis CT-43]
 gi|409175754|gb|AFV20059.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|452106091|gb|AGG03031.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
           thuringiensis serovar thuringiensis str. IS5056]
          Length = 216

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 117/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC  LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 38  EPLIVVLANVLSDEECGELIEMSKNKMKRS-----KVGSSRDVNDIRTSSGAFLE--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|159481038|ref|XP_001698589.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158282329|gb|EDP08082.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 258

 Score =  153 bits (386), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 74/143 (51%), Positives = 101/143 (70%), Gaps = 3/143 (2%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++W PRAF+YHNFLS+AEC++L D+    + +S VVDSKTGQSK   +RTS G    R
Sbjct: 9   ETISWSPRAFIYHNFLSEAECDHLTDIGNKRVSRSLVVDSKTGQSKLDDIRTSYGAAFGR 68

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTK---NGGQR 194
           G+D +I  +E+RIA++T +P E+GE +Q+L Y  GQKYDAH+D+F D  +     + G R
Sbjct: 69  GEDPVIAAVEERIAEWTHLPPEYGEPMQILRYVDGQKYDAHWDWFDDPVHHAAYLHEGNR 128

Query: 195 MATLLMYLSDVEEGGETVFPAAN 217
            AT+L+YLS VE GGET  P A+
Sbjct: 129 YATVLLYLSGVEGGGETNLPLAD 151


>gi|398808448|ref|ZP_10567311.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
 gi|398087480|gb|EJL78066.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
          Length = 280

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 86/209 (41%), Positives = 114/209 (54%), Gaps = 26/209 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V+ N LS  ECE LI  A+  + +S  V+++TG    +  RTS G F +RG++ I+
Sbjct: 93  PRVVVFGNLLSAEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSDGMFFERGENEIV 152

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
             +E+R+A     P+E+GEG+Q+L Y  G +Y  HYDYF   E  T    K GGQR+ATL
Sbjct: 153 ARLEQRLATLLRWPLEYGEGLQILRYAPGAQYRPHYDYFDPGEPGTPTILKRGGQRVATL 212

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL + E GG T FP                     GL V P RG  + F   RPD   
Sbjct: 213 VMYLQEPEGGGATTFP-------------------DVGLEVAPVRGCGVFFSYDRPDPVT 253

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
              +LHGG PV+ G KW ATKW+   E+K
Sbjct: 254 --RTLHGGAPVLAGEKWVATKWLREREFK 280


>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
 gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
          Length = 224

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 82/203 (40%), Positives = 117/203 (57%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V+   LS+ EC+ L+ LA+P +++S  VD+ TG S+ +  RTS G F +RG+  +I
Sbjct: 37  PRVVVFGGLLSEQECDELVALAQPRLLRSETVDNSTGGSEVNAARTSDGMFFERGETPLI 96

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEF-NTKN----GGQRMATL 198
             IE+RIA+    P+E GEG+QVLHY  G +Y  H+D+F      T N    GGQR+ T+
Sbjct: 97  ERIERRIAELVHWPVERGEGLQVLHYRPGAQYKPHHDFFDPAHPGTANILRRGGQRVGTV 156

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+    GG T FP                   + GL V+P +G+A+ F   RP A+ 
Sbjct: 157 VIYLNTPAGGGATTFP-------------------EVGLEVQPIKGNAVFFSYERPLAST 197

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              +LHGG PV+ G KW ATKW+
Sbjct: 198 --RTLHGGAPVLDGEKWVATKWL 218


>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
 gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
          Length = 232

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 116/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTSSG FL    + 
Sbjct: 54  EPLIVVLGNVLSDEECDELIELSKNKLARS-----KVGSSRDVNDIRTSSGAFLD--DNE 106

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 107 LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 165

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 166 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 206

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW  T+W+    YK
Sbjct: 207 TLHGGAPVTKGEKWITTQWVRRGTYK 232


>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           tochigiensis BGSC 4Y1]
 gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           tochigiensis BGSC 4Y1]
          Length = 232

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 116/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTS G FL    + 
Sbjct: 54  EPLIVVLGNVLSDEECDELIELSKNKLARS-----KVGSSRDVNDIRTSKGAFLD--DNE 106

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 107 LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 165

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 166 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 206

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 207 TLHGGAPVTKGEKWIATQWVRRGTYK 232


>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
 gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
          Length = 232

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 116/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTS G FL    + 
Sbjct: 54  EPLIVVLGNVLSDEECDELIELSKNKLARS-----KVGSSRDVNDIRTSKGAFLD--DNE 106

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 107 LTEKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 165

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 166 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 206

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 207 TLHGGAPVTKGEKWIATQWVRRGAYK 232


>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
          Length = 296

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 80/200 (40%), Positives = 117/200 (58%), Gaps = 22/200 (11%)

Query: 85  RAFVYH--NFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           + F+ H   FLS+ EC+ LI++++  +  STV+D KTG+ K +  RTS G      ++  
Sbjct: 108 KPFILHLDYFLSEEECDQLIEMSRERLKPSTVIDPKTGEEKAATGRTSKGMSFYLQENEF 167

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNTKNGGQRMATLLMY 201
           I+ +EKRIA+    P+E+GEG+QVL+Y +G++Y +H+DYF   +   + GGQR+ T L+Y
Sbjct: 168 IKKVEKRIAELIEFPVENGEGLQVLNYGIGEEYKSHFDYFPQSKVVPEKGGQRVGTFLIY 227

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DV  GGETVFP A                   G+S+ PK+G A+ F        +D  
Sbjct: 228 LNDVPAGGETVFPKA-------------------GVSIVPKKGSAVYFQYGNSKGEVDRM 268

Query: 262 SLHGGCPVIRGNKWSATKWM 281
           SLH   PV  G KW ATKW+
Sbjct: 269 SLHSSIPVSEGEKWVATKWI 288


>gi|363814557|ref|NP_001242754.1| uncharacterized protein LOC100794585 [Glycine max]
 gi|255628535|gb|ACU14612.1| unknown [Glycine max]
          Length = 238

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 77/143 (53%), Positives = 99/143 (69%), Gaps = 2/143 (1%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++ W PR  + HNFLS  EC+YL  LA P +  STVVD+KTG+   S VRTSSG FL  
Sbjct: 82  EVLNWSPRIILLHNFLSMEECDYLRALALPRLHISTVVDTKTGKGIKSDVRTSSGMFLNS 141

Query: 138 GQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            + +  +++ IEKRI+ ++ IP+E+GE +QVL YE  Q Y  H+DYF D FN K GGQR+
Sbjct: 142 KERKYPMVQAIEKRISVYSQIPIENGELMQVLRYEKNQYYKPHHDYFSDTFNLKRGGQRI 201

Query: 196 ATLLMYLSDVEEGGETVFPAANA 218
           AT+LMYLSD  E GET FP A +
Sbjct: 202 ATMLMYLSDNIERGETYFPLAGS 224


>gi|229186477|ref|ZP_04313640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
 gi|228596991|gb|EEK54648.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
          Length = 216

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 117/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTSSG FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECDELIELSKNKLARS-----KVGSSRDVNDIRTSSGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL++Y
Sbjct: 91  LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVIY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
 gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
          Length = 295

 Score =  152 bits (385), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 80/208 (38%), Positives = 120/208 (57%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P+  V+ + LS  EC  +I+ ++  + +ST V+ +TG+    R RTS G + +RG+D  I
Sbjct: 106 PQVIVFGDVLSPDECAEMIERSRHRLKRSTTVNPETGKEDVIRNRTSEGIWYQRGEDAFI 165

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE-----FNTKNGGQRMATL 198
             +++RI+     P+E+GEG+Q+LHY    +Y  H+DYF  +      +T  GGQR+ATL
Sbjct: 166 ERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQRVATL 225

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+DV +GGET+FP A                   G+SV  ++G A+ F  M     L
Sbjct: 226 VIYLNDVPDGGETIFPEA-------------------GISVAARQGGAVYFRYMNGQRQL 266

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DP +LHGG PV+ G+KW  TKWM    Y
Sbjct: 267 DPLTLHGGAPVLGGDKWIMTKWMRERAY 294


>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
           eutropha JMP134]
 gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
           eutropha JMP134]
          Length = 282

 Score =  152 bits (384), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 78/210 (37%), Positives = 117/210 (55%), Gaps = 24/210 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P   +Y + LS AEC+ L++LA+  + +S V++  TG       RTS G   + G+  +I
Sbjct: 90  PSIRLYQHLLSDAECDALVELARGRLARSPVINPDTGDENLIDARTSMGAMFQVGEHTLI 149

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATL 198
           + IE RIA    +P++HGEG+Q+L+Y+ G +Y  H+D+F      +    + GGQR ATL
Sbjct: 150 QRIEDRIAAVLGVPVDHGEGLQILNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRTATL 209

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+  + GG T FP                   + GL V P +G+A+ F  ++PD  L
Sbjct: 210 VIYLNTPQAGGATAFP-------------------RIGLEVAPVKGNAVYFSYLQPDGKL 250

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           D  +LH G PV  G KW ATKW+  H Y+A
Sbjct: 251 DERTLHAGLPVQSGEKWIATKWLREHPYRA 280


>gi|118479416|ref|YP_896567.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis str. Al
           Hakam]
 gi|118418641|gb|ABK87060.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis str. Al
           Hakam]
          Length = 232

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 117/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTSSG FL    + 
Sbjct: 54  EPLIVVLGNVLSDEECDELIELSKNKLARS-----KVGSSRDVNDIRTSSGAFLD--DNE 106

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL++Y
Sbjct: 107 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVIY 165

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 166 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 206

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 207 TLHGGAPVTKGEKWIATQWVRRGTYK 232


>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
 gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
          Length = 216

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 84/206 (40%), Positives = 120/206 (58%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N +S  EC  LI+++K  + +ST+     G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLGNVISDEECGELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    Y+
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYR 216


>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
 gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
 gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
 gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
 gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
 gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
 gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
 gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
 gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
 gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
 gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
 gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
 gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
 gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
          Length = 216

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 83/206 (40%), Positives = 120/206 (58%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N +S  EC  LI+++K  + +ST+     G ++D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLGNVISDEECNELIEMSKNKIKRSTI-----GSARDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    Y+
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYR 216


>gi|224056224|ref|XP_002298763.1| predicted protein [Populus trichocarpa]
 gi|222846021|gb|EEE83568.1| predicted protein [Populus trichocarpa]
          Length = 175

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 75/155 (48%), Positives = 102/155 (65%), Gaps = 6/155 (3%)

Query: 129 TSSGTFLKRGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEF 186
           T+  TF+   +D+   +  IE++IA  T IP  HGE   +L YE+GQKYD+HYD F  + 
Sbjct: 18  TTESTFIGGSEDKTGTLDFIERKIAKATMIPQSHGEAFNILRYEIGQKYDSHYDAFNPDE 77

Query: 187 NTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDA 246
                 QR+A+ L+YLS VEEGGET+FP  N +  S  +  E  +C   GL VKP++GD 
Sbjct: 78  YGPQPSQRVASFLLYLSSVEEGGETMFPFENGSAVSSGF--EYKQC--VGLKVKPRQGDG 133

Query: 247 LLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           LLF+S+ P+ T+D +SLHG CPVI+G KW ATKW+
Sbjct: 134 LLFYSLFPNGTIDRTSLHGSCPVIKGEKWVATKWI 168


>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
 gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
          Length = 319

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 81/209 (38%), Positives = 114/209 (54%), Gaps = 24/209 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  ++   L   ECE LI L++  + +S VV+  TG       RTS G   + G+  +I
Sbjct: 127 PRIALFQRLLMPDECEALIALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVGEHPLI 186

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE-----FNTKNGGQRMATL 198
             +E RIA  T +P+EHGEG+Q+L+Y+ G +Y  HYD+F  +        + GGQRMATL
Sbjct: 187 ERLEARIAAVTGVPVEHGEGLQILNYKPGAEYQPHYDFFNPQRPGEARQLRVGGQRMATL 246

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+DV  GG T FP                   K GL V P +G+A+ F  +  D +L
Sbjct: 247 VIYLNDVPAGGATAFP-------------------KLGLRVNPVQGNAVFFAYLGEDGSL 287

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D  +LH G PV +G KW ATKW+    Y+
Sbjct: 288 DERTLHAGLPVEQGEKWIATKWLREAPYR 316


>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
           2522]
 gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
           2522]
          Length = 229

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 79/199 (39%), Positives = 114/199 (57%), Gaps = 25/199 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP   +  N LS+ EC+ LI L+K  + +S + +          +RTSS  F    ++ +
Sbjct: 43  EPLIVLLGNVLSEEECDQLISLSKDRIERSKISNKSVHD-----LRTSSSMFFDDAENDV 97

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  +EKR++    IP++HGEGIQ+L+Y +GQ+Y AHYDYF    N+K    R++TL+MYL
Sbjct: 98  VSTVEKRVSQIMKIPVDHGEGIQILNYAIGQEYKAHYDYF-SSGNSKVNNPRISTLVMYL 156

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           +DVE GGET FP  N                     V PK+G A+ F     D TL+  +
Sbjct: 157 NDVEAGGETYFPKLN-------------------FYVAPKKGMAVYFEYFYNDTTLNELT 197

Query: 263 LHGGCPVIRGNKWSATKWM 281
           LHGG PV+ G+KW+AT+WM
Sbjct: 198 LHGGAPVVIGDKWAATQWM 216


>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
 gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
          Length = 216

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 116/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTS G FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECDKLIELSKNKLARS-----KVGSSRDVNDIRTSKGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pulsiensis BGSC 4CC1]
 gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pulsiensis BGSC 4CC1]
          Length = 232

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 116/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTS G FL    + 
Sbjct: 54  EPLIVVLGNVLSDEECDELIELSKNKLARS-----KVGSSRDVNDIRTSKGAFLD--DNE 106

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 107 LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 165

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 166 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 206

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 207 TLHGGAPVTKGEKWIATQWVRRGTYK 232


>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
 gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
          Length = 216

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 116/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTS G FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECDKLIELSKNKLARS-----KVGSSRDVNDIRTSKGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTAKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
 gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
          Length = 216

 Score =  152 bits (384), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 116/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+L+K  + +S     K G S+D + +RTS G FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECDELIELSKSKLARS-----KVGSSRDVNDIRTSKGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTVKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|218231188|ref|YP_002369041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           B4264]
 gi|218159145|gb|ACK59137.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           B4264]
          Length = 216

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 117/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC  LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 38  EPLIVVLANVLSDEECGELIEMSKNKMERS-----KIGSSRDVNDIRTSSGAFLE--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +++  
Sbjct: 150 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSINEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|229152436|ref|ZP_04280628.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
 gi|228631044|gb|EEK87681.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
          Length = 248

 Score =  152 bits (383), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 117/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC  LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 70  EPLIVVLANVLSDEECGELIEMSKNKMERS-----KIGSSRDVNDIRTSSGAFLE--DNE 122

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 123 LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 181

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +++  
Sbjct: 182 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSINEL 222

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 223 TLHGGAPVTKGEKWIATQWVRRGTYK 248


>gi|377810637|ref|YP_005043077.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
           YI23]
 gi|357939998|gb|AET93554.1| proCollegen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia sp.
           YI23]
          Length = 297

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 85/220 (38%), Positives = 121/220 (55%), Gaps = 26/220 (11%)

Query: 74  EQWTEIVA--WEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSS 131
           E+ T ++A    P A +   FL+ +EC+ LI LA+P + +STVVD  TG+   +  R+S 
Sbjct: 90  ERKTRVIARLQRPAAVLLDEFLTGSECDQLIALARPRLSRSTVVDPVTGRDVAAGHRSSD 149

Query: 132 GTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL--DEFNTK 189
           GTF +  +  ++  +E RIA  T +  E+GEG+Q+L Y+ G +   H DY +  +E N +
Sbjct: 150 GTFFRLAETPLVARLEMRIAALTGLAAENGEGLQLLRYQPGAESTPHVDYLVAGNETNRE 209

Query: 190 N---GGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDA 246
           +    GQR+ TLLMYL+DVE GGETVFP                   + G SV P+RG A
Sbjct: 210 SIARSGQRVGTLLMYLNDVEGGGETVFP-------------------QVGCSVVPRRGQA 250

Query: 247 LLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           L F         DP+SLH   P+  G KW ATKW+    +
Sbjct: 251 LYFEYCNRAGVCDPASLHASTPLRSGEKWVATKWIRARRF 290


>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
 gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
          Length = 292

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 80/208 (38%), Positives = 118/208 (56%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P+  V+ + LS  EC  +I+ ++  + +ST V+  TG+    R RTS G + +RG+D  I
Sbjct: 103 PQVIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRNRTSEGIWYQRGEDPFI 162

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE-----FNTKNGGQRMATL 198
             +++RI+     P+E+GEG+Q+LHY    +Y  H+DYF  +      +T  GGQR+ATL
Sbjct: 163 ERMDRRISSLMNWPVENGEGLQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQGGQRVATL 222

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+DV +GGET+FP A                   G+SV   +G A+ F  M     L
Sbjct: 223 VIYLNDVPDGGETIFPEA-------------------GMSVAASQGGAVYFRYMNDRRQL 263

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DP +LHGG PV+ G+KW  TKWM    Y
Sbjct: 264 DPLTLHGGAPVLAGDKWIMTKWMRERAY 291


>gi|229146822|ref|ZP_04275187.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
 gi|228636650|gb|EEK93115.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
          Length = 216

 Score =  151 bits (382), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 117/206 (56%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI+++K  M +S     K G S+D + +RTSSG FL+   + 
Sbjct: 38  EPLIVVLANVLSDEECDELIEMSKNKMERS-----KIGSSRDVNDIRTSSGAFLE--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F       +L+  
Sbjct: 150 LNDVEEGGETYFPKLN-------------------LSVHPRKGMAVYFEYFYQGQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGTYK 216


>gi|229031885|ref|ZP_04187873.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
 gi|228729503|gb|EEL80492.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
          Length = 216

 Score =  151 bits (381), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 115/206 (55%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC  LI+L+K     S +  SK G S+D + +RTS G FL    + 
Sbjct: 38  EPLIVVLGNVLSDEECGELIELSK-----SKLARSKVGSSRDVNDIRTSKGAFLD--DNE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P  HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTTKIEKRISSIMNVPASHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV +G KW AT+W+    YK
Sbjct: 191 TLHGGAPVTKGEKWIATQWVRRGAYK 216


>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
 gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
          Length = 216

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 83/206 (40%), Positives = 120/206 (58%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N +S  EC+ LI+++K  + +ST+     G S+D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLGNVISDEECDELIEMSKNKIKRSTI-----GSSRDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG  V +G KW AT+W+    Y+
Sbjct: 191 TLHGGASVTKGEKWIATQWVRRGTYR 216


>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
           IL144]
 gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
           oxidoreductase protein [Rubrivivax gelatinosus IL144]
          Length = 279

 Score =  151 bits (381), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 83/204 (40%), Positives = 114/204 (55%), Gaps = 26/204 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V+   LS  EC+ L+ LA+P + +S  VD+ TG S+ +  RTS G F +RG+  +I
Sbjct: 92  PRVVVFGGLLSDEECDELVALARPRLARSETVDNSTGGSEVNAARTSDGMFFERGEKPLI 151

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEF-NTKN----GGQRMATL 198
             IE+RIA+    P+E GEG+QVL Y  G +Y  H+D+F      T N    GGQR+ T+
Sbjct: 152 ERIERRIAELVRWPVERGEGLQVLRYRPGAQYKPHHDFFDPAHPGTANILRRGGQRVGTV 211

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+    GG T FP                   + GL V+P +G+A+ F   RP A+ 
Sbjct: 212 VMYLNTPAGGGATTFP-------------------EVGLEVQPVKGNAVFFSYERPLAST 252

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
              +LHGG PV+ G KW ATKWM 
Sbjct: 253 --RTLHGGAPVLDGEKWVATKWMR 274


>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
 gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
          Length = 511

 Score =  150 bits (380), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 81/206 (39%), Positives = 118/206 (57%), Gaps = 22/206 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           EIV   P   V+H+ LS  E +YL +LA+P + ++TV     G+    RVRTS G +L+R
Sbjct: 311 EIVLLNPFIVVFHDALSPQEIDYLQNLARPLLKRTTV--HVNGKYVSRRVRTSKGAWLER 368

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNTKNGGQRMA 196
             + + R IE+R+ D T + M+  E   +++Y +G  Y AHYD+F   +  T   G R+A
Sbjct: 369 DLNNLTRRIERRVVDMTELSMQGSEAYNIMNYGLGGHYAAHYDFFNTTKQQTSETGDRIA 428

Query: 197 TLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           T+L YLSDVE+GG TVFP                      L+V P+RG AL ++++  + 
Sbjct: 429 TVLFYLSDVEQGGATVFPNLK-------------------LAVSPERGMALFWYNLLDNG 469

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMH 282
           T D  +LHGGCPV+ G+KW  T W+H
Sbjct: 470 TGDTRTLHGGCPVLVGSKWVMTLWIH 495


>gi|307108817|gb|EFN57056.1| hypothetical protein CHLNCDRAFT_143796 [Chlorella variabilis]
          Length = 334

 Score =  150 bits (379), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 80/207 (38%), Positives = 120/207 (57%), Gaps = 10/207 (4%)

Query: 75  QWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTF 134
           Q  ++++  PRA++   FLS+ +C+++I +A+  +  S +       ++++R        
Sbjct: 129 QPMQLLSLYPRAYLMPRFLSQKQCDHVIAMAERRLAPSGLAFKAGDTAENTR-------- 180

Query: 135 LKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
                D ++  IE ++A  T IP  HGE   VL YE  Q YD+HYD F +E       QR
Sbjct: 181 -DEDPDGVLAWIEDKLAAVTMIPAGHGEPFNVLRYEPSQHYDSHYDSFSEEEYGPQFSQR 239

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +AT+L+YL+DVEEGGETVF        +     +   C   G+ VKP++GDALLF+S+  
Sbjct: 240 IATVLLYLADVEEGGETVFLLEGKGGLARLERIDYKAC-DTGIKVKPRQGDALLFFSVSV 298

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWM 281
           + TLD  SLHGGCPV+ G KW+ TKW+
Sbjct: 299 NGTLDKHSLHGGCPVVAGTKWAMTKWI 325


>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
 gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
          Length = 311

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 82/213 (38%), Positives = 114/213 (53%), Gaps = 24/213 (11%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V   P   V    LS  EC+ +I L++  M  S VVD ++G S +S VR S G+  +RG+
Sbjct: 117 VMSNPNIAVIRGLLSDEECDEVIRLSRGKMKTSQVVDRESGGSYESSVRKSEGSHFERGE 176

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE-----FNTKNGGQR 194
           + ++R IE R++    +P+  GE +Q+LHY  G +Y AH D+F  +       T+ GGQR
Sbjct: 177 NELVRRIEARLSALVDLPVNRGEPLQILHYGPGGEYKAHQDFFEPKDPGSAVLTRVGGQR 236

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           + T++MYL+DV EGGET FP                     G S KP +G A+ F     
Sbjct: 237 IGTVVMYLNDVPEGGETAFPDI-------------------GFSAKPIKGSAVYFEYQNA 277

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D  LD   LH G PVIRG+KW  TKW+    Y+
Sbjct: 278 DGQLDYRCLHAGMPVIRGDKWIMTKWLRERPYE 310


>gi|303279839|ref|XP_003059212.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226459048|gb|EEH56344.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 409

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 90/249 (36%), Positives = 142/249 (57%), Gaps = 39/249 (15%)

Query: 73  GEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVV--DSKTGQSKDSRV--R 128
           G+   E ++  PRA+++  FL+K EC +LI+++ P++ +STVV  D+  G++   R   R
Sbjct: 79  GDARVEKLSDSPRAYLFREFLTKEECAHLIEISTPHLKRSTVVGDDALLGEADGRRSDYR 138

Query: 129 TSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQ---VLHYEVGQKYDAHYDYFLDE 185
           TS+G FL +  D ++  +E+R+  F+ +P E+ E +Q   +L YE+GQ+Y  H    +D 
Sbjct: 139 TSTGAFLPKLYDDVVTRVERRVEAFSRLPFENQEQLQARSLLRYELGQEYRDH----VDG 194

Query: 186 FNTKNGGQRMATLLMYLSDVEEGGETVFPAANAN------FTSVRWWNELSECG------ 233
           F T+NGG+R+AT+LM+L++ EEGGET FP    +        + R   ELS+C       
Sbjct: 195 FATENGGKRVATVLMFLAEPEEGGETAFPNGEPSEAVAARVAAQRARGELSDCAWRGGGG 254

Query: 234 ---------KQGLSVKPKRGDALLFWSMRPDAT-------LDPSSLHGGCPVIRGNKWSA 277
                     +G +VKP+ GDA+LF+S   D         +  +S H  CP  RG KW+A
Sbjct: 255 GTAGGGRGNLRGFAVKPRLGDAVLFFSYDADDDGGYDGAEVSHASTHASCPTTRGVKWTA 314

Query: 278 TKWMHVHEY 286
           TKW+H   +
Sbjct: 315 TKWIHERAF 323


>gi|229098707|ref|ZP_04229647.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
 gi|423441025|ref|ZP_17417931.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
 gi|423533441|ref|ZP_17509859.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
 gi|228684786|gb|EEL38724.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
 gi|402417686|gb|EJV49986.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
 gi|402463660|gb|EJV95360.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
          Length = 216

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 82/206 (39%), Positives = 119/206 (57%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N +S  EC  LI+++K  + +ST+     G ++D + +RTSSG FL+  ++ 
Sbjct: 38  EPLIVVLGNVISDEECNELIEMSKNKIKRSTI-----GSARDVNDIRTSSGAFLE--ENE 90

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKRI+    +P+ HGEG+ +L+YEV Q+Y AHYDYF  E +      R++TL+MY
Sbjct: 91  LTSKIEKRISSIMNVPVTHGEGLHILNYEVDQQYKAHYDYFA-EHSRSAANNRISTLVMY 149

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   LSV P++G A+ F     D +L+  
Sbjct: 150 LNDVEEGGETFFPKLN-------------------LSVHPRKGMAVYFEYFYQDQSLNEL 190

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG  V +G KW AT+W+    Y+
Sbjct: 191 TLHGGASVTKGEKWIATQWVRRGTYR 216


>gi|124267278|ref|YP_001021282.1| hypothetical protein Mpe_A2091 [Methylibium petroleiphilum PM1]
 gi|124260053|gb|ABM95047.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
          Length = 289

 Score =  149 bits (377), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 84/209 (40%), Positives = 112/209 (53%), Gaps = 26/209 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  V+   LS AEC+ ++ LA   + +S  VD+ TG S+ +  RTS G F  RG+  +
Sbjct: 101 DPRVIVFSGLLSDAECDEIVALAGARLARSHTVDTATGASEVNAARTSDGMFFTRGEHPV 160

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMAT 197
               E RIA     P+E+GEG+QVLHY  G +Y  HYDYF  D+  T    + GGQR+AT
Sbjct: 161 CARFEARIAALLNWPVENGEGLQVLHYRPGAEYKPHYDYFDPDQPGTPAVLRRGGQRVAT 220

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           L+ YL+    GG T FP                     GL V P +G A+ F   RP  +
Sbjct: 221 LVTYLNTPTRGGGTTFP-------------------DIGLEVTPLKGHAVFFSYDRPHPS 261

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
               SLHGG PV+ G+KW ATKW+ V  +
Sbjct: 262 T--RSLHGGAPVLEGDKWVATKWLRVGRF 288


>gi|357135727|ref|XP_003569460.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 2
           [Brachypodium distachyon]
          Length = 314

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 82/208 (39%), Positives = 122/208 (58%), Gaps = 16/208 (7%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSK----TGQSKDSRVRTSSGTFL 135
           +AW PR F+Y  FLS  EC++L+ +A+  +  S +V++     T  S D+R +      L
Sbjct: 63  LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLVNAGARNITQNSTDARFKFQ----L 118

Query: 136 KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
              +D ++  IE RI+ ++FIP EHGE +Q+L Y   Q      D+  D   + +GG R+
Sbjct: 119 ADSKDIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-----DHNKDGTQSSSGGNRL 173

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
            T+LMYLSDV++GGETVFP +    T  +    LSEC   G +VKP +GDA+L +++RPD
Sbjct: 174 VTILMYLSDVKQGGETVFPRSELKDTQAKE-GALSECA--GYAVKPVKGDAILLFNLRPD 230

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHV 283
              D  S +  C V+ G KW A K +H+
Sbjct: 231 GVTDSDSHYEDCSVLEGEKWLAIKHLHI 258


>gi|357417854|ref|YP_004930874.1| procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
 gi|355335432|gb|AER56833.1| Procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
          Length = 283

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 120/227 (52%), Gaps = 28/227 (12%)

Query: 62  FEKRSSIAEEKGEQWTEIVA--WEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKT 119
            E+      + G++  +++A    PR  V+ N L+  EC+ LI LA+  + +S V D  T
Sbjct: 72  IERNGPALLQAGDRQVQVLASLLHPRVIVFGNLLAAEECDALIALARRQIKRSPVFDPDT 131

Query: 120 GQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHY 179
           GQ +  + RTS G F  RG + +   +E RIA     P+E+GEG+QVL Y  G +Y+ HY
Sbjct: 132 GQDQQHQARTSEGMFFGRGANPLCARVEARIAALLNWPLENGEGLQVLRYGPGAQYEPHY 191

Query: 180 DYFLD-----EFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGK 234
           DYF       E   + GGQR+A+L++YL+   +GG T FP A+                 
Sbjct: 192 DYFDPARPGAEVALRRGGQRVASLVIYLNTPTQGGATTFPDAH----------------- 234

Query: 235 QGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
             L V P +G+A+ F   RP       +LHGG PV+ G KW ATKW+
Sbjct: 235 --LEVAPIKGNAVYFSYDRPHPMT--GTLHGGAPVVEGEKWVATKWL 277


>gi|239814309|ref|YP_002943219.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
 gi|239800886|gb|ACS17953.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
          Length = 279

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 84/209 (40%), Positives = 112/209 (53%), Gaps = 26/209 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V+ N +S  ECE LI  A+  + +S  V+++TG    +  RTS G F +RG++ I+
Sbjct: 92  PRVVVFGNLVSPEECEGLIAAARVRLARSLTVETRTGGEVLNVDRTSEGMFFERGENDIV 151

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
             +E+RIA     P+E GEG+Q+L Y  G +Y  HYDYF   E  T    K GGQR+ATL
Sbjct: 152 ARLEQRIAALLRWPVEFGEGLQILRYAPGAQYRPHYDYFDPGEPGTPTILKRGGQRVATL 211

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL +  +GG T FP                     GL V P RG  + F    PD   
Sbjct: 212 VMYLQEPGQGGATTFPDV-------------------GLEVAPVRGTGVFFSYEEPDPAT 252

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
              +LHGG PV+ G KW ATKW+   E+K
Sbjct: 253 --RTLHGGAPVLAGEKWVATKWLREREFK 279


>gi|398804098|ref|ZP_10563100.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
 gi|398094921|gb|EJL85274.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
          Length = 277

 Score =  148 bits (373), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 89/224 (39%), Positives = 125/224 (55%), Gaps = 30/224 (13%)

Query: 71  EKGEQWTEIVAWE--PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVR 128
           + G++W  +      P  +V+ N LS AECE LI  A+  + +S  VD +TG  + +  R
Sbjct: 75  DAGDKWVTVREHRSAPELWVFDNLLSAAECEALIAAAESRLARSLTVDIRTGGEELNHDR 134

Query: 129 TSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFN 187
           TS G F  RG++ +IR IE RIA     P+++GEG+QVL Y  G +Y  HYDYF   E  
Sbjct: 135 TSHGMFYTRGENEVIRRIEARIARLLNWPVQNGEGLQVLRYRRGAEYKPHYDYFDPGEPG 194

Query: 188 T----KNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKR 243
           T    + GGQR+A+L+MYL +  EGG TVFP                     GL V+P++
Sbjct: 195 TAAILRRGGQRVASLIMYLREPGEGGATVFP-------------------DIGLKVRPQQ 235

Query: 244 GDALLF-WSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           G A+ F +++   A+L   +LHGG PV  G KW ATKW+   E+
Sbjct: 236 GSAVFFSYALAHPASL---TLHGGEPVKSGEKWIATKWLREREF 276


>gi|148653656|ref|YP_001280749.1| procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
 gi|148572740|gb|ABQ94799.1| Procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
          Length = 268

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 85/212 (40%), Positives = 114/212 (53%), Gaps = 24/212 (11%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V ++P   V ++FLS  EC+ LI  A   +  S VVD + G   +   RTS+ T   RG+
Sbjct: 75  VCYKPFVTVINDFLSPEECDALISDADQKLKASRVVDPEDGSFVEHSARTSTSTGYHRGE 134

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-----LDEFNTKNGGQR 194
             II+ IE RIAD    P++HGEG+QVL YE G +Y  H+D+F          TK GGQR
Sbjct: 135 IDIIKTIEARIADLINWPVDHGEGLQVLRYEDGGEYRPHFDFFDPAKKSSRLVTKQGGQR 194

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           + T LMYLS+V+ GG T FP  N NF                  ++P +G AL F +   
Sbjct: 195 VGTFLMYLSEVDSGGSTRFP--NLNF-----------------EIRPNKGSALYFANTNL 235

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            A ++P +LH G PV  G K+ ATKW+    Y
Sbjct: 236 KAEIEPLTLHAGMPVTEGVKYLATKWLREKPY 267


>gi|319763870|ref|YP_004127807.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
 gi|330823866|ref|YP_004387169.1| procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
 gi|317118431|gb|ADV00920.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans BC]
 gi|329309238|gb|AEB83653.1| Procollagen-proline dioxygenase [Alicycliphilus denitrificans K601]
          Length = 284

 Score =  147 bits (372), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 83/211 (39%), Positives = 117/211 (55%), Gaps = 30/211 (14%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  ++ N LS  EC+ +I+ A+  M +S  V + +G  + ++ RTS G F +RG++  +
Sbjct: 97  PRVVLFGNLLSPEECQAVIEAARTRMARSLTVQAASGGEEVNKDRTSDGMFFQRGENEAV 156

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
             +E+RIA     P+E+GEG+QVLHY  G +Y  HYDYF   E  T    + GGQR+ATL
Sbjct: 157 ARLEERIARLVRWPVENGEGLQVLHYRPGAEYKPHYDYFDPAEPGTPRLLRRGGQRVATL 216

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+D   GG T FP                      L + P++G+A+ F   R     
Sbjct: 217 VIYLNDPVRGGGTTFPDVP-------------------LEIGPRQGNAVFFSYGR----A 253

Query: 259 DPSS--LHGGCPVIRGNKWSATKWMHVHEYK 287
            PSS  LHGG PVI G KW ATKW+   E+K
Sbjct: 254 HPSSRTLHGGAPVIEGEKWIATKWLREREFK 284


>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
 gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
          Length = 297

 Score =  147 bits (371), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 77/210 (36%), Positives = 113/210 (53%), Gaps = 24/210 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P+  ++   L+  EC+ L+ L++  + +S VV+  TG       RTS G   +  +  +I
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHALI 164

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN-----GGQRMATL 198
             IE RIA  T +P EHGEG+Q+L+Y+ G +Y  H+DYF  +   +      GGQR+ATL
Sbjct: 165 ARIEARIAAVTGVPAEHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQLSVGGQRIATL 224

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+  E GG T FP                   + GL V P +G+A+ F  + PD TL
Sbjct: 225 VIYLNTPEAGGATAFP-------------------RVGLEVAPVKGNAVYFSYLLPDGTL 265

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           D  +LH G PV  G KW ATKW+    Y++
Sbjct: 266 DERTLHAGLPVASGEKWIATKWLRERPYRS 295


>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
 gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
          Length = 293

 Score =  147 bits (371), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 79/210 (37%), Positives = 113/210 (53%), Gaps = 24/210 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  +  N L  AEC+ ++ LA+  + +S VV+  TG       RTS G   + G+  ++
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVVNPDTGDENLIDARTSMGAMFQVGEHALL 160

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATL 198
           + IE RIA  T  P+EHGEG QVL+Y+ G +Y  H+D+F      +    + GGQR+AT+
Sbjct: 161 QRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRVATM 220

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+    GG T FP                   + GL V P +G+A+LF    PD  L
Sbjct: 221 VIYLNSPASGGATAFP-------------------RIGLEVAPVKGNAVLFSYGLPDGAL 261

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           D  +LH G PV  G KW ATKW+  H Y++
Sbjct: 262 DERTLHAGLPVEAGEKWIATKWLREHPYRS 291


>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
 gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
          Length = 292

 Score =  147 bits (371), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 79/208 (37%), Positives = 117/208 (56%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P+  V+ + LS  EC  +I+ ++  + +ST V+  TG+    R RTS G + +RG+D  I
Sbjct: 103 PQMIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRNRTSEGIWYQRGEDPFI 162

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE-----FNTKNGGQRMATL 198
             +++RI+     P+E+GEG+Q+L Y    +Y  H+DYF  +      +T  GGQR+ATL
Sbjct: 163 ERMDRRISSLMNWPVENGEGLQLLRYGTTGEYRPHFDYFPPDQPGSTVHTAQGGQRVATL 222

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+DV +GGET+FP A                   G+SV   +G A+ F  M     L
Sbjct: 223 VIYLNDVPDGGETIFPEA-------------------GMSVAASQGGAVYFRYMNGRRQL 263

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DP +LHGG PV+ G+KW  TKWM    Y
Sbjct: 264 DPLTLHGGAPVLSGDKWIMTKWMRERAY 291


>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
 gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
          Length = 293

 Score =  147 bits (370), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 79/210 (37%), Positives = 113/210 (53%), Gaps = 24/210 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  +  N L  AEC+ ++ LA+  + +S VV+  TG       RTS G   + G+  ++
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVVNPDTGDENLIDARTSMGAMFQVGEHALL 160

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATL 198
           + IE RIA  T  P+EHGEG QVL+Y+ G +Y  H+D+F      +    + GGQR+AT+
Sbjct: 161 QRIEARIAAVTGWPVEHGEGFQVLNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRVATM 220

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+    GG T FP                   + GL V P +G+A+LF    PD  L
Sbjct: 221 VIYLNSPASGGATAFP-------------------RIGLEVAPVKGNAVLFSYGLPDGAL 261

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           D  +LH G PV  G KW ATKW+  H Y++
Sbjct: 262 DERTLHAGLPVEAGEKWIATKWLREHPYRS 291


>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
 gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
          Length = 211

 Score =  147 bits (370), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 80/204 (39%), Positives = 114/204 (55%), Gaps = 25/204 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP    + N LS  EC+ LID A      S +  SK  + + S +RTSSG F + 
Sbjct: 24  EVLHEEPLIVKFLNVLSDEECQNLIDCAS-----SRLERSKLAKKEISSIRTSSGMFFEE 78

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
            ++ +I  IEKRI+    +P+EH EG+QVLHYE GQ++ AH+D+F    +  +   R++T
Sbjct: 79  NENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKAHFDFF-GPNHPSSSNNRIST 137

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           L++YL+DVEEGG T FP                     G+   PK+G A+ F     D  
Sbjct: 138 LVVYLNDVEEGGVTTFPNL-------------------GIVNVPKKGTAVYFEYFYNDQK 178

Query: 258 LDPSSLHGGCPVIRGNKWSATKWM 281
           L+  +LH G PVI+G KW AT+WM
Sbjct: 179 LNELTLHSGEPVIQGEKWVATQWM 202


>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
           19424]
 gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
           taiwanensis LMG 19424]
          Length = 296

 Score =  146 bits (369), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 77/210 (36%), Positives = 113/210 (53%), Gaps = 24/210 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P+  ++   LS  EC+ L+ L++  + +S VV+  TG       RTS G   +  +  +I
Sbjct: 104 PQVQLFQQLLSDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHALI 163

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN-----GGQRMATL 198
             IE RIA  T +P +HGEG+Q+L+Y+ G +Y  H+DYF  +   +      GGQR+ATL
Sbjct: 164 ARIEARIAAVTGVPADHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQLSVGGQRIATL 223

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+  E GG T FP                   + GL V P +G+A+ F  + PD TL
Sbjct: 224 VIYLNTPEAGGATAFP-------------------RVGLEVAPVKGNAVYFSYLLPDGTL 264

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           D  +LH G PV  G KW ATKW+    Y++
Sbjct: 265 DDRTLHAGLPVAAGEKWIATKWLRERPYRS 294


>gi|319786559|ref|YP_004146034.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
 gi|317465071|gb|ADV26803.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
          Length = 289

 Score =  146 bits (368), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 82/203 (40%), Positives = 111/203 (54%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V    LS  EC+ L++L++P + +ST VD++TG S+    RTS GTF +RG   + 
Sbjct: 102 PRVVVLGGLLSDEECDALVELSRPRLRRSTTVDAQTGGSQVHADRTSRGTFFERGAHPVC 161

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
             IE RIA     P+E+GEG+QVLHY  G ++  HYDYF  DE       + GGQR+AT+
Sbjct: 162 ATIEARIARLLEWPVENGEGLQVLHYPPGAEFRPHYDYFDPDEPGAEVLLRQGGQRVATV 221

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+    GG T FP A+    +V                   +G+A+ F   RP    
Sbjct: 222 VMYLNTPARGGATTFPDAHLEVAAV-------------------KGNAVFFSYDRPHPMT 262

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              +LHGG PV  G KW ATKW+
Sbjct: 263 --RTLHGGAPVTEGEKWIATKWL 283


>gi|294499597|ref|YP_003563297.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
 gi|294349534|gb|ADE69863.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
          Length = 219

 Score =  145 bits (367), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 81/200 (40%), Positives = 114/200 (57%), Gaps = 26/200 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI L+K  M +S     K G +++ + +RTSSG F +  ++ 
Sbjct: 38  EPLVLVLGNVLSNEECDELIQLSKDKMQRS-----KIGAAREVNSIRTSSGMFFEESENE 92

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           ++  IE+R++      +E+ EG+QVL Y   Q+Y AH+DYF    +  +   R++TL+MY
Sbjct: 93  LVHQIERRLSKIMGPSIEYAEGLQVLKYLPDQEYKAHHDYFTSA-SKASKNNRISTLVMY 151

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP                   K GLSV P +G A+ F     DA L+  
Sbjct: 152 LNDVEEGGETYFP-------------------KLGLSVSPTKGMAVYFEYFYSDAELNDR 192

Query: 262 SLHGGCPVIRGNKWSATKWM 281
           +LHGG PVI+G KW AT+WM
Sbjct: 193 TLHGGAPVIKGEKWVATQWM 212


>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
 gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
          Length = 211

 Score =  145 bits (367), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 114/199 (57%), Gaps = 25/199 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP   V  N LS  EC+ LI LA   + +S +  ++    +++ +RTSS  F++  ++ I
Sbjct: 32  EPLIVVLGNVLSDEECDELIQLAGDKVKRSKIGTTR----EENELRTSSSMFIEDDENLI 87

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  ++KRI+    IPMEHGEG+Q+L Y  GQ+Y AH+D+F  +    N   R++TL+MYL
Sbjct: 88  VTRVKKRISAIMKIPMEHGEGLQILRYTPGQQYKAHHDFFSSDSKITNN--RISTLVMYL 145

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           +DVE+GGET FP                       SV P++G A+ F     D TL+  +
Sbjct: 146 NDVEQGGETFFPHLK-------------------FSVSPRKGMAVYFEYFYSDQTLNDFT 186

Query: 263 LHGGCPVIRGNKWSATKWM 281
           LHGG PV+ G KW AT+WM
Sbjct: 187 LHGGAPVVEGEKWVATQWM 205


>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           NRRL B-14911]
 gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           NRRL B-14911]
          Length = 217

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 76/199 (38%), Positives = 118/199 (59%), Gaps = 24/199 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP   +  N LS  ECE LI +++  + +S + +++T       +RTSS  F + G++ +
Sbjct: 38  EPLIVILGNVLSDEECEGLIRMSEDKLKRSKIGNTRTVDD----IRTSSSMFFEEGENEL 93

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  IE+R++    IP+EHGEG+Q+L+Y +GQ+Y AH+D+F       +   R++TL+MYL
Sbjct: 94  VARIERRLSQIMNIPVEHGEGLQMLNYHIGQEYKAHFDFFSSSSRAAS-NPRISTLVMYL 152

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           +DVEEGGET FP  N                    SV P++G A+ F     +  L+  +
Sbjct: 153 NDVEEGGETYFPKLN-------------------FSVNPQKGSAVYFEYFYDNQDLNDLT 193

Query: 263 LHGGCPVIRGNKWSATKWM 281
           LHGG PVI+G+KW+AT+WM
Sbjct: 194 LHGGAPVIKGSKWAATQWM 212


>gi|386712780|ref|YP_006179102.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
           2266]
 gi|384072335|emb|CCG43825.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
           2266]
          Length = 211

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 81/204 (39%), Positives = 115/204 (56%), Gaps = 26/204 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P   +  N +S+ ECE LI L+K  M +S +      Q + S +RTSS TFL   +D + 
Sbjct: 34  PLIAILGNVVSEEECEELIFLSKNKMNRSKI----GSQHEVSDIRTSSSTFLP--EDDLT 87

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLS 203
             IEKR+A    +P+EHGEG+ +L+Y+ GQ+Y AHYDYF  +    N   R++TL++YL+
Sbjct: 88  NRIEKRVAQIMNVPVEHGEGLHILNYKQGQEYKAHYDYFRSKAKAAN-NPRISTLVLYLN 146

Query: 204 DVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSL 263
           DVEEGGET FP  N                   LS+ P +G A+ F     D  ++  +L
Sbjct: 147 DVEEGGETYFPHMN-------------------LSISPHKGMAVYFEYFYSDPLINERTL 187

Query: 264 HGGCPVIRGNKWSATKWMHVHEYK 287
           HGG PV  G KW+AT W+   +Y+
Sbjct: 188 HGGSPVTSGEKWAATMWVRRKQYR 211


>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
 gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
           KWC4]
          Length = 215

 Score =  145 bits (366), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 79/203 (38%), Positives = 111/203 (54%), Gaps = 25/203 (12%)

Query: 79  IVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRG 138
           ++  EP    +   LS  EC  LI+ A P + +S +V+        S +RTS G F +  
Sbjct: 25  VLHQEPLIVRFERLLSDDECRQLIETAAPRLKESKLVNKVV-----SDIRTSRGMFFEEE 79

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATL 198
           +   I  IE+RIA    +P+EH EG+QVLHY  GQ+Y AH+D+F    +      R++TL
Sbjct: 80  ESPFIHRIERRIAQLMNVPIEHAEGLQVLHYGPGQEYKAHHDFFAPG-SPAARNNRISTL 138

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+DVEEGGETVFP                     G+++KPKRG AL F     +  L
Sbjct: 139 IVYLNDVEEGGETVFPLL-------------------GIAMKPKRGAALYFEYFYRNQAL 179

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
           +  +LH   PV+RG KW AT+WM
Sbjct: 180 NDLTLHSSVPVVRGEKWVATQWM 202


>gi|418523362|ref|ZP_13089380.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410699993|gb|EKQ58573.1| hypothetical protein WS7_20388 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
          Length = 286

 Score =  145 bits (365), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 83/203 (40%), Positives = 109/203 (53%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V   FLS  EC+ LI LA+P + +S  VD+  G+      RTS G  L+ GQD + 
Sbjct: 96  PRVVVLGGFLSDGECDALIALARPRLARSRTVDNANGEHLVHAARTSDGMCLRVGQDALC 155

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + IE RIA     P++HGEG+QVL Y  G +Y  HYDYF  D   T    + GGQR+A+L
Sbjct: 156 QRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILLQAGGQRVASL 215

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  E GG T FP A+ +  +V                   +G+A+ F   RP    
Sbjct: 216 VMYLNTPERGGATRFPDAHLDVAAV-------------------KGNAVFFSYDRPHPMT 256

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              SLH G PV+ G KW ATKW+
Sbjct: 257 --RSLHAGAPVLAGEKWVATKWL 277


>gi|239816557|ref|YP_002945467.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
 gi|239803134|gb|ACS20201.1| Procollagen-proline dioxygenase [Variovorax paradoxus S110]
          Length = 296

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 116/206 (56%), Gaps = 24/206 (11%)

Query: 88  VYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIE 147
           V  +  S  ECE LI LA+P +  ST VD  TG+++    R+S G F +  ++  +  ++
Sbjct: 103 VLSDVFSAEECEALIALARPRLAPSTSVDPLTGRNRLGAQRSSLGMFFRLRENAFVARLD 162

Query: 148 KRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATLLMYL 202
           +R+++   +P+E+GEG+QVLHY  G +   H+D+ +     ++ + +  GQR++TL+ YL
Sbjct: 163 ERLSELMNLPVENGEGLQVLHYPAGAQSLPHFDFLVPSNAANQASLQRSGQRVSTLVAYL 222

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           ++VEEGGETVFP                   + G SV P+RG A+ F        +D +S
Sbjct: 223 NEVEEGGETVFP-------------------ETGWSVSPQRGGAVYFEYCNSLGQVDHAS 263

Query: 263 LHGGCPVIRGNKWSATKWMHVHEYKA 288
           LH G PV+ G KW ATKWM    + A
Sbjct: 264 LHAGAPVLSGEKWVATKWMRQRRFVA 289


>gi|325925807|ref|ZP_08187179.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
           91-118]
 gi|325543793|gb|EGD15204.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
           91-118]
          Length = 286

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 82/203 (40%), Positives = 110/203 (54%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V   FLS  EC+ LI LA+P++ +S  VD+  G+      RTS    L+ GQD + 
Sbjct: 96  PRVVVLGGFLSDEECDALIALARPHLARSRTVDNANGEHVVHAARTSDSMCLRLGQDALC 155

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + IE RIA     P++HGEG+QVL Y  G +Y  HYDYF  D   T    + GGQR+A+L
Sbjct: 156 QRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQAGGQRVASL 215

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  E GG T FP A+ +  +V                   +G+A+ F   RP    
Sbjct: 216 VMYLNTPERGGATRFPDAHLDVAAV-------------------KGNAVFFSYDRPHPMT 256

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              SLH G PV+ G+KW ATKW+
Sbjct: 257 --RSLHAGAPVLAGDKWVATKWL 277


>gi|428170517|gb|EKX39441.1| hypothetical protein GUITHDRAFT_114401 [Guillardia theta CCMP2712]
          Length = 322

 Score =  144 bits (364), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 91/287 (31%), Positives = 144/287 (50%), Gaps = 37/287 (12%)

Query: 16  TLTLVLSMLFMLTI---VLLMLLAMGIFYIPIGDDDSPPNDLTSFRRRAFEKRSSIAEEK 72
           +L L    LF++ +   VL MLL     ++     D+  +   S  RR+      +A  K
Sbjct: 15  SLRLYSGFLFLVIVGYAVLSMLLQS--LWMTGPKSDALLSKAPSLERRSMTNLGGMA--K 70

Query: 73  GEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLA-KPYMVKSTVVDSKTGQSKDSRVRTSS 131
              W E V+ +PR F+ HN L++ EC++L+ LA +  +  S +    T +  +S  RT+ 
Sbjct: 71  KSTWIETVSVDPRIFIVHNLLTEEECDHLVSLALQKGLSASLITPYGTNKLVESTTRTNK 130

Query: 132 GTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEF----N 187
             +L   QD +++ +E +IA  T    E GE +QVLHY   Q++  H+DYF        N
Sbjct: 131 QAWLDFQQDDVVKRVEDKIAKLTKTTPEQGENLQVLHYAKSQQFTEHHDYFDPATDPPEN 190

Query: 188 TKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDAL 247
            + GG R+ T+++YL   EEGGET F AAN   T+                    +GDA+
Sbjct: 191 YEKGGNRLITVIVYLQAAEEGGETHFGAANLKLTAA-------------------KGDAV 231

Query: 248 LFWSMRPDA------TLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           +F++++          +D  +LH G P I+G KW ATKW+H   Y++
Sbjct: 232 MFYNLKHGCDGIDPTCVDKQTLHAGLPPIKGEKWVATKWIHERGYQS 278


>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
 gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
          Length = 211

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 79/204 (38%), Positives = 112/204 (54%), Gaps = 25/204 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP    + N LS  EC+ LID A      S +  SK  + + S +RTSSG F + 
Sbjct: 24  EVLHEEPLIVKFLNVLSDEECQNLIDCAS-----SRLERSKLAKKEISSIRTSSGMFFEE 78

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
            ++ +I  IEKRI+    +P+EH EG+QVLHYE GQ++  H+D+F    +  +   R+ T
Sbjct: 79  NENPLISEIEKRISSLMHLPIEHAEGLQVLHYEPGQEFKPHFDFFGPN-HPSSSNNRICT 137

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           L++YL+DVEEGG T FP                     G+   PK+G A+ F     D  
Sbjct: 138 LVVYLNDVEEGGVTTFPNL-------------------GIVNVPKKGTAVYFEYFYNDQK 178

Query: 258 LDPSSLHGGCPVIRGNKWSATKWM 281
           L+  +LH G PVI+G KW AT+WM
Sbjct: 179 LNELTLHSGEPVIQGEKWVATQWM 202


>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
 gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
          Length = 219

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 79/200 (39%), Positives = 113/200 (56%), Gaps = 26/200 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI L+K  M +S     K G  ++ + +RTSSG F +  ++ 
Sbjct: 38  EPLVLVLGNVLSNEECDELIQLSKDKMQRS-----KIGAEREVNSIRTSSGMFFEESENE 92

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           ++  IE+R++      +E+ EG+Q+L Y   Q+Y AH+DYF    +  +   R++TL+MY
Sbjct: 93  LVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSA-SKASKNNRISTLVMY 151

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP                   K GLS+ P +G A+ F     DA L+  
Sbjct: 152 LNDVEEGGETYFP-------------------KLGLSISPTKGMAVYFEYFYSDAELNDR 192

Query: 262 SLHGGCPVIRGNKWSATKWM 281
           +LHGG PVI+G KW AT+WM
Sbjct: 193 TLHGGAPVIKGEKWVATQWM 212


>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
          Length = 210

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 115/199 (57%), Gaps = 26/199 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP   V  N LS  EC+ LI L+K  M +S     K   ++++ +RTS+  FL      +
Sbjct: 32  EPFVAVLGNVLSDEECDELISLSKDRMNRS-----KIAGNQENDIRTSTSVFLPEDASEV 86

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           ++ +EKRI+    IP+EHGEG+Q+L+Y++GQ+Y AH+D+F  +   +N   R++TL++YL
Sbjct: 87  VQRVEKRISQIMNIPVEHGEGLQLLNYQIGQEYKAHFDFFSPKKLIEN--PRISTLVLYL 144

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           +DVEEGG+T FP                      LSV P +G A+ F     D  L+  +
Sbjct: 145 NDVEEGGDTYFPNLK-------------------LSVSPHKGMAVYFEYFYDDPMLNELT 185

Query: 263 LHGGCPVIRGNKWSATKWM 281
           LHGG PV  G+KW+AT WM
Sbjct: 186 LHGGAPVTIGDKWAATMWM 204


>gi|3297815|emb|CAA19873.1| putative protein [Arabidopsis thaliana]
 gi|7270340|emb|CAB80108.1| putative protein [Arabidopsis thaliana]
          Length = 257

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 73/185 (39%), Positives = 115/185 (62%), Gaps = 7/185 (3%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           ++++W PRA  + NF +  +C+ +I+ AK  +  S +   K   +++++  RTSSGTF+ 
Sbjct: 28  QVLSWRPRAIYFPNFATAEQCQAIIERAKVNLKPSALALRKGETAENTKGTRTSSGTFIS 87

Query: 137 RGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
             ++    +  +E++IA  T IP  HGE   +L YE+GQKYD+HYD F          QR
Sbjct: 88  ASEESTGALDFVERKIARATMIPRSHGESFNILRYELGQKYDSHYDVFNPTEYGPQSSQR 147

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +A+ L+YLSDVEEGGET+FP  N +   + +  +  +C   GL VKP++GD LLF+S+ P
Sbjct: 148 IASFLLYLSDVEEGGETMFPFENGSNMGIGY--DYKQC--IGLKVKPRKGDGLLFYSVFP 203

Query: 255 DATLD 259
           + T+D
Sbjct: 204 NGTID 208


>gi|357135725|ref|XP_003569459.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like isoform 1
           [Brachypodium distachyon]
          Length = 303

 Score =  144 bits (363), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 80/204 (39%), Positives = 119/204 (58%), Gaps = 19/204 (9%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           +AW PR F+Y  FLS  EC++L+ +A+  +  S +V++       +R  T + T      
Sbjct: 63  LAWHPRVFLYEGFLSGMECDHLVYVARLNIESSLLVNAG------ARNITQNST-----D 111

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IE RI+ ++FIP EHGE +Q+L Y   Q      D+  D   + +GG R+ T+L
Sbjct: 112 DIVVSKIEDRISLWSFIPKEHGESMQILKYGSNQS-----DHNKDGTQSSSGGNRLVTIL 166

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLSDV++GGETVFP +    T  +    LSEC   G +VKP +GDA+L +++RPD   D
Sbjct: 167 MYLSDVKQGGETVFPRSELKDTQAKE-GALSECA--GYAVKPVKGDAILLFNLRPDGVTD 223

Query: 260 PSSLHGGCPVIRGNKWSATKWMHV 283
             S +  C V+ G KW A K +H+
Sbjct: 224 SDSHYEDCSVLEGEKWLAIKHLHI 247


>gi|295704991|ref|YP_003598066.1| 2OG-Fe(II) oxygenase [Bacillus megaterium DSM 319]
 gi|294802650|gb|ADF39716.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium DSM 319]
          Length = 219

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 80/200 (40%), Positives = 113/200 (56%), Gaps = 26/200 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   V  N LS  EC+ LI L+K  M +S     K G +++ + +RTSSG F    ++ 
Sbjct: 38  EPLVLVLGNVLSNEECDELIRLSKDKMQRS-----KIGAAREVNSIRTSSGMFFDESENE 92

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           ++  IE+R++      +E+ EG+Q+L Y   Q+Y AH+DYF    +  +   R++TL+MY
Sbjct: 93  LVHQIERRLSKIMGPSIEYAEGLQILKYLPDQEYKAHHDYFTSA-SKASKNNRISTLVMY 151

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP                   K GLSV P +G A+ F     DA L+  
Sbjct: 152 LNDVEEGGETYFP-------------------KLGLSVSPTKGMAVYFEYFYSDAELNDR 192

Query: 262 SLHGGCPVIRGNKWSATKWM 281
           +LHGG PVI+G KW AT+WM
Sbjct: 193 TLHGGAPVIKGEKWVATQWM 212


>gi|346723630|ref|YP_004850299.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346648377|gb|AEO41001.1| hypothetical protein XACM_0696 [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 286

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 82/203 (40%), Positives = 110/203 (54%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V   FLS  EC+ LI LA+P++ +S  VD+  G+      RTS    L+ GQD + 
Sbjct: 96  PRVVVLGGFLSDEECDALIALAQPHLARSRTVDNANGEHVVHAARTSDSMCLRLGQDALC 155

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + IE RIA     P++HGEG+QVL Y  G +Y  HYDYF  D   T    + GGQR+A+L
Sbjct: 156 QRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQAGGQRVASL 215

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  E GG T FP A+ +  +V                   +G+A+ F   RP    
Sbjct: 216 VMYLNTPERGGATRFPDAHLDVAAV-------------------KGNAVFFSYDRPHPMT 256

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              SLH G PV+ G+KW ATKW+
Sbjct: 257 --RSLHAGAPVLAGDKWVATKWL 277


>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
 gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
          Length = 209

 Score =  144 bits (363), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 82/206 (39%), Positives = 111/206 (53%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP   +  N LS AEC+ LIDLA   M ++     K G S D S VRTSS  F +  ++ 
Sbjct: 31  EPLILILDNVLSWAECDLLIDLASARMQRA-----KIGSSHDVSEVRTSSSMFFEESENE 85

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
            I  +E R+A+   IP+ H E +QVL Y+ G++Y  H+DYF    +  N   R++TL+MY
Sbjct: 86  CIGQVEARVAELMNIPVSHAEPLQVLRYQPGEQYHPHFDYFTQGSSMNN---RISTLVMY 142

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP+ +                    SV PK+G A+ F     D  L+  
Sbjct: 143 LNDVEEGGETYFPSLH-------------------FSVTPKKGSAVYFEYFYNDTRLNEL 183

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LH G PV  G KW AT+WM    Y+
Sbjct: 184 TLHAGHPVEAGEKWVATQWMRRQRYR 209


>gi|159462456|ref|XP_001689458.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158283446|gb|EDP09196.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 221

 Score =  144 bits (362), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 83/201 (41%), Positives = 108/201 (53%), Gaps = 31/201 (15%)

Query: 87  FVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGI 146
            VYHNFLS  EC ++IDLA   M +STVV SK     D  +RTS GTFL+R  D +I  I
Sbjct: 1   MVYHNFLSDRECRHIIDLAHAQMKRSTVVGSKNAGVVDD-IRTSYGTFLRRVPDPVIAAI 59

Query: 147 EKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVE 206
           E R+A ++ +P  H E +QVL Y    KY  H D          G +R+AT+L+YL   E
Sbjct: 60  EHRLALWSHLPASHQEDMQVLRYGPTNKYGPHID----------GLERVATVLIYLGQAE 109

Query: 207 EGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD-ATLDPSSLHG 265
                                 LS+C +  ++ KPKRGDAL+F+   PD    D  S+H 
Sbjct: 110 RA-------------------NLSQCARGRVAYKPKRGDALMFFDTMPDYKQTDVHSMHT 150

Query: 266 GCPVIRGNKWSATKWMHVHEY 286
           GCPV+ G KW+A KW+H   Y
Sbjct: 151 GCPVVEGVKWNAVKWLHGTPY 171


>gi|433460968|ref|ZP_20418587.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
 gi|432190746|gb|ELK47751.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
          Length = 211

 Score =  144 bits (362), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 83/206 (40%), Positives = 112/206 (54%), Gaps = 28/206 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQDR 141
           EP+  +  N +S+ ECE LI L+K       V  SK G   D S +RTSS  FL    D 
Sbjct: 33  EPKIAILGNVVSEEECEALIRLSK-----DKVNRSKIGSDHDVSDIRTSSSAFLP--DDE 85

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           +   IEKR+A    +P+EHGEGI +LHY+ GQ+Y AH+DYF           R++TL++Y
Sbjct: 86  LTGRIEKRLAQIMNVPVEHGEGIHILHYKPGQEYKAHHDYFRSTSRAAK-NPRISTLVLY 144

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+DVEEGGET FP  N                   L+V P +G A+ F     D  ++  
Sbjct: 145 LNDVEEGGETYFPEMN-------------------LTVSPHKGMAVYFEYFYNDPAINER 185

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           +LHGG PV  G KW+AT W+   +Y+
Sbjct: 186 TLHGGSPVTAGEKWAATMWVRRQQYR 211


>gi|224001336|ref|XP_002290340.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220973762|gb|EED92092.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 483

 Score =  143 bits (361), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 88/239 (36%), Positives = 123/239 (51%), Gaps = 24/239 (10%)

Query: 59  RRAFEKRSSIAEEKGEQWT-EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDS 117
           RR  E   S + +  +Q T E ++  P       FLS  EC+Y+ ++A P +  S+V   
Sbjct: 242 RRTVELDPSASSKSQKQVTIETLSLRPLVVSVEGFLSDEECDYIAEIASPQVKYSSVSLK 301

Query: 118 KTGQSKDSRV-RTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYD 176
              + KDS   RTS   FL    D ++  I+ R+A  T IP  H E +QVL Y  G+KYD
Sbjct: 302 DADKGKDSSEWRTSQSAFLSARDDEVLTEIDHRVASLTRIPRNHQEYVQVLRYGAGEKYD 361

Query: 177 AHYDYF-------------LDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSV 223
           +H+DYF             L E   KN   R AT+  YL+DV +GGET+FP     +   
Sbjct: 362 SHHDYFDPSAYRSDKSTLRLIENGKKN---RYATVFWYLTDVHDGGETIFP----RYGGA 414

Query: 224 RWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGN-KWSATKWM 281
                  +C   GL VKP++G  ++F+S+     +DP SLHG CPV   N KW+A KW+
Sbjct: 415 PAPRSHKDC-SIGLKVKPQKGKVVIFYSLDASGEMDPFSLHGACPVGENNLKWAANKWI 472


>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
 gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
          Length = 297

 Score =  143 bits (361), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 76/210 (36%), Positives = 112/210 (53%), Gaps = 24/210 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P+  ++   L+  EC+ L+ L++  + +S VV+  TG       RTS G   +  +  +I
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQVAEHPLI 164

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN-----GGQRMATL 198
             IE RIA  T +P EHGEG+Q+L+Y+ G +Y  H+DYF  +   +      GGQR+ATL
Sbjct: 165 TRIEARIAAVTGVPAEHGEGLQILNYKPGGEYQPHFDYFNPQRPGEARQLSVGGQRIATL 224

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+  E GG T FP                   + GL V P +G+A+ F  + PD  L
Sbjct: 225 VIYLNTPEAGGATAFP-------------------RVGLEVAPVKGNAVYFSYLLPDGAL 265

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           D  +LH G PV  G KW ATKW+    Y++
Sbjct: 266 DERTLHAGLPVAFGEKWIATKWLRERPYRS 295


>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 215

 Score =  143 bits (361), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 78/199 (39%), Positives = 107/199 (53%), Gaps = 25/199 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP    +   L+  EC  LI+ A P + +S +V+        S +RTS G F +  ++  
Sbjct: 29  EPLIMRFERLLTDDECRQLIEAAAPRLRESKLVNKVV-----SEIRTSRGMFFEEEENPF 83

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           I  IEKRI+    +P+EH EG+QVLHY  GQ+Y AHYD+F    +      R++TL++YL
Sbjct: 84  IHRIEKRISALMNVPIEHAEGLQVLHYGPGQEYQAHYDFFGPN-SPSASNNRISTLIIYL 142

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           +DVE GGETVFP  +                   L VKP+RG AL F        L+  +
Sbjct: 143 NDVEAGGETVFPLLD-------------------LEVKPERGSALYFEYFYRQQELNNLT 183

Query: 263 LHGGCPVIRGNKWSATKWM 281
           LH   PV+RG KW AT+WM
Sbjct: 184 LHSSVPVVRGEKWVATQWM 202


>gi|428175714|gb|EKX44602.1| hypothetical protein GUITHDRAFT_71994 [Guillardia theta CCMP2712]
          Length = 244

 Score =  143 bits (360), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 80/212 (37%), Positives = 112/212 (52%), Gaps = 14/212 (6%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVV---DSKTGQSK-DSRVRTSSGTFLKRGQ 139
           PR FV  NFLS  ECE +I  A P +  STV+   D   G+ K    VRTS   +L   +
Sbjct: 27  PRLFVVENFLSAEECEEIIKTATPLLAPSTVLKQGDQSNGEEKVKDEVRTSETAWLMDKK 86

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGGQRMA 196
             I+  I +R+ +   IPM + E +QVL Y   Q Y  HYD+F  +       +G  R+ 
Sbjct: 87  VPIVAKIRQRVEELIRIPMSYAEDMQVLKYTFKQHYHVHYDFFDPKMYPGRWSSGHNRLV 146

Query: 197 TLLMYLSDVEEGGETVFPAANAN---FTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           T+  YL+ VE+GGET+FP  N +      ++ W       +  + VKP RG A++F+ M+
Sbjct: 147 TVFFYLTSVEKGGETIFPFGNTSAEEHHKIQSWGPCENAVESSIKVKPVRGSAVIFYLMK 206

Query: 254 P----DATLDPSSLHGGCPVIRGNKWSATKWM 281
           P       LD +SLHGGC  I G KW+A  W+
Sbjct: 207 PHGHTHGELDHTSLHGGCDPIVGEKWAANYWI 238


>gi|78046308|ref|YP_362483.1| 2OG-Fe(II) oxygenase [Xanthomonas campestris pv. vesicatoria str.
           85-10]
 gi|78034738|emb|CAJ22383.1| putative 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas
           campestris pv. vesicatoria str. 85-10]
          Length = 296

 Score =  143 bits (360), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 82/203 (40%), Positives = 109/203 (53%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V   FLS  EC+ LI LA+P + +S  VD+  G+      RTS    L+ GQD + 
Sbjct: 106 PRVVVLGGFLSDEECDALIALARPRLARSRTVDNANGEHVVHAARTSDSMCLRLGQDALC 165

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + IE RIA     P++HGEG+QVL Y  G +Y  HYDYF  D   T    + GGQR+A+L
Sbjct: 166 QRIEARIARLLDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPVLVQAGGQRVASL 225

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  E GG T FP A+ +  +V                   +G+A+ F   RP    
Sbjct: 226 VMYLNTPERGGATRFPDAHLDVAAV-------------------KGNAVFFSYDRPHPMT 266

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              SLH G PV+ G+KW ATKW+
Sbjct: 267 --RSLHAGAPVLAGDKWVATKWL 287


>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
 gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
          Length = 283

 Score =  142 bits (359), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 77/211 (36%), Positives = 114/211 (54%), Gaps = 24/211 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP   +  + LS  EC+ LI++ +  + +S+VVD  +G       R S G F+    D +
Sbjct: 90  EPVVALLADVLSPRECDRLIEIGRERVRRSSVVDPDSGGEVLIDARKSEGAFVNGSTDPL 149

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE-----FNTKNGGQRMAT 197
           +  I++RIA+    P+E+GE + +L Y  G +Y  H+DYF +E      + + GGQR+AT
Sbjct: 150 VATIDRRIAELVQQPVENGEDLHILRYGAGGEYRPHFDYFPEEQAGSKHHMQRGGQRIAT 209

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           L++YL+ VEEGG+T FP                     GL++ P+RG AL F  +     
Sbjct: 210 LILYLNQVEEGGDTTFP-------------------DIGLTIHPRRGAALYFEYVNALGQ 250

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
            DP +LH G PV RG KW ATKWM    ++A
Sbjct: 251 TDPRTLHAGMPVERGEKWIATKWMRRGRFRA 281


>gi|77748547|ref|NP_641044.2| hypothetical protein XAC0691 [Xanthomonas axonopodis pv. citri str.
           306]
 gi|381169877|ref|ZP_09879039.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380689647|emb|CCG35526.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 286

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 82/203 (40%), Positives = 108/203 (53%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V   FLS  EC+ LI LA+P + +S  VD+  G+      RTS    L+ GQD + 
Sbjct: 96  PRVVVLGGFLSDGECDALIALARPRLARSRTVDNANGEHMVHAARTSDSMCLRVGQDALC 155

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + IE RIA     P++HGEG+QVL Y  G +Y  HYDYF  D   T    + GGQR+A+L
Sbjct: 156 QRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILLQAGGQRVASL 215

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  E GG T FP A+ +  +V                   +G+A+ F   RP    
Sbjct: 216 VMYLNTPERGGATRFPDAHLDVAAV-------------------KGNAVFFSYDRPHPMT 256

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              SLH G PV+ G KW ATKW+
Sbjct: 257 --RSLHAGAPVLAGEKWVATKWL 277


>gi|21106803|gb|AAM35580.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 306

 Score =  142 bits (358), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 82/203 (40%), Positives = 108/203 (53%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V   FLS  EC+ LI LA+P + +S  VD+  G+      RTS    L+ GQD + 
Sbjct: 116 PRVVVLGGFLSDGECDALIALARPRLARSRTVDNANGEHMVHAARTSDSMCLRVGQDALC 175

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + IE RIA     P++HGEG+QVL Y  G +Y  HYDYF  D   T    + GGQR+A+L
Sbjct: 176 QRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAAGTPILLQAGGQRVASL 235

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  E GG T FP A+ +  +V                   +G+A+ F   RP    
Sbjct: 236 VMYLNTPERGGATRFPDAHLDVAAV-------------------KGNAVFFSYDRPHPMT 276

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              SLH G PV+ G KW ATKW+
Sbjct: 277 --RSLHAGAPVLAGEKWVATKWL 297


>gi|418515355|ref|ZP_13081536.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
 gi|410708074|gb|EKQ66523.1| hypothetical protein MOU_00890 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB1386]
          Length = 216

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 82/203 (40%), Positives = 108/203 (53%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V   FLS  EC+ LI LA+P + +S  VD+  G+      RTS    L+ GQD + 
Sbjct: 26  PRVVVLGGFLSDGECDALIALARPRLARSRTVDNANGEHLVHAARTSDSMCLRVGQDALC 85

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + IE RIA     P++HGEG+QVL Y  G +Y  HYDYF  D   T    + GGQR+A+L
Sbjct: 86  QRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDAVGTPILLQAGGQRVASL 145

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  E GG T FP A+ +  +V                   +G+A+ F   RP    
Sbjct: 146 VMYLNTPERGGATRFPDAHLDVAAV-------------------KGNAVFFSYDRPHPMT 186

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              SLH G PV+ G KW ATKW+
Sbjct: 187 --RSLHAGAPVLAGEKWVATKWL 207


>gi|352086439|ref|ZP_08953941.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
 gi|389799401|ref|ZP_10202396.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
 gi|351679404|gb|EHA62545.1| Procollagen-proline dioxygenase [Rhodanobacter sp. 2APBS1]
 gi|388442818|gb|EIL98985.1| procollagen-proline dioxygenase [Rhodanobacter sp. 116-2]
          Length = 284

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 83/209 (39%), Positives = 111/209 (53%), Gaps = 29/209 (13%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P   V  N LS  ECE LI LA+P + ++  VDS+  Q  D R RTS G F    +  ++
Sbjct: 95  PALRVLENILSTQECEELIALARPRLQRALTVDSEGRQQVDRR-RTSEGMFFTLNEVPLV 153

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-----TKNGGQRMATL 198
             IE+R+A    +P  HGEG+Q+LHY  GQ+Y+ H+D+F  E       T  GGQR+A++
Sbjct: 154 GRIEQRLAALLRVPASHGEGLQILHYLPGQEYEPHFDWFDPEQPGYGAITAVGGQRIASV 213

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+    GG T FP                   + GL+V  +RG A+ F         
Sbjct: 214 VMYLNTPARGGGTAFP-------------------ELGLTVTARRGSAVYFAY----EGG 250

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           DPSSLH G PV+ G KW ATKW+    YK
Sbjct: 251 DPSSLHAGLPVLDGEKWIATKWLRERPYK 279


>gi|325922187|ref|ZP_08183974.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
           19865]
 gi|325547306|gb|EGD18373.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
           19865]
          Length = 285

 Score =  142 bits (357), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 80/203 (39%), Positives = 106/203 (52%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V  +FLS AEC+ LI LA+P + +S  VD+  G       RTS    L+ GQD + 
Sbjct: 96  PRVVVLGDFLSDAECDALIALAQPRLARSRTVDNDNGAQIVHAARTSDSMCLQLGQDALC 155

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-----TKNGGQRMATL 198
           + IE RIA     P++HGEG+QVL Y  G +Y  HYDYF           + GGQR+A+L
Sbjct: 156 QRIEARIARLLDWPVDHGEGLQVLRYATGAEYQPHYDYFDPTAAGTPVLLQAGGQRLASL 215

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  E GG T FP  + +  +V                   +G+A+ F   RP    
Sbjct: 216 VMYLNTPERGGATRFPDVHLDVAAV-------------------KGNAVFFSYDRPHPMT 256

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              SLH G PV+ G KW ATKW+
Sbjct: 257 --RSLHAGAPVLAGEKWVATKWL 277


>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
 gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
          Length = 286

 Score =  141 bits (355), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 72/211 (34%), Positives = 117/211 (55%), Gaps = 24/211 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   +  + L   EC+ LI++ + ++ +S+VVD  +G+      R S G F+    D +
Sbjct: 93  QPVIALVADVLDDTECDRLIEIGREHVQRSSVVDPDSGKEITIEERRSEGAFVNASTDAL 152

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE-----FNTKNGGQRMAT 197
           +  I++RIA+    P+E+GE + +L Y +G +Y  HYDYF +E      + + GGQR+AT
Sbjct: 153 VETIDRRIAELFRQPVENGEDLHILRYGMGGEYRPHYDYFPEEQAGSKHHMQRGGQRIAT 212

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           +++YL++VE+GG+T FP                     GL++ P+RG AL F  +     
Sbjct: 213 VILYLNEVEQGGDTTFPDI-------------------GLAIHPRRGSALYFEYVNELGQ 253

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
            DP +LH G PV +G KW ATKW+    ++A
Sbjct: 254 SDPKTLHAGTPVEKGEKWIATKWIRRGRFRA 284


>gi|375106426|ref|ZP_09752687.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
           JOSHI_001]
 gi|374667157|gb|EHR71942.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderiales bacterium
           JOSHI_001]
          Length = 295

 Score =  140 bits (354), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 80/210 (38%), Positives = 111/210 (52%), Gaps = 28/210 (13%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V+   LS  EC+ ++DLA+P + +S  V + +G S+ +  RTS G F  RG+  + 
Sbjct: 108 PRVMVFGGLLSDEECDAMVDLARPRLARSETVHNGSGGSEVNAARTSDGMFFDRGEFPLC 167

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT------KNGGQRMAT 197
           R IE+RIA     P+E+GEG+QVL Y  G +Y AH+DYF D          K GGQR+ T
Sbjct: 168 RTIEQRIAALVNWPVENGEGLQVLRYRPGSEYKAHHDYF-DPAQPGTPTILKRGGQRVGT 226

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           ++MYL+    GG T FP                     GL V P +G+A+ F   R    
Sbjct: 227 VVMYLNHPIRGGGTAFPDV-------------------GLEVAPFKGNAVFFSYDRAHPM 267

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
               +LH G PV+ G KW ATKW+   E++
Sbjct: 268 T--RTLHAGTPVLEGEKWVATKWVREGEFR 295


>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
 gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
          Length = 283

 Score =  140 bits (353), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 76/208 (36%), Positives = 114/208 (54%), Gaps = 22/208 (10%)

Query: 77  TEIVAWEPRAFVYH--NFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTF 134
            ++++   + FV H    LS  EC+ LI L++  +  S VVD  +G+ +    RTS    
Sbjct: 87  VKVLSRNEKPFVLHLDQVLSSEECDELISLSRSRLQPSLVVDRGSGEERAGSGRTSKSMA 146

Query: 135 LKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEF-NTKNGGQ 193
            +  ++ ++  IE RIA+ T  P E+GEG+Q+L+Y +G++Y  H+D+F     +   GGQ
Sbjct: 147 FRLKENELVERIETRIAELTGYPAENGEGLQILNYGLGEEYKPHFDFFPPHMADASKGGQ 206

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+ T L+YL+DVE+GGETVF                    K GLS  PK+G A+ F    
Sbjct: 207 RVGTFLIYLNDVEDGGETVF-------------------SKAGLSFVPKKGAAIYFHYGN 247

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWM 281
               LD  S+H   PV +G KW+ATKW+
Sbjct: 248 AQGQLDRLSVHSSVPVRKGEKWAATKWI 275


>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
 gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
          Length = 279

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 78/208 (37%), Positives = 108/208 (51%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P   V  NF++  EC  LI LA+  +  +TVVD  TG+    + RTS      R +  +I
Sbjct: 91  PEVVVLDNFITAEECAQLIALAEGKVEDATVVDPATGEFVKHQDRTSMNAAFARAEHPLI 150

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEF-----NTKNGGQRMATL 198
             +E RIA     P E+GEG+QVL Y  G +Y AH+DYF  +      N + GGQR+ T 
Sbjct: 151 ARLEARIAAAIHWPAENGEGMQVLRYRSGGEYKAHFDYFDTQSEGGRKNMQTGGQRVGTF 210

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L+YL DV+ GG T FPA N                     ++PK+G AL F +  P+   
Sbjct: 211 LVYLCDVDAGGATRFPALN-------------------FEIRPKKGMALFFANTLPNGEG 251

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           +P +LH G PV+ G K+ A+KW+    Y
Sbjct: 252 NPLTLHAGVPVVSGVKYLASKWLREKPY 279


>gi|384429387|ref|YP_005638747.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
           campestris pv. raphani 756C]
 gi|341938490|gb|AEL08629.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
           campestris pv. raphani 756C]
          Length = 286

 Score =  139 bits (351), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 80/203 (39%), Positives = 107/203 (52%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V    LS  EC+ LI LA+P + +S  VD++ G       RTS    L+ GQD + 
Sbjct: 96  PRVVVLGGLLSDDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 155

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + IE RIA     P+EHGEG+QVL Y  G +Y  HYDYF  D   T    ++GGQR+A+L
Sbjct: 156 QRIEARIARLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRVASL 215

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  E GG T FP  + +  +V                   +G+A+ F   RP    
Sbjct: 216 VMYLNTPERGGATRFPDVHLDVAAV-------------------KGNAVFFSYDRPHPMT 256

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              +LH G PV+ G KW ATKW+
Sbjct: 257 --RTLHAGAPVLAGEKWVATKWL 277


>gi|363543363|ref|NP_001241692.1| prolyl 4-hydroxylase 8-2 [Zea mays]
 gi|347978834|gb|AEP37759.1| prolyl 4-hydroxylase 8-2 [Zea mays]
          Length = 184

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 64/85 (75%), Positives = 75/85 (88%)

Query: 71  EKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS 130
           E+G QWTE+++WEPRAFVYHNFLSK ECEYLI LAKP+MVKSTVVDS TG+SKDSRVRTS
Sbjct: 90  ERGAQWTEVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTS 149

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTF 155
           SG FL+RG+D++IR IE+ I   TF
Sbjct: 150 SGMFLQRGRDKVIRAIEELIKRSTF 174


>gi|77761111|ref|YP_241833.2| hypothetical protein XC_0735 [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 288

 Score =  139 bits (350), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 79/203 (38%), Positives = 107/203 (52%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V    L+  EC+ LI LA+P + +S  VD++ G       RTS    L+ GQD + 
Sbjct: 98  PRVVVLGGLLADDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 157

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + IE RIA     P+EHGEG+QVL Y  G +Y  HYDYF  D   T    ++GGQR+A+L
Sbjct: 158 QRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRVASL 217

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  E GG T FP  + +  +V                   +G+A+ F   RP    
Sbjct: 218 VMYLNTPERGGATRFPDVHLDVAAV-------------------KGNAVFFSYDRPHPMT 258

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              +LH G PV+ G KW ATKW+
Sbjct: 259 --RTLHAGAPVLAGEKWVATKWL 279


>gi|294627644|ref|ZP_06706226.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
 gi|292597996|gb|EFF42151.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 11122]
          Length = 296

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 81/203 (39%), Positives = 107/203 (52%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P   V   FLS  EC+ LI LA+P + +S  VD+  G+      RTS    L+ GQD + 
Sbjct: 106 PCVVVLGGFLSGGECDALIALARPRLARSRTVDNANGEHVVHAARTSDSMCLRVGQDALC 165

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + IE RIA     P++HGEG+QVL Y  G +Y  HYDYF  D   T    + GGQR+A+L
Sbjct: 166 QRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLLQAGGQRVASL 225

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  E GG T FP A+ +  +V                   +G+A+ F   RP    
Sbjct: 226 VMYLNTPERGGATRFPDAHLDVAAV-------------------KGNAVFFSYDRPHPMT 266

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              SLH G PV+ G KW ATKW+
Sbjct: 267 --RSLHAGAPVLAGEKWVATKWL 287


>gi|294666178|ref|ZP_06731433.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
 gi|292604043|gb|EFF47439.1| 2OG-Fe II oxygenase superfamily protein [Xanthomonas fuscans subsp.
           aurantifolii str. ICPB 10535]
          Length = 296

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 81/203 (39%), Positives = 107/203 (52%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P   V   FLS  EC+ LI LA+P + +S  VD+  G+      RTS    L+ GQD + 
Sbjct: 106 PCVVVLGGFLSGGECDALIALARPRLARSRTVDNANGEHVVHAARTSDSMCLRVGQDALC 165

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + IE RIA     P++HGEG+QVL Y  G +Y  HYDYF  D   T    + GGQR+A+L
Sbjct: 166 QRIEARIARLLDWPVDHGEGLQVLRYGTGAEYRPHYDYFDPDAAGTPVLLQAGGQRVASL 225

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  E GG T FP A+ +  +V                   +G+A+ F   RP    
Sbjct: 226 VMYLNTPERGGATRFPDAHLDVAAV-------------------KGNAVFFSYDRPHPMT 266

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              SLH G PV+ G KW ATKW+
Sbjct: 267 --RSLHAGAPVLAGEKWVATKWL 287


>gi|66572403|gb|AAY47813.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. 8004]
          Length = 308

 Score =  139 bits (349), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 79/203 (38%), Positives = 107/203 (52%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V    L+  EC+ LI LA+P + +S  VD++ G       RTS    L+ GQD + 
Sbjct: 118 PRVVVLGGLLADDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 177

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + IE RIA     P+EHGEG+QVL Y  G +Y  HYDYF  D   T    ++GGQR+A+L
Sbjct: 178 QRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRVASL 237

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  E GG T FP  + +  +V                   +G+A+ F   RP    
Sbjct: 238 VMYLNTPERGGATRFPDVHLDVAAV-------------------KGNAVFFSYDRPHPMT 278

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              +LH G PV+ G KW ATKW+
Sbjct: 279 --RTLHAGAPVLAGEKWVATKWL 299


>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
 gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
          Length = 293

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 80/208 (38%), Positives = 106/208 (50%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P   V    L   EC+ LI  +   + +ST VD   G  +    R+S GTF     D  I
Sbjct: 97  PTIAVLDQVLDDEECDELIRRSADKLQRSTTVDPVNGGYEVIAARSSEGTFFPVNADDFI 156

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGGQRMATL 198
             +++RIA+    P+E+GEG+QVLHY  G +Y  H+DYF       E     GGQR++TL
Sbjct: 157 ARLDRRIAELMNCPVENGEGLQVLHYGEGGEYQPHFDYFSPGDPGSEAQMVVGGQRVSTL 216

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L+YL+DV +GG TVFP                     GL V P++G A+ F     D  +
Sbjct: 217 LIYLNDVAQGGATVFPTL-------------------GLRVLPRKGMAVYFEYSNRDGQV 257

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           DP +LHGG PV +G KW  TKWM    Y
Sbjct: 258 DPLTLHGGEPVEKGEKWIITKWMRQRSY 285


>gi|325915062|ref|ZP_08177391.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325538760|gb|EGD10427.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 286

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 79/203 (38%), Positives = 107/203 (52%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V   FLS AEC+ +I LA+P + +S  VD+  G       RTS    L+ GQD + 
Sbjct: 96  PRVMVLGGFLSDAECDAMIALAQPRLARSRTVDNANGAHVVHAARTSDSMCLQLGQDALC 155

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + IE RIA     P+E+GEG+QVL Y  G +Y  HYDYF  D   T    + GGQR+A+L
Sbjct: 156 QRIEARIARLLDWPVENGEGLQVLRYGTGAEYQPHYDYFDPDAAGTPVLLQAGGQRVASL 215

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  + GG T FP  + +  ++                   +G+A+ F   RP    
Sbjct: 216 VMYLNTPDRGGATRFPDVHLDIAAI-------------------KGNAVFFSYDRPHPMT 256

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              SLH G PV+ G KW ATKW+
Sbjct: 257 --RSLHAGAPVLAGEKWVATKWL 277


>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
 gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
          Length = 215

 Score =  138 bits (347), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 76/206 (36%), Positives = 114/206 (55%), Gaps = 26/206 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP   +  N LS  EC+ LI+ +K  + +S + + ++     +++RTSSG F +  ++  
Sbjct: 35  EPLIVILGNVLSNEECDELIEHSKERLQRSKIGEERSV----NQIRTSSGVFCE--ENET 88

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  IEKRI+    IP+EHG+G+QVL Y  GQ+Y  H+D+F D  +  +   R++TL+MYL
Sbjct: 89  VAKIEKRISQIMNIPIEHGDGLQVLLYAPGQEYKPHFDFFADT-SRASANNRISTLVMYL 147

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           +DVEEGGET FP  N                   LSV P +G A+ F     +  L+  +
Sbjct: 148 NDVEEGGETTFPMLN-------------------LSVFPSKGMAVYFEYFYSNHELNERT 188

Query: 263 LHGGCPVIRGNKWSATKWMHVHEYKA 288
           LH G PV +G KW AT WM    ++ 
Sbjct: 189 LHAGAPVRKGEKWVATMWMRRQTFRV 214


>gi|389793983|ref|ZP_10197143.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
 gi|388433014|gb|EIL89992.1| 2OG-Fe(II) oxygenase [Rhodanobacter fulvus Jip2]
          Length = 282

 Score =  137 bits (346), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 79/203 (38%), Positives = 111/203 (54%), Gaps = 29/203 (14%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P   V    LS+ EC  LI+LA+P + ++  VDS   Q  D R RTS G F + G+  ++
Sbjct: 93  PALRVLDGLLSERECADLIELARPRLQRALTVDSDGKQQIDQR-RTSEGMFFRAGETPLV 151

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF---LDEFN--TKNGGQRMATL 198
             IE+R+A    +P  HGEG+Q+LHY  GQ+Y+ HYD+F   L  ++  T   GQR+A++
Sbjct: 152 AAIEQRLAQLLGVPASHGEGLQILHYGPGQEYEPHYDWFDPALPGYDKLTARAGQRIASV 211

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  E GG T FP                   + GL+V  +RG A+ F         
Sbjct: 212 VMYLNTPERGGGTAFP-------------------EIGLTVTARRGAAVYFAY----EGG 248

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
           D SSLH G PV++G KW AT W+
Sbjct: 249 DQSSLHAGLPVLQGEKWIATHWL 271


>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
 gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
          Length = 522

 Score =  137 bits (344), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 72/203 (35%), Positives = 115/203 (56%), Gaps = 29/203 (14%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P+  ++H+ LS  E E L  LAKP + ++T+ + +TG+++ S+ R S  ++        
Sbjct: 328 KPKIVIFHDVLSDTEIELLKRLAKPILERATIANQQTGKAERSKDRVSKSSWFPDEYHST 387

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATL 198
           IR I KR+AD T + M+  E +QV++Y +G +YD H+D+F    L E N      R+AT+
Sbjct: 388 IRTITKRVADMTGLSMDTAEELQVVNYGLGGQYDPHFDFFHWGKLKEVN------RIATV 441

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                   K G++++ ++G A  ++++     L
Sbjct: 442 LFYMSDVSIGGATVFP-------------------KLGVTLEARKGTAAFWYNLHSSGEL 482

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
           D S+LHG CPV+ G KW A KW+
Sbjct: 483 DYSTLHGACPVLIGEKWVANKWI 505


>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
 gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
          Length = 525

 Score =  136 bits (343), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 74/212 (34%), Positives = 121/212 (57%), Gaps = 22/212 (10%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
           TE++A +P   +YH+ ++ +E   L  LA P + ++TV + K G++   + RTS  T+L 
Sbjct: 322 TELLALDPYMVLYHDVITPSEIRELQYLAVPTLKRATVFNQKMGRNTVVKTRTSKVTWLT 381

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF---LDEFNTKNGGQ 193
              + +   + +RI+D T   +   E +QV++Y +G  YD H+DYF   + +  TK  G 
Sbjct: 382 DSLNPLTVRLNRRISDMTGFDLYGSEMLQVMNYGLGGHYDLHFDYFNATIAKDLTKLNGD 441

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+AT+L YL+DVE+GG TVFP                   KQ  ++ PK+G A++++++R
Sbjct: 442 RIATVLFYLTDVEQGGATVFPNI-----------------KQ--AIFPKKGTAVMWYNLR 482

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
            +   DP +LH  CPVI G+KW   KW+  H+
Sbjct: 483 HNNDGDPQTLHAACPVIVGSKWVCNKWIREHQ 514


>gi|397568865|gb|EJK46391.1| hypothetical protein THAOC_34939 [Thalassiosira oceanica]
          Length = 488

 Score =  136 bits (343), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 89/242 (36%), Positives = 121/242 (50%), Gaps = 27/242 (11%)

Query: 57  FRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVV- 115
           FRR      S+I         E ++ +P       FL+  EC+Y+++ A P M  S V  
Sbjct: 247 FRREIELAPSTITGNSKNVTIETLSMKPLVLSISGFLADEECDYIMEKAAPTMKYSGVSL 306

Query: 116 -DSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQK 174
            D+  G+   S  RTS  TF+    D I+R IE R A  T +P+ H E +QVL Y V +K
Sbjct: 307 KDADKGRPA-SDWRTSQSTFVAAMGDPILRDIELRTASLTRVPVTHQEFVQVLRYGVTEK 365

Query: 175 YDAHYDYF-------------LDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFT 221
           YDAH+D+F             L E   KN   R AT+  YL+DV  GGET FP       
Sbjct: 366 YDAHHDFFDPSSYRSDPGTLQLIENGKKN---RYATVFWYLTDVARGGETCFPRHGGAPP 422

Query: 222 SVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGN--KWSATK 279
                 + S C   GL VKP++G  ++F+S+     +DP SLHG CPV+     KW+A K
Sbjct: 423 P----RDFSMC--TGLKVKPQKGKVIIFYSLDASGEMDPLSLHGACPVLGKEDIKWAANK 476

Query: 280 WM 281
           W+
Sbjct: 477 WL 478


>gi|77747935|ref|NP_638775.2| hypothetical protein XCC3429 [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
          Length = 288

 Score =  136 bits (343), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 78/203 (38%), Positives = 106/203 (52%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V    L+  EC+ LI LA+P + +S  VD++ G       RTS    L+ GQD + 
Sbjct: 98  PRVVVLGGLLADDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 157

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + IE RIA     P+EHGEG+QVL Y  G +Y  HYDYF  D   T    ++GGQR+A+L
Sbjct: 158 QRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRVASL 217

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  E GG T  P  + +  +V                   +G+A+ F   RP    
Sbjct: 218 VMYLNTPERGGATRVPDVHLDVAAV-------------------KGNAVFFSYDRPHPMT 258

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              +LH G PV+ G KW ATKW+
Sbjct: 259 --RTLHAGAPVLAGEKWVATKWL 279


>gi|398810140|ref|ZP_10568970.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
 gi|398083831|gb|EJL74535.1| 2OG-Fe(II) oxygenase superfamily enzyme [Variovorax sp. CF313]
          Length = 296

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/207 (37%), Positives = 111/207 (53%), Gaps = 26/207 (12%)

Query: 88  VYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIE 147
           V  +     ECE LI LA+P +  ST VD  +G+      R+S G F +  ++  I  ++
Sbjct: 103 VLDDVFDPQECEELIALARPRLAPSTTVDPLSGRDLVGEQRSSLGMFFRLRENAFIARLD 162

Query: 148 KRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN------GGQRMATLLMY 201
           +R+++   +P+E+GEG+QVL Y  G +   H+D FL   N  N       GQR++TL+ Y
Sbjct: 163 QRVSELMNLPVENGEGLQVLCYPAGAQSMPHFD-FLVPSNAANKASLARSGQRVSTLVSY 221

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L++VEEGGET+FP                EC   G SV P+RG A+ F        +D +
Sbjct: 222 LNEVEEGGETIFP----------------EC---GWSVPPRRGSAVYFEYCNSLGQVDHA 262

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYKA 288
           SLH G PV+ G KW ATKWM    + A
Sbjct: 263 SLHAGGPVLHGEKWVATKWMRQRRFVA 289


>gi|21114687|gb|AAM42699.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. ATCC 33913]
          Length = 308

 Score =  136 bits (343), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 78/203 (38%), Positives = 106/203 (52%), Gaps = 26/203 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V    L+  EC+ LI LA+P + +S  VD++ G       RTS    L+ GQD + 
Sbjct: 118 PRVVVLGGLLADDECDALIALARPQLARSRTVDNRDGSEIVHAARTSHSMALQPGQDALC 177

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNT----KNGGQRMATL 198
           + IE RIA     P+EHGEG+QVL Y  G +Y  HYDYF  D   T    ++GGQR+A+L
Sbjct: 178 QRIEARIAQLLEWPVEHGEGLQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRVASL 237

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+  E GG T  P  + +  +V                   +G+A+ F   RP    
Sbjct: 238 VMYLNTPERGGATRVPDVHLDVAAV-------------------KGNAVFFSYDRPHPMT 278

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
              +LH G PV+ G KW ATKW+
Sbjct: 279 --RTLHAGAPVLAGEKWVATKWL 299


>gi|428183249|gb|EKX52107.1| hypothetical protein GUITHDRAFT_150687 [Guillardia theta CCMP2712]
          Length = 315

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 90/279 (32%), Positives = 138/279 (49%), Gaps = 29/279 (10%)

Query: 20  VLSMLFMLTI---VLLMLLAMGIFYIPIGDDDSPPNDLTSFRRRAFEKRSSIAEEKGEQW 76
           VL  +F+L I   V+L  +A  +        +  P  L S    A     S A++K E W
Sbjct: 19  VLCGIFVLGIGSGVILTFIAAQVVSRRSEILEVKPQKLRSGMLNAPGTSLSPADKKNEFW 78

Query: 77  TEIVA--WEPRAFVYHNFLSKAECEYLIDLA-KPYMVKSTVVDSKTGQSKDSRVRTSSGT 133
            E ++    PR +V HN L+K ECE L  L     M K+ ++     +  +S  RT++  
Sbjct: 79  IETISDLPGPRIYVLHNILTKEECESLKSLGVMAGMEKALIIPYGGKELVESSTRTNTAA 138

Query: 134 FLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEF----NTK 189
           +L+  Q  ++  +E  +A  T    E+GE +Q+LHY+  Q++  H+DYF        N +
Sbjct: 139 WLEYHQGPVVTKLENLLAKVTNTEPENGENLQILHYQTSQQFKEHHDYFDPATDPPENFE 198

Query: 190 NGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLF 249
            GG R+AT ++YL + EEGGET F                    K    VKP+ G A+LF
Sbjct: 199 PGGNRLATAIIYLQNAEEGGETDF-------------------MKIDTKVKPEAGSAVLF 239

Query: 250 WSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           + ++PD ++D  ++H G P   G KW ATKW+H   Y+ 
Sbjct: 240 YDLKPDGSVDKLTIHSGNPPKGGEKWVATKWIHERRYQG 278


>gi|389809938|ref|ZP_10205598.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
 gi|388441354|gb|EIL97635.1| procollagen-proline dioxygenase [Rhodanobacter thiooxydans LCS2]
          Length = 284

 Score =  136 bits (342), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 81/209 (38%), Positives = 109/209 (52%), Gaps = 29/209 (13%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P   V  N LS  EC+ LI LA+P + ++  VDS+  Q  D R RTS G F    +  ++
Sbjct: 95  PALRVLENILSARECDELIALARPRLQRALTVDSEGRQQVDRR-RTSEGMFFTLDEVPLV 153

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD-----EFNTKNGGQRMATL 198
             IE+R+A    +P  HGEG+Q+LHY  GQ Y+ H+D+F       E  T  GGQR+A++
Sbjct: 154 GRIERRVAALLDVPASHGEGLQILHYLPGQAYEPHFDWFDPDQPGYETITAVGGQRIASV 213

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+    GG T FPA                    GL+V  +RG A+ F     D   
Sbjct: 214 VMYLNTPARGGGTAFPAL-------------------GLTVTARRGAAVYFAYEGGDC-- 252

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
             SSLH G PV+ G KW ATKW+    Y+
Sbjct: 253 --SSLHAGLPVLEGEKWIATKWLRERPYR 279


>gi|159490898|ref|XP_001703410.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
 gi|158280334|gb|EDP06092.1| prolyl 4-hydroxylase [Chlamydomonas reinhardtii]
          Length = 429

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 85/210 (40%), Positives = 117/210 (55%), Gaps = 10/210 (4%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQ-SKDSRVRTSSGTFLK 136
           +I++  PR  V+ NF+ KA  E +I LA  +M  S +      Q   + +VRTS GTFL 
Sbjct: 216 QILSLYPRIKVFPNFVDKARREEIIALASKFMYPSGLAYRPGEQVEAEQQVRTSKGTFLG 275

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMA 196
                 +  +E +IA  T IP ++GE   VL+Y+  Q YD+H D F  +   +   QR+A
Sbjct: 276 GDSSPALTWLESKIAAVTDIPRQNGEFWNVLNYKHTQHYDSHMDSFDPKEYGQQYSQRIA 335

Query: 197 TLLMYLSDVE-EGGETVFP---AANANFTSVRWWNELSEC-GKQGLSVKPKRGDALLFWS 251
           T+++ LSD    GGETVF     AN +     W    ++C    GL  KP+ GDA+LFWS
Sbjct: 336 TVIVVLSDEGLVGGETVFKREGKANIDKPITNW----TDCDADGGLRYKPRAGDAVLFWS 391

Query: 252 MRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
             PD  LD  +LHG CPV+ GNKW A KW+
Sbjct: 392 AFPDGRLDQHALHGSCPVVTGNKWVAVKWI 421


>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
 gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
          Length = 496

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 78/205 (38%), Positives = 116/205 (56%), Gaps = 22/205 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           EI    P   V+H+ LS  E + L  LA+P + ++TVV  K  + KDSR RTS GT+++R
Sbjct: 297 EIRLLNPFIIVFHDVLSPREIDELQKLARPLLERTTVVKFKKYE-KDSR-RTSKGTWIER 354

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEF-NTKNGGQRMA 196
             + + + IE+RI D   + + + E  QV++Y +G  Y AH D+  D + + K    R+A
Sbjct: 355 DHNNLTKRIERRITDMVELDLRYSEPFQVMNYGLGGHYAAHEDFLGDTWADKKEEDDRIA 414

Query: 197 TLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           T+L YL+DVE+GG TVF   N                    +V PKRG AL ++++  + 
Sbjct: 415 TVLFYLTDVEQGGATVFTILNQ-------------------AVSPKRGTALFWYNLHRNG 455

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWM 281
           T D  +LHGGCPV+ G+KW  T W+
Sbjct: 456 TGDTRTLHGGCPVLVGSKWIMTLWI 480


>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
 gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
          Length = 215

 Score =  135 bits (340), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 78/206 (37%), Positives = 113/206 (54%), Gaps = 26/206 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP   V  N LS +EC+ LI+ ++  + +S + + ++  S    +RTSSG F +  Q   
Sbjct: 35  EPLVVVLGNVLSDSECDELIEHSRERLQRSKIGEDRSVNS----IRTSSGVFCE--QTET 88

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           I  IEKRI+    IP+EHG+G+QVL Y  GQ+Y  HYD+F  E +  +   R++TL+MYL
Sbjct: 89  ITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFA-ETSRASTNNRISTLVMYL 147

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           +DVE+GGETVFP  +                   LSV P +G A+ F     +  ++  +
Sbjct: 148 NDVEQGGETVFPLLH-------------------LSVFPTKGMAVYFEYFYRNQEVNEFT 188

Query: 263 LHGGCPVIRGNKWSATKWMHVHEYKA 288
           LH G  VI G KW AT WM    ++ 
Sbjct: 189 LHAGAQVIHGEKWVATMWMRRQSFRV 214


>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
 gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
          Length = 511

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 121/204 (59%), Gaps = 24/204 (11%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  +P   ++H+ LS  E + L ++A+P++ +S VV  +       R+  S+GT+++R
Sbjct: 313 EVLVLDPLVVIFHDVLSSREIDGLQEIARPHLERSMVVKYRANVQGKHRI--SAGTWVER 370

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
             + +   IE+RIAD   + +E  E   V++Y +G +Y AH+D+F  +    N   R+AT
Sbjct: 371 KYNNLTWRIERRIADMVDLNLEGSEPFYVINYGIGGQYKAHWDFFGADTVEDN---RLAT 427

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           +L Y++DVE+GG TVFP                   + G +V+ KRG+AL +++M+ + T
Sbjct: 428 VLFYMNDVEQGGATVFP-------------------RLGQTVRAKRGNALFWYNMQHNGT 468

Query: 258 LDPSSLHGGCPVIRGNKWSATKWM 281
           +D  +LHGGCP++ G+KW  T+W+
Sbjct: 469 VDDRTLHGGCPILVGSKWIFTQWI 492


>gi|357459545|ref|XP_003600053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355489101|gb|AES70304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 156

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 68/125 (54%), Positives = 91/125 (72%), Gaps = 4/125 (3%)

Query: 92  FLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIA 151
           + SK ECE+LI L KPY+ +S + D +TG+  ++R   + G F+K   D+II+ IE+RI 
Sbjct: 25  YESKEECEHLIKLGKPYLERSRISDKRTGKGIENRFAYACGGFVK---DKIIKNIEQRIP 81

Query: 152 DFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGET 211
           D   IP+E+GEG+QV+HY VGQK+  HYD   +E +  NGG R+AT LMYLSDVEEGGET
Sbjct: 82  DIISIPVENGEGLQVIHYGVGQKFVPHYDSRSNE-SFWNGGPRVATFLMYLSDVEEGGET 140

Query: 212 VFPAA 216
           VFP+A
Sbjct: 141 VFPSA 145


>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
 gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
          Length = 220

 Score =  135 bits (339), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 79/206 (38%), Positives = 112/206 (54%), Gaps = 26/206 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP   V  N LS +EC+ LI+ ++  + +S + +  +  S    +RTSSG F +  Q   
Sbjct: 40  EPLVVVLGNVLSDSECDELIEHSRERLQRSKIGEDGSVNS----IRTSSGVFCE--QTET 93

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           I  IEKRI+    IP+EHG+G+QVL Y  GQ+Y  HYD+F  E +  +   R++TL+MYL
Sbjct: 94  ITRIEKRISQIMNIPIEHGDGLQVLRYTPGQEYKPHYDFFA-ETSRASTNNRISTLVMYL 152

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           +DVE+GGETVFP  +                   LSV P +G A+ F     +  L+  +
Sbjct: 153 NDVEQGGETVFPLLH-------------------LSVFPTKGMAVYFEYFYSNQELNDFT 193

Query: 263 LHGGCPVIRGNKWSATKWMHVHEYKA 288
           LH G  VI G KW AT WM    ++ 
Sbjct: 194 LHAGTQVIHGEKWVATMWMRRQSFRV 219


>gi|224122338|ref|XP_002318810.1| predicted protein [Populus trichocarpa]
 gi|222859483|gb|EEE97030.1| predicted protein [Populus trichocarpa]
          Length = 310

 Score =  134 bits (337), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 73/209 (34%), Positives = 121/209 (57%), Gaps = 8/209 (3%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W+PR FVY  FL+  EC++LI LA+     S   D  +G+ + +R+  SS + L    
Sbjct: 61  VSWQPRVFVYKGFLTDEECDHLISLAQGTKETSEGKDDDSGRIERNRLFASSTSLLNM-D 119

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+  IE+R++ +T +P E+ + +QV+HY + +    ++DYF ++    +    MATL+
Sbjct: 120 DNILSRIEERVSAWTLLPKENSKPLQVMHYGI-EDAKNYFDYFGNKSAIISSEPLMATLV 178

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            YLS+V +GGE  FP +       + W   S+C K   S++P +G+A+LF+++ P+ + D
Sbjct: 179 FYLSNVTQGGEIFFPKSEVK---NKIW---SDCTKISDSLRPIKGNAILFFTVHPNTSPD 232

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
             S H  CPV+ G  W ATK  ++   K 
Sbjct: 233 MGSSHSRCPVLEGEMWYATKKFYLRAIKV 261


>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
 gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
          Length = 514

 Score =  133 bits (335), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 79/243 (32%), Positives = 121/243 (49%), Gaps = 27/243 (11%)

Query: 45  GDDDSPPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDL 104
           GDD  P  +L   R R    R+        +  E V  +P   +YH+ +S  E + +I +
Sbjct: 278 GDDQRPAKELAKLRCRYEHNRTPFLRISPLKLQE-VNHDPMIVMYHDVISNKEIDAIISI 336

Query: 105 AKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGI 164
           +KP M +S V D    +   S+ RTSS  +L      ++R + +R  D T + M   E +
Sbjct: 337 SKPLMHRSMVGDDH--EKAVSKTRTSSNAWLDDVMHPVVRTLSQRTEDMTNLAMTAAERL 394

Query: 165 QVLHYEVGQKYDAHYDYFLDE-----FNTKNGGQRMATLLMYLSDVEEGGETVFPAANAN 219
           QV +Y +G  Y  HYDY + E     + +   G R+AT++ YLSDV  GG TVFP     
Sbjct: 395 QVGNYGIGGHYLPHYDYAVAEEGKEVYPSIGKGNRIATVMYYLSDVAIGGATVFP----- 449

Query: 220 FTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATK 279
                         + GL V P++G A+ ++++  + T+D  +LHG CPV  G+KW   K
Sbjct: 450 --------------QLGLGVFPQKGSAIFWYNLHANGTVDHRTLHGACPVFVGSKWVGNK 495

Query: 280 WMH 282
           W+H
Sbjct: 496 WIH 498


>gi|242051901|ref|XP_002455096.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
 gi|241927071|gb|EES00216.1| hypothetical protein SORBIDRAFT_03g004265 [Sorghum bicolor]
          Length = 303

 Score =  133 bits (335), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 76/204 (37%), Positives = 114/204 (55%), Gaps = 18/204 (8%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PR F+Y  FLS  EC++LI +A      S VV    G +       S G  +   +
Sbjct: 62  LSWHPRVFLYEGFLSDMECDHLISMAHGKKQSSLVVGGSAGNN-------SQGASI---E 111

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D I+  IE RI+ ++F+P + GE +Q+L YEV +      DY   E  + +G  R+ T+L
Sbjct: 112 DTIVSTIEDRISVWSFLPKDFGESMQILKYEVNKS-----DYNNYESQSSSGHDRLVTVL 166

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           MYLSDV+ GGET FP +    T V      SEC   G +V+P RG+A+L ++++PD  +D
Sbjct: 167 MYLSDVKRGGETAFPRSELKGTKVELAAP-SECA--GYAVQPVRGNAILLFNLKPDGVID 223

Query: 260 PSSLHGGCPVIRGNKWSATKWMHV 283
             S +  C V+ G +W A K +H+
Sbjct: 224 KDSQYEMCSVLEGEEWLAIKHIHL 247


>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
 gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
          Length = 539

 Score =  133 bits (335), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 75/210 (35%), Positives = 114/210 (54%), Gaps = 23/210 (10%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            EI+ ++P A ++ N +S +E E + +LA P + ++TV +SKTG+ + +  R S   +LK
Sbjct: 318 VEILRFDPLAVLFKNVISDSEIEVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLK 377

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGG 192
              D +I  + +RI DFT +     E +QV +Y +G  YD H+D+   E    F T N G
Sbjct: 378 GDLDPVIDRVNRRIEDFTGLNQATSEELQVANYGLGGHYDPHFDFARKEEKNAFKTLNTG 437

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT+L Y+S  E GG TVF             N L      G +V P + DAL ++++
Sbjct: 438 NRIATVLFYMSQPERGGATVF-------------NHL------GTAVFPSKNDALFWYNL 478

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           R D   D  + H  CPV+ G KW + KW+H
Sbjct: 479 RRDGEGDLRTRHAACPVLLGVKWVSNKWIH 508


>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
 gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
          Length = 529

 Score =  133 bits (335), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 71/207 (34%), Positives = 113/207 (54%), Gaps = 22/207 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+++ +P   +YH+ +S +E   L  LA P + ++TV + ++ ++   + RTS  T+L  
Sbjct: 324 ELISLDPYMVIYHDVISPSEISELQSLAVPGLKRATVFNQQSMRNHVVKTRTSKVTWLLD 383

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN---TKNGGQR 194
             +++   + +RI D T   M   E +QV++Y +G  YD HYDYF        T+  G R
Sbjct: 384 TLNQLTIRLNRRITDMTGFDMYGSEMLQVMNYGLGGHYDKHYDYFNSSVAADLTRLNGDR 443

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +AT+L YL+DVE+GG TVFP                       +V PK G A++++++R 
Sbjct: 444 IATVLFYLTDVEQGGATVFPNIEK-------------------AVFPKSGTAVVWYNLRH 484

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWM 281
           D   DP +LH  CPVI G+KW   KW+
Sbjct: 485 DGNGDPQTLHAACPVIVGSKWVCNKWI 511


>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
 gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
          Length = 454

 Score =  133 bits (335), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 75/205 (36%), Positives = 106/205 (51%), Gaps = 24/205 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  ++   L+ AEC+ L+ LA+  + +S V++  TG       RTS G   + G+  +I
Sbjct: 132 PRVTLFQQLLTDAECDALVALARGRLARSPVINPDTGDENLIEARTSLGAMFQVGEHPLI 191

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-----LDEFNTKNGGQRMATL 198
             IE  IA  T I  E GEG+Q+L+Y+ G +Y  HYD+F      +    K GGQR+ TL
Sbjct: 192 ERIEDCIAAVTGIAAERGEGLQILNYKPGGEYQPHYDFFNPQRPGEARQLKVGGQRVGTL 251

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+    GG T FP                   K GL V P +G+A+ F   + D  L
Sbjct: 252 VIYLNSPLAGGATAFP-------------------KLGLEVAPVKGNAVYFSYRKSDGAL 292

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHV 283
           D  +LH G PV  G KW ATKW++ 
Sbjct: 293 DERTLHAGLPVEAGEKWIATKWLNA 317


>gi|388515007|gb|AFK45565.1| unknown [Lotus japonicus]
          Length = 154

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 65/99 (65%), Positives = 81/99 (81%), Gaps = 1/99 (1%)

Query: 51  PNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMV 110
           PNDLTS      ++ +   + +GEQW E+++WEPRAFVYHNFL+K ECEYLID+AKP M 
Sbjct: 55  PNDLTSIAHNTLDR-TDDDDGRGEQWVEVISWEPRAFVYHNFLTKEECEYLIDIAKPNMH 113

Query: 111 KSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKR 149
           KSTVVDS+TG+SKDSRVRTSSGTFL RG+ +I+R IEK+
Sbjct: 114 KSTVVDSETGKSKDSRVRTSSGTFLPRGRGKIVRNIEKK 152


>gi|389728965|ref|ZP_10189244.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
 gi|388441204|gb|EIL97500.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
          Length = 285

 Score =  133 bits (334), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 83/215 (38%), Positives = 108/215 (50%), Gaps = 31/215 (14%)

Query: 79  IVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRG 138
           + A  P   V+   LS  EC  LI+LAKP + ++  V     Q  D   RTS G F   G
Sbjct: 90  LAAETPPLRVFDGLLSDDECAALIELAKPRLQRARTVAEDGAQQIDEH-RTSDGMFFGLG 148

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN------TKNGG 192
           +  +I  IE RIA    IP++HGEG+QVLHY  GQ+Y+ H D+F D         T  GG
Sbjct: 149 EQPLIERIEARIAALLGIPVDHGEGLQVLHYLPGQQYEPHQDWF-DPTQPGYAAITATGG 207

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
           QR+A+L++YL+  + GG T FP                   + GL+V   RG A+ F   
Sbjct: 208 QRIASLVIYLNTPDAGGGTAFP-------------------EIGLTVTALRGSAVCFTYE 248

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
             D      SLH G PV RG KW ATKW+    Y+
Sbjct: 249 SGDVF----SLHAGLPVTRGEKWIATKWLRERPYR 279


>gi|147834798|emb|CAN75013.1| hypothetical protein VITISV_039948 [Vitis vinifera]
          Length = 282

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 75/194 (38%), Positives = 99/194 (51%), Gaps = 39/194 (20%)

Query: 127 VRTSSGTFLKRGQDRI--IRGIEKRIADFTFIPMEHGE---------------------- 162
           +R  SG F+   +D+   +  IE++IA    IP  HGE                      
Sbjct: 90  IRLCSGVFISASEDKTGTLDLIEQKIARVIMIPRTHGEIKPKENCLNWLGQVPPFEFVVM 149

Query: 163 -----------GIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGET 211
                         +L YE+GQ+Y++HYD F           R+AT L+YLSDVEEGGET
Sbjct: 150 KRFLTDVVYHVAFNILRYEIGQRYNSHYDAFDPAEYGPQKSHRIATFLVYLSDVEEGGET 209

Query: 212 VFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIR 271
           +FP  N       +  +   C   GL VKP +GD LLF+SM P+ T+DP+SLHG CPVI+
Sbjct: 210 MFPFENGLNMDKDY--DFQRC--IGLKVKPHQGDGLLFYSMFPNGTIDPTSLHGSCPVIK 265

Query: 272 GNKWSATKWMHVHE 285
           G KW ATKW+   E
Sbjct: 266 GEKWVATKWIRDQE 279


>gi|428182311|gb|EKX51172.1| hypothetical protein GUITHDRAFT_92735 [Guillardia theta CCMP2712]
          Length = 190

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 78/190 (41%), Positives = 108/190 (56%), Gaps = 21/190 (11%)

Query: 109 MVKSTVVDS-KTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVL 167
           M +ST+ ++    ++     RTSS  +L +  D ++  I  R+A+   +PME  E +QVL
Sbjct: 1   MGRSTIAEAGNEAKNGVGSARTSSTAWLSKTADPLVAKIRTRVAELVKLPMELAEDMQVL 60

Query: 168 HYEVGQKYDAHYDYFLDE----FNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSV 223
           HY   Q Y AH+D+F       F T  G  R  T+  YLSDVEEGGETVFP AN +   V
Sbjct: 61  HYSKNQHYWAHHDFFDPNIYRGFVTSPGQNRFITVFFYLSDVEEGGETVFPFANGDDRRV 120

Query: 224 RWWNELSECGKQGLSVKPKRGDALLFWSM---------RPD---ATLDPSSLHGGCPVIR 271
               + ++C + GL VKPK G+A++F+SM          PD     LD  SLHGGC VI+
Sbjct: 121 ---TDFADCSR-GLKVKPKAGNAIIFYSMLAKRQQEICPPDDLGCNLDVRSLHGGCDVIK 176

Query: 272 GNKWSATKWM 281
           G+KW+A  W+
Sbjct: 177 GDKWAANYWI 186


>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
 gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
          Length = 528

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 70/210 (33%), Positives = 114/210 (54%), Gaps = 21/210 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V  +P   +YH+ LS  E + L  +A P + ++TV  + +G+++  R RTS   +   
Sbjct: 326 ELVGLDPYMVLYHDVLSAKEIKELQGMATPGLKRATVFQAASGRNEVVRTRTSKVAWFPD 385

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF--LDEFNTKNGGQRM 195
           G   +   +  RI D T   +   E +Q+++Y +G  YD HYDYF  ++   T   G R+
Sbjct: 386 GYSPLTVRLNARITDMTGFNLHGSEMLQLMNYGLGGHYDQHYDYFNTINSNLTAMSGDRI 445

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           AT+L YL+DVE+GG TVFP       ++R             +V P+RG  +++++++ D
Sbjct: 446 ATVLFYLTDVEQGGATVFP-------NIRK------------AVFPQRGSVIMWYNLKDD 486

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
             +D  +LH  CPVI G+KW   KW+   E
Sbjct: 487 GQIDTQTLHAACPVIVGSKWVCNKWIRERE 516


>gi|357605723|gb|EHJ64752.1| prolyl 4-hydroxylase alpha subunit [Danaus plexippus]
          Length = 235

 Score =  132 bits (332), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 71/201 (35%), Positives = 110/201 (54%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P   V+++ LS  E +Y+  +A+P   ++TV D  TG+   +  R S   +LK  +  ++
Sbjct: 22  PDIIVFNDVLSDYEIDYIKRIAQPRFRRATVHDPATGELVPAHYRISKSAWLKDEESAVV 81

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+AD T + M   E +QV++Y +G  YD H+D+   E N   K  G R+AT+L Y
Sbjct: 82  ARVSRRVADITGLSMTTAEELQVVNYGIGGHYDPHFDFARKEENAFEKFNGNRIATVLFY 141

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDV +GG TVF              EL      GLSV P+RG A+ + ++ P    D +
Sbjct: 142 MSDVAQGGATVF-------------TEL------GLSVFPRRGSAVFWLNLHPSGEGDLA 182

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+RG+KW   KW+H
Sbjct: 183 TRHAACPVLRGSKWVCNKWIH 203


>gi|17541712|ref|NP_502317.1| Protein PHY-2 [Caenorhabditis elegans]
 gi|32171589|sp|Q20065.1|P4HA2_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|3876769|emb|CAA93469.1| Protein PHY-2 [Caenorhabditis elegans]
          Length = 539

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 74/210 (35%), Positives = 113/210 (53%), Gaps = 23/210 (10%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            EI+ ++P A ++ N +  +E E + +LA P + ++TV +SKTG+ + +  R S   +LK
Sbjct: 318 VEILRFDPLAVLFKNVIHDSEIEVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLK 377

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGG 192
              D +I  + +RI DFT +     E +QV +Y +G  YD H+D+   E    F T N G
Sbjct: 378 GDLDPVIDRVNRRIEDFTNLNQATSEELQVANYGLGGHYDPHFDFARKEEKNAFKTLNTG 437

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT+L Y+S  E GG TVF             N L      G +V P + DAL ++++
Sbjct: 438 NRIATVLFYMSQPERGGATVF-------------NHL------GTAVFPSKNDALFWYNL 478

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           R D   D  + H  CPV+ G KW + KW+H
Sbjct: 479 RRDGEGDLRTRHAACPVLLGVKWVSNKWIH 508


>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Cricetulus griseus]
          Length = 534

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 72/204 (35%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   +D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
 gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
           musculus]
          Length = 534

 Score =  131 bits (330), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 72/204 (35%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   +D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|170064960|ref|XP_001867743.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
 gi|167882146|gb|EDS45529.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
          Length = 545

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 68/204 (33%), Positives = 112/204 (54%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP   +YH  +S AE E +  LAKP   ++TV + KTG+ + +  R S   +LK  +  +
Sbjct: 341 EPYIVIYHEVMSDAEIEVIKRLAKPRFRRATVQNYKTGELEVANYRISKSAWLKDEEHSV 400

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           +R + +R+ D T + M   E +QV++Y +G  Y+ H+D+   E    F +   G R+AT+
Sbjct: 401 VRTVGQRVEDMTGLTMTTAEELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRIATV 460

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV +GG TVFP       S+R            ++++PK+G A  ++++      
Sbjct: 461 LFYMSDVSQGGATVFP-------SIR------------VALRPKKGTAAFWYNLHASGHG 501

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D ++ H  CPV+ G KW + KW+H
Sbjct: 502 DYATRHAACPVLTGTKWVSNKWIH 525


>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
          Length = 534

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 72/204 (35%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   +D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
          Length = 454

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 72/204 (35%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   +D +
Sbjct: 254 KPRIIRFHDIISDAENEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 313

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 314 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 373

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 374 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 414

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 415 DYSTRHAACPVLVGNKWVSNKWLH 438


>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
 gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
 gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
           [Rattus norvegicus]
          Length = 534

 Score =  131 bits (329), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 72/204 (35%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   +D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|196011902|ref|XP_002115814.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
 gi|190581590|gb|EDV21666.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
          Length = 534

 Score =  130 bits (328), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 76/248 (30%), Positives = 124/248 (50%), Gaps = 33/248 (13%)

Query: 42  IPIGDDDSPPNDLTSFRRRAFEKR--SSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECE 99
           +  GD+    N LT ++ R   +   S I         E+++ +P   +YHN L+  E E
Sbjct: 297 VKAGDNKMVSNHLTCYQLRQHARLLFSPIN-------VEVISLQPYILIYHNLLNDLEVE 349

Query: 100 YLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPME 159
            L  LA P + ++TV +  TG+ + +  R S   +L      ++R I   I D T + ME
Sbjct: 350 ALKTLAAPMLQRATVHNKDTGKLEYATYRISKSAWLNDDDHPLVRRISTLIEDVTGLTME 409

Query: 160 HGEGIQVLHYEVGQKYDAHYDYF-----LDEFNTKNGGQRMATLLMYLSDVEEGGETVFP 214
             E +Q+ +Y +G  Y+ H+D+       D F T  GG R+AT+L+YLS VE GG TVF 
Sbjct: 410 SAEALQIANYGIGGHYEPHFDHADVRSGTDVFKTWKGGNRIATMLIYLSSVELGGATVFS 469

Query: 215 AANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNK 274
           +A                   G+ ++P++G A  ++++  +   +  + H  CPV+ G+K
Sbjct: 470 SA-------------------GVRIEPRQGSAAFWYNLHRNGNGNNLTRHAACPVLIGSK 510

Query: 275 WSATKWMH 282
           W A KW+H
Sbjct: 511 WIANKWIH 518


>gi|224006596|ref|XP_002292258.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
           CCMP1335]
 gi|220971900|gb|EED90233.1| hypothetical protein THAPSDRAFT_263536 [Thalassiosira pseudonana
           CCMP1335]
          Length = 206

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 80/218 (36%), Positives = 118/218 (54%), Gaps = 31/218 (14%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLA-KPYMVKSTVVDSKTGQSKDSRVRTSSGTF------- 134
           +PR F  HNFLS  E + L+  +  P    +    ++ G +     RTS   F       
Sbjct: 1   DPRVFYVHNFLSADEADELVAFSMAPSTGGTHKAWNQGGSNAKLTTRTSMNAFDITTKLS 60

Query: 135 --LKRGQDRIIR--GIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFN-- 187
             +KR   R++R    ++ +AD          GIQ+L YE+GQ Y AH+DYF + + N  
Sbjct: 61  FRIKRRAFRLLRMGAYKENLAD----------GIQILRYELGQAYIAHHDYFPVRQSNDH 110

Query: 188 ----TKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKR 243
               +K G  R AT+ +YLSDVE GG+T+    +A   +  W ++L +     L+V P+R
Sbjct: 111 LWDPSKGGSNRFATIFLYLSDVEVGGQTL--EKDAGVDAGSWEDKLVDQCYSKLAVPPRR 168

Query: 244 GDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           GDA+LF+S  PD  LDP+SLHG CP+++G KW A  W+
Sbjct: 169 GDAILFYSQYPDGHLDPNSLHGACPILKGTKWGANLWV 206


>gi|347964867|ref|XP_309164.4| AGAP000971-PA [Anopheles gambiae str. PEST]
 gi|333466515|gb|EAA04901.5| AGAP000971-PA [Anopheles gambiae str. PEST]
          Length = 553

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 75/240 (31%), Positives = 121/240 (50%), Gaps = 23/240 (9%)

Query: 47  DDSPPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAK 106
           +  PP +L S     +   SS     G    E     P   +YH+ +S  E E +   A+
Sbjct: 312 EQQPPIELRSQLVCRYTTNSSPFLRIGPLKLEEAYLRPYIVIYHDVMSDREIERIKHYAR 371

Query: 107 PYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQV 166
           P   ++TV + KTG+ + +  R S   +LK  +D +IR I +R+ D T + ME  E +QV
Sbjct: 372 PRFRRATVQNYKTGELEFANYRISKSAWLKDAEDEMIRTISQRVEDMTGLTMETAEELQV 431

Query: 167 LHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTS 222
           ++Y +G  Y+ H+D+   E    F +   G R+AT+L Y+SDV +GG TVFP+ N     
Sbjct: 432 VNYGIGGHYEPHFDFARREERNAFKSLGTGNRIATVLFYMSDVTQGGATVFPSLN----- 486

Query: 223 VRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                         L++ P++G A  ++++      D ++ H  CPV+ G KW + KW+H
Sbjct: 487 --------------LALWPRKGTAAFWFNLHASGRGDYATRHAACPVLTGTKWVSNKWIH 532


>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
           anatinus]
          Length = 493

 Score =  130 bits (328), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 73/204 (35%), Positives = 107/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   YH  +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   +D +
Sbjct: 293 KPRIVRYHEIISDAEIETVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 352

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 353 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 412

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 413 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 453

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 454 DYSTRHAACPVLVGNKWVSNKWIH 477


>gi|319795182|ref|YP_004156822.1| procollagen-proline dioxygenase [Variovorax paradoxus EPS]
 gi|315597645|gb|ADU38711.1| Procollagen-proline dioxygenase [Variovorax paradoxus EPS]
          Length = 296

 Score =  130 bits (327), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 109/204 (53%), Gaps = 26/204 (12%)

Query: 91  NFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRI 150
           N +   EC+ LI++AKP +  ST+VD  +G+   S  R S G F +  ++ ++  +++R+
Sbjct: 107 NVVDAHECKALIEMAKPRLAPSTLVDPMSGRDVVSDKRASWGMFFRLCENDLVARLDRRL 166

Query: 151 ADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN------GGQRMATLLMYLSD 204
           +    +P+E+GEG+ +L+Y  G   + H+DY L   N  N       GQR++TL+ YL+D
Sbjct: 167 SALMNLPLENGEGLHLLYYPTGAGSEPHHDY-LAPTNAANRESIARSGQRVSTLVTYLND 225

Query: 205 VEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLH 264
             EGG+TVFP                   + GL+V P RG+A  F     +  +D  SLH
Sbjct: 226 APEGGQTVFP-------------------QLGLAVSPIRGNACYFEYCDGNGRVDARSLH 266

Query: 265 GGCPVIRGNKWSATKWMHVHEYKA 288
              PV RG+KW  TKWM    + A
Sbjct: 267 ASAPVTRGDKWVMTKWMRERRFVA 290


>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
          Length = 534

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
 gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
          Length = 534

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|115434812|ref|NP_001042164.1| Os01g0174500 [Oryza sativa Japonica Group]
 gi|55296794|dbj|BAD68120.1| prolyl 4-hydroxylase -like [Oryza sativa Japonica Group]
 gi|113531695|dbj|BAF04078.1| Os01g0174500 [Oryza sativa Japonica Group]
 gi|222617830|gb|EEE53962.1| hypothetical protein OsJ_00571 [Oryza sativa Japonica Group]
          Length = 303

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 81/255 (31%), Positives = 129/255 (50%), Gaps = 40/255 (15%)

Query: 49  SPPNDLTSFRRRAFEKRSSIAEEK-----------------GEQWTEIVAWEPRAFVYHN 91
           +PP  L S RR       SIA+EK                     ++ ++W PR F+Y  
Sbjct: 19  APPGALASSRRFDL----SIAQEKLVNSTGGSTASSSHLVFDPSKSKRLSWHPRIFLYEG 74

Query: 92  FLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIA 151
           FLS  EC++L+ + +  M  S         S  + +           +D ++  IE RI+
Sbjct: 75  FLSDMECDHLVSMGRGNMESSLAFTDGDRNSSYNNI-----------EDIVVSKIEDRIS 123

Query: 152 DFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGET 211
            ++F+P E+GE IQVL Y V +          +E  + +G  R+AT+LMYLSDV++GGET
Sbjct: 124 LWSFLPKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLATILMYLSDVKQGGET 178

Query: 212 VFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIR 271
           VFP +       +     S+C   G +V+P +G+A+L +++RPD   D  S +  CPV+ 
Sbjct: 179 VFPRSEMKDAQAKE-GAPSQC--SGYAVRPAKGNAILLFNLRPDGETDKDSQYEECPVLE 235

Query: 272 GNKWSATKWMHVHEY 286
           G KW A K +++ ++
Sbjct: 236 GEKWLAIKHINLRKF 250


>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
          Length = 488

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 288 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 347

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 348 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 407

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 408 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 448

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 449 DYSTRHAACPVLVGNKWVSNKWLH 472


>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
           melanoleuca]
          Length = 534

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
          Length = 534

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
           [Monodelphis domestica]
          Length = 537

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 72/204 (35%), Positives = 107/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H  +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   +D +
Sbjct: 337 KPRIVRFHEIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 396

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 397 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 456

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 457 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 497

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 498 DYSTRHAACPVLVGNKWVSNKWIH 521


>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 566

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-1 [Nomascus leucogenys]
          Length = 502

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 302 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 361

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 362 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 421

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 422 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 462

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 463 DYSTRHAACPVLVGNKWVSNKWLH 486


>gi|302143843|emb|CBI22704.3| unnamed protein product [Vitis vinifera]
          Length = 317

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 72/208 (34%), Positives = 122/208 (58%), Gaps = 11/208 (5%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD----SRVRTSSGTFL 135
           ++W+PRAF+Y  FLS  EC++LI LA   + K   + +  G S +      +++S G   
Sbjct: 60  LSWQPRAFLYRGFLSDEECDHLISLA---LGKKEELATNGGDSGNVVLKRLLKSSEGPLY 116

Query: 136 KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
               D +   IEKRI+ +TF+P E+ E ++V+ Y+  +     Y+YF ++  +K G   M
Sbjct: 117 I--DDEVAARIEKRISAWTFLPKENSEPLEVVQYQF-ENAKQKYNYFSNKSTSKFGEPLM 173

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           AT+L++LS+V  GGE  FP +    +  +    LS+C +    ++P +G+A+LF+++ P+
Sbjct: 174 ATVLLHLSNVTRGGELFFPESELKNSQSKS-GILSDCTESSSGLRPVKGNAILFFNVHPN 232

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHV 283
           A+ D SS +  CPV+ G  W ATK+ H+
Sbjct: 233 ASPDKSSSYARCPVLEGEMWCATKFFHL 260


>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
           sapiens]
 gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
           troglodytes]
 gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
 gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I variant [Homo
           sapiens]
 gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
           sapiens]
 gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
           sapiens]
 gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
           jacchus]
          Length = 534

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
           garnettii]
          Length = 534

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
           porcellus]
          Length = 534

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
           [Oryctolagus cuniculus]
          Length = 534

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
          Length = 534

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
           lupus familiaris]
          Length = 534

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
          Length = 244

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 44  KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 103

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 104 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 163

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 164 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 204

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 205 DYSTRHAACPVLVGNKWVSNKWLH 228


>gi|307190793|gb|EFN74662.1| Prolyl 4-hydroxylase subunit alpha-2 [Camponotus floridanus]
          Length = 476

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 66/204 (32%), Positives = 111/204 (54%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  VYHN +   E E +  +A+P   ++TV + KTG  + +  R S   +L+  + + 
Sbjct: 268 DPRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 327

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           +  + KR+   T + +E  E +QV++Y +G  Y+ H+D+   E    F +   G R+AT+
Sbjct: 328 VAAVSKRVEHMTSMSIETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATV 387

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE+GG TVF A N                   +S+ P++G A  +++++P+   
Sbjct: 388 LYYMSDVEQGGGTVFTAIN-------------------ISLWPRKGSAAFWYNLKPNGEG 428

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G+KW A KW+H
Sbjct: 429 DFKTRHAACPVLTGSKWVANKWLH 452


>gi|328876967|gb|EGG25330.1| putative prolyl 4-hydroxylase alpha subunit [Dictyostelium
           fasciculatum]
          Length = 244

 Score =  129 bits (325), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 78/208 (37%), Positives = 114/208 (54%), Gaps = 38/208 (18%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDR-- 141
           PR +   +FLS AECE+LID++K  +     + S          R+  G F+K G++   
Sbjct: 33  PRVYRVPDFLSPAECEHLIDISKNKLRPCNEISSGVH-------RSGWGLFMKEGEEDHD 85

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG-------GQR 194
           +++ I +R+     +  E+ E +QV+ Y  G++  AHYDYF     T NG       GQR
Sbjct: 86  VVKKIFQRMKMLVNL-TENCEVMQVIRYHPGEETSAHYDYF--NPLTTNGAMKIGLYGQR 142

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           + T+LMYLS+VEEGGET FP                     G+ VKP +GDA+LF++ +P
Sbjct: 143 VCTILMYLSEVEEGGETSFPEV-------------------GVKVKPVKGDAVLFYNCKP 183

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMH 282
           +  +DP SLH G PVI+G KW A K ++
Sbjct: 184 NGEVDPLSLHQGDPVIKGTKWVAIKLIN 211


>gi|301093292|ref|XP_002997494.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110636|gb|EEY68688.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 324

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 77/221 (34%), Positives = 114/221 (51%), Gaps = 16/221 (7%)

Query: 72  KGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRV-RTS 130
           KG+   E ++  P  F    FL   E + ++ L+  ++  STV      + + +   RTS
Sbjct: 103 KGDVVLETLSLTPLVFSVDEFLKDDEIDIIMALSLEHLKPSTVTLMDGHEDRAATDWRTS 162

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN 190
           +  FL   +   +  I++R+AD T +P++H E +QVL YE  QKYD H DYF  E +  +
Sbjct: 163 TTYFLSSSKHSKLDEIDQRVADLTKVPVDHQEDVQVLRYEETQKYDHHTDYFPVEHHKNS 222

Query: 191 G----------GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVK 240
                        RM T+  Y+SDV +GG T+FP A            + +C   GL V 
Sbjct: 223 PHVLESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAGGAPRP----QSMKDCS-TGLKVS 277

Query: 241 PKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           PK+   ++F+SM P+   DP SLHGGCPV  G K+S  KW+
Sbjct: 278 PKKRKVIVFYSMLPNGQGDPMSLHGGCPVEDGIKYSGNKWV 318


>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
           harrisii]
          Length = 385

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 72/204 (35%), Positives = 107/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H  +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   +D +
Sbjct: 185 KPRIVRFHEIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 244

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 245 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 304

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 305 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 345

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 346 DYSTRHAACPVLVGNKWVSNKWIH 369


>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
           [Papio anubis]
          Length = 379

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 179 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 238

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 239 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 298

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 299 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 339

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 340 DYSTRHAACPVLVGNKWVSNKWLH 363


>gi|195341536|ref|XP_002037362.1| GM12882 [Drosophila sechellia]
 gi|194131478|gb|EDW53521.1| GM12882 [Drosophila sechellia]
          Length = 550

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/242 (31%), Positives = 122/242 (50%), Gaps = 24/242 (9%)

Query: 45  GDDDSPPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDL 104
           G+    P+DL S R R    R       G    E V  +P   +YH+ +  +E + +  +
Sbjct: 308 GELKPSPSDLRSLRCRYVTNRVPFLR-LGPLKLEEVHADPYIVIYHDAMYDSEIDLIKRM 366

Query: 105 AKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGI 164
           A+P   ++TV +S TG  + +  R S   +LK  +DR+I  + +R AD T + M+  E +
Sbjct: 367 ARPRFRRATVQNSVTGALETANYRISKSAWLKTQEDRVIETVVQRTADMTGLDMDSAEEL 426

Query: 165 QVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATLLMYLSDVEEGGETVFPAANANF 220
           QV++Y +G  Y+ H+D+   E    F   N G R+AT+L Y+SDVE+GG TVF + +   
Sbjct: 427 QVVNYGIGGHYEPHFDFARKEEERAFEGINLGNRIATVLFYMSDVEQGGATVFTSLHT-- 484

Query: 221 TSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKW 280
                            ++ PK+G A  + ++  D   D  + H  CPV+ G KW + KW
Sbjct: 485 -----------------ALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKW 527

Query: 281 MH 282
           +H
Sbjct: 528 IH 529


>gi|195575089|ref|XP_002105512.1| GD21521 [Drosophila simulans]
 gi|194201439|gb|EDX15015.1| GD21521 [Drosophila simulans]
          Length = 550

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/242 (31%), Positives = 122/242 (50%), Gaps = 24/242 (9%)

Query: 45  GDDDSPPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDL 104
           G+    P+DL S R R    R       G    E V  +P   +YH+ +  +E + +  +
Sbjct: 308 GELKPSPSDLRSLRCRYVTNRVPFLR-LGPLKLEEVHADPYIVIYHDAMYDSEIDLIKRM 366

Query: 105 AKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGI 164
           A+P   ++TV +S TG  + +  R S   +LK  +DR+I  + +R AD T + M+  E +
Sbjct: 367 ARPRFRRATVQNSVTGALETANYRISKSAWLKTQEDRVIETVVQRTADMTGLDMDSAEEL 426

Query: 165 QVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATLLMYLSDVEEGGETVFPAANANF 220
           QV++Y +G  Y+ H+D+   E    F   N G R+AT+L Y+SDVE+GG TVF + +   
Sbjct: 427 QVVNYGIGGHYEPHFDFARKEEERAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHT-- 484

Query: 221 TSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKW 280
                            ++ PK+G A  + ++  D   D  + H  CPV+ G KW + KW
Sbjct: 485 -----------------ALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKW 527

Query: 281 MH 282
           +H
Sbjct: 528 IH 529


>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
           [Loxodonta africana]
          Length = 534

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPDV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|359490628|ref|XP_002271805.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Vitis
           vinifera]
          Length = 312

 Score =  129 bits (324), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 72/208 (34%), Positives = 121/208 (58%), Gaps = 16/208 (7%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD----SRVRTSSGTFL 135
           ++W+PRAF+Y  FLS  EC++LI LA   + K   + +  G S +      +++S G   
Sbjct: 60  LSWQPRAFLYRGFLSDEECDHLISLA---LGKKEELATNGGDSGNVVLKRLLKSSEGPLY 116

Query: 136 KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
               D +   IEKRI+ +TF+P E+ E ++V+ Y+  +     Y+YF ++  +K G   M
Sbjct: 117 I--DDEVAARIEKRISAWTFLPKENSEPLEVVQYQF-ENAKQKYNYFSNKSTSKFGEPLM 173

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           AT+L++LS+V  GGE  FP + +          LS+C +    ++P +G+A+LF+++ P+
Sbjct: 174 ATVLLHLSNVTRGGELFFPESESKS------GILSDCTESSSGLRPVKGNAILFFNVHPN 227

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHV 283
           A+ D SS +  CPV+ G  W ATK+ H+
Sbjct: 228 ASPDKSSSYARCPVLEGEMWCATKFFHL 255


>gi|24651407|ref|NP_733371.1| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
 gi|20269806|gb|AAM18058.1|AF495536_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]EFB
           [Drosophila melanogaster]
 gi|15292529|gb|AAK93533.1| SD05564p [Drosophila melanogaster]
 gi|23172692|gb|AAF57053.2| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
 gi|220946562|gb|ACL85824.1| PH4alphaEFB-PA [synthetic construct]
          Length = 550

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/242 (31%), Positives = 122/242 (50%), Gaps = 24/242 (9%)

Query: 45  GDDDSPPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDL 104
           G+    P+DL S R R    R       G    E V  +P   +YH+ +  +E + +  +
Sbjct: 308 GELKPSPSDLRSLRCRYVTNRVPFLR-LGPLKLEEVHADPYIVIYHDAMYDSEIDLIKRM 366

Query: 105 AKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGI 164
           A+P   ++TV +S TG  + +  R S   +LK  +DR+I  + +R AD T + M+  E +
Sbjct: 367 ARPRFRRATVQNSVTGALETANYRISKSAWLKTQEDRVIETVVQRTADMTGLDMDSAEEL 426

Query: 165 QVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATLLMYLSDVEEGGETVFPAANANF 220
           QV++Y +G  Y+ H+D+   E    F   N G R+AT+L Y+SDVE+GG TVF + +   
Sbjct: 427 QVVNYGIGGHYEPHFDFARKEEQRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHT-- 484

Query: 221 TSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKW 280
                            ++ PK+G A  + ++  D   D  + H  CPV+ G KW + KW
Sbjct: 485 -----------------ALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKW 527

Query: 281 MH 282
           +H
Sbjct: 528 IH 529


>gi|397643670|gb|EJK76008.1| hypothetical protein THAOC_02250 [Thalassiosira oceanica]
          Length = 480

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 90/255 (35%), Positives = 118/255 (46%), Gaps = 66/255 (25%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLA----KPY-MVKSTVVDSKT-GQSKDSRV---RTSSGT 133
           EPR F  HNFLS AE +  +  +     PY M  ST    K   Q  D  V   RTS   
Sbjct: 204 EPRVFYVHNFLSAAEADEFVKFSTAPENPYKMAPSTGGTHKAWNQGGDGAVLTTRTSENA 263

Query: 134 F---------LKRGQDRIIR--GIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF 182
           F         +K+   R++R  G ++ +AD          GIQ+L Y+VGQ Y AH+DYF
Sbjct: 264 FDITTKQSFDVKKRAFRLLRMNGYQENMAD----------GIQILRYKVGQAYVAHHDYF 313

Query: 183 L----DEFN---TKNGGQRMATLLMYLSDVEEGGETVFP--------------------- 214
                 +FN      G  R AT+ +YLSDV  GG+TVFP                     
Sbjct: 314 PTHQSKDFNWDPLSGGSNRFATIFLYLSDVSYGGQTVFPNCEKLSAEKSPELVERLGESP 373

Query: 215 --------AANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGG 266
                    +NA      W + L     +  +V P+RGDA+LF+S RPD  LD +SLHG 
Sbjct: 374 SASELKEFVSNAGLMEGSWEDNLIHKCYEKFAVPPRRGDAILFYSQRPDGLLDTNSLHGA 433

Query: 267 CPVIRGNKWSATKWM 281
           CP++ G KW A  W+
Sbjct: 434 CPILNGTKWGANLWV 448


>gi|195341590|ref|XP_002037389.1| GM12139 [Drosophila sechellia]
 gi|194131505|gb|EDW53548.1| GM12139 [Drosophila sechellia]
          Length = 525

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 69/210 (32%), Positives = 115/210 (54%), Gaps = 21/210 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V  EP   +YH+ LS  E   L  +A P + ++TV  + +G+++  + RTS   +   
Sbjct: 323 ELVGLEPYMVLYHDVLSPKEITELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAWFPD 382

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN--TKNGGQRM 195
           G + +   +  RI+D T   +   E +Q+++Y +G  YD HYD+F +  +  T   G R+
Sbjct: 383 GYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNNTNSNMTAMSGDRI 442

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           AT+L YL+DVE+GG TVFP       ++R             +V P+RG  ++++++R +
Sbjct: 443 ATVLFYLTDVEQGGATVFP-------NIRK------------AVFPQRGSVVMWYNLRDN 483

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
             +D  +LH  CPVI G+KW   KW+   E
Sbjct: 484 GQIDTQTLHAACPVIVGSKWVCNKWIRERE 513


>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
           caballus]
          Length = 302

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 102 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 161

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 162 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 221

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 222 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 262

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 263 DYSTRHAACPVLVGNKWVSNKWLH 286


>gi|345481336|ref|XP_001600680.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Nasonia
           vitripennis]
          Length = 556

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 64/204 (31%), Positives = 112/204 (54%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  +YH+ +   E E +  +A+P   ++TV + KTG+ + +  R S   +L+  + + 
Sbjct: 348 DPRIVIYHDVIYDDEIETIKRMAQPRFKRATVQNYKTGELEIANYRISKSAWLQEHEHKH 407

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           +R + +R+   T + +E  E +QV++Y +G  Y+ H+D+   E    F +   G R+AT+
Sbjct: 408 VRAVSQRVEHMTSMSIETAEELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRIATV 467

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE+GG TVF   N                   +S+ PK+G A  +++++P+   
Sbjct: 468 LYYMSDVEQGGGTVFTKIN-------------------ISLWPKKGSAAFWYNLKPNGEG 508

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G+KW A KW+H
Sbjct: 509 DYKTRHAACPVLTGSKWVANKWLH 532


>gi|193688213|ref|XP_001943683.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Acyrthosiphon pisum]
          Length = 552

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 66/205 (32%), Positives = 112/205 (54%), Gaps = 23/205 (11%)

Query: 82  WEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDR 141
           + PR  +Y + L   E E +  +A+P + ++TV + KTG+ + +  R S   +LK  +D 
Sbjct: 345 FSPRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFADYRISKSAWLKEHEDV 404

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMAT 197
           ++  + KR+   T +  E  E +QV++Y VG  YD HYD+     ++ F +   G R+AT
Sbjct: 405 VVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIAT 464

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           +L Y+SDV +GG TVFP          W          G++++P +G A +++++ P   
Sbjct: 465 VLFYMSDVAQGGATVFP----------WL---------GVALQPVKGTAAVWFNLYPSGN 505

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMH 282
            D  + H  CPV++G+KW   KW+H
Sbjct: 506 GDLRTRHAACPVLQGSKWVCNKWLH 530


>gi|328696638|ref|XP_003240086.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Acyrthosiphon pisum]
          Length = 534

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 66/205 (32%), Positives = 112/205 (54%), Gaps = 23/205 (11%)

Query: 82  WEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDR 141
           + PR  +Y + L   E E +  +A+P + ++TV + KTG+ + +  R S   +LK  +D 
Sbjct: 327 FSPRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFADYRISKSAWLKEHEDV 386

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMAT 197
           ++  + KR+   T +  E  E +QV++Y VG  YD HYD+     ++ F +   G R+AT
Sbjct: 387 VVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIAT 446

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           +L Y+SDV +GG TVFP          W          G++++P +G A +++++ P   
Sbjct: 447 VLFYMSDVAQGGATVFP----------WL---------GVALQPVKGTAAVWFNLYPSGN 487

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMH 282
            D  + H  CPV++G+KW   KW+H
Sbjct: 488 GDLRTRHAACPVLQGSKWVCNKWLH 512


>gi|348683507|gb|EGZ23322.1| hypothetical protein PHYSODRAFT_310730 [Phytophthora sojae]
          Length = 417

 Score =  129 bits (323), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 79/223 (35%), Positives = 118/223 (52%), Gaps = 20/223 (8%)

Query: 72  KGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKS--TVVDSKTGQSKDSRVRT 129
           KG+   E ++  P  F    FL   E + +++L+  ++  S  T++D    ++  +  RT
Sbjct: 196 KGDVVLETLSMTPLVFSVEEFLKDDEIDIIMNLSLEHLKPSGVTLMDGHENRAA-TDWRT 254

Query: 130 SSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTK 189
           S+  FL       I  I++R++D T +P++H E +QVL YE  QKYD H DYF  E + K
Sbjct: 255 STTYFLPSDAHPKIDEIDQRVSDLTKVPIDHQEDVQVLRYEKTQKYDHHTDYFPVEHH-K 313

Query: 190 NGGQ-----------RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLS 238
           N              RM T+  Y+SDV +GG T+FP A            + +C   GL+
Sbjct: 314 NAPHILESIDYGYKNRMITVFWYMSDVAKGGHTIFPRAGGAPRP----TSMKDC-TTGLN 368

Query: 239 VKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           V PK+   ++F+SM P+   DP SLHGGCPV  G K+S  KW+
Sbjct: 369 VPPKKRKVIVFYSMLPNGEGDPMSLHGGCPVEEGVKYSGNKWV 411


>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
          Length = 534

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEVVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVLAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|383864775|ref|XP_003707853.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Megachile
           rotundata]
          Length = 550

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 65/204 (31%), Positives = 111/204 (54%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  +YHN +   E E +  +A+P   ++TV + KTG  + +  R S   +L+  + + 
Sbjct: 342 DPRIVIYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 401

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           +  + KR+   T + +E  E +QV++Y +G  Y+ H+D+   E    F +   G R+AT+
Sbjct: 402 VAAVSKRVEHMTSLNVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATV 461

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE+GG TVF A N                   +S+ P++G A  +++++P+   
Sbjct: 462 LYYMSDVEQGGGTVFTAIN-------------------ISLWPRKGSAAFWFNLKPNGEG 502

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G+KW A KW+H
Sbjct: 503 DLRTRHAACPVLTGSKWVANKWLH 526


>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
          Length = 249

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 66/205 (32%), Positives = 112/205 (54%), Gaps = 23/205 (11%)

Query: 82  WEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDR 141
           + PR  +Y + L   E E +  +A+P + ++TV + KTG+ + +  R S   +LK  +D 
Sbjct: 42  FSPRIILYRDVLYDNEIEVIKRMAQPRLKRATVQNYKTGELEFADYRISKSAWLKEHEDV 101

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMAT 197
           ++  + KR+   T +  E  E +QV++Y VG  YD HYD+     ++ F +   G R+AT
Sbjct: 102 VVANVAKRVEVMTGLTTETAEELQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIAT 161

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           +L Y+SDV +GG TVFP          W          G++++P +G A +++++ P   
Sbjct: 162 VLFYMSDVAQGGATVFP----------WL---------GVALQPVKGTAAVWFNLYPSGN 202

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMH 282
            D  + H  CPV++G+KW   KW+H
Sbjct: 203 GDLRTRHAACPVLQGSKWVCNKWLH 227


>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
           aries]
          Length = 534

 Score =  128 bits (322), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVLAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|195425415|ref|XP_002061004.1| GK10713 [Drosophila willistoni]
 gi|194157089|gb|EDW71990.1| GK10713 [Drosophila willistoni]
          Length = 502

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 74/213 (34%), Positives = 115/213 (53%), Gaps = 24/213 (11%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            EI+   P   +YH+ L   E E L  LA P + +ST+ D     +     RTS+  FL 
Sbjct: 280 VEILNNLPFVAIYHDVLYDREIEELKRLAVPTITRSTIYDYDKEGNVPVNFRTSNSVFLL 339

Query: 137 RGQDRIIRGIEKRIADFTFIPM--EHGEGIQVLHYEVGQKYDAHYDYF-LDEFNTKNGGQ 193
                ++  + +R+AD T + +     + +QV++Y +G  Y  H+D+F  DE   K  G 
Sbjct: 340 NNASYLVDILRQRVADMTHLNVFKNSSDDLQVMNYGLGGYYRYHFDFFGKDESPNKLLGD 399

Query: 194 RMATLLMYLSDVEEGGETVFPAAN-ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
           R+ T+L+Y++DV++GG TVFPA    NF                    PK+G AL+F ++
Sbjct: 400 RIITVLIYMTDVQQGGATVFPALRITNF--------------------PKKGSALIFRNL 439

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
             + + DPS+LH GCPV+ G+KW+ATKW++  E
Sbjct: 440 DNNISPDPSTLHAGCPVLFGSKWAATKWIYSAE 472


>gi|218187602|gb|EEC70029.1| hypothetical protein OsI_00603 [Oryza sativa Indica Group]
          Length = 549

 Score =  128 bits (321), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 71/206 (34%), Positives = 113/206 (54%), Gaps = 19/206 (9%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PR F+Y  FLS  EC++L+   +  M  S         S  + +           +
Sbjct: 309 LSWHPRIFLYEGFLSDMECDHLVSTGRGNMDSSLAFTDGDRNSSYNNI-----------E 357

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IE RI+ ++F+P E+GE IQVL Y V ++         +E  +  GG  +AT+L
Sbjct: 358 DIVVSKIEDRISLWSFLPKENGENIQVLKYGVNRR-----GSIKEEPKSSTGGHWLATIL 412

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           +YLSDV++GGETVFP +       +     S+C   G +V+P +G+ALL +++RPD  +D
Sbjct: 413 IYLSDVKQGGETVFPRSEMKDAQAKE-GAPSQC--SGYAVRPAKGNALLLFNLRPDGEID 469

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHE 285
             S +  CPV+ G KW A K +H+ +
Sbjct: 470 KDSQYEECPVLEGEKWLAIKHIHLRK 495


>gi|343172438|gb|AEL98923.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
           [Silene latifolia]
 gi|343172440|gb|AEL98924.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein, partial
           [Silene latifolia]
          Length = 120

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 63/121 (52%), Positives = 81/121 (66%), Gaps = 5/121 (4%)

Query: 165 QVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVR 224
            VL YEVGQKY++HYD F          QR+A+ L+YLSDVEEGGET+FP  N N  S  
Sbjct: 3   NVLRYEVGQKYNSHYDAFHPAEYGPQKSQRIASFLLYLSDVEEGGETMFPYENDNIDSNY 62

Query: 225 WWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVH 284
              +  +C   GL VKP++GD LLF+S+  + T+DP+S+HG CPVI+G KW ATKW+   
Sbjct: 63  ---DYVQC--IGLKVKPRQGDGLLFYSLFSNGTIDPTSIHGSCPVIKGEKWVATKWIRNE 117

Query: 285 E 285
           E
Sbjct: 118 E 118


>gi|321474876|gb|EFX85840.1| hypothetical protein DAPPUDRAFT_309107 [Daphnia pulex]
          Length = 528

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 67/204 (32%), Positives = 106/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   +YH  +  AE + +  LA+P   ++ V D  TG+S   + R +   FLK  +  +
Sbjct: 328 KPLLVIYHGVIFDAEIDVVKKLAQPRFKRTGVTDRDTGRSMPVQYRIAKAAFLKDSEHNL 387

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATL 198
           I  + +R+ D T + M   E +QV +Y +G  Y  H+DY     +      + G R+AT 
Sbjct: 388 IVKMSRRVGDITGLDMAASEDLQVCNYGIGGHYVPHFDYARQGEIHGPRDLDWGNRIATW 447

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE GG TVFPA                    G ++ P++G A  ++++RP+   
Sbjct: 448 LFYMSDVEAGGATVFPAV-------------------GAALWPQKGSAAFWYNLRPNGNG 488

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  +LH GCPV+ G+KW + KW+H
Sbjct: 489 DEDTLHAGCPVLTGSKWVSNKWIH 512


>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
 gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
          Length = 520

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 66/206 (32%), Positives = 110/206 (53%), Gaps = 20/206 (9%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           EIV   P   +YH+ LS AE + + ++A P + ++TV  +  G+++  + RTS   +   
Sbjct: 321 EIVGLNPYMVIYHDVLSSAEIDEMKEMATPSLKRATVYKASLGKNEVVKTRTSKVAWFPD 380

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNTKNGGQRMA 196
             + +   +  RI D T   +   E +Q+++Y +G  YD HYD+F   E ++   G R+A
Sbjct: 381 SYNSLTLRLNARIHDMTGFDLSGSEMLQLMNYGLGGHYDKHYDFFNATEKSSSLTGDRIA 440

Query: 197 TLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           T+L Y+SDVE+GG TVFP                       +V P+RG A+++++++ D 
Sbjct: 441 TVLFYMSDVEQGGATVFPNIYK-------------------TVYPQRGTAVMWYNLKDDG 481

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMH 282
             D  +LH  CPV+ G+KW   KW+ 
Sbjct: 482 QPDEQTLHAACPVLVGSKWVCNKWIR 507


>gi|389775678|ref|ZP_10193553.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
 gi|388437120|gb|EIL93940.1| procollagen-proline dioxygenase [Rhodanobacter spathiphylli B39]
          Length = 284

 Score =  127 bits (320), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 77/208 (37%), Positives = 107/208 (51%), Gaps = 29/208 (13%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P   V  N L+  ECE LI LA+P + ++  V S      D R RTS G F    +  ++
Sbjct: 95  PALRVLENLLAAEECEELIALAQPRLKRALTVASDGSNQVDQR-RTSEGMFFTLNELPLV 153

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-----TKNGGQRMATL 198
             IE+R+A    +P+ HGEG+Q+LHY  GQ+Y+ H+D+F  +       T  GGQR+A++
Sbjct: 154 GRIEQRLATLLGMPVSHGEGLQILHYLPGQEYEPHFDWFDPQQPGYDTITAVGGQRVASV 213

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           +MYL+   +GG T FP                   + GL+V  +RG A+ F     D   
Sbjct: 214 VMYLNTPAQGGGTAFP-------------------ELGLTVTARRGAAVYFAYEGGDQ-- 252

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
              SLH G PV RG KW ATKW+    Y
Sbjct: 253 --QSLHAGLPVQRGEKWIATKWLRERPY 278


>gi|350416719|ref|XP_003491070.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
           impatiens]
          Length = 557

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 65/204 (31%), Positives = 110/204 (53%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  VYHN +   E E +  +A+P   ++TV + KTG  + +  R S   +L+  +   
Sbjct: 349 DPRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHEH 408

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           +  + +R+   T + ++  E +QV++Y +G  Y+ H+D+   E    F +   G R+AT+
Sbjct: 409 VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATV 468

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE+GG TVF A N                   +S+ PK+G A  +++++P+   
Sbjct: 469 LYYMSDVEQGGGTVFTAIN-------------------ISLWPKKGSAAFWYNLKPNGEG 509

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G+KW A KW+H
Sbjct: 510 DFKTRHAACPVLTGSKWVANKWLH 533


>gi|340722330|ref|XP_003399560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
           terrestris]
          Length = 557

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 65/204 (31%), Positives = 110/204 (53%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  VYHN +   E E +  +A+P   ++TV + KTG  + +  R S   +L+  +   
Sbjct: 349 DPRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHEH 408

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           +  + +R+   T + ++  E +QV++Y +G  Y+ H+D+   E    F +   G R+AT+
Sbjct: 409 VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATV 468

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE+GG TVF A N                   +S+ PK+G A  +++++P+   
Sbjct: 469 LYYMSDVEQGGGTVFTAIN-------------------ISLWPKKGSAAFWYNLKPNGEG 509

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G+KW A KW+H
Sbjct: 510 DFKTRHAACPVLTGSKWVANKWLH 533


>gi|195061074|ref|XP_001995919.1| GH14105 [Drosophila grimshawi]
 gi|193891711|gb|EDV90577.1| GH14105 [Drosophila grimshawi]
          Length = 513

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 69/208 (33%), Positives = 112/208 (53%), Gaps = 19/208 (9%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  +P   +YH+ +S  E E L  LA P + ++ VVD  T ++   + RTS  T+L  
Sbjct: 315 ELLQLDPYMVLYHDAISPREIEDLQFLAMPRLKRAKVVDQVTHRNMMVKERTSKVTWLGD 374

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
             +     + KRI D +   M   E +QV++Y +G  Y +HYD+      T+  G R+AT
Sbjct: 375 ATNAFTMRLNKRIEDMSGFTMYGSEMLQVMNYGLGGHYASHYDFLNATSKTRLNGDRIAT 434

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           ++ YLSDVE+GG TVFP                   K   +V P+RG A+++++++ +  
Sbjct: 435 VMFYLSDVEQGGATVFP-------------------KIQKAVFPQRGTAIIWYNLKENGD 475

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHE 285
            D +++H  CPVI G+KW   KW+  +E
Sbjct: 476 FDTNTIHAACPVIVGSKWVCNKWIRENE 503


>gi|332026992|gb|EGI67088.1| Prolyl 4-hydroxylase subunit alpha-1 [Acromyrmex echinatior]
          Length = 415

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 66/204 (32%), Positives = 110/204 (53%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  VYHN +   E E +  +A+P   ++TV + KTG  + +  R S   +L+  + + 
Sbjct: 207 DPRIVVYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 266

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           +  + KR+   T + +E  E +QV++Y +G  Y+ H+D+   E    F +   G R+AT+
Sbjct: 267 VAAVSKRVEHMTSMSVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATV 326

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE+GG TVF A N                   +S+ P++G A  + +++P+   
Sbjct: 327 LYYMSDVEQGGGTVFTAIN-------------------ISLWPRKGSAAFWHNLKPNGEG 367

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G+KW A KW+H
Sbjct: 368 DFKTRHAACPVLTGSKWVANKWLH 391


>gi|195575145|ref|XP_002105540.1| GD16902 [Drosophila simulans]
 gi|194201467|gb|EDX15043.1| GD16902 [Drosophila simulans]
          Length = 525

 Score =  127 bits (319), Expect = 5e-27,   Method: Compositional matrix adjust.
 Identities = 67/210 (31%), Positives = 111/210 (52%), Gaps = 21/210 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V  +P   +YH+ LS  E   L  +A P + ++TV  + +G+++  + RTS   +   
Sbjct: 323 ELVGLDPYMVLYHDVLSPKEITELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAWFPD 382

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF--LDEFNTKNGGQRM 195
           G + +   +  RI+D T   +   E +Q+++Y +G  YD HYD+F   +   T   G R+
Sbjct: 383 GYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRI 442

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           AT+L YL+DVE+GG TVFP                       +V P+RG  ++++++R +
Sbjct: 443 ATVLFYLTDVEQGGATVFPNIRK-------------------AVFPQRGSVVMWYNLRDN 483

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
             +D  +LH  CPVI G+KW   KW+   E
Sbjct: 484 GQIDTQTLHAACPVIVGSKWVCNKWIRERE 513


>gi|307211752|gb|EFN87747.1| Prolyl 4-hydroxylase subunit alpha-1 [Harpegnathos saltator]
          Length = 415

 Score =  127 bits (319), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 65/204 (31%), Positives = 110/204 (53%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   YHN +   E E +  +A+P   ++TV + KTG  + +  R S   +L+  + + 
Sbjct: 207 DPRIVFYHNVIYDEEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 266

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           +  + KR+   T + +E  E +QV++Y +G  Y+ H+D+   E    F +   G R+AT+
Sbjct: 267 VAAVSKRVEHMTSMSVETAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATV 326

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE+GG TVF A N                   +S+ P++G A  +++++P+   
Sbjct: 327 LYYMSDVEQGGGTVFTAIN-------------------ISLWPRKGSAAFWYNLKPNGEG 367

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G+KW A KW+H
Sbjct: 368 DFKTRHAACPVLTGSKWVANKWLH 391


>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
 gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
          Length = 526

 Score =  127 bits (319), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 71/204 (34%), Positives = 107/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E +  LAKP + ++TV D +TG+   ++ R S   +L   +D +
Sbjct: 326 KPRIIRFHDIISDAEIEIVKYLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYEDPV 385

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 386 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 445

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 446 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 486

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 487 DYSTRHAACPVLVGNKWVSNKWLH 510


>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
          Length = 541

 Score =  127 bits (319), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 69/207 (33%), Positives = 110/207 (53%), Gaps = 21/207 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+ + +PR  +YHN ++  E E    LA+  + +STV +S TG S+ ++ R +   FL+ 
Sbjct: 336 ELASLKPRLVIYHNVVTDEEIETAKKLAQSRLRRSTVQNSLTGASEPTKYRIAKAAFLQN 395

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD-EFNTKNG-GQRM 195
            +   I  + +RI D T + M   E +QV +Y +G  Y+ HYD+    E     G G R+
Sbjct: 396 SEHDHIVKMTRRIGDVTGLDMTTAEELQVCNYGIGGHYEPHYDHARKGEVQKDFGWGNRI 455

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           AT + Y+SDVE GG TVFP  N                   L++ P++G A  ++++ P+
Sbjct: 456 ATWMFYMSDVEAGGATVFPQIN-------------------LALWPQKGSAAFWFNLHPN 496

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMH 282
              D  + H  CPV+ G+KW + KW+H
Sbjct: 497 GEGDDLTQHAACPVLTGSKWVSNKWIH 523


>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
          Length = 537

 Score =  127 bits (319), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 67/206 (32%), Positives = 108/206 (52%), Gaps = 23/206 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   VYH+ +S  E E +  +AKP   ++T+ +SKTG+ + +  R S   +LK  +   
Sbjct: 335 KPMIVVYHDVMSDDEIETVKKMAKPRFKRATIRNSKTGELEPANYRISKSAWLKSEEHDH 394

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           I  + +R+ D T + M   E +QV++Y +G  Y+ H+DY   E    F     G R+AT 
Sbjct: 395 ILKVTRRVGDITGLDMSTAEDLQVVNYGIGGHYEPHFDYARTETTEAFKELGWGNRIATW 454

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE GG TVFP                     G +V P++G A  ++++ P+   
Sbjct: 455 LFYMSDVEAGGATVFPPT-------------------GAAVWPRKGSAAFWYNLYPNGKG 495

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVH 284
           +  + H  CPV+ G+KW + +W+H H
Sbjct: 496 NELTRHAACPVLSGSKWVSNRWIHEH 521


>gi|66820122|ref|XP_643703.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
 gi|60471803|gb|EAL69758.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
          Length = 221

 Score =  127 bits (318), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 76/212 (35%), Positives = 112/212 (52%), Gaps = 38/212 (17%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++  PR +    FL+  ECE+LID +K  +     + S          R+  G F+K G+
Sbjct: 28  LSQAPRIYRIPGFLTDEECEFLIDTSKNKLRPCNEISSGVH-------RSGWGLFMKEGE 80

Query: 140 D--RIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG------ 191
           +  +I + I  ++  F  I  E  E +QV+ Y  G++  +H+DYF     T NG      
Sbjct: 81  EDHQITKNIFNKMKSFVNIS-ESCEVMQVIRYNQGEETSSHFDYF--NPLTTNGSMKIGL 137

Query: 192 -GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFW 250
            GQR+ T+LMYL DVEEGGET FP                     G+ VKP +GDA+LF+
Sbjct: 138 YGQRVCTILMYLCDVEEGGETTFPEV-------------------GIKVKPIKGDAVLFY 178

Query: 251 SMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           + +P+  +DP SLH G PV++GNKW A K ++
Sbjct: 179 NCKPNGDVDPLSLHQGDPVLKGNKWVAIKLIN 210


>gi|21711777|gb|AAM75079.1| RE70601p [Drosophila melanogaster]
          Length = 316

 Score =  127 bits (318), Expect = 8e-27,   Method: Compositional matrix adjust.
 Identities = 67/210 (31%), Positives = 115/210 (54%), Gaps = 21/210 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V  +P   +YH+ LS  E + L  +A P + ++TV  + +G+++  + RTS   +   
Sbjct: 114 ELVGLDPYMVLYHDVLSPKEIKELQGMATPSLKRATVYQASSGRNEVVKTRTSKVAWFPD 173

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF--LDEFNTKNGGQRM 195
           G + +   +  RI+D T   +   E +Q+++Y +G  YD HYD+F   +   T   G R+
Sbjct: 174 GYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRI 233

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           AT+L YL+DVE+GG TVFP       ++R             +V P+RG  +++++++ +
Sbjct: 234 ATVLFYLTDVEQGGATVFP-------NIRK------------AVFPQRGSVVMWYNLKDN 274

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
             +D  +LH  CPVI G+KW   KW+   E
Sbjct: 275 GQIDTQTLHAACPVIVGSKWVCNKWIRERE 304


>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
           IMCC9480]
 gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
           IMCC9480]
          Length = 280

 Score =  126 bits (317), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 68/208 (32%), Positives = 107/208 (51%), Gaps = 24/208 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V  N LS  EC+ +  +++    +ST +D+ +G ++    RTS    ++RG+  +I
Sbjct: 92  PRIVVLGNVLSDDECDAIAAMSRTRFARSTTIDNASGINRFDDSRTSESAHIQRGETELI 151

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEF-----NTKNGGQRMATL 198
             I+ R+A  +  P++HGE +Q+  Y+ G +Y  H+D+F         + +  GQR+AT+
Sbjct: 152 ARIDARLAALSGWPVDHGEPLQLQKYQAGNEYRPHFDWFDPALAGTAKHLEKSGQRLATI 211

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YL+DVEEGG T FP                     GL V P++G AL F +  P    
Sbjct: 212 ILYLTDVEEGGGTSFPGI-------------------GLDVHPQKGGALFFRNTTPYGVP 252

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           D  + H G PV +G K  A KW+    Y
Sbjct: 253 DRKTQHAGLPVEKGTKIIANKWLREKPY 280


>gi|24651477|ref|NP_733395.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
 gi|20269812|gb|AAM18061.1|AF495539_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]PV [Drosophila
           melanogaster]
 gi|23172718|gb|AAN14252.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
          Length = 525

 Score =  126 bits (317), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 66/210 (31%), Positives = 112/210 (53%), Gaps = 21/210 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V  +P   +YH+ LS  E + L  +A P + ++TV  + +G+++  + RTS   +   
Sbjct: 323 ELVGLDPYMVLYHDVLSPKEIKELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAWFPD 382

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF--LDEFNTKNGGQRM 195
           G + +   +  RI+D T   +   E +Q+++Y +G  YD HYD+F   +   T   G R+
Sbjct: 383 GYNPLTVRLNARISDMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRI 442

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           AT+L YL+DVE+GG TVFP                       +V P+RG  +++++++ +
Sbjct: 443 ATVLFYLTDVEQGGATVFPNIRK-------------------AVFPQRGSVVMWYNLKDN 483

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
             +D  +LH  CPVI G+KW   KW+   E
Sbjct: 484 GQIDTQTLHAACPVIVGSKWVCNKWIRERE 513


>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Cricetulus griseus]
          Length = 534

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 70/204 (34%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++T+ +  TG  +    R S   +L   +D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLETVHYRISKSAWLSGYEDPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|410927705|ref|XP_003977281.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
           rubripes]
          Length = 531

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 107/203 (52%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    YH+ LS  E E + +LAKP + ++TV D +TGQ   +  R S   +L   +  ++
Sbjct: 332 PHIVRYHDILSNREMETVKELAKPRLRRATVHDPQTGQLTTAPYRVSKSAWLGAFEHPVV 391

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             I +RI D T + +   E +QV +Y VG +Y+ HYD+      D F     G R+AT L
Sbjct: 392 DRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHYDFGRKDEPDAFKELGTGNRIATWL 451

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           +Y+S+V+ GG TV       FT +            G SV PK+G A+ ++++ P    D
Sbjct: 452 LYMSEVQAGGATV-------FTDI------------GASVSPKKGSAVFWYNLHPSGDGD 492

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ GNKW + KW+H
Sbjct: 493 YRTRHAACPVLLGNKWVSNKWIH 515


>gi|157114985|ref|XP_001658091.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
 gi|108877086|gb|EAT41311.1| AAEL007038-PA [Aedes aegypti]
          Length = 545

 Score =  126 bits (317), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 70/204 (34%), Positives = 112/204 (54%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   +YH+ +S+AE E +  LAKP   ++TV + KTG+ + +  R S   +LK  +   
Sbjct: 341 KPYIVIYHDVISEAEMELVKRLAKPRFRRATVQNYKTGELEVANYRISKSAWLKDHEHPY 400

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           I+ I +R+ D T + M   E +QV++Y +G  Y+ H+D+   E    F +   G R+AT+
Sbjct: 401 IKAIGERVEDMTGLTMSTAEELQVVNYGIGGHYEPHFDFARREETNAFKSLGTGNRIATV 460

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV +GG TVFP       S+R            L++ PK+G A  ++++      
Sbjct: 461 LFYMSDVTQGGATVFP-------SLR------------LALWPKKGAAAFWFNLHASGQG 501

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ G KW + KW+H
Sbjct: 502 DYSTRHAACPVLTGTKWVSNKWIH 525


>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
          Length = 561

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 70/204 (34%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++T+ +  TG  +    R S   +L   +D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
          Length = 534

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 70/204 (34%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++T+ +  TG  +    R S   +L   +D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
 gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide [Mus musculus]
 gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
 gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
 gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
           musculus]
          Length = 534

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 70/204 (34%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++T+ +  TG  +    R S   +L   +D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
 gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
          Length = 525

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 67/210 (31%), Positives = 114/210 (54%), Gaps = 21/210 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V  +P   +YH+ LS  E + L  +A P + ++TV  + +G+++  + RTS   +   
Sbjct: 323 ELVGLDPYMVLYHDVLSAKEIKELQGMATPGLTRATVFQASSGRNEVVKTRTSKVAWFPD 382

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF--LDEFNTKNGGQRM 195
             + +   +  RIAD T   +   E +Q+++Y +G  YD HYD+F  ++   T   G R+
Sbjct: 383 SYNPLTVRLNARIADMTGFNLYGSEMLQLMNYGLGGHYDQHYDFFNTINSNLTAMSGDRI 442

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           AT+L YL+DVE+GG TVFP       ++R             +V P+RG  +++++++ +
Sbjct: 443 ATVLFYLTDVEQGGATVFP-------NIRK------------AVFPQRGSVIMWYNLQDN 483

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
              D  +LH  CPVI G+KW   KW+   E
Sbjct: 484 GQTDNKTLHAACPVIVGSKWVCNKWIRERE 513


>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
          Length = 507

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 70/204 (34%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++T+ +  TG  +    R S   +L   +D +
Sbjct: 307 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGNLETVHYRISKSAWLSGYEDPV 366

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 367 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATW 426

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 427 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 467

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 468 DYSTRHAACPVLVGNKWVSNKWLH 491


>gi|380025232|ref|XP_003696381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Apis florea]
          Length = 537

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 65/204 (31%), Positives = 110/204 (53%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  VYHN +   E E +  +A+P   ++TV + KTG  + +  R S   +L+  + + 
Sbjct: 329 DPRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 388

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           +  + +R+   T + ++  E +QV++Y +G  Y+ H+D+   E    F +   G R+AT+
Sbjct: 389 VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATV 448

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE+GG TVF A N     +  W              PK+G A  +++++P+   
Sbjct: 449 LYYMSDVEQGGGTVFTAIN-----IALW--------------PKKGSAAFWYNLKPNGEG 489

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G+KW A KW+H
Sbjct: 490 DFKTRHAACPVLTGSKWVANKWLH 513


>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide precursor [Salmo
           salar]
 gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
          Length = 545

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 110/203 (54%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR   YH+ LS +E E + +LAKP + ++T+ +  TG  + +  R S   +L   +D ++
Sbjct: 346 PRIIRYHDVLSNSEIEKVKELAKPRLRRATISNPITGVLETAHYRISKSAWLTAYEDPVV 405

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             I +RI D T + ++  E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 406 DKINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWL 465

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           +Y+SDV  GG TV       FT V            G +V PK+G A+ ++++ P    D
Sbjct: 466 IYMSDVPSGGATV-------FTDV------------GAAVWPKKGSAVFWYNLFPSGEGD 506

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
            S+ H  CPV+ GNKW + KW+H
Sbjct: 507 YSTRHAACPVLVGNKWVSNKWIH 529


>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
           [Rattus norvegicus]
          Length = 534

 Score =  126 bits (316), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 70/204 (34%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++T+ +  TG  +    R S   +L   +D +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGYEDPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|302830268|ref|XP_002946700.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
 gi|300267744|gb|EFJ51926.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
          Length = 186

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 70/161 (43%), Positives = 94/161 (58%), Gaps = 5/161 (3%)

Query: 124 DSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL 183
           + +VRTS GTFL       +R +E +IA  T +P  +GE   VL+Y+  Q YD+H D F 
Sbjct: 17  EQQVRTSKGTFLGGDSSPALRWLEDKIAAVTLLPRTNGEFWNVLNYKHSQHYDSHMDSFD 76

Query: 184 DEFNTKNGGQRMATLLMYLSDVE-EGGETVFPAANANFTS--VRWWNELSECGKQGLSVK 240
            +       QR+AT+++ LSD    GGETVF     +  +  +  W +    G  GL  K
Sbjct: 77  PKEYGPQYSQRIATVIVVLSDDGLMGGETVFKREGKSSINKPISNWTDCDADG--GLKYK 134

Query: 241 PKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           P+ GDA+LFWS RPD  LDP +LHG CPV+ GNKW A KW+
Sbjct: 135 PRAGDAVLFWSARPDGQLDPHALHGSCPVVTGNKWVAVKWL 175


>gi|328790718|ref|XP_392392.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Apis mellifera]
          Length = 415

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 64/204 (31%), Positives = 111/204 (54%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  VYHN +   E E +  +A+P   ++TV + KTG  + +  R S   +L+  + + 
Sbjct: 207 DPRIVVYHNVIYDDEIETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEHEHKH 266

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           +  + +R+   T + ++  E +QV++Y +G  Y+ H+D+   E    F +   G R+AT+
Sbjct: 267 VAAVSRRVEHMTSMTVDTAEELQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATV 326

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE+GG TVF A N                   +++ PK+G A  +++++P+   
Sbjct: 327 LYYMSDVEQGGGTVFTAIN-------------------IALWPKKGSAAFWYNLKPNGEG 367

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G+KW A KW+H
Sbjct: 368 DFKTRHAACPVLTGSKWVANKWLH 391


>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
 gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
          Length = 509

 Score =  125 bits (315), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 73/212 (34%), Positives = 118/212 (55%), Gaps = 26/212 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  +P   VYH+  S  E   LI+LAK  + ++T+ D   G+ + S  RTS   +L  
Sbjct: 303 EVLNLDPFITVYHDVASDREISKLIELAKSRISRATIRDD--GEPQVSNARTSQNAWLDA 360

Query: 138 GQDRIIRGIEKRIADFTF-IPMEHGEGIQVLHYEVGQKYDAHYDYFLDE--FNTKNGGQR 194
           G DR++  +++R+ D T  +  +  E +QV +Y VG  Y AH+D+ ++   +     G R
Sbjct: 361 GDDRVVTTLDRRVGDMTGGLRQQSYEMLQVNNYGVGGHYVAHHDWAMEAVPYAGLRVGNR 420

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +AT++ YLSDVE GG TVFP                   + GL+V P++G A+L++++  
Sbjct: 421 IATVMFYLSDVEIGGATVFP-------------------QLGLAVFPRKGSAILWYNLYR 461

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           +   D  +LH  CPV+ G+KW A +W  +HEY
Sbjct: 462 NGKGDRRTLHAACPVLSGSKWVANQW--IHEY 491


>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
 gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
          Length = 516

 Score =  125 bits (314), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 81/248 (32%), Positives = 125/248 (50%), Gaps = 27/248 (10%)

Query: 40  FYIPI--GDDDSPPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAE 97
            Y P+  GD   PP++ ++   R +   +S          E+V  +P   VYH+  S AE
Sbjct: 275 LYEPLCRGDHQRPPSETSNLYCR-YHMSTSPFLRLAPLKQEVVNLDPFVAVYHDAASDAE 333

Query: 98  CEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIP 157
              +I+L +P + +S V D+   + + S+ RTS  ++L      ++  + +R  D     
Sbjct: 334 INKVIELGRPQINRSMVGDA--AKKEVSKSRTSQNSWLTDYDHPVVAALSRRTKDMALGL 391

Query: 158 MEHG-EGIQVLHYEVGQKYDAHYDYFLDE--FNTKNGGQRMATLLMYLSDVEEGGETVFP 214
            E   E +QV +Y +G  Y  HYD+  +E  +   N G R+ATL+ YLSDVEEGG TVFP
Sbjct: 392 DETAYESLQVNNYGIGGHYLPHYDWSREENPYPELNTGNRIATLMFYLSDVEEGGATVFP 451

Query: 215 AANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNK 274
                                G+ V PK+G A+ ++++R     D  +LHG CPV+ G+K
Sbjct: 452 -------------------HLGVGVFPKKGTAIFWYNLRASGKGDEKTLHGACPVLIGSK 492

Query: 275 WSATKWMH 282
           W A KW+H
Sbjct: 493 WVANKWIH 500


>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
           latipes]
          Length = 532

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 72/201 (35%), Positives = 107/201 (53%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y N LS  E E + +LAKP + ++TV D KTG    +  R S   +L+   D +I
Sbjct: 335 PHIVRYLNILSDQEIEKIKELAKPRLARATVRDPKTGVLTTAPYRVSKSAWLEGEDDPVI 394

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEF--NTKNGGQRMATLLMY 201
             + +RI D T + +E  E +QV +Y VG +Y+ H+D+    F  N K  G R+AT L Y
Sbjct: 395 DRVNQRIQDITGLTVETAELLQVANYGVGGQYEPHFDFSRRPFDSNLKVDGNRLATFLNY 454

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP    +F               G S+ P++G A+ ++++      D  
Sbjct: 455 MSDVEAGGATVFP----DF---------------GASIWPRKGTAVFWYNLFRSGEGDYR 495

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G+KW + KW+H
Sbjct: 496 TRHAACPVLVGSKWVSNKWIH 516


>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
 gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
          Length = 528

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 66/207 (31%), Positives = 109/207 (52%), Gaps = 22/207 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V  +P   +YH+ +S  E   L D+A P + ++TV  +   +S+  + RTS   +   
Sbjct: 324 ELVGLDPYMVLYHDVISAPEISQLQDMATPGLKRATVYKASGRRSEVVKTRTSKVAWFPD 383

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN---TKNGGQR 194
             + +   + +RIAD T   +   E +Q ++Y +G  YD HYD+F        T+  G R
Sbjct: 384 TFNELTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFNASTATNLTQMNGDR 443

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +AT+L YL+DVE+GG TVFP                       +V P+RG A+++++++ 
Sbjct: 444 IATVLFYLTDVEQGGATVFPNIRK-------------------AVFPQRGSAIIWYNLKD 484

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWM 281
           D   +P +LH  CPV+ G+KW   KW+
Sbjct: 485 DGDPNPQTLHAACPVLVGSKWVCNKWI 511


>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
          Length = 490

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 74/200 (37%), Positives = 111/200 (55%), Gaps = 32/200 (16%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP+   YH+ +S  E E L D+A+P + +S     +TG    S +RTS   FL+  +   
Sbjct: 307 EPKIIRYHDVISDTEIETLKDIARPELTRS-----QTGWGVISDIRTSQSVFLE--EVGT 359

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  I +RIAD T + +E  E + V +Y +G +Y  H+D   DE N     +R AT L+Y+
Sbjct: 360 VARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDTG-DEVN-----ERTATFLIYM 413

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           SDVE GG TVF       T+V            G++VKP++G A+ ++++  +  LD  +
Sbjct: 414 SDVEVGGATVF-------TNV------------GVAVKPEKGSAVFWYNLHKNGELDLKT 454

Query: 263 LHGGCPVIRGNKWSATKWMH 282
            H GCPV+ GNKW A KW+H
Sbjct: 455 KHAGCPVLVGNKWVANKWIH 474


>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
           [Monodelphis domestica]
          Length = 537

 Score =  125 bits (313), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 70/204 (34%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H  +S AE E + DLAKP + ++T+ +  TG  + +  R S   +L   +D +
Sbjct: 337 KPRIVRFHEIISDAEIEIVKDLAKPRLRRATISNPITGVLETAHYRISKSAWLSGYEDPV 396

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 397 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 456

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 457 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 497

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 498 DYSTRHAACPVLVGNKWVSNKWIH 521


>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
          Length = 508

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 74/200 (37%), Positives = 111/200 (55%), Gaps = 32/200 (16%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP+   YH+ +S  E E L D+A+P + +S     +TG    S +RTS   FL+  +   
Sbjct: 325 EPKIIRYHDVISDTEIETLKDIARPELTRS-----QTGWGVISDIRTSQSVFLE--EVGT 377

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  I +RIAD T + +E  E + V +Y +G +Y  H+D   DE N     +R AT L+Y+
Sbjct: 378 VARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDTG-DEVN-----ERTATFLIYM 431

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           SDVE GG TVF       T+V            G++VKP++G A+ ++++  +  LD  +
Sbjct: 432 SDVEVGGATVF-------TNV------------GVAVKPEKGSAVFWYNLHKNGELDLKT 472

Query: 263 LHGGCPVIRGNKWSATKWMH 282
            H GCPV+ GNKW A KW+H
Sbjct: 473 KHAGCPVLVGNKWVANKWIH 492


>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
 gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
          Length = 493

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 67/208 (32%), Positives = 112/208 (53%), Gaps = 22/208 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V  +P   +YH+ +S  E   L D+A P + ++TV  +   +S+  + RTS   +   
Sbjct: 289 ELVGLDPYMVLYHDVISALEISQLQDMATPGLKRATVYKASGRRSEVVKTRTSKVAWFPD 348

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN---TKNGGQR 194
             + +   + +RIAD T   +   E +Q ++Y +G  YD HYD+F        T+  G R
Sbjct: 349 TFNELTERLNRRIADMTNFDLLGSEMLQAMNYGLGGHYDKHYDFFNASTAANLTQMNGDR 408

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +AT+L YL+DVE+GG TVFP       ++R             +V P+RG A+++++++ 
Sbjct: 409 IATVLFYLTDVEQGGATVFP-------NIRK------------AVFPQRGSAIIWYNLKD 449

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMH 282
           D   +P +LH  CPV+ G+KW   KW+ 
Sbjct: 450 DGDPNPQTLHAACPVLVGSKWVCNKWIR 477


>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
           rubripes]
          Length = 538

 Score =  124 bits (312), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 107/203 (52%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y +FLS  E E + +LAKP + ++TV D K+G    +  R S   +L+  +D II
Sbjct: 339 PNIVRYLDFLSNEEIEKIKELAKPKLARATVRDPKSGVLTTASYRVSKSAWLEGEEDPII 398

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             + +RI D T + ++  E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 399 ARVNQRIEDLTGLTVKTAELLQVANYGVGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFL 458

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP    +F +  W               P++G A+ ++++      D
Sbjct: 459 NYMSDVEAGGATVFP----DFGAAIW---------------PRKGTAVFWYNLFKSGEGD 499

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ GNKW + KW+H
Sbjct: 500 YRTRHAACPVLVGNKWVSNKWIH 522


>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
 gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
 gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
          Length = 534

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++T+ +  TG  +    R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
          Length = 545

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 70/202 (34%), Positives = 106/202 (52%), Gaps = 21/202 (10%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  VYH+ +S  E E +  LA+P   ++TV   ++G+ + SR R +   +LK  +   
Sbjct: 345 KPRIVVYHDIISDEEIETIKRLAQPRFERATVQKKESGEREFSRYRIAKSAWLKHEEHDY 404

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD-EFNTKNG-GQRMATLLM 200
           +  I  R+ D T + M   E +QV +Y +G  Y+ HYDY    E     G G R+AT L 
Sbjct: 405 VSDINFRVGDITGLDMATSEDLQVCNYGIGGHYEPHYDYARKGEVQQDFGWGGRIATWLF 464

Query: 201 YLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDP 260
           Y+SDVE GG TVFP  N                   LS+ P++G A  ++++ P+   + 
Sbjct: 465 YMSDVEAGGATVFPKLN-------------------LSLWPQKGSAAFWFNLYPNGEGNE 505

Query: 261 SSLHGGCPVIRGNKWSATKWMH 282
            + H GCPV+ G+KW A  W+H
Sbjct: 506 MTQHAGCPVLTGSKWVANYWIH 527


>gi|156370133|ref|XP_001628326.1| predicted protein [Nematostella vectensis]
 gi|156215300|gb|EDO36263.1| predicted protein [Nematostella vectensis]
          Length = 526

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 74/211 (35%), Positives = 114/211 (54%), Gaps = 23/211 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           EIV+  P+  ++HN LS+ E E +++LA+P + ++ V + +TG+ +D   R S   +L  
Sbjct: 320 EIVSVNPQITLFHNVLSEMEIEQMLELARPRLRRARVNNLETGEIEDVDYRISQIAWLSD 379

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTK----NGGQ 193
               I+R I +R+   T +    GE +QV +Y VG  Y+ H+D+ LD  N+       G 
Sbjct: 380 SDGDIVRRINRRVGFITGLNTNTGECLQVNNYGVGGHYEPHFDHSLDMENSPIASLGQGN 439

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+AT + YLS+VE GG TVF                    K G+   P +G A+ +++++
Sbjct: 440 RIATFMFYLSEVEAGGSTVFI-------------------KTGVKTNPFKGGAVFWYNLK 480

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMHVH 284
                D  SLH GCPV+ GNKW A KW+H H
Sbjct: 481 KSGEGDWDSLHAGCPVLIGNKWVANKWLHEH 511


>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
 gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
          Length = 534

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++T+ +  TG  +    R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
          Length = 534

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++T+ +  TG  +    R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
           garnettii]
          Length = 534

 Score =  124 bits (312), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++T+ +  TG  +    R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 566

 Score =  124 bits (311), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++T+ +  TG  +    R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
           [Oryctolagus cuniculus]
          Length = 534

 Score =  124 bits (311), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++T+ +  TG  +    R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
           sapiens]
 gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
 gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
           sapiens]
 gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
           troglodytes]
 gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
           troglodytes]
 gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
 gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
 gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
 gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
 gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
           sapiens]
 gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
           sapiens]
 gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [synthetic
           construct]
 gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [synthetic
           construct]
 gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
 gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score =  124 bits (311), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++T+ +  TG  +    R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|170064956|ref|XP_001867741.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
 gi|167882144|gb|EDS45527.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
          Length = 520

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 72/208 (34%), Positives = 111/208 (53%), Gaps = 24/208 (11%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V  EP   VYH  +S  E   LI+LA+P + +S V D+++ Q   S++R S   + + 
Sbjct: 318 EVVNLEPLIVVYHEAVSDREIAKLIELARPLIKRSAVGDTRSEQI--SKIRISQNAWFEN 375

Query: 138 GQDRIIRGIEKRIADFTFIPMEHG-EGIQVLHYEVGQKYDAHYDYFL--DEFNTKNGGQR 194
             D I+  + +R  D      E   E +QV +Y +G  Y  HYD+    + F  K  G R
Sbjct: 376 EHDPIVETLNQRARDMAGGLNEPSYELLQVNNYGLGGFYSIHYDWSTSANPFPNKGMGNR 435

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +ATL+ YLSDV+EGG TVFP  N                   L+V+P++G A+ ++++  
Sbjct: 436 IATLMFYLSDVQEGGSTVFPRLN-------------------LAVRPRKGTAIFWYNLHR 476

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMH 282
           +   +  +LH  CPV+ G+KW A KW+H
Sbjct: 477 NGKGNKKTLHAACPVLIGSKWVANKWIH 504


>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
 gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
          Length = 538

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 107/203 (52%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y N LS +E E + +LAKP + ++TV D KTG    +  R S   +L+  +D +I
Sbjct: 339 PHIVRYLNALSDSEIEKIKELAKPRLARATVRDPKTGVLTTANYRVSKSAWLEGEEDPVI 398

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             + +RI D T +  +  E +Q+ +Y VG +Y+ H+D+      D F T   G R+AT L
Sbjct: 399 ERVNQRIEDITGLTTQTAELLQIANYGVGGQYEPHFDFSRKDEPDAFKTLGTGNRVATFL 458

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP    +F               G ++ PK+G A+ ++++      D
Sbjct: 459 NYMSDVEAGGATVFP----DF---------------GAAIYPKKGTAVFWYNLFRSGEGD 499

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW+H
Sbjct: 500 YRTRHAACPVLVGCKWVSNKWIH 522


>gi|330799463|ref|XP_003287764.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
 gi|325082219|gb|EGC35708.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
          Length = 220

 Score =  124 bits (311), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 74/212 (34%), Positives = 111/212 (52%), Gaps = 38/212 (17%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++ +PR +    FL++ EC +LID +K  +     + S          R+  G F+K G+
Sbjct: 28  LSQKPRVYRIPEFLTEEECNHLIDTSKNKLRPCNEISSGVH-------RSGWGLFMKEGE 80

Query: 140 DR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG------ 191
           +   + + I  ++ +F  I  +  E +Q++ Y  G++  AHYDYF     T NG      
Sbjct: 81  EEHPVTKNIFNKMKNFVNI-SDSCEVMQIIRYNPGEETSAHYDYF--NPLTTNGSMKIGL 137

Query: 192 -GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFW 250
            GQR+ T+LMYL DVEEGGET FP                     G+ VKP RGDA+LF+
Sbjct: 138 YGQRICTILMYLCDVEEGGETSFPEV-------------------GIKVKPIRGDAVLFY 178

Query: 251 SMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           + +P+  +DP SLH G PV +G KW A K ++
Sbjct: 179 NCKPNGDVDPLSLHQGDPVTKGTKWVAIKLIN 210


>gi|281206564|gb|EFA80750.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
           pallidum PN500]
          Length = 251

 Score =  124 bits (310), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 76/212 (35%), Positives = 112/212 (52%), Gaps = 38/212 (17%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+ +PR +    FL+  ECE+LI+ +K  +     + S          R+  G F+K G+
Sbjct: 60  VSQKPRIYRIPKFLTDEECEHLIETSKNKLKPCNEISSGVH-------RSGWGLFMKEGE 112

Query: 140 DR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG------ 191
           +   + + I  R+  F  +  E  E +QV+ Y  G++  AH+DYF     T NG      
Sbjct: 113 EDHPVTQNIFNRMKTFVNL-TESSEVMQVIRYNPGEETSAHFDYF--NPLTTNGAMKIGL 169

Query: 192 -GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFW 250
            GQR+ T+LMYL+DVEEGGET FP  N                   + VKP +GDA+LF+
Sbjct: 170 YGQRICTILMYLADVEEGGETSFPEVN-------------------VKVKPIKGDAVLFY 210

Query: 251 SMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           + +P+  +DP SLH G PVI+G KW A K ++
Sbjct: 211 NCKPNGEVDPLSLHQGDPVIKGTKWIAIKLVN 242


>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
           [Loxodonta africana]
          Length = 534

 Score =  124 bits (310), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++T+ +  TG  +    R S   +L   ++ +
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPDV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
           gallus]
          Length = 536

 Score =  124 bits (310), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   + + +S  E E + +LAKP + ++TV D +TG+   +  R S   +L   +  +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 395

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 396 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 455

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++ P    
Sbjct: 456 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFPSGEG 496

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 497 DYSTRHAACPVLVGNKWVSNKWLH 520


>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
 gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
          Length = 272

 Score =  123 bits (309), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 75/210 (35%), Positives = 104/210 (49%), Gaps = 26/210 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P+  +  N LS  EC+ +I        +STV     G S     RTS   F++RG+  +
Sbjct: 82  QPQIILLGNVLSDEECDAIIAHCGTRYTRSTVTGEADGSSMVHEGRTSEMAFIQRGEAEV 141

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN------GGQRMA 196
              IE+R+A     P E  E  Q+  Y+  Q+Y  HYD+ LD  ++ +      GGQR+A
Sbjct: 142 AERIERRLAALAHWPAECSEPFQLQKYDATQEYRPHYDW-LDPDSSGHRSHLARGGQRLA 200

Query: 197 TLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           T ++YLSDVE+GG TVFP                     GL V PK+G AL F +   + 
Sbjct: 201 TFILYLSDVEQGGGTVFPGL-------------------GLEVYPKKGSALWFLNTDINH 241

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
             D  +LHGG PV+RG K  A KW+    Y
Sbjct: 242 QPDKRTLHGGAPVVRGTKIIANKWLRQGRY 271


>gi|194765194|ref|XP_001964712.1| GF22904 [Drosophila ananassae]
 gi|190614984|gb|EDV30508.1| GF22904 [Drosophila ananassae]
          Length = 547

 Score =  123 bits (309), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 66/204 (32%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP   +YH+ +  +E E +  +A+P   ++TV +S TG  + +  R S   +LK  +D +
Sbjct: 342 EPYIVIYHDAMYDSEIELIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTEEDHV 401

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           I  + +R AD T + M+  E +QV++Y +G  Y+ H+D+   E    F   N G R+AT+
Sbjct: 402 IGTVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARKEEKRAFEGLNLGNRIATV 461

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE+GG TVF + +                    ++ PK+G A  + ++  D   
Sbjct: 462 LFYMSDVEQGGATVFTSLHT-------------------ALFPKKGTAAFWMNLHRDGEG 502

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G KW + KW+H
Sbjct: 503 DVRTRHAACPVLTGTKWVSNKWIH 526


>gi|112984520|ref|NP_001037195.1| prolyl 4-hydroxylase alpha subunit precursor [Bombyx mori]
 gi|37543673|gb|AAM21932.1| prolyl 4-hydroxylase alpha subunit [Bombyx mori]
          Length = 550

 Score =  123 bits (309), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 68/202 (33%), Positives = 109/202 (53%), Gaps = 21/202 (10%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P  F++H  ++  E E++   AKP   ++ V D KTG+   +  R S  ++L+  +  +
Sbjct: 336 KPDIFMFHEVMTDDEIEFIKKRAKPRFKRAVVHDPKTGELTPAHYRISKSSWLRDEESPV 395

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN--TKNGGQRMATLLM 200
           I  I +R+ D T + M H E +QV++Y +G  Y+ H+D+     N  TK GG R+AT+L 
Sbjct: 396 IARITQRVTDMTGLSMLHAEELQVVNYGIGGHYEPHFDFARKRENPFTKFGGNRIATVLF 455

Query: 201 YLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDP 260
           Y+SDV +GG TVF              EL      GLS+ P +  A  + ++      D 
Sbjct: 456 YMSDVAQGGATVF-------------TEL------GLSLFPIKRAAAFWLNLHASGEGDL 496

Query: 261 SSLHGGCPVIRGNKWSATKWMH 282
           ++ H  CPV+RG+KW + KW+H
Sbjct: 497 ATRHAACPVLRGSKWVSNKWIH 518


>gi|115495019|ref|NP_001069238.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
 gi|122144801|sp|Q1RMU3.1|P4HA1_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|92097479|gb|AAI14709.1| Prolyl 4-hydroxylase, alpha polypeptide I [Bos taurus]
 gi|296472132|tpg|DAA14247.1| TPA: prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
 gi|440892721|gb|ELR45796.1| Prolyl 4-hydroxylase subunit alpha-1 [Bos grunniens mutus]
          Length = 534

 Score =  123 bits (309), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++T+ +  TG  +    R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEVVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVLAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|195505190|ref|XP_002099397.1| GE10881 [Drosophila yakuba]
 gi|194185498|gb|EDW99109.1| GE10881 [Drosophila yakuba]
          Length = 487

 Score =  123 bits (309), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 65/204 (31%), Positives = 109/204 (53%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   +YH+ +  +E + +  +A+P   ++TV +S TG  + +  R S   +LK  +DR+
Sbjct: 282 DPYIVIYHDAMYDSEIDVIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTHEDRV 341

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           I  + +R AD T + ME  E +QV++Y +G  Y+ H+D+   E    F   N G R+AT+
Sbjct: 342 IGTVVQRTADMTGLDMESAEELQVVNYGIGGHYEPHFDFARKEEERAFEGLNLGNRIATV 401

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE+GG TVF + +                    ++ P++G A  + ++  D   
Sbjct: 402 LFYMSDVEQGGATVFTSLHT-------------------ALFPRKGTAAFWMNLHRDGQG 442

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G KW + KW+H
Sbjct: 443 DVRTRHAACPVLTGTKWVSNKWIH 466


>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
           gallus]
          Length = 536

 Score =  123 bits (308), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   + + +S  E E + +LAKP + ++TV D +TG+   +  R S   +L   +  +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 395

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 396 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 455

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++ P    
Sbjct: 456 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFPSGEG 496

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 497 DYSTRHAACPVLVGNKWVSNKWLH 520


>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Meleagris gallopavo]
          Length = 536

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   + + +S  E E + +LAKP + ++TV D +TG+   +  R S   +L   +  +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 395

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 396 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 455

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++ P    
Sbjct: 456 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFPSGEG 496

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 497 DYSTRHAACPVLVGNKWVSNKWLH 520


>gi|218665910|ref|YP_002425647.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
 gi|218518123|gb|ACK78709.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
           ferrooxidans ATCC 23270]
          Length = 248

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 69/197 (35%), Positives = 99/197 (50%), Gaps = 20/197 (10%)

Query: 91  NFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRI 150
             L+   C+ LI + +  +  +TV D +TGQ      R S   + KR    I++ + + I
Sbjct: 70  GLLTPENCQNLIAIGQSLLRPATVTDEQTGQEVAHGERVSEMAWPKRDDYPILQSLAEGI 129

Query: 151 ADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT-KNGGQRMATLLMYLSDVEEGG 209
           A  T IP++  E +Q+LHY  G +Y  HYD F  +  T + GG R ATL++YL+ VEEGG
Sbjct: 130 AQLTGIPIDCQEPLQILHYRPGGEYKPHYDAFAADAPTLRQGGNRQATLILYLNAVEEGG 189

Query: 210 ETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPV 269
           ET FP                   + GL V P  G  + F ++  +    P SLH G PV
Sbjct: 190 ETAFP-------------------ELGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPV 230

Query: 270 IRGNKWSATKWMHVHEY 286
            +G KW AT+W+    Y
Sbjct: 231 RKGEKWIATQWIRQEAY 247


>gi|426255744|ref|XP_004021508.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Ovis
           aries]
          Length = 534

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++T+ +  TG  +    R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVLAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|195110919|ref|XP_002000027.1| GI24860 [Drosophila mojavensis]
 gi|193916621|gb|EDW15488.1| GI24860 [Drosophila mojavensis]
          Length = 487

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 66/204 (32%), Positives = 109/204 (53%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   +YH+ +  +E E L  +A+P   ++TV ++ TG  + +  R S   +LK  + R+
Sbjct: 282 DPYIVIYHDAMFDSEIEVLKRMARPRFRRATVQNAVTGALETANYRISKSAWLKTAEHRV 341

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           I  + +R AD T + M+  E +QV++Y +G  Y+ H+D+   E    F   N G R+AT+
Sbjct: 342 IGTVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARREEIRAFEGLNLGNRIATV 401

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE+GG TVF + +A                    +KPK+G A  + ++      
Sbjct: 402 LFYMSDVEQGGATVFTSLHA-------------------VLKPKKGTAAFWMNLHRSGEG 442

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G+KW + KW+H
Sbjct: 443 DVRTRHAACPVLTGSKWVSNKWIH 466


>gi|125772807|ref|XP_001357662.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
 gi|54637394|gb|EAL26796.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score =  123 bits (308), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 68/204 (33%), Positives = 109/204 (53%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   +YH+ +  +E + +  +A+P   ++TV +S TG  + +  R S   +LK  +D +
Sbjct: 344 DPYIVIYHDAMYDSEMDLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTEEDSV 403

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           I  + +R AD T + ME  E +QV++Y +G  Y  H+D+   E    F   N G R+AT+
Sbjct: 404 IAKVVQRTADMTGLDMESAEELQVVNYGIGGHYAPHFDFARREEKRAFEGLNLGNRIATV 463

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE+GG TV       FT++R             ++ PKRG A  + ++  D   
Sbjct: 464 LFYMSDVEQGGATV-------FTTLR------------TALWPKRGTAAFWMNLHRDGEG 504

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G KW + KW+H
Sbjct: 505 DKRTQHAACPVLTGTKWVSNKWIH 528


>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
 gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
          Length = 487

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 70/215 (32%), Positives = 113/215 (52%), Gaps = 30/215 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  +P   +YH+ +S  E   L ++AKP + ++ V +S     + S+ RT+   +   
Sbjct: 282 ELIGLDPYMVLYHDVISPNEIAELQEMAKPQLKRARVYNSTKNTDQLSKTRTAKLAWFLD 341

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG------ 191
             +++   + +RI D T   +   E +QV++Y +G  Y  H+DY    FNT  G      
Sbjct: 342 TFNQLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDY----FNTTKGPHITQI 397

Query: 192 -GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFW 250
            G R+AT+L YL+DVE+GG TVFP                E  K   +V PKRG A++++
Sbjct: 398 NGDRIATVLFYLNDVEQGGATVFP----------------EIKK---AVFPKRGSAIMWY 438

Query: 251 SMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
           +++ D   +  +LH GCPVI G+KW   KW+   E
Sbjct: 439 NLKDDGEGNRDTLHAGCPVIVGSKWVCNKWIRERE 473


>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
           gallus]
          Length = 489

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   + + +S  E E + +LAKP + ++TV D +TG+   +  R S   +L   +  +
Sbjct: 289 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 348

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 349 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 408

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++ P    
Sbjct: 409 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFPSGEG 449

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 450 DYSTRHAACPVLVGNKWVSNKWLH 473


>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1
          Length = 516

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 69/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   + + +S  E E + +LAKP + ++TV D +TG+   +  R S   +L   +  +
Sbjct: 316 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 375

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 376 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 435

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++ P    
Sbjct: 436 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFPSGEG 476

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 477 DYSTRHAACPVLVGNKWVSNKWLH 500


>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
 gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
          Length = 534

 Score =  122 bits (307), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 73/213 (34%), Positives = 109/213 (51%), Gaps = 29/213 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E +  +P   +YH  LS  E   LI  A   M K+T V  + G  K +R RT+ G + K+
Sbjct: 320 EQIGLDPYVVLYHEVLSAREISMLIGKAAQNM-KNTRVHKEQGVPKKNRGRTAKGFWFKK 378

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNG-- 191
             + + +GI +RI D T   +   EG QV++Y +G  Y  H DYF     +  +T++G  
Sbjct: 379 ESNELTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYFDFASSNHTDTRSGYS 438

Query: 192 ---GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALL 248
              G R+AT+L YL+DVE+GG TVF                      G SV P+ G A+ 
Sbjct: 439 MDLGDRIATVLFYLTDVEQGGATVF-------------------ADVGYSVYPQAGTAIF 479

Query: 249 FWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           ++++  +   DP + H  CPVI G+KW  T+W+
Sbjct: 480 WYNLDTNGKGDPRTRHAACPVIVGSKWVMTEWI 512


>gi|195159323|ref|XP_002020531.1| GL13463 [Drosophila persimilis]
 gi|194117300|gb|EDW39343.1| GL13463 [Drosophila persimilis]
          Length = 487

 Score =  122 bits (307), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 68/204 (33%), Positives = 109/204 (53%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   +YH+ +  +E + +  +A+P   ++TV +S TG  + +  R S   +LK  +D +
Sbjct: 282 DPYIVIYHDAMYDSEMDLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTEEDSV 341

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           I  + +R AD T + ME  E +QV++Y +G  Y  H+D+   E    F   N G R+AT+
Sbjct: 342 IAKVVQRTADMTGLDMESAEELQVVNYGIGGHYAPHFDFARREEKRAFEGLNLGNRIATV 401

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE+GG TV       FT++R             ++ PKRG A  + ++  D   
Sbjct: 402 LFYMSDVEQGGATV-------FTTLR------------TALWPKRGTAAFWMNLHRDGEG 442

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G KW + KW+H
Sbjct: 443 DKRTQHAACPVLTGTKWVSNKWIH 466


>gi|195452726|ref|XP_002073473.1| GK14136 [Drosophila willistoni]
 gi|194169558|gb|EDW84459.1| GK14136 [Drosophila willistoni]
          Length = 550

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 66/204 (32%), Positives = 109/204 (53%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   +YH+ +  +E + +  +A+P   ++TV +S TG  + +  R S   +LK  +D++
Sbjct: 345 DPYIVIYHDAMYDSEMDLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTEEDQV 404

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           I  + +R AD T + M+  E +QV++Y +G  Y+ H+D+   E    F   N G R+AT+
Sbjct: 405 IGTVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARREEKRAFEGLNLGNRIATV 464

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE+GG TVF + +A      W               PK+G A  + ++  D   
Sbjct: 465 LFYMSDVEQGGATVFTSLHAAL----W---------------PKKGTAAFWMNLHRDGEG 505

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G KW + KW+H
Sbjct: 506 DVRTRHAACPVLTGTKWVSNKWIH 529


>gi|416009427|ref|ZP_11561250.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
 gi|339836568|gb|EGQ64151.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
          Length = 196

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 68/197 (34%), Positives = 98/197 (49%), Gaps = 20/197 (10%)

Query: 91  NFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRI 150
             L+   C+ LI + +  +  +TV D +TGQ      R S   + KR    I++ + + I
Sbjct: 18  GLLTPENCQNLIAIGQSLLRPATVTDEQTGQEVAHGERVSEMAWPKRDDHPILQSLAEGI 77

Query: 151 ADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT-KNGGQRMATLLMYLSDVEEGG 209
           A  T IP++  E +Q+LHY  G +Y  HYD F  +  T + GG R  TL++YL+ VEEGG
Sbjct: 78  AQLTGIPIDCQEPLQILHYRPGGEYKPHYDAFAADAPTLRQGGNRQGTLILYLNAVEEGG 137

Query: 210 ETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPV 269
           ET FP                   + GL V P  G  + F ++  +    P SLH G PV
Sbjct: 138 ETAFP-------------------ELGLQVSPIPGGGVFFRNLNEEGQRHPLSLHAGLPV 178

Query: 270 IRGNKWSATKWMHVHEY 286
            +G KW AT+W+    Y
Sbjct: 179 RKGEKWIATQWIRQEAY 195


>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
          Length = 536

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 68/204 (33%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   + + +S  E E + +LAKP + ++TV D +TG+   +  R S   +L   +  +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 395

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 396 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 455

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV P++G A+ ++++ P    
Sbjct: 456 LFYMSDVSAGGATVFPEV-------------------GASVWPRKGTAVFWYNLFPSGEG 496

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 497 DYSTRHAACPVLVGNKWVSNKWLH 520


>gi|148233143|ref|NP_001090904.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Sus scrofa]
 gi|83778522|gb|ABC47142.1| procollagen-proline 2-oxoglutarate-4-dioxygenase [Sus scrofa]
          Length = 534

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 67/204 (32%), Positives = 105/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE + + DLAKP + ++T+ +  TG  +    R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIDIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  +  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRLNMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 495 DYSTRHAACPVLVGNKWVSNKWLH 518


>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
           carolinensis]
          Length = 542

 Score =  122 bits (306), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 69/203 (33%), Positives = 104/203 (51%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR   +   +S  E E + +LAKP + ++TV D +TG+   +  R S   +L   ++ I+
Sbjct: 343 PRIVRFVEIISDEEIETVKELAKPRLSRATVHDPQTGKLTTAHYRVSKSAWLSGYENPIV 402

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 403 ARINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWL 462

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDV  GG TVFP                     G SV P++G A+ ++++ P    D
Sbjct: 463 FYMSDVSAGGATVFPEV-------------------GASVWPRKGTAVFWYNLFPSGEGD 503

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
            S+ H  CPV+ GNKW + KW+H
Sbjct: 504 YSTRHAACPVLVGNKWVSNKWIH 526


>gi|290243077|ref|YP_003494747.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
 gi|288945582|gb|ADC73280.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
          Length = 575

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 78/209 (37%), Positives = 109/209 (52%), Gaps = 27/209 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++ +P       FL   ECE LI LA+  M ++ V  S  G S  S+ RT S  +L+ 
Sbjct: 51  ETLSQDPLVVYLDEFLEPGECEALIHLAQGRMKRALV--SLDGSSGVSQGRTGSNCWLRY 108

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYD-YFLDEFN----TKNGG 192
            ++ + R I +R+A     P+E+ E +QV+HY   Q+Y  HYD Y LD       T+ GG
Sbjct: 109 QEEPLARRIGERVAKRVGFPLEYAEPLQVIHYGHEQEYRPHYDAYDLDTPRGLRCTRQGG 168

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
           QRM T L+YL++VEEGG T FP A                   G+ V P++G   +F ++
Sbjct: 169 QRMVTALLYLNEVEEGGATAFPNA-------------------GVEVAPRKGRIAIFNNV 209

Query: 253 RPD-ATLDPSSLHGGCPVIRGNKWSATKW 280
             D     P SLHGG PV  G KW+A+ W
Sbjct: 210 GADPGRPHPRSLHGGMPVKSGEKWAASIW 238


>gi|91091610|ref|XP_969386.1| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
           putative [Tribolium castaneum]
 gi|270001037|gb|EEZ97484.1| hypothetical protein TcasGA2_TC011321 [Tribolium castaneum]
          Length = 536

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 61/203 (30%), Positives = 110/203 (54%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P  F++ + L+ +E   +  +A+P   ++TV ++ TG+ + ++ R S   +LK  + + I
Sbjct: 332 PDIFIFRDVLADSEIATIKRMAQPRFKRATVQNTDTGELEIAQYRISKSAWLKEEEHKHI 391

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R++D T + M   E +QV++Y +G  Y+ H+D+      + F +   G R+AT+L
Sbjct: 392 ADVSQRVSDMTGLTMSTAEELQVVNYGIGGHYEPHFDFARRDERNAFKSLGTGNRIATVL 451

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE+GG TVFP+       V  W              P++G A  ++++ P    D
Sbjct: 452 FYMSDVEQGGATVFPS-----IQVSLW--------------PQKGSAAFWYNLHPSGDGD 492

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G+KW + KW+H
Sbjct: 493 KMTRHAACPVLTGSKWVSNKWIH 515


>gi|292619367|ref|XP_001922562.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Danio rerio]
          Length = 541

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 66/203 (32%), Positives = 106/203 (52%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR   YH  +++ E E + +L+KP + ++T+ +  TG  + +  R S   +L   +  ++
Sbjct: 342 PRIIRYHEIITEQEIEKIKELSKPRLRRATISNPITGVLETAHYRISKSAWLAAYEHPVV 401

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             I +RI D T + ++  E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 402 DRINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWL 461

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDV  GG TVFP                     G +VKP +G A+ ++++ P    D
Sbjct: 462 FYMSDVAAGGATVFPEV-------------------GAAVKPLKGTAVFWYNLFPSGEGD 502

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
            S+ H  CPV+ GNKW + KW+H
Sbjct: 503 YSTRHAACPVLVGNKWVSNKWIH 525


>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
 gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
          Length = 521

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 67/211 (31%), Positives = 113/211 (53%), Gaps = 22/211 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  +P   +YH+ +S  E   L ++AKP + ++TV +S    ++  + RT+   +   
Sbjct: 319 ELIGLDPYMVLYHDVISPNEIAELQEMAKPELKRATVYNSTKNTNQFVKTRTAKVAWFLD 378

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN---TKNGGQR 194
             +++   + +RI D T   +   E +QV++Y +G  Y  H+DYF    N   ++  G R
Sbjct: 379 TFNQLTERLNQRIMDMTNFVLNGSEMLQVMNYGLGGYYVKHFDYFNTTTNPHISQINGDR 438

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +AT+L YL+DVE+GG TVFP                E  K   +V PKRG A+++++++ 
Sbjct: 439 IATVLFYLNDVEQGGATVFP----------------EIKK---AVFPKRGSAIMWYNLKD 479

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
           D   +  +LH  CPVI G+KW   KW+   E
Sbjct: 480 DGEGNRDTLHAACPVIVGSKWVCNKWIRERE 510


>gi|47213360|emb|CAF90979.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 511

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 72/210 (34%), Positives = 109/210 (51%), Gaps = 30/210 (14%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR   YH+ LS  E E + +LA+P + ++TV D +TGQ   +  R S   +L   +  I+
Sbjct: 307 PRIVRYHDVLSNREMEKVKELARPRLRRATVHDPRTGQLTTAPYRVSKSAWLGAFEHPIV 366

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             I +RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 367 DQINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHFDFGQKDEPDAFEELGTGNRIATWL 426

Query: 200 MY-------LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
           +Y       +SDV+ GG TV       FT +            G SV P++G A+ ++++
Sbjct: 427 LYVSAAVLRMSDVQAGGATV-------FTDI------------GASVLPQKGSAVFWYNL 467

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           RP    D  + H  CPV+ GNKW + KW+H
Sbjct: 468 RPSGDGDYRTRHAACPVLLGNKWVSNKWIH 497


>gi|147791524|emb|CAN70717.1| hypothetical protein VITISV_029140 [Vitis vinifera]
          Length = 173

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 85/143 (59%), Gaps = 32/143 (22%)

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           ++ IEKRI+ ++ +P+E+GE IQ                    FN K GGQR+AT+L+YL
Sbjct: 55  LQAIEKRISVYSQVPVENGELIQ--------------------FNLKRGGQRVATMLIYL 94

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQ---GLSVKPKRGDALLFWSMRPDATLD 259
           SD  EGGET FP A + F           CG +   GLSV P +G+A+LFWSM  D   D
Sbjct: 95  SDNVEGGETYFPMAGSGFC---------RCGGKSVRGLSVAPVKGNAVLFWSMGLDGQSD 145

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
           P+S+HGGC V+ G KWSATKWM 
Sbjct: 146 PNSIHGGCEVLAGEKWSATKWMR 168


>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
          Length = 492

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 64/198 (32%), Positives = 103/198 (52%), Gaps = 21/198 (10%)

Query: 85  RAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIR 144
           R  ++ NF S  EC +L +  +  +  S  V    G  +    R S+  +L+   D ++ 
Sbjct: 305 RLQIFRNFASAQECAHLREEGRKKL--SRAVAWTDGAFRPVEFRISTAAWLQPDHDDVVT 362

Query: 145 GIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSD 204
            +  RIAD T + +E  E +QV +Y +G  Y+ HYD+          G R+AT ++YL+ 
Sbjct: 363 NLHTRIADATQLDLEFAEALQVSNYGIGGFYETHYDHHASRERELPEGDRIATFMIYLNQ 422

Query: 205 VEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLH 264
           VE+GG T FP                   + G +V+P  GDA+ ++++ PD   D ++LH
Sbjct: 423 VEQGGYTAFP-------------------RLGAAVEPGHGDAVFWYNLLPDGESDNNTLH 463

Query: 265 GGCPVIRGNKWSATKWMH 282
           G CPV++G+KW A KW+H
Sbjct: 464 GACPVLQGSKWVANKWIH 481


>gi|195391754|ref|XP_002054525.1| GJ24502 [Drosophila virilis]
 gi|194152611|gb|EDW68045.1| GJ24502 [Drosophila virilis]
          Length = 487

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 66/204 (32%), Positives = 108/204 (52%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   +YH+ +  +E E +  +A+P   ++TV +S TG  + +  R S   +LK  + R+
Sbjct: 282 DPYIVIYHDAMYDSEIEIIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTAEHRV 341

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           I  + +R AD T + M+  E +QV++Y +G  Y+ H+D+   E    F   N G R+AT+
Sbjct: 342 IGTVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARREEKRAFEGLNLGNRIATM 401

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE+GG TVF + +A      W               PK+G A  + ++      
Sbjct: 402 LFYMSDVEQGGATVFTSLHAAL----W---------------PKKGTAAFWMNLHRSGEG 442

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G+KW + KW+H
Sbjct: 443 DVRTRHAACPVLTGSKWVSNKWIH 466


>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
           (Silurana) tropicalis]
          Length = 526

 Score =  121 bits (304), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 65/204 (31%), Positives = 106/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   YH+ +S  E   + +LAKP + ++T+ +  TG  + ++ R +   +L   +D +
Sbjct: 326 KPRIVRYHDIISDEEISKVKELAKPRLRRATISNPITGVLETAQYRITKSAWLSGYEDPV 385

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  + +RI   T + M   E +QV +Y +G +Y+ H+D+      D F     G R+AT 
Sbjct: 386 VARLNRRIEGVTGLDMSTAEELQVANYGIGGQYEPHFDFLRKYEPDAFKKLGTGNRVATW 445

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDVE GG TVFP                     G +V PK+G A+ ++++      
Sbjct: 446 LFYMSDVEAGGATVFPEV-------------------GAAVYPKKGTAVFWYNLLESGEG 486

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 487 DYSTRHAACPVLVGNKWVSNKWIH 510


>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
 gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
          Length = 534

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 73/215 (33%), Positives = 107/215 (49%), Gaps = 33/215 (15%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E +  +P   +YH  LS  E   LI  A   M K+T V  + G  K +R RT+ G + K+
Sbjct: 320 EQIGLDPYVVLYHEVLSAREISMLIGKATQNM-KNTRVHKEQGVPKKNRGRTAKGFWFKK 378

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG------ 191
             + + +GI +RI D T   +   EG QV++Y +G  Y  H DYF  +F + N       
Sbjct: 379 ESNELTKGITRRIMDMTGFDLADSEGFQVINYGIGGHYLLHMDYF--DFASSNHTDTRSS 436

Query: 192 -----GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDA 246
                G R+AT+L YL+DVE+GG TVF                      G SV P+ G A
Sbjct: 437 YSMDLGDRIATVLFYLTDVEQGGATVF-------------------ADVGYSVYPQAGTA 477

Query: 247 LLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           + ++++  +   DP + H  CPVI G+KW  T+W+
Sbjct: 478 IFWYNLDTNGKGDPRTKHAACPVIVGSKWVMTEWI 512


>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
 gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
          Length = 534

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 73/216 (33%), Positives = 107/216 (49%), Gaps = 33/216 (15%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
           TE +  +P   +YH  LS  E   LI  A   M K+T +  +    K +R RT+ G +LK
Sbjct: 319 TEQIGLDPYVVLYHEVLSAREISMLIGKAAQNM-KNTKIHKERAVPKKNRGRTAKGFWLK 377

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG----- 191
           +  + + + I +RI D T   +   EG QV++Y +G  Y  H DYF  +F + N      
Sbjct: 378 KESNELTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYF--DFASSNHTDTRS 435

Query: 192 ------GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGD 245
                 G R+AT+L YL+DVE+GG TVF                   G  G  V P+ G 
Sbjct: 436 RYSIDLGDRIATVLFYLTDVEQGGATVF-------------------GDVGYYVSPQAGT 476

Query: 246 ALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           A+ ++++  D   DP + H  CPVI G+KW  T+W+
Sbjct: 477 AIFWYNLDTDGNGDPRTRHAACPVIVGSKWVMTEWI 512


>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
          Length = 534

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 73/216 (33%), Positives = 107/216 (49%), Gaps = 33/216 (15%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
           TE +  +P   +YH  LS  E   LI  A   M K+T +  +    K +R RT+ G +LK
Sbjct: 319 TEQIGLDPYVVLYHEVLSAREISMLIGKAAQNM-KNTKIHKERAVPKKNRGRTAKGFWLK 377

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG----- 191
           +  + + + I +RI D T   +   EG QV++Y +G  Y  H DYF  +F + N      
Sbjct: 378 KESNELTKRITRRIMDMTGFDLADSEGFQVINYGIGGHYFLHMDYF--DFASSNHTDTRS 435

Query: 192 ------GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGD 245
                 G R+AT+L YL+DVE+GG TVF                   G  G  V P+ G 
Sbjct: 436 RYSIDLGDRIATVLFYLTDVEQGGATVF-------------------GDVGYYVSPQAGT 476

Query: 246 ALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           A+ ++++  D   DP + H  CPVI G+KW  T+W+
Sbjct: 477 AIFWYNLDTDGNGDPRTRHAACPVIVGSKWVMTEWI 512


>gi|383642155|ref|ZP_09954561.1| hypothetical protein SeloA3_06917 [Sphingomonas elodea ATCC 31461]
          Length = 327

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 71/200 (35%), Positives = 103/200 (51%), Gaps = 25/200 (12%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFL-KRGQDR 141
           +PR   +  FLS+ EC ++   A+  +  S V+D  +G+     +RTS G  +    ++ 
Sbjct: 139 DPRVEHFPGFLSREECAHVATTAQDLLEPSFVLDPNSGRPIPHPIRTSDGGAIGPTNENL 198

Query: 142 IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMY 201
           ++R I  RIA  T   +E GE + VL Y  GQ+Y  H D      N     QR+AT ++Y
Sbjct: 199 VVRAINLRIAAATGTAVEQGESLTVLRYARGQEYRRHLDTIAGAEN-----QRIATFIVY 253

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+D  EGGET FP  N                   + V+P+ GDA+ F ++RPD T DP 
Sbjct: 254 LNDGFEGGETHFPLLN-------------------IQVRPRIGDAIRFDTIRPDGTPDPR 294

Query: 262 SLHGGCPVIRGNKWSATKWM 281
            +H G PV  G KW AT+W+
Sbjct: 295 LVHAGQPVRNGVKWIATRWI 314


>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
 gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [Danio rerio]
          Length = 536

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 70/203 (34%), Positives = 106/203 (52%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR   YH  +S +E E + ++AKP + ++T+ +  TG  + +  R S   +L   +   I
Sbjct: 337 PRIVRYHEIISDSEIETVKEMAKPRLRRATISNPITGVLETAPYRISKSAWLSGYEHSTI 396

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             I +RI D T + M+  E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 397 ERINQRIEDVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWL 456

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDV  GG TV       FT V            G +V PK+G A+ ++++ P    D
Sbjct: 457 FYMSDVSAGGATV-------FTDV------------GAAVWPKKGTAVFWYNLFPSGEGD 497

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
            S+ H  CPV+ GNKW + KW+H
Sbjct: 498 YSTRHAACPVLVGNKWVSNKWIH 520


>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
 gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
          Length = 533

 Score =  121 bits (303), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 73/215 (33%), Positives = 108/215 (50%), Gaps = 30/215 (13%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
           TE +  +P   +YH  LS  E   L+  A   M K+T V S+   + + R RT+ G +LK
Sbjct: 319 TEQIGLKPYVVLYHEVLSAREISMLMGKAAQNM-KNTRVQSEKAVNTN-RERTAKGYWLK 376

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG----- 191
           +  + + R I +RI D T   +   E  QV++Y +G  Y  H+DYF    +   G     
Sbjct: 377 KESNEMTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYSLHFDYFGFASSNYTGERSHH 436

Query: 192 ----GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDAL 247
               G R+AT+L YL+DVE+GG TVF                   G  G SV P+ G A+
Sbjct: 437 SIVLGDRIATVLFYLTDVEQGGATVF-------------------GNVGYSVYPQAGTAI 477

Query: 248 LFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            ++++  D   DP + H  CPV+ G+KW  T+W+H
Sbjct: 478 FWYNLDTDGNGDPLTRHASCPVVVGSKWVMTEWIH 512


>gi|321474953|gb|EFX85917.1| hypothetical protein DAPPUDRAFT_309108 [Daphnia pulex]
          Length = 549

 Score =  121 bits (303), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 67/205 (32%), Positives = 106/205 (51%), Gaps = 24/205 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   +YH+ +   E E +  LA P   ++TV++S TG+ + ++ R S   FLK  +   
Sbjct: 346 KPLLVIYHDVIFDEEIETVKKLAHPRFKRTTVMNSATGKLETAKYRISKAAFLKNKEHHH 405

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNG-GQRMAT 197
           +  + +R+   T + M   E +QV +Y +G  Y+ H+DY        FN  +G   R+AT
Sbjct: 406 VLKMSRRVGAITGLDMSTAEDLQVCNYGIGGHYEPHFDYARKNETIGFNKDSGWRNRIAT 465

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
            L Y+SDVE GG TVFPA N     V  W              P++G A  ++++ P+  
Sbjct: 466 WLFYMSDVEAGGATVFPALN-----VALW--------------PQKGSAAFWYNLFPNGE 506

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMH 282
            +  + H  CPV+ G+KW A KW+H
Sbjct: 507 GNELTRHAACPVLTGSKWVANKWIH 531


>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
           guttata]
          Length = 536

 Score =  120 bits (302), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 68/204 (33%), Positives = 104/204 (50%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   + + +S  E E + +LAKP + ++TV D +TG+   +  R S   +L   +  +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGYESPV 395

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 396 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 455

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV P++G A+ ++++ P    
Sbjct: 456 LFYMSDVSAGGATVFPEV-------------------GASVWPRKGTAVFWYNLFPSGEG 496

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW   KW+H
Sbjct: 497 DYSTRHAACPVLVGNKWVFNKWLH 520


>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
 gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
          Length = 534

 Score =  120 bits (302), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 76/214 (35%), Positives = 107/214 (50%), Gaps = 30/214 (14%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
           TE +  +P   +YH  LS  E   LI  A   M K+T V  +T + K +R RT+ G +LK
Sbjct: 320 TEQIGLDPYVVLYHEVLSAREISMLISKAAQNM-KNTRVHRET-KPKTNRGRTAKGHWLK 377

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG----- 191
           +  + + R I +RI D T   +   E  QV++Y +G  Y  H DYF    +   G     
Sbjct: 378 KESNELTRRITRRIVDMTGFDLADSEDFQVINYGIGGHYFLHMDYFDYASSNYTGPRSRQ 437

Query: 192 ----GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDAL 247
               G R+AT+L YLSDVE+GG TVF                   G  G SV P+ G A+
Sbjct: 438 SKVLGDRIATVLFYLSDVEQGGATVF-------------------GNVGYSVYPQAGTAI 478

Query: 248 LFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
            ++++  D   DP + H  CPVI G+KW  T+W+
Sbjct: 479 FWYNLDTDGNGDPLTRHASCPVIVGSKWVMTEWI 512


>gi|242018356|ref|XP_002429643.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
           humanus corporis]
 gi|212514628|gb|EEB16905.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
           humanus corporis]
          Length = 534

 Score =  120 bits (302), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 65/204 (31%), Positives = 106/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR  +YH+ LS  E + +  LA P   ++TV +S+TG+ + +  R S   +L+      
Sbjct: 333 DPRIVLYHDVLSDREIKTIQQLAVPRFKRATVQNSETGKLEVAHYRISKSAWLEDVDHPY 392

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           +  + +R+ D T + M   E +QV++Y +G  Y+ H+D+   E    F +   G R+AT+
Sbjct: 393 VAKVSQRVEDITGLNMATAESLQVVNYGIGGHYEPHFDFARKEEKNAFQSLGTGNRIATI 452

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV +GG TVFP        V  W              PK+G A  ++++R +   
Sbjct: 453 LFYMSDVSQGGATVFPG-----IKVSLW--------------PKKGTAAFWYNLRKNGEG 493

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G+KW   KW+H
Sbjct: 494 DYLTRHAACPVLTGSKWVCNKWIH 517


>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
          Length = 235

 Score =  120 bits (302), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 72/200 (36%), Positives = 107/200 (53%), Gaps = 32/200 (16%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           EP+   YH+ +S  E E L D+A+P + +S     +TG    S +RTS   FL   +   
Sbjct: 52  EPKIIRYHDVISDTEIETLKDIARPELTRS-----QTGWGVISEIRTSQSVFLD--EVGT 104

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  I +RIAD T + +E  E + V +Y +G +Y  H+D   D        +R AT L+Y+
Sbjct: 105 VARISQRIADITGLSVESAEKLHVQNYGIGGRYTPHFDAGGDV------NERTATFLIYM 158

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           SDVE GG TVF       T+V            G++VKP++G A+ + ++  +  LD  +
Sbjct: 159 SDVEVGGATVF-------TNV------------GVAVKPEKGSAVFWNNLHKNGELDLKT 199

Query: 263 LHGGCPVIRGNKWSATKWMH 282
            H GCPV+ GNKW A KW+H
Sbjct: 200 KHAGCPVLVGNKWVANKWIH 219


>gi|443709454|gb|ELU04126.1| hypothetical protein CAPTEDRAFT_167710 [Capitella teleta]
          Length = 535

 Score =  120 bits (301), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 67/208 (32%), Positives = 110/208 (52%), Gaps = 22/208 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E + ++P   VYH  +S  + + +  LA P + ++TVV+S TG+ + ++ R S   +LK 
Sbjct: 331 ETMNFDPWIAVYHQLMSDKDIDDIKALATPRLARATVVNSVTGELEFAKYRISKSGWLKD 390

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNTKNG--GQR 194
            +   +  I  R +  T + +   E +Q+ +Y +G  Y+ H+DY  L E  + +   G R
Sbjct: 391 EEHPTVAKISNRCSALTNLSLSTVEELQIANYGIGGHYEPHFDYSRLAEVTSFDHWRGNR 450

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           + T++ YLSDVE GG TVF  A                   G  ++P++G A +++++ P
Sbjct: 451 ILTVIFYLSDVEAGGGTVFMTA-------------------GTKLRPEKGAAAVWYNLHP 491

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMH 282
           D T D  + H  CPV+ GNKW A KW H
Sbjct: 492 DGTGDDETKHAACPVLTGNKWVANKWFH 519


>gi|403183473|gb|EJY58123.1| AAEL017524-PA, partial [Aedes aegypti]
          Length = 212

 Score =  120 bits (300), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 70/210 (33%), Positives = 111/210 (52%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E  + +P   +YHN +S  E E +I ++KP + +S V +S + +   S  RTS   +L  
Sbjct: 7   EEASLDPLIVIYHNAISDKEIEQIIQVSKPMLKRSMVGESFSKEV--SNERTSQNAWLAD 64

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-----DEFNTKNGG 192
               +++ +  R  D T +  +  E +QV +Y +G  Y  H+D+       + +     G
Sbjct: 65  YDFELVKVLSLRTEDMTGLDRKSYESLQVNNYGIGGFYLPHFDWVRTNGTEEPYKDMGLG 124

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+ATL+ YLSDVE+GG TVFP                   + G+ V PK+G A+ ++++
Sbjct: 125 NRIATLMYYLSDVEQGGATVFP-------------------QIGVGVFPKKGSAIFWYNL 165

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            PD T D  +LHG CPV+ G+KW A KW+H
Sbjct: 166 LPDGTGDERTLHGACPVLLGSKWVANKWIH 195


>gi|260825357|ref|XP_002607633.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
 gi|229292981|gb|EEN63643.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
          Length = 520

 Score =  120 bits (300), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 69/208 (33%), Positives = 109/208 (52%), Gaps = 22/208 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E V  +P+ +V HN L+  E E +  LA+P + ++ V    TG+ + +  R S   +L  
Sbjct: 316 EQVFDKPKLWVLHNILTDPEMEVIKKLAQPRLRRARVESPTTGEGELASYRISKSAWLYD 375

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL--DEFNTK-NGGQR 194
            + R+IR + +R+ D T + ME  E +QV++Y +G  Y+ H+D     +EF    N G R
Sbjct: 376 WEHRVIRRVNQRVEDVTGLTMETAELLQVVNYGIGGHYEPHFDCATKDEEFALDPNEGDR 435

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +AT+L Y+SDVE GG TVFP                     G  V P++G    ++++  
Sbjct: 436 IATMLFYMSDVEAGGATVFPQV-------------------GARVVPEKGAGAFWYNLLK 476

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMH 282
               D  + H GCPV+ G+KW + KW+H
Sbjct: 477 SGEGDMLTEHAGCPVLVGSKWVSNKWIH 504


>gi|255545252|ref|XP_002513687.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223547595|gb|EEF49090.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 309

 Score =  120 bits (300), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 68/200 (34%), Positives = 109/200 (54%), Gaps = 12/200 (6%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W PR F+Y  FL+  EC+ LI LA      +  +    G    + ++ +S        
Sbjct: 61  LSWRPRVFLYKGFLTDEECDRLISLAH----GAKEISKGKGDGSRNNIQLASSESRSHIY 116

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++  IE+RI+ +TFIP E+ + +QV+HY + +  + H+DYF D     +    MATL+
Sbjct: 117 DDLLARIEERISAWTFIPKENSKPLQVMHYGIEEARE-HFDYF-DNKTLISNVSLMATLV 174

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           +YLS+V  GGE +FP   +      W    S+C K    ++P +G+A+L ++   +A+ D
Sbjct: 175 LYLSNVTRGGEILFP--KSELKDKVW----SDCTKDSSILRPVKGNAVLIFNAHLNASAD 228

Query: 260 PSSLHGGCPVIRGNKWSATK 279
             S HG CPV+ G  W ATK
Sbjct: 229 SRSTHGRCPVLEGEMWCATK 248


>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
           gallus]
          Length = 536

 Score =  120 bits (300), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 67/204 (32%), Positives = 104/204 (50%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   + + +S  E E + +LAKP + ++T+ +  TG  + +  R S   +L   +  +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALETAHYRISKSAWLSGYESPV 395

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 396 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 455

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++ P    
Sbjct: 456 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFPSGEG 496

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 497 DYSTRHAACPVLVGNKWVSNKWLH 520


>gi|308476969|ref|XP_003100699.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
 gi|308264511|gb|EFP08464.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
          Length = 573

 Score =  120 bits (300), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 72/228 (31%), Positives = 113/228 (49%), Gaps = 41/228 (17%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            EI+ ++P A ++ N +S +E + + +LA P + ++TV +SKTG+ + +  R S   +LK
Sbjct: 334 VEILRFDPLAVLFKNVISDSEIKVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLK 393

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL------------- 183
                +I  + +RI DFT +     E +QV +Y +G  YD H+D+               
Sbjct: 394 GDLHPVIERVNRRIEDFTGLYQGTSEELQVANYGLGGHYDPHFDFARIANYGLGGHYEPH 453

Query: 184 ---------DEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGK 234
                    + F T N G R+AT+L Y+S  E GG TVF             N L     
Sbjct: 454 YDMSLKEEKNAFKTLNTGNRIATVLFYMSQPERGGATVF-------------NHL----- 495

Query: 235 QGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            G +V P + DAL ++++R D   D  + H  CPV+ G KW + KW+H
Sbjct: 496 -GTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKWVSNKWIH 542


>gi|324507368|gb|ADY43128.1| Prolyl 4-hydroxylase subunit alpha-2 [Ascaris suum]
          Length = 534

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 71/210 (33%), Positives = 108/210 (51%), Gaps = 23/210 (10%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            EI+ + P   ++   +S  E E +  LA P + ++TV +++TG  + +  R S   +LK
Sbjct: 318 VEILRFSPLVVLFKQVISDYEIEVIEKLAIPKLKRATVQNARTGDLEYANYRISKSAWLK 377

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGG 192
                 I  I KRI   T +  E  E +Q  +Y +G  YD H+D+     ++ F T N G
Sbjct: 378 GTDHPAIDRINKRIDLMTNLNQETAEELQAQNYGIGGHYDPHFDFARKEDINAFKTLNTG 437

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT+L+Y+SDVE GG TVF             N L      G +V P + DAL ++++
Sbjct: 438 NRIATILIYMSDVESGGATVF-------------NHL------GNAVFPSKYDALFWYNL 478

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           R D   D  + H  CPV+ G KW + KW+H
Sbjct: 479 RRDGEGDLRTRHAACPVLTGIKWVSNKWIH 508


>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Meleagris gallopavo]
          Length = 536

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 67/204 (32%), Positives = 104/204 (50%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   + + +S  E E + +LAKP + ++T+ +  TG  + +  R S   +L   +  +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALETAHYRISKSAWLSGYESPV 395

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 396 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 455

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++ P    
Sbjct: 456 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFPSGEG 496

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 497 DYSTRHAACPVLVGNKWVSNKWLH 520


>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
           gallus]
          Length = 536

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 67/204 (32%), Positives = 104/204 (50%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   + + +S  E E + +LAKP + ++T+ +  TG  + +  R S   +L   +  +
Sbjct: 336 KPRIVRFLDIISDEEIETVKELAKPRLRRATISNPITGALETAHYRISKSAWLSGYESPV 395

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 396 VSRINTRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATW 455

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++ P    
Sbjct: 456 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFPSGEG 496

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D S+ H  CPV+ GNKW + KW+H
Sbjct: 497 DYSTRHAACPVLVGNKWVSNKWLH 520


>gi|194905436|ref|XP_001981196.1| GG11753 [Drosophila erecta]
 gi|190655834|gb|EDV53066.1| GG11753 [Drosophila erecta]
          Length = 550

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 72/242 (29%), Positives = 118/242 (48%), Gaps = 24/242 (9%)

Query: 45  GDDDSPPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDL 104
           G+    P+DL S R R            G    E    +P   ++H+ +   E + +  +
Sbjct: 308 GELKPSPSDLRSLRCRYVTNGVPFLR-LGPLKLEEAHADPYIVIFHDAMYDGEIDLIKRM 366

Query: 105 AKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGI 164
           A+P   ++TV +S TG  + +  R S   +LK  + R+I  + +R AD T + M+  E +
Sbjct: 367 ARPRFRRATVQNSVTGALETANYRISKSAWLKTPEHRVIETVVQRTADMTGLDMDSAEEL 426

Query: 165 QVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATLLMYLSDVEEGGETVFPAANANF 220
           QV++Y +G  Y+ H+D+   E    F   N G R+AT+L Y+SDVE+GG TVF + +   
Sbjct: 427 QVVNYGIGGHYEPHFDFARKEEQRAFEGLNLGNRIATVLFYMSDVEQGGATVFTSLHT-- 484

Query: 221 TSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKW 280
                            ++ PK+G A  + ++  D   D  + H  CPV+ G KW + KW
Sbjct: 485 -----------------ALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKWVSNKW 527

Query: 281 MH 282
           +H
Sbjct: 528 IH 529


>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
           latipes]
          Length = 555

 Score =  119 bits (299), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 67/203 (33%), Positives = 105/203 (51%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y + +S+AE + +  LAKP + ++T+ +  TG  + +  R S   +L   +D ++
Sbjct: 351 PYIVRYIDIISEAEMDKIKQLAKPRLRRATISNPVTGVLETAPYRISKSAWLTAYEDPVV 410

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             I +RI D T + M+  E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 411 EKINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWL 470

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDV  GG TVFP                     G SV P++G A+ ++++      D
Sbjct: 471 FYMSDVSAGGATVFPDV-------------------GASVGPQKGTAVFWYNLFASGEGD 511

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
            S+ H  CPV+ GNKW + KW+H
Sbjct: 512 YSTRHAACPVLVGNKWVSNKWIH 534


>gi|395509387|ref|XP_003758979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Sarcophilus harrisii]
          Length = 534

 Score =  119 bits (298), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 66/201 (32%), Positives = 108/201 (53%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ LS  E E + +LAKP + ++TV D KTG    +  R S  ++L+ G D +I
Sbjct: 337 PHIVRYYDVLSDEEIERIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEGDDPVI 396

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 397 AQLNRRMHYITGLSVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 456

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP    +F +  W               PK+G ++ ++++      D  
Sbjct: 457 MSDVEAGGATVFP----DFGATIW---------------PKKGTSVFWYNLFRSGEGDYR 497

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G+KW + KW H
Sbjct: 498 TRHAACPVLVGSKWVSNKWFH 518


>gi|356559784|ref|XP_003548177.1| PREDICTED: uncharacterized protein LOC100795761 [Glycine max]
          Length = 264

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 117/206 (56%), Gaps = 20/206 (9%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLKRG 138
           ++W+PR F+Y  FLS  EC+YL+ LA  Y VK    +  +G    S  V TS        
Sbjct: 20  ISWQPRVFLYKGFLSDKECDYLVSLA--YAVK----EKSSGNGGLSEGVETSLDM----- 68

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATL 198
           +D I+  IE+R++ + F+P E+ + +QV+HY   Q    + DYF ++   +  G  MAT+
Sbjct: 69  EDDILARIEERLSVWAFLPKEYSKPLQVMHYGPEQN-GRNLDYFTNKTQLELSGPLMATI 127

Query: 199 LMYLS-DVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           ++YLS DV +GG+ +FP       SV   +  S C      ++P +G+A+LF+S+ P A+
Sbjct: 128 ILYLSNDVTQGGQILFP------ESVPGSSSWSSCSNSSNILQPVKGNAILFFSLHPSAS 181

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHV 283
            D SS H  CPV+ G+ WSA K+ + 
Sbjct: 182 PDKSSFHARCPVLEGDMWSAIKYFYA 207


>gi|195055779|ref|XP_001994790.1| GH14110 [Drosophila grimshawi]
 gi|193892553|gb|EDV91419.1| GH14110 [Drosophila grimshawi]
          Length = 487

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 67/204 (32%), Positives = 109/204 (53%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   +YH+ +  +E E L  +A+P   ++TV +S TG  + +  R S   +LK  +  I
Sbjct: 282 DPYIVIYHDAMYDSEIEVLKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTPEHEI 341

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           I  + +R AD T + M+  E +QV++Y +G  Y+ H+D+   E    F   N G R+AT+
Sbjct: 342 IGTVVQRTADMTGLDMDSAEELQVVNYGIGGHYEPHFDFARREEKLAFEGLNLGNRIATM 401

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV++GG TV       FTS+R             ++ PK+G A  + ++      
Sbjct: 402 LFYMSDVQQGGATV-------FTSLR------------TALWPKKGTAAFWMNLHRSGEG 442

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           D  + H  CPV+ G+KW + KW+H
Sbjct: 443 DARTRHAACPVLTGSKWVSNKWIH 466


>gi|321474877|gb|EFX85841.1| hypothetical protein DAPPUDRAFT_208740 [Daphnia pulex]
          Length = 545

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 63/204 (30%), Positives = 106/204 (51%), Gaps = 23/204 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   +YHN ++  E E +  +A+P   ++TV +S TG  + +  R S   +LK  +   
Sbjct: 343 KPLIVIYHNVINDDEIETVKKMAQPRFKRATVQNSVTGNLEPANYRISKSAWLKSEEHDH 402

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           +  + +R+ D T + M   E +QV++Y +G  Y+ H+DY   E    F     G R+AT 
Sbjct: 403 VFKVTRRVGDVTGLDMATAEDLQVVNYGIGGHYEPHFDYARKEEVNAFKDLGWGNRVATW 462

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+S+VE GG TVFP  N                   L++ P++G A  ++++ P+   
Sbjct: 463 LFYMSEVEAGGATVFPKLN-------------------LALWPQKGSAAFWYNLHPNGEG 503

Query: 259 DPSSLHGGCPVIRGNKWSATKWMH 282
           +  + H  CPV+ G+KW + KW+H
Sbjct: 504 NELTRHAACPVLTGSKWVSNKWIH 527


>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
 gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
          Length = 513

 Score =  119 bits (298), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 66/207 (31%), Positives = 115/207 (55%), Gaps = 23/207 (11%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSK--DSRVRTSSGTF 134
           TEI++  P   +YH+ ++  E   L +L+KP+M +  +  +K       DS  RTS+  +
Sbjct: 315 TEILSLSPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSG-RTSNSVW 373

Query: 135 LKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
           L   ++ ++  +E+R+   T   ME+ E  Q+++Y +G  Y  H D+F +    + GG R
Sbjct: 374 LTSHENAVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHF-ETPQHRGGGDR 432

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +AT+L YLSDV +GG T+FP  N                   +SV+P++GDALL++++  
Sbjct: 433 IATVLFYLSDVPQGGATLFPRLN-------------------ISVQPRQGDALLWYNLND 473

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWM 281
               +  ++H  CP+I+G+KW+  KW+
Sbjct: 474 RGQGEIGTVHTSCPIIQGSKWALVKWI 500


>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
          Length = 543

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 66/203 (32%), Positives = 105/203 (51%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR   + + +S  E E + +L+KP + ++T+ +  TG  + +  R S   +L   ++ ++
Sbjct: 344 PRIVRFLDIISNEEIEKVKELSKPRLRRATISNPITGVLETAHYRISKSAWLSGYENPVV 403

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             I +RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 404 ARINQRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWL 463

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDV  GG TVFP                     G SV PK+G A+ ++++ P    D
Sbjct: 464 FYMSDVAAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFPSGEGD 504

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
            S+ H  CPV+ GNKW + KW+H
Sbjct: 505 YSTRHAACPVLVGNKWVSNKWIH 527


>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
 gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
          Length = 578

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 65/208 (31%), Positives = 115/208 (55%), Gaps = 23/208 (11%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSK--DSRVRTSSGTF 134
           TE+++  P   +YH+ ++  E   L +L+KP+M +  +  +K       DS  RTS+  +
Sbjct: 378 TELLSLAPYMVLYHDVITPLESLTLKNLSKPHMKRRAMTFNKQKLRPLIDSG-RTSNSVW 436

Query: 135 LKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNTKNGGQ 193
           L   ++ ++  +E+R+   T   ME+ E  Q+++Y +G  Y  H D+F   +   + GG 
Sbjct: 437 LTSHENAVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQLEHRGGGD 496

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+AT+L YLSDV +GG T+FP  N                   +SV+P++GDALL++++ 
Sbjct: 497 RIATVLFYLSDVPQGGATLFPRLN-------------------ISVQPRQGDALLWYNLN 537

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWM 281
                +  ++H  CP+I+G+KW+  KW+
Sbjct: 538 DRGQGEIGTVHTSCPIIKGSKWALVKWI 565


>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
           latipes]
          Length = 523

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 67/203 (33%), Positives = 106/203 (52%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    YH+  S+ E E + +LAKP + ++TV D +TG+   ++ R S   +L   +  I+
Sbjct: 324 PYIVRYHDVASEKEMETVKELAKPRLRRATVHDPQTGKLTTAQYRVSKSAWLGSHEHPIV 383

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             I +RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 384 DRINQRIEDITGLDVSTAEDLQVANYGVGGQYEPHFDFGRKDEADAFEELGTGNRIATWL 443

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           +Y+SDV+ GG TVF     +  +V W               PK+G A+ ++++      D
Sbjct: 444 LYMSDVQAGGNTVF----TDIGAVVW---------------PKKGTAVFWYNLHRSGEGD 484

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ GNKW + KW+H
Sbjct: 485 YRTRHAACPVLVGNKWVSNKWIH 507


>gi|301115862|ref|XP_002905660.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262110449|gb|EEY68501.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 215

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 72/215 (33%), Positives = 107/215 (49%), Gaps = 28/215 (13%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRV-RTSSGTFLKRGQDRI 142
           P  F    FL   E + +++L+ P++  S V      +++ +   RTS+  +L      +
Sbjct: 3   PLVFSVEEFLRDDEIDVILELSMPHLAPSGVTLQDGHENRPATDWRTSTTYWLDSSSHPV 62

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGG---------- 192
           ++ I+KR AD   +P+ H E +QVL YE  Q YD H DYF  E +  +            
Sbjct: 63  VQTIDKRTADLVKVPISHQESVQVLRYEPTQHYDQHLDYFSAERHRNSPDVLKRIEYGYK 122

Query: 193 QRMATLLMYLSDVEEGGETVF------PAANANFTSVRWWNELSECGKQGLSVKPKRGDA 246
            RM T+  Y+SDV +GG T F      P  ++N           +C  QG+SV PK+   
Sbjct: 123 NRMITVFWYMSDVAKGGHTNFARSGGLPRPSSN----------KDCS-QGISVAPKKRKV 171

Query: 247 LLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           ++F+SM P+   DP SLH GCPV  G K S  KW+
Sbjct: 172 VVFYSMLPNGEGDPMSLHAGCPVEEGIKLSGNKWI 206


>gi|374620441|ref|ZP_09692975.1| 2OG-Fe(II) oxygenase superfamily enzyme [gamma proteobacterium
           HIMB55]
 gi|374303668|gb|EHQ57852.1| 2OG-Fe(II) oxygenase superfamily enzyme [gamma proteobacterium
           HIMB55]
          Length = 570

 Score =  119 bits (297), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 75/223 (33%), Positives = 116/223 (52%), Gaps = 28/223 (12%)

Query: 70  EEKGE-QWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVR 128
           E KG  + TE  + +P   V +N +S  EC YLI+LAKP++ ++ VV  +    K+S  R
Sbjct: 6   ELKGSYEATEAYSLDPLVGVRNNVISPVECAYLIELAKPHIKRAGVVLDE--GYKESEGR 63

Query: 129 TSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-----L 183
           T S  +LK  +D +++ + +RI+D   +P+E+ E +Q++HY   Q+Y  H+D F      
Sbjct: 64  TGSNHWLKYDEDDVVQSVGQRISDIVGLPLEYAESMQIIHYGPEQEYRPHFDAFNLSLPK 123

Query: 184 DEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKR 243
            +   K GGQR+ T L+YL+ VE GG T FP                   K G++V    
Sbjct: 124 GQRAAKWGGQRLVTALVYLNKVEAGGATQFP-------------------KLGITVPALP 164

Query: 244 GDALLFWSMRPDAT-LDPSSLHGGCPVIRGNKWSATKWMHVHE 285
           G  ++F +   D +   P SLH G PV  G KW+   W  + +
Sbjct: 165 GRMVIFHNTTHDISGPHPLSLHAGMPVEAGEKWAFNMWFRLQD 207


>gi|405964867|gb|EKC30309.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
          Length = 591

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 77/244 (31%), Positives = 114/244 (46%), Gaps = 43/244 (17%)

Query: 60  RAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKT 119
           R F + + I   K ++  E+V +EPR  ++H+ +S    E+L  +A     +STV    T
Sbjct: 345 RCFLRETVIPYYKAKE--EVVNYEPRIAIFHDVISPTSIEHLKSVASKGFTRSTVFLENT 402

Query: 120 GQS------KDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPME------HGEGIQVL 167
           G        K   VR S  ++L   +   +  +E RI   T +  E      H E  QVL
Sbjct: 403 GPDGHVTYGKLDNVRVSQTSWLGTDEYPELSRLENRIKLTTGLSAEYKSVRSHSEKFQVL 462

Query: 168 HYEVGQKYDAHYDYF----------LDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAAN 217
           +Y VG  Y  HYDY           LD  + +  G+RMAT + YL+DV+ GG TVFP   
Sbjct: 463 NYGVGGMYTVHYDYTGYMLGIPSNPLDSDDIRTSGERMATWMFYLNDVKAGGATVFPEVK 522

Query: 218 ANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSA 277
                                +   +G A  ++++RP    DP +LHGGCPV+ G+KW +
Sbjct: 523 TR-------------------IPVAKGGAAFWYNVRPSGATDPRTLHGGCPVLVGSKWVS 563

Query: 278 TKWM 281
            KW+
Sbjct: 564 NKWI 567


>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
 gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
          Length = 510

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 66/204 (32%), Positives = 109/204 (53%), Gaps = 23/204 (11%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++   P   VYH+ LS +E   ++++A+  M +++ V      S  S  RT+ G +LKR
Sbjct: 313 ELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRTS--SPTRTAMGAWLKR 370

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
             + + R I +R+ D + + +E  E +QV++Y +G  Y  H D+F    + +  G R+AT
Sbjct: 371 SSNALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFTQ--HPEVMGNRLAT 428

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           +L YL+DVE+GG T+F  A                      V P+RG AL ++++  D  
Sbjct: 429 VLFYLTDVEQGGATMFNKAEHK-------------------VLPRRGTALFWYNLHTDGE 469

Query: 258 LDPSSLHGGCPVIRGNKWSATKWM 281
            D S+ H  CP+I G+KW  T+W+
Sbjct: 470 GDWSTTHAACPIIVGSKWVLTQWI 493


>gi|398806116|ref|ZP_10565064.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
 gi|398089832|gb|EJL80333.1| 2OG-Fe(II) oxygenase superfamily enzyme [Polaromonas sp. CF318]
          Length = 294

 Score =  118 bits (295), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 69/198 (34%), Positives = 94/198 (47%), Gaps = 21/198 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR  V  NFLS  EC+ L + A+P    +TVVD        +  R++    L      ++
Sbjct: 95  PRIVVLDNFLSSEECDGLCEEARPAFAPATVVDPHQDAVHAAHFRSNDSAQLPAAGSELV 154

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLS 203
           R +E RI   T  P    E +Q+  Y  GQ Y  HYD+F  +     GGQR+ATL++YL 
Sbjct: 155 RRVEARIERLTGWPSAFCETLQLQRYAQGQDYRPHYDFFGQDMVEAQGGQRLATLILYLR 214

Query: 204 DVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSL 263
             E GG T F    AN                G+ + P++G AL F    PD   +  +L
Sbjct: 215 APEAGGATYF----ANL---------------GMRIAPRKGSALFF--TYPDPGNNSGTL 253

Query: 264 HGGCPVIRGNKWSATKWM 281
           HGG  V+ G KW AT+W 
Sbjct: 254 HGGEAVLAGEKWIATQWF 271


>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
          Length = 548

 Score =  118 bits (295), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 64/198 (32%), Positives = 106/198 (53%), Gaps = 23/198 (11%)

Query: 85  RAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIR 144
           R  V+  F S  EC +L    K  + ++  V    G+ +    R S+  +L+   D I++
Sbjct: 339 RLQVFRQFASPEECRHLQHAGKRRLERA--VAWTDGRFQPVEFRISTAAWLQPDHDAIVK 396

Query: 145 GIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSD 204
            I  RI D T + +E+ E +Q+ +Y +G  Y+ H+D+      T   G+R+AT ++YL+ 
Sbjct: 397 RIHGRIEDATQVDIEYAEALQISNYGMGGFYEPHFDH--SSRGTNPDGERLATFMIYLNP 454

Query: 205 VEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLH 264
           V++GG T FP                   + G +V+P  GDA+ +++++P    DP +LH
Sbjct: 455 VKQGGFTAFP-------------------RLGAAVQPGYGDAVFWYNLQPSGVGDPLTLH 495

Query: 265 GGCPVIRGNKWSATKWMH 282
           G CPV+RG+KW A KW+H
Sbjct: 496 GACPVLRGSKWVANKWIH 513


>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
 gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
          Length = 535

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 66/204 (32%), Positives = 107/204 (52%), Gaps = 23/204 (11%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++   P   VYH+ LS +E   ++++A+  M +++ V      S  S  RT+ G +LKR
Sbjct: 338 ELLGEHPYVVVYHDVLSDSEIAEILEMAERRMARTSTVAQPNRTS--SPTRTALGAWLKR 395

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
             + + R I +R+ D + + +E  E +QV++Y +G  Y  H D+F         G R+AT
Sbjct: 396 SSNALTRRIARRVRDMSGLQLEGSERMQVINYGIGGHYVPHKDWFTQHPEVM--GNRLAT 453

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           +L YL+DVE+GG T+F  A                      V P+RG AL ++++  D  
Sbjct: 454 VLFYLTDVEQGGATMFNKAEHK-------------------VLPRRGTALFWYNLHTDGE 494

Query: 258 LDPSSLHGGCPVIRGNKWSATKWM 281
            D S+ H  CP+I G+KW  T+W+
Sbjct: 495 GDWSTTHAACPIIVGSKWVLTQWI 518


>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
           queenslandica]
          Length = 525

 Score =  118 bits (295), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 66/211 (31%), Positives = 111/211 (52%), Gaps = 25/211 (11%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
           TE+   +P+ +++++ ++  E E L +LA P + ++TV   + G+   +  R S   +L 
Sbjct: 319 TEVAFVKPKIYIFYDIVTDREIERLKELANPKLNRATV-HGENGELLHATYRISKSGWLS 377

Query: 137 RGQDRI--IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL---DEFNTKNG 191
              D +  +  I++RI D T + M   E +QV++Y +G +Y+ HYD+     D F +   
Sbjct: 378 GSDDPLGYVDRIDQRIEDVTGLTMSTAEQLQVVNYGIGGQYEPHYDFARTGEDTFTSLGS 437

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
           G R++TLL+Y+SDVE+GG TVFP   A                      P +  A  +W+
Sbjct: 438 GNRISTLLIYMSDVEKGGATVFPGVGARLV-------------------PIKRAAAYWWN 478

Query: 252 MRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           ++     D S+ H GCPV+ G+KW   KW+H
Sbjct: 479 LKRSGDGDYSTRHAGCPVLVGSKWVCNKWIH 509


>gi|356530852|ref|XP_003533993.1| PREDICTED: uncharacterized protein LOC100775928 [Glycine max]
          Length = 302

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 85/262 (32%), Positives = 135/262 (51%), Gaps = 21/262 (8%)

Query: 26  MLTIVLLMLLAMGIFYIPIGDDDSPPNDLTSFRRRAFEK-RSSIAEEKGEQWTEIV--AW 82
           M +I LL  L +  F I     +S   +L S +  A +    SI        + +V  +W
Sbjct: 1   MTSISLLHALFVFFFLIATSLTESSRKELRSKQETALQMLEHSIHYSNRINPSRVVQISW 60

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR F+Y  FLS  EC+YL+ LA  Y VK          S +        TFL   +D I
Sbjct: 61  QPRVFLYKGFLSDKECDYLVSLA--YAVKEK-------SSGNGGFSEGVETFLDI-EDDI 110

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  IE+R++ + F+P E+ + +QV+HY   +    + DYF ++   +  G  MAT+++YL
Sbjct: 111 LARIEERLSLWAFLPKEYSKPLQVMHYG-PEPNGRNLDYFTNKTQLELSGPLMATIVLYL 169

Query: 203 SDVE-EGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           S+   +GG+ +FP       SV   +  S C      ++P +G+A+LF+S+ P A+ D +
Sbjct: 170 SNAATQGGQILFP------ESVPRSSSWSSCSNSSNILQPVKGNAILFFSLHPSASPDKN 223

Query: 262 SLHGGCPVIRGNKWSATKWMHV 283
           S H  CPV+ GN WSA K+ + 
Sbjct: 224 SFHARCPVLEGNMWSAIKYFYA 245


>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
           niloticus]
          Length = 536

 Score =  117 bits (293), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 67/203 (33%), Positives = 105/203 (51%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y + LS  E E + +LAKP + ++TV D KTG    +  R S   +L+  +D +I
Sbjct: 337 PHIVRYLDLLSDEEIEKIKELAKPRLARATVRDPKTGVLTTANYRVSKSAWLEGEEDPVI 396

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             + +RI   T + +E  E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 397 DRVNQRIEAITGLTVETAELLQVANYGVGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFL 456

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP    +F +  W               P++G ++ ++++      D
Sbjct: 457 NYMSDVEAGGATVFP----DFGAAIW---------------PRKGTSVFWYNLFRSGEGD 497

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G+KW + KW+H
Sbjct: 498 YRTRHAACPVLVGSKWVSNKWIH 520


>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 575

 Score =  117 bits (292), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 378 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 437

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 438 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 497

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 498 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 538

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 539 TRHAACPVLVGCKWVSNKWFH 559


>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
           leucogenys]
 gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
           leucogenys]
          Length = 535

 Score =  117 bits (292), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 397

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 398 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 457

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 458 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 498

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 499 TRHAACPVLVGCKWVSNKWFH 519


>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
           leucogenys]
          Length = 556

 Score =  117 bits (292), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 418

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 419 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 478

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 479 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 519

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 520 TRHAACPVLVGCKWVSNKWFH 540


>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
 gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
           mulatta]
 gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
           mulatta]
          Length = 533

 Score =  117 bits (292), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 456 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 497 TRHAACPVLVGCKWVSNKWFH 517


>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
           gorilla]
          Length = 565

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 427

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 428 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 487

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 488 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 528

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 529 TRHAACPVLVGCKWVSNKWFH 549


>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_c
           [Homo sapiens]
          Length = 565

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 427

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 428 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 487

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 488 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 528

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 529 TRHAACPVLVGCKWVSNKWFH 549


>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
 gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
 gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
 gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_a
           [Homo sapiens]
 gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_a
           [Homo sapiens]
 gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II [synthetic
           construct]
 gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II [synthetic
           construct]
          Length = 533

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 456 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 497 TRHAACPVLVGCKWVSNKWFH 517


>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
           boliviensis boliviensis]
 gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
           boliviensis boliviensis]
 gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
           boliviensis boliviensis]
          Length = 533

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 456 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 497 TRHAACPVLVGCKWVSNKWFH 517


>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
           troglodytes]
 gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
           troglodytes]
 gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
           troglodytes]
 gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
           paniscus]
 gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
           paniscus]
 gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
           paniscus]
 gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
          Length = 533

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 456 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 497 TRHAACPVLVGCKWVSNKWFH 517


>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
          Length = 533

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 456 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 497 TRHAACPVLVGCKWVSNKWFH 517


>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
           abelii]
 gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 533

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 456 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 497 TRHAACPVLVGCKWVSNKWFH 517


>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
 gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
          Length = 517

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 67/210 (31%), Positives = 114/210 (54%), Gaps = 25/210 (11%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVV---DSKTGQSKDSRVRTSSGT 133
           TEI++  P   +YH+ ++  E   L +L+KP M +  +V   + K     DS  RTS+  
Sbjct: 315 TEILSLSPYMVLYHDVITPLESLTLKNLSKPLMKRRAMVMVNNLKVRPFIDSG-RTSNSV 373

Query: 134 FLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF--LDEFNTKNG 191
           +L   ++ ++  +E+R+   T   ME+ E  Q+++Y +G  Y  H D+F        + G
Sbjct: 374 WLASHENAVMERLERRVGVMTNFEMENSEVYQLINYGIGGHYKPHTDHFETPQAPEHRGG 433

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
           G R+AT+L YLSDV +GG T+FP  N                   +SV+P++GDALL+++
Sbjct: 434 GDRIATVLFYLSDVPQGGATLFPRLN-------------------ISVQPRQGDALLWYN 474

Query: 252 MRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           +      +  ++H  CP+I+G+KW+  KW+
Sbjct: 475 LNDRGQGEIGTVHTSCPIIQGSKWALVKWI 504


>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_d
           [Homo sapiens]
          Length = 488

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 291 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 350

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 351 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 410

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 411 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 451

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 452 TRHAACPVLVGCKWVSNKWFH 472


>gi|348523976|ref|XP_003449499.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
           niloticus]
          Length = 594

 Score =  117 bits (292), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 64/203 (31%), Positives = 108/203 (53%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    YHN +S+ + E + +LAKP + ++T+ +  TG  + +  R S   +L   +  ++
Sbjct: 395 PHIVRYHNIVSEKDMEKVKELAKPRLRRATISNPVTGVLETAHYRISKSAWLGAYEHPVV 454

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             I + I D T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 455 DKINQLIEDVTGLNVKTAEDLQVANYGLGGQYEPHFDFGRKDEPDAFEELGTGNRIATWL 514

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           +Y++DV+ GG TV       FT +            G +VKPK+G A+ ++++ P    D
Sbjct: 515 LYMTDVQAGGATV-------FTDI------------GAAVKPKKGTAVFWYNLYPSGEGD 555

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ GNKW + KW+H
Sbjct: 556 YRTRHAACPVLLGNKWVSNKWIH 578


>gi|395509389|ref|XP_003758980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Sarcophilus harrisii]
          Length = 536

 Score =  116 bits (291), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 66/203 (32%), Positives = 106/203 (52%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ LS  E E + +LAKP + ++TV D KTG    +  R S  ++L+ G D +I
Sbjct: 337 PHIVRYYDVLSDEEIERIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEGDDPVI 396

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 397 AQLNRRMHYITGLSVKTAELLQVANYGMGGQYEPHFDFSRKGEQDAFKHLGTGNRVATFL 456

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP    +F +  W               PK+G ++ ++++      D
Sbjct: 457 NYMSDVEAGGATVFP----DFGATIW---------------PKKGTSVFWYNLFRSGEGD 497

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G+KW + KW H
Sbjct: 498 YRTRHAACPVLVGSKWVSNKWFH 520


>gi|327265288|ref|XP_003217440.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Anolis
           carolinensis]
          Length = 554

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 66/203 (32%), Positives = 104/203 (51%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y+N LS  E E + +LAKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 355 PHIVRYYNVLSDEEIEKIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEEDDLVV 414

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 415 AKVNQRMEHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKEEPDAFKRLGTGNRVATFL 474

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP    +F +  W               PK+G A+ ++++      D
Sbjct: 475 NYMSDVEAGGATVFP----DFGAAIW---------------PKKGTAVFWYNLFRSGEGD 515

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 516 YRTRHAACPVLVGCKWVSNKWFH 538


>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
           kowalevskii]
          Length = 533

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 106/209 (50%), Gaps = 23/209 (11%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V  +P+  ++H+ +   E   +  LA P + ++T+ +S TG  + +  R S   +L  
Sbjct: 327 EVVFDKPKLIIFHDAILTNEIRKVKALASPRLRRATIQNSVTGNLEFAEYRISKSAWLSE 386

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQ 193
               ++  +  RI  +T + M+  E +QV +Y +G  Y+ H+D+     ++ F + N G 
Sbjct: 387 DDGDVVHRLNHRIEQYTGLTMDTAEELQVANYGLGGHYEPHFDFARKEEINAFKSLNTGN 446

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+AT L Y+SDVE GG TVFP   A                      P++G A  ++++ 
Sbjct: 447 RIATFLFYMSDVEAGGATVFPQVGARLI-------------------PEKGSAAFWYNLL 487

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            +   D S+ H  CPV+ G+KW + KW+H
Sbjct: 488 KNGEGDYSTRHAACPVLVGSKWVSNKWIH 516


>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
          Length = 504

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 366

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 367 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 426

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 427 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 467

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 468 TRHAACPVLVGCKWVSNKWFH 488


>gi|326928728|ref|XP_003210527.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Meleagris
           gallopavo]
          Length = 535

 Score =  116 bits (291), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 106/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E +  LAKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 338 PHIVRYYDVMSDEEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 397

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K+ G R+AT L Y
Sbjct: 398 AKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSTLKSEGNRLATFLNY 457

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP    +F +  W               PK+G A+ ++++      D  
Sbjct: 458 MSDVEAGGATVFP----DFGAAIW---------------PKKGTAVFWYNLFRSGEGDYR 498

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 499 TRHAACPVLVGCKWVSNKWFH 519


>gi|334311009|ref|XP_001371555.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Monodelphis
           domestica]
          Length = 534

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 65/201 (32%), Positives = 106/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ LS  E E + +++KP + ++TV D KTG       R S  ++LK   D II
Sbjct: 337 PHIVRYYDVLSDEEIEKIKEISKPKLSRATVRDPKTGHLIVVSYRISKSSWLKEDDDPII 396

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 397 AQVNRRMQYITGLSVKTAELLQVSNYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 456

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP    +F +  W               PK+G ++ ++++      D  
Sbjct: 457 MSDVEAGGATVFP----DFGAAIW---------------PKKGTSVFWYNLFRSGECDYR 497

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G+KW + KW H
Sbjct: 498 TRHAACPVLVGSKWVSNKWFH 518


>gi|321463241|gb|EFX74258.1| hypothetical protein DAPPUDRAFT_22132 [Daphnia pulex]
          Length = 523

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 69/214 (32%), Positives = 110/214 (51%), Gaps = 33/214 (15%)

Query: 81  AWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKRGQ 139
           ++EP  + +H+ LS  E E + +LAKP + +S +V  K G   + S VRTS   +L  G 
Sbjct: 316 SFEPAIYTFHDVLSDEEIETIKELAKPLLARS-MVQGKLGVGHEVSNVRTSKTAWLPEGL 374

Query: 140 DRIIRGIEKRIADFTFI---PM-EHGEGIQVLHYEVGQKYDAHYDYFLDE--------FN 187
             ++  + +RI   T +   P+ +  E +QV +Y +G  Y  H+DY + +          
Sbjct: 375 HPLLNRLSRRIGLITGLKTDPIRDEAELLQVANYGIGGHYSPHHDYLMKDKADFEYMHHR 434

Query: 188 TKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDAL 247
               G R+AT + YL+DVE GG T FP A                   G++VKP +G A 
Sbjct: 435 ELQAGDRIATFMFYLNDVERGGSTAFPRA-------------------GVAVKPVKGGAA 475

Query: 248 LFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
            +++++     DP +LHG CPV+ G+KW + KW+
Sbjct: 476 FWFNLKRSGKPDPLTLHGACPVLLGHKWVSNKWI 509


>gi|57525020|ref|NP_001006155.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Gallus gallus]
 gi|82082587|sp|Q5ZLK5.1|P4HA2_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|53129464|emb|CAG31388.1| hypothetical protein RCJMB04_5l17 [Gallus gallus]
          Length = 534

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 106/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E +  LAKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 337 PHIVRYYDVMSDEEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 396

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K+ G R+AT L Y
Sbjct: 397 AKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSTLKSEGNRLATFLNY 456

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP    +F +  W               PK+G A+ ++++      D  
Sbjct: 457 MSDVEAGGATVFP----DFGAAIW---------------PKKGTAVFWYNLFRSGEGDYR 497

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 498 TRHAACPVLVGCKWVSNKWFH 518


>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
           SAR86A]
 gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
           SAR86A]
          Length = 205

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 67/205 (32%), Positives = 103/205 (50%), Gaps = 27/205 (13%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P  +V +NFLS  ECE  +++ K  M ++ V+     +   SR  T+   +L+     +
Sbjct: 16  DPIVYVVNNFLSDDECEAFVEMGKGKMERAKVISDDESEFHASR--TNDFCWLEHSASDV 73

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-----LDEFNTKNGGQRMAT 197
           I  + KR +    +P+ + E  Q+++Y  G +Y  H+D F       + N   GGQRM T
Sbjct: 74  IHEVSKRFSVLVKMPINNAEQFQLVYYGPGNEYKPHFDAFDKTTKEGQNNWFPGGQRMVT 133

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
            L YL+DVEEGG T FP  N                   +SVKP +GD ++F +     T
Sbjct: 134 ALAYLNDVEEGGATDFPKIN-------------------VSVKPNKGDVVVFHNCIEGTT 174

Query: 258 -LDPSSLHGGCPVIRGNKWSATKWM 281
            ++P +LHGG PV+ G KW+   W 
Sbjct: 175 EINPQALHGGSPVVAGEKWAVNLWF 199


>gi|198449524|ref|XP_002136918.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
 gi|198130646|gb|EDY67476.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
          Length = 530

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 76/212 (35%), Positives = 112/212 (52%), Gaps = 28/212 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++ +P   VYHN LS AE   +  +A+P ++KS  V  +   SK S+VRT+ G ++  
Sbjct: 320 EELSLDPYIVVYHNVLSDAEIAKVERVAEP-LLKSIGV-GEMDNSKKSKVRTALGAWIPD 377

Query: 138 GQDRI-----IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-TKNG 191
               I     I+ I +RI D T + ++HG+ +Q++ Y  G  YD H+DY  D    T+  
Sbjct: 378 KNMHISGWPVIQRIVRRIHDMTGLIIKHGQVVQLIKYGYGGHYDTHFDYLNDSLPITQAL 437

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
           G RMAT+L YL+DV+ GG TVFP                      L V  +RG  L++++
Sbjct: 438 GDRMATVLFYLNDVKHGGSTVFPVLK-------------------LKVPSERGKVLVWYN 478

Query: 252 MRPDA-TLDPSSLHGGCPVIRGNKWSATKWMH 282
           M  +   LD  +LHG CPVI G K   + W+H
Sbjct: 479 MHGETHDLDSRTLHGSCPVIDGAKTVLSCWIH 510


>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 1 [Oryctolagus
           cuniculus]
          Length = 533

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             I +R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 396 ARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 456 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 497 TRHAACPVLVGCKWVSNKWFH 517


>gi|170591592|ref|XP_001900554.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|16415740|emb|CAC82616.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|21425621|emb|CAD19314.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|158592166|gb|EDP30768.1| prolyl 4-hydroxylase, putative [Brugia malayi]
          Length = 541

 Score =  116 bits (290), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 66/210 (31%), Positives = 108/210 (51%), Gaps = 23/210 (10%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            EI+ + P A ++ + ++  E   +  LA P + ++TV +S TG+ + +  RTS   +LK
Sbjct: 325 VEILRFNPLAVLFRDVITDEEVTMIQMLATPRLRRATVQNSITGELETASYRTSKSAWLK 384

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGG 192
             +  ++  I KRI   T +  E  E +QV +Y +G  YD H+D+     ++ F + N G
Sbjct: 385 DEEHEVVHRINKRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSLNTG 444

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+ATLL Y++  E GG TV       FT V+             +V P + DAL ++++
Sbjct: 445 NRLATLLFYMTQPESGGATV-------FTEVK------------TTVMPSKNDALFWYNL 485

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  + H  CPV+ G KW + KW+H
Sbjct: 486 LRSGEGDLRTRHAACPVLTGTKWVSNKWIH 515


>gi|428178571|gb|EKX47446.1| hypothetical protein GUITHDRAFT_152114 [Guillardia theta CCMP2712]
          Length = 262

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 74/225 (32%), Positives = 115/225 (51%), Gaps = 33/225 (14%)

Query: 76  WTEIVAWEPRAFVYHNFLSKAECEYLIDLA-KPYMVKSTVVDSKTGQSKDSRVRTSSGTF 134
           + E +   PR F   N L+K ECE+L+ LA +  + K+ ++   T +  +S  RT+ G +
Sbjct: 56  YLEQINASPRVFRIRNLLTKQECEHLMLLAFRKGLSKTMIMPYGTHKLVESTTRTNDGAW 115

Query: 135 LKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVG-QKYDAHYDYFLDEFNT----K 189
           L   QD ++R +E+ +   T    + GE +QVLHY  G Q +  HYDYF    +     +
Sbjct: 116 LDFLQDDVVRRLEETLGKLTKTTPQQGENLQVLHYSNGAQFFQEHYDYFDPARDPPESFE 175

Query: 190 NGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLF 249
            GG R  T+++YL    EGGET FP             EL      GL +  + GDAL+F
Sbjct: 176 QGGNRYITVIVYLEAALEGGETHFP-------------EL------GLKLTAQPGDALMF 216

Query: 250 WSMR-------PDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           ++++       PD  ++  ++H   P +RG KW A KW+H   Y+
Sbjct: 217 YNLKEHCSGTDPDC-VEKKTIHAALPPVRGEKWVAVKWIHEKPYQ 260


>gi|224068121|ref|XP_002191580.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Taeniopygia
           guttata]
          Length = 539

 Score =  115 bits (289), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 106/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E +  LAKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 342 PHIVRYYDVMSDEEIEKIKQLAKPRLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 401

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K+ G R+AT L Y
Sbjct: 402 AKVNQRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSTLKSEGNRLATFLNY 461

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP    +F +  W               PK+G A+ ++++      D  
Sbjct: 462 MSDVEAGGATVFP----DFGAAIW---------------PKKGTAVFWYNLFRSGEGDYR 502

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 503 TRHAACPVLVGCKWVSNKWFH 523


>gi|344175386|emb|CCA88057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
          Length = 331

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 68/210 (32%), Positives = 104/210 (49%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQS--KDSRVRTSSGTFL 135
           + V+  PRA +  + LS  EC+ LI+ A+  +  S V++ ++GQ    ++    S  +F 
Sbjct: 126 QFVSHHPRAALISDLLSTQECDALIEQARSRLTTSYVIEYESGQEVVNEATRSCSCASFP 185

Query: 136 KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEF--NTK---N 190
                 + + I +R A     P  H EG+    Y  G+++  H DYF      N K   +
Sbjct: 186 PEEMSMLQKRIVERAARLVGQPGAHCEGVTFARYLPGEQFRPHVDYFRGAVLNNDKIMGS 245

Query: 191 GGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFW 250
            G R+AT+L+YL++VE GG T FP                     G  V+P++G AL F 
Sbjct: 246 SGHRIATVLLYLNEVEAGGATFFP-------------------NPGFEVRPQKGGALYFA 286

Query: 251 SMRPDATLDPSSLHGGCPVIRGNKWSATKW 280
             + D ++DP+SLH GC V +G KW AT W
Sbjct: 287 YQQADGSMDPTSLHEGCAVTQGEKWIATLW 316


>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
 gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
          Length = 538

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 65/201 (32%), Positives = 105/201 (52%), Gaps = 20/201 (9%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P    YH+ LS  +   L  +A P+M +STV      QSK S  R S   +L+     +
Sbjct: 331 DPLVVSYHDMLSPQQIIELRQMAVPHMKRSTVNPLPGRQSKKSAFRVSKNAWLEYDTHPM 390

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD-EFNTKNGGQRMATLLMY 201
           +  + + ++D T + M + E +QV +Y VG  Y+ H+D+F+D +      G R+AT + Y
Sbjct: 391 MGRMLRDLSDATGLDMTYCEQLQVANYGVGGHYEPHWDFFVDSQHYPAEEGNRIATAIFY 450

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           LSDVE+GG T FP  N                    +V+P+ G+ L ++++     +D  
Sbjct: 451 LSDVEQGGATAFPFLN-------------------FAVRPQLGNILFWYNLHRSLDMDYR 491

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H GCPV++G+KW A  W+H
Sbjct: 492 TKHAGCPVLKGSKWIANIWIH 512


>gi|402593814|gb|EJW87741.1| hypothetical protein WUBG_01349 [Wuchereria bancrofti]
          Length = 541

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 66/210 (31%), Positives = 108/210 (51%), Gaps = 23/210 (10%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            EI+ + P A ++ + ++  E   +  LA P + ++TV +S TG+ + +  RTS   +LK
Sbjct: 325 VEILRFNPLAVLFRDVITDEEITMIQMLATPRLRRATVQNSITGELETASYRTSKSAWLK 384

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGG 192
             +  ++  I KRI   T +  E  E +QV +Y +G  YD H+D+     ++ F + N G
Sbjct: 385 DEEHEVVHRINKRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSLNTG 444

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+ATLL Y++  E GG TV       FT V+             +V P + DAL ++++
Sbjct: 445 NRLATLLFYMTQPESGGATV-------FTEVK------------TTVMPSKNDALFWYNL 485

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  + H  CPV+ G KW + KW+H
Sbjct: 486 LRSGEGDLRTRHAACPVLTGTKWVSNKWIH 515


>gi|414587754|tpg|DAA38325.1| TPA: hypothetical protein ZEAMMB73_894856 [Zea mays]
          Length = 169

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 55/109 (50%), Positives = 75/109 (68%), Gaps = 2/109 (1%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+++W PR  V+HNFLS  EC+YL+ +A+P +  STVVD  TG+   S VRTSSG F+  
Sbjct: 58  EVISWTPRIIVFHNFLSSEECDYLMAIARPRLQISTVVDVATGKGVKSDVRTSSGMFVNS 117

Query: 138 GQDR--IIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD 184
            + +  +++ IEKRI+ F+ IP E+GE IQVL YE  Q Y  H+DYF D
Sbjct: 118 EERKSPVVQAIEKRISVFSQIPKENGELIQVLRYEASQYYRPHHDYFSD 166


>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
           laevis]
 gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
          Length = 533

 Score =  115 bits (289), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 67/201 (33%), Positives = 103/201 (51%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR   Y + LS  E E + +LAKP + ++TV D KTG    +  R S   +L+   D +I
Sbjct: 336 PRIVRYLDVLSDEEIEKIKELAKPRLARATVRDPKTGVLTVANYRVSKSAWLEEYDDPVI 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEF--NTKNGGQRMATLLMY 201
             +  R+   T +  +  E +QV +Y +G +Y+ H+D+    F  N K  G R+AT L Y
Sbjct: 396 GRVNSRMQAITGLTKDTAELLQVANYGMGGQYEPHFDFSRRPFDSNLKTEGNRLATYLNY 455

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP    +F +  W               P++G A+ ++++      D  
Sbjct: 456 MSDVEAGGATVFP----DFGAAIW---------------PRKGTAVFWYNLFRSGEGDYR 496

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G+KW + KW H
Sbjct: 497 TRHAACPVLVGSKWVSNKWFH 517


>gi|607947|gb|AAA62207.1| prolyl 4-hydroxylase alpha subunit [Caenorhabditis elegans]
          Length = 558

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 64/210 (30%), Positives = 105/210 (50%), Gaps = 23/210 (10%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            EI  + P A ++ + +S  E   + +LAKP + ++TV DS TG+   +  R S   +LK
Sbjct: 320 VEIKRFNPLAVLFKDVISDDEVAAIQELAKPKLARATVHDSVTGKLVTATYRISKSAWLK 379

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGG 192
             +  ++  + KRI   T + ME  E +Q+ +Y +G  YD H+D+   E    F +   G
Sbjct: 380 EWEGDVVETVNKRIGYMTNLEMETAEELQIANYGIGGHYDPHFDHAKKEESKSFESLGTG 439

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT+L Y+S    GG TVF  A +                   ++ P + DAL ++++
Sbjct: 440 NRIATVLFYMSQPSHGGGTVFTEAKS-------------------TILPTKNDALFWYNL 480

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 +P + H  CPV+ G KW + KW+H
Sbjct: 481 YKQGDGNPDTRHAACPVLVGIKWVSNKWIH 510


>gi|326914688|ref|XP_003203656.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Meleagris
           gallopavo]
          Length = 539

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 70/214 (32%), Positives = 112/214 (52%), Gaps = 28/214 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E +  +P   +YH+F+S AE E +  LA P++ +S V   +  Q  + R+  S+  +LK 
Sbjct: 335 ETLRLQPYIVLYHDFVSDAEAETIKGLAGPWLQRSVVASGEKQQKVEYRISKSA--WLKD 392

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++R +E R+A  T + +   + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 393 TADPVVRALELRMAAITGLDLRPPYAEYLQVVNYGLGGHYEPHFDHATSRKSPLYRMKSG 452

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT+++YLS VE GG T F  AN                    SV   +  AL +W++
Sbjct: 453 NRIATVMIYLSAVEAGGSTAFIYAN-------------------FSVPVVKNAALFWWNL 493

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           R +   D  +LH GCPV+ G+KW A KW  +HEY
Sbjct: 494 RRNGDGDGDTLHAGCPVLAGDKWVANKW--IHEY 525


>gi|17552840|ref|NP_499464.1| Protein DPY-18 [Caenorhabditis elegans]
 gi|20455505|sp|Q10576.2|P4HA1_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; AltName: Full=Protein dumpy-18; Flags:
           Precursor
 gi|3881011|emb|CAA21045.1| Protein DPY-18 [Caenorhabditis elegans]
 gi|6900013|emb|CAB71298.1| prolyl 4-hydroxylase alpha subunit 1 [Caenorhabditis elegans]
          Length = 559

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 64/210 (30%), Positives = 105/210 (50%), Gaps = 23/210 (10%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            EI  + P A ++ + +S  E   + +LAKP + ++TV DS TG+   +  R S   +LK
Sbjct: 321 VEIKRFNPLAVLFKDVISDDEVAAIQELAKPKLARATVHDSVTGKLVTATYRISKSAWLK 380

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGG 192
             +  ++  + KRI   T + ME  E +Q+ +Y +G  YD H+D+   E    F +   G
Sbjct: 381 EWEGDVVETVNKRIGYMTNLEMETAEELQIANYGIGGHYDPHFDHAKKEESKSFESLGTG 440

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT+L Y+S    GG TVF  A +                   ++ P + DAL ++++
Sbjct: 441 NRIATVLFYMSQPSHGGGTVFTEAKS-------------------TILPTKNDALFWYNL 481

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 +P + H  CPV+ G KW + KW+H
Sbjct: 482 YKQGDGNPDTRHAACPVLVGIKWVSNKWIH 511


>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
           catus]
 gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
           catus]
          Length = 533

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 456 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 497 TRHAACPVLVGCKWVSNKWFH 517


>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Cricetulus griseus]
          Length = 533

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 456 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 497 TRHAACPVLVGCKWVSNKWFH 517


>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
           musculus]
 gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
 gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
 gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
          Length = 535

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 397

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 398 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 457

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 458 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 498

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 499 TRHAACPVLVGCKWVSNKWFH 519


>gi|443709455|gb|ELU04127.1| hypothetical protein CAPTEDRAFT_149240 [Capitella teleta]
          Length = 532

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 65/208 (31%), Positives = 109/208 (52%), Gaps = 22/208 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E  + EP   VYH  ++  E E + ++A P + ++TV +S TGQ + ++ R S   +L+ 
Sbjct: 330 ETASLEPWIAVYHQLMNDHEIERIKEMATPRLARATVHNSATGQLEHAKYRISKSGWLRD 389

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGGQR 194
            +D +I  I +R +  T + +   E +QV++Y +G +Y+ H+D+      T   K  G R
Sbjct: 390 EEDPLIARISERCSALTNLSLTTVEELQVVNYGIGGQYEPHFDFSRRSEPTAFEKWRGNR 449

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           + T++ Y++DVE GG TVF  A                   G+ V P++G A ++ ++ P
Sbjct: 450 ILTVIYYMTDVEAGGATVFLDA-------------------GVKVYPEKGSAAVWHNLLP 490

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMH 282
               D  + H  CPV+ G+KW A KW H
Sbjct: 491 SGEGDMRTRHAACPVLTGSKWVANKWFH 518


>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
           musculus]
          Length = 593

 Score =  115 bits (288), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 396 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 455

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 456 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 515

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 516 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 556

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 557 TRHAACPVLVGCKWVSNKWFH 577


>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
           garnettii]
          Length = 538

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 104/201 (51%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 400

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             +  R+   T + ++  E +QV +Y VG +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 401 ARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRVATFLNY 460

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 461 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 501

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 502 TRHAACPVLVGCKWVSNKWFH 522


>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
 gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
          Length = 549

 Score =  115 bits (288), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 68/211 (32%), Positives = 113/211 (53%), Gaps = 27/211 (12%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            E ++ +P   ++H+ + ++E + L+ LAK  + ++TV    +  S  S  RTS  TFL 
Sbjct: 327 VEELSHDPLLVLFHDVIYQSEIDTLMRLAKNKIHRATVTGHNS--SVVSNARTSQFTFLP 384

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD-EFNTKN----- 190
           + + +++R I++R+AD T + +E+ E  Q+ +Y +G  Y  H D+F    F TK      
Sbjct: 385 KTRHKVLRTIDQRVADMTDLHLEYAEDHQLANYGIGGHYAQHMDWFYPITFETKQVSNPE 444

Query: 191 GGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFW 250
            G R+ T+L YLSDVE+GG T FPA                  KQ L  +PK+  A  ++
Sbjct: 445 MGNRIGTVLFYLSDVEQGGATAFPAL-----------------KQLL--RPKKHAAAFWY 485

Query: 251 SMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           ++      D  ++HG CP+I G+KW   +W+
Sbjct: 486 NLHASGVGDARTMHGACPIIVGSKWVLNRWI 516


>gi|156370129|ref|XP_001628324.1| predicted protein [Nematostella vectensis]
 gi|156215298|gb|EDO36261.1| predicted protein [Nematostella vectensis]
          Length = 541

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 69/230 (30%), Positives = 115/230 (50%), Gaps = 44/230 (19%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQ--------SKD----- 124
           E V  +P   ++ NF++ +E + + +LA P + ++TV D  TG+        SK      
Sbjct: 304 ERVFVKPEVLIFRNFITDSEIKRIKELATPRLKRATVKDPVTGELIFANYRISKRRATIQ 363

Query: 125 ---------SRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKY 175
                    +  R S   +L+  +D +++ I  R+  ++ + M   E +QV++Y +G  Y
Sbjct: 364 HPVTGKLEFANYRISKSGWLRDEEDELVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHY 423

Query: 176 DAHYDYFLD---EFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSEC 232
           + HYD+  D   +F +   G R+AT L YLSDVE GG TVF                   
Sbjct: 424 EPHYDFARDGEDKFTSLGTGNRIATFLSYLSDVEAGGGTVFT------------------ 465

Query: 233 GKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            + G +V P++GDA  +++++     D S+ H  CPV+ G+KW A KW+H
Sbjct: 466 -RVGATVWPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKWVANKWIH 514


>gi|452752943|ref|ZP_21952682.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
           proteobacterium JLT2015]
 gi|451959765|gb|EMD82182.1| eukaryotic Peptidyl prolyl 4-hydroxylase, alpha subunit [alpha
           proteobacterium JLT2015]
          Length = 314

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 73/220 (33%), Positives = 106/220 (48%), Gaps = 26/220 (11%)

Query: 69  AEEKGEQW-TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRV 127
           A E+ E W TE V+  P   +  +  S AEC YL  ++ P +  ST++D +TG  +   V
Sbjct: 113 AAEEEENWRTEPVSETPSIRMVRHLFSSAECAYLQQMSAPRLRPSTILDPQTGARRPDPV 172

Query: 128 RTSSGTFLKRGQDRIIRG-IEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEF 186
           RTS G  L   ++ ++ G + +RIA  T      GE + +L Y   Q+Y  H+D      
Sbjct: 173 RTSVGAALSPVEEDLVVGMLNRRIAAATGTDRMQGEPLHILRYSGAQEYRPHHDAVAGLE 232

Query: 187 NTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDA 246
           N     QR  TL++YL+   EGGET FP                   + G  ++ ++GDA
Sbjct: 233 N-----QRSHTLIVYLTADYEGGETAFP-------------------ELGFRLRGRQGDA 268

Query: 247 LLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           LLF ++R D   D    H G P   G KW AT+W+    Y
Sbjct: 269 LLFANLREDGRPDLRMRHAGLPATSGAKWIATRWIRTRPY 308


>gi|281350467|gb|EFB26051.1| hypothetical protein PANDA_009188 [Ailuropoda melanoleuca]
          Length = 511

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 66/197 (33%), Positives = 100/197 (50%), Gaps = 23/197 (11%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++T+ +  TG  +    R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATL 198
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+      D F     G R+AT 
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATW 453

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L Y+SDV  GG TVFP                     G SV PK+G A+ ++++      
Sbjct: 454 LFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEG 494

Query: 259 DPSSLHGGCPVIRGNKW 275
           D S+ H  CPV+ GNKW
Sbjct: 495 DYSTRHAACPVLVGNKW 511


>gi|224009604|ref|XP_002293760.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
           CCMP1335]
 gi|220970432|gb|EED88769.1| prolyl 4-hydroxylase alpha subunit [Thalassiosira pseudonana
           CCMP1335]
          Length = 206

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 73/209 (34%), Positives = 109/209 (52%), Gaps = 18/209 (8%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLA---KPYMVKSTVVDSKTGQSKDS--RVRTSSG 132
           ++++  PRAF   NFLS+ E ++++ L    K +   +   D  T   +DS    RTS  
Sbjct: 4   KVLSCAPRAFEIENFLSQTEVDHIMYLTTGMKLHRSTTAGSDQITADERDSTRNTRTSLN 63

Query: 133 TFLKRGQDRIIRGIEKRIADFTFIPMEH-GEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG 191
           T++ R +  II  I +R AD   +      E +Q++HY+VGQ+Y AH+D+   + + +  
Sbjct: 64  TWVYREKSAIIDTIYRRAADLQLMNEALIAEALQLVHYDVGQEYTAHHDWGHPDIDNEYQ 123

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
             R  TLL+YL++  EGG T FP         RW N  +   + GL V+PK G A+LF+S
Sbjct: 124 PARYCTLLLYLNEGMEGGATQFP---------RWVNAET---RNGLDVEPKIGKAVLFYS 171

Query: 252 MRPDATLDPSSLHGGCPVIRGNKWSATKW 280
             PD  +D  S H   PV  G KW    W
Sbjct: 172 QLPDGNMDDWSHHAAMPVRVGEKWLMNLW 200


>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Cavia porcellus]
          Length = 533

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 105/201 (52%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEEDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 396 ARVNRRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 456 MSDVEAGGATVFP-------------------DLGAALWPKKGTAVFWYNLLRSGEGDYR 496

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 497 TRHAACPVLVGCKWVSNKWFH 517


>gi|346724248|ref|YP_004850917.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
           citrumelo F1]
 gi|346648995|gb|AEO41619.1| hypothetical protein XACM_1335 [Xanthomonas axonopodis pv.
           citrumelo F1]
          Length = 418

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 71/212 (33%), Positives = 97/212 (45%), Gaps = 31/212 (14%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR   Y   LS  EC  L+ LA+P++  S V+D     +  + VRTS G  L    D II
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTGRAPVRTSHGATL----DPII 283

Query: 144 -----RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD---EFNTKNGGQRM 195
                R  + R+A    +P+ H E + VL Y  G++Y AH DY        +    G R 
Sbjct: 284 EDFAARAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQ 343

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
            T+ +YL+DV  GGET FP A                   G+ V+P+ G  + F ++  D
Sbjct: 344 RTVCVYLNDVGAGGETEFPVA-------------------GVRVRPRPGTLVCFDNLHAD 384

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
              D  SLH G PV  G+KW  T W     Y+
Sbjct: 385 GRPDADSLHAGLPVTAGSKWLGTLWFRQQRYR 416


>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Loxodonta africana]
          Length = 534

 Score =  115 bits (287), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 104/201 (51%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E +  +AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 337 PHIVRYYDVMSDEEIERIKQIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 396

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 397 AQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 456

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 457 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 497

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 498 TRHAACPVLVGCKWVSNKWFH 518


>gi|78046960|ref|YP_363135.1| hypothetical protein XCV1404 [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
 gi|78035390|emb|CAJ23035.1| conserved hypothetical protein [Xanthomonas campestris pv.
           vesicatoria str. 85-10]
          Length = 418

 Score =  114 bits (286), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 71/212 (33%), Positives = 97/212 (45%), Gaps = 31/212 (14%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR   Y   LS  EC  L+ LA+P++  S V+D     +  + VRTS G  L    D II
Sbjct: 228 PRIEEYAAVLSADECRLLMLLARPHLRASKVIDPNDASTGRAPVRTSHGATL----DPII 283

Query: 144 -----RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD---EFNTKNGGQRM 195
                R  + R+A    +P+ H E + VL Y  G++Y AH DY        +    G R 
Sbjct: 284 EDFAARAAQSRLAACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQ 343

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
            T+ +YL+DV  GGET FP A                   G+ V+P+ G  + F ++  D
Sbjct: 344 RTVCVYLNDVGAGGETEFPVA-------------------GVRVRPRPGTLVCFDNLHAD 384

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
              D  SLH G PV  G+KW  T W     Y+
Sbjct: 385 GRPDADSLHAGLPVTAGSKWLGTLWFRQQRYR 416


>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
           rubripes]
          Length = 548

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 66/203 (32%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y + +S  E E +  LAKP + ++T+ +  TG  + +  R S   +L   +  +I
Sbjct: 349 PYIVRYIDIISDKEIETVKKLAKPRLRRATISNPITGVLETASYRISKSAWLTGYEHPVI 408

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             I +RI D T + M+  E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 409 EIINQRIEDLTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWL 468

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDV  GG TVFP                     G +V P++G A+ ++++  +   D
Sbjct: 469 FYMSDVAAGGATVFPDV-------------------GAAVWPQKGTAVFWYNLFANGEGD 509

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
            S+ H  CPV+ GNKW + KW+H
Sbjct: 510 YSTRHAACPVLVGNKWVSNKWIH 532


>gi|56118630|ref|NP_001007975.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
           (Silurana) tropicalis]
 gi|51513259|gb|AAH80485.1| p4ha2 protein [Xenopus (Silurana) tropicalis]
          Length = 527

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 68/205 (33%), Positives = 104/205 (50%), Gaps = 23/205 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR   Y N LS  E   + +LAKP + ++TV D KTG    +  R S   +L+   D +I
Sbjct: 338 PRIVRYLNALSDEEIAKIKELAKPKLARATVRDPKTGVLSVANYRVSKSAWLEENDDPVI 397

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEF--NTKNGGQRMATLLMY 201
             +  R+   T + ++  E +QV +Y +G +Y+ H+D+    F  N K  G R+AT L Y
Sbjct: 398 ARVNLRMQAITGLTVDTAELLQVANYGMGGQYEPHFDFSRRPFDSNLKTDGNRLATFLNY 457

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP    +F +  W               PK+G A+ ++++      D  
Sbjct: 458 MSDVEAGGATVFP----DFGAAIW---------------PKKGTAVFWYNLFRSGEGDYR 498

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEY 286
           + H  CPV+ G+KW   KW H  ++
Sbjct: 499 TRHAACPVLVGSKWG--KWTHTQDH 521


>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
           araneus]
          Length = 533

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 104/201 (51%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTTASYRVSKSSWLEETDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             +  R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 396 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 456 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 497 TRHAACPVLVGCKWVSNKWFH 517


>gi|395521232|ref|XP_003764722.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Sarcophilus
           harrisii]
          Length = 521

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 66/210 (31%), Positives = 109/210 (51%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S +E + +   A P++ +S V   +  Q  + R+  S+  +LK 
Sbjct: 317 EVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVASGEKQQQVEYRISKSA--WLKD 374

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D I+  +++RIA  T + ++  + E +QV++Y +G  Y+ H+D+     +     N G
Sbjct: 375 TVDPILVSLDRRIAALTGLNVQPPYAEHLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSG 434

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                    SV   +  AL +W++
Sbjct: 435 NRVATFMIYLSSVEAGGSTAFIYAN-------------------FSVPVVKNAALFWWNL 475

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 476 HRSGQGDGDTLHAGCPVLVGDKWVANKWIH 505


>gi|308497208|ref|XP_003110791.1| CRE-DPY-18 protein [Caenorhabditis remanei]
 gi|308242671|gb|EFO86623.1| CRE-DPY-18 protein [Caenorhabditis remanei]
          Length = 559

 Score =  114 bits (286), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 64/210 (30%), Positives = 104/210 (49%), Gaps = 23/210 (10%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            EI  + P A ++ + +S  E   + +LAKP + ++TV DS TG+   +  R S   +LK
Sbjct: 321 VEIKRFNPLAVLFKDVISDDEVATIQELAKPKLARATVHDSATGKLVTATYRISKSAWLK 380

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGG 192
             +  ++  + KRI   T + ME  E +Q+ +Y +G  YD H+D+   E    F +   G
Sbjct: 381 EWEHEVVERVNKRIELMTNLEMETAEELQIANYGIGGHYDPHFDHAKKEESKSFESLGTG 440

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT+L Y+S    GG TVF    +                   +V P + DAL ++++
Sbjct: 441 NRIATVLFYMSQPSHGGGTVFTEVKS-------------------TVLPTKNDALFWYNL 481

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 +P + H  CPV+ G KW + KW+H
Sbjct: 482 FKQGDGNPDTRHAACPVLVGIKWVSNKWIH 511


>gi|332221664|ref|XP_003259983.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Nomascus
           leucogenys]
          Length = 558

 Score =  114 bits (285), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 418

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 419 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFL 478

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 479 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 519

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 520 YRTRHAACPVLVGCKWVSNKWFH 542


>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
          Length = 542

 Score =  114 bits (285), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 66/210 (31%), Positives = 108/210 (51%), Gaps = 23/210 (10%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            EI+ + P A  + + ++  E   +  LA P + ++TV +S TG+ + +  RTS   +LK
Sbjct: 326 VEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELETASYRTSKSAWLK 385

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGG 192
             +  I+  I +RI   T +  E  E +QV +Y +G  YD H+D+     ++ F + N G
Sbjct: 386 DEEHEIVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSLNTG 445

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+ATLL Y++  E GG TV       FT V+             +V P + DAL ++++
Sbjct: 446 NRLATLLFYMTQPESGGATV-------FTEVK------------TTVMPSKNDALFWYNL 486

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  + H  CPV+ G+KW + KW+H
Sbjct: 487 LRSGEGDLRTRHAACPVLIGSKWVSNKWIH 516


>gi|332221660|ref|XP_003259981.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Nomascus
           leucogenys]
          Length = 537

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 397

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 398 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFL 457

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 458 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 498

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 499 YRTRHAACPVLVGCKWVSNKWFH 521


>gi|348501574|ref|XP_003438344.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
           niloticus]
          Length = 615

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 67/203 (33%), Positives = 101/203 (49%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y + +S AE E +  LAKP + ++T+ +  TG  + +  R S   +L    D +I
Sbjct: 416 PYIVRYLDIISDAEIERVKQLAKPRLRRATISNPITGVLETASYRISKSAWLTEYDDPMI 475

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             I  RI   T + M+  E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 476 EKINDRIEGVTGLEMDTAEELQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWL 535

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDV  GG TVFP                     G +V P++G A+ ++++      D
Sbjct: 536 FYMSDVSAGGATVFPDV-------------------GAAVWPQKGTAVFWYNLFASGEGD 576

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
            S+ H  CPV+ GNKW + KW+H
Sbjct: 577 YSTRHAACPVLVGNKWVSNKWIH 599


>gi|391342914|ref|XP_003745760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Metaseiulus
           occidentalis]
          Length = 525

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 107/209 (51%), Gaps = 23/209 (11%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++   P   ++H+ +S  E + +I+L+ P + ++TV ++K+G+ + +  R S   +LK 
Sbjct: 320 EVIHERPYLALFHDIMSDDEIQTVIELSAPRLKRATVQNAKSGELEVANYRISKSAWLKN 379

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQ 193
               ++  +  R    T +     E +QV++Y +G  Y+AH+D+      D F     G 
Sbjct: 380 HDHEVVERLSFRFEYLTGLTHLTAEELQVVNYGIGGHYEAHFDFARRDEKDAFKQLGTGN 439

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+AT + Y+SDV+ GG TVFP                   + GL+V P++G A  +W++ 
Sbjct: 440 RIATWINYMSDVKAGGATVFP-------------------RLGLTVWPEKGSAAFWWNLH 480

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                D  + H  CPV+ G+KW + KW H
Sbjct: 481 RSGEGDILTRHAACPVLAGSKWVSNKWFH 509


>gi|363729586|ref|XP_417248.3| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Gallus gallus]
          Length = 542

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 69/214 (32%), Positives = 111/214 (51%), Gaps = 28/214 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E +  +P   +YH+F+S AE E +  LA P++ +S V   +  Q  + R+  S+  +LK 
Sbjct: 338 ETLRLQPYIVLYHDFVSDAEAETIKGLAGPWLQRSVVASGEKQQKVEYRISKSA--WLKD 395

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDE---FNTKNGG 192
             D +++ +E R+A  T + +   + E +QV++Y +G  Y+ H+D+             G
Sbjct: 396 TADPVVQALELRMAAITGLDLRPPYAEYLQVVNYGLGGHYEPHFDHATSRKSPLYRMKSG 455

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT+++YLS VE GG T F  AN                    SV   +  AL +W++
Sbjct: 456 NRIATVMIYLSAVEAGGSTAFIYAN-------------------FSVPVVKNAALFWWNL 496

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           R +   D  +LH GCPV+ G+KW A KW  +HEY
Sbjct: 497 RRNGDGDGDTLHAGCPVLAGDKWVANKW--IHEY 528


>gi|4758868|ref|NP_004190.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
           sapiens]
 gi|217272863|ref|NP_001136071.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
           sapiens]
 gi|20455169|sp|O15460.1|P4HA2_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|2439985|gb|AAB71339.1| prolyl 4-hydroxylase alpha (II) subunit [Homo sapiens]
 gi|18073926|emb|CAC85689.1| Prolyl 4-hydroxylase alpha IIb subunit [Homo sapiens]
 gi|119582746|gb|EAW62342.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_b
           [Homo sapiens]
 gi|119582747|gb|EAW62343.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_b
           [Homo sapiens]
          Length = 535

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFL 455

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 456 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFH 519


>gi|403255941|ref|XP_003920663.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Saimiri
           boliviensis boliviensis]
 gi|403255945|ref|XP_003920665.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Saimiri
           boliviensis boliviensis]
          Length = 535

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDAFKHLGTGNRVATFL 455

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 456 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFH 519


>gi|119582752|gb|EAW62348.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_f
           [Homo sapiens]
          Length = 567

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 368 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 427

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 428 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFL 487

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 488 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 528

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 529 YRTRHAACPVLVGCKWVSNKWFH 551


>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
          Length = 541

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 66/210 (31%), Positives = 108/210 (51%), Gaps = 23/210 (10%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            EI+ + P A  + + ++  E   +  LA P + ++TV +S TG+ + +  RTS   +LK
Sbjct: 325 VEILRFSPLAVFFRDVITDEEVTIIQMLATPRLRRATVQNSITGELETASYRTSKSAWLK 384

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGG 192
             +  I+  I +RI   T +  E  E +QV +Y +G  YD H+D+     ++ F + N G
Sbjct: 385 DEEHEIVHRINRRIDLMTNLEQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSLNTG 444

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+ATLL Y++  E GG TV       FT V+             +V P + DAL ++++
Sbjct: 445 NRLATLLFYMTQPESGGATV-------FTEVK------------TTVMPSKNDALFWYNL 485

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  + H  CPV+ G+KW + KW+H
Sbjct: 486 LRSGEGDLRTRHAACPVLIGSKWVSNKWIH 515


>gi|297675929|ref|XP_002815906.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pongo
           abelii]
          Length = 535

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFL 455

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 456 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFH 519


>gi|114601566|ref|XP_001162222.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
           troglodytes]
 gi|114601568|ref|XP_001162843.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 17 [Pan
           troglodytes]
 gi|397518358|ref|XP_003829358.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pan
           paniscus]
 gi|397518362|ref|XP_003829360.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Pan
           paniscus]
 gi|410215944|gb|JAA05191.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410255608|gb|JAA15771.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331279|gb|JAA34586.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
          Length = 535

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFL 455

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 456 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFH 519


>gi|323452216|gb|EGB08091.1| hypothetical protein AURANDRAFT_26622 [Aureococcus anophagefferens]
          Length = 190

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 73/204 (35%), Positives = 101/204 (49%), Gaps = 28/204 (13%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           PR F+    LS+ EC+++I+L    + KS V     G    S+ RTS   +L+R    I+
Sbjct: 1   PRVFLVREMLSEFECDHIIELGTKVVRKSMV---GQGGGFTSKTRTSENGWLRRSASPIL 57

Query: 144 RGIEKRIADFTFIPME------HGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
             I KR  D   I  +      + E +QV+ Y+  Q+Y  H+D F D+   +   QR  T
Sbjct: 58  ENIYKRFGDVLGIDHDLLRSGKNAEELQVVRYDRSQEYAPHHD-FGDDGTPQ---QRFLT 113

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           LL+Y+   EEGG T FP AN                  G+ V P RGDA+LF+SM PD  
Sbjct: 114 LLLYIQLPEEGGATSFPKANDGM---------------GVQVVPARGDAVLFYSMLPDGN 158

Query: 258 LDPSSLHGGCPVIRGNKWSATKWM 281
            D  +LH G PV +G KW    W+
Sbjct: 159 ADDLALHAGMPVRKGQKWVCNLWV 182


>gi|395736141|ref|XP_003776706.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 577

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 378 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 437

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 438 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFL 497

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 498 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 538

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 539 YRTRHAACPVLVGCKWVSNKWFH 561


>gi|113682363|ref|NP_001038463.1| prolyl 4-hydroxylase, alpha polypeptide I a precursor [Danio rerio]
          Length = 522

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 67/225 (29%), Positives = 108/225 (48%), Gaps = 45/225 (20%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVV----------------------DSKTGQ 121
           PR   YH  +++ E E + +L+KP + ++T+                       D +TG+
Sbjct: 301 PRIIRYHEIITEQEIEKIKELSKPRLRRATISNPITGVLETAHYRISKRRATVHDPQTGK 360

Query: 122 SKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDY 181
              ++ R S   +L   +  ++  I +RI D T + ++  E +QV +Y VG +Y+ H+D+
Sbjct: 361 LTTAQYRVSKSAWLAAYEHPVVDRINQRIEDITGLNVKTAEELQVANYGVGGQYEPHFDF 420

Query: 182 FL----DEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGL 237
                 D F     G R+AT L Y+SDV  GG TVFP                     G 
Sbjct: 421 GRKDEPDAFKELGTGNRIATWLFYMSDVAAGGATVFPEV-------------------GA 461

Query: 238 SVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           +VKP +G A+ ++++ P    D S+ H  CPV+ GNKW + KW+H
Sbjct: 462 AVKPLKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKWVSNKWIH 506


>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
 gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
          Length = 519

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 68/208 (32%), Positives = 106/208 (50%), Gaps = 22/208 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E V  +P+ +V HN LS  E E +  LA+P +  +   +  TG +  S  R S   +L  
Sbjct: 315 EQVFDKPKLWVLHNILSDPEMEVIKKLAQPRLRPAATQNPTTGGAVLSSYRISKNAWLYY 374

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL--DEFN-TKNGGQR 194
            + R+I  +++R+ D T + ME  E +QV++Y +G  Y+ H+D     +EF    N G R
Sbjct: 375 WEHRLINRVKQRVEDATGLTMETAEPLQVINYGIGGHYEPHFDCATKDEEFALDPNEGDR 434

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +AT+L Y+SDVE GG TVFP                     G  V P++G    ++++  
Sbjct: 435 IATMLFYMSDVEAGGATVFPQV-------------------GARVVPEKGAGAFWYNLLK 475

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMH 282
               D  + H GCPV+ G+KW +  W+H
Sbjct: 476 SGEGDMLTEHAGCPVLVGSKWVSNMWIH 503


>gi|345326417|ref|XP_001510155.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
           [Ornithorhynchus anatinus]
          Length = 888

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 65/203 (32%), Positives = 104/203 (51%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ LS  E E + +LAKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 689 PHIVRYYDVLSDEEIEKIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEEDDPVV 748

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 749 AQVNRRMQYITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFL 808

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP    +F +  W               PK+G A+ ++++      D
Sbjct: 809 NYMSDVEAGGATVFP----DFGAAIW---------------PKKGTAVFWYNLFRSGEGD 849

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 850 YRTRHAACPVLVGCKWVSNKWFH 872


>gi|387016442|gb|AFJ50340.1| Prolyl 4-hydroxylase subunit alpha-2-like [Crotalus adamanteus]
          Length = 533

 Score =  114 bits (285), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 65/201 (32%), Positives = 102/201 (50%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y+  LS  E E + +LAKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYEVLSDEEIEKIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEEDDLVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             +  R+   T +  +  E +QV +Y +G +Y+ H+D+    F+   K  G R+AT L Y
Sbjct: 396 ARVNHRMEQITGLTTKTAELLQVANYGMGGQYEPHFDFSRRPFDITLKTEGNRLATFLNY 455

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP    +F +  W               PK+G A+ ++++      D  
Sbjct: 456 MSDVEAGGATVFP----DFGAAIW---------------PKKGTAVFWYNLFRSGEGDYR 496

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 497 TRHAACPVLVGCKWVSNKWFH 517


>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
 gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
          Length = 534

 Score =  114 bits (284), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 63/201 (31%), Positives = 103/201 (51%), Gaps = 20/201 (9%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P    +H+ LS+     L ++A P+M +STV     GQ + S  R S   +L       
Sbjct: 327 DPLVVTFHDMLSQHRIAELREMAVPHMQRSTVNPLPGGQRRKSAFRVSKNAWLPYSTHPT 386

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG-GQRMATLLMY 201
           +  + + ++D T + M   E +QV +Y VG  Y+ H+D+F D  +     G R+AT + Y
Sbjct: 387 MGRMLRDVSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIFY 446

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           LSDVE+GG T FP  N                    +V+P+ G+ L ++++   + +D  
Sbjct: 447 LSDVEQGGATAFPFLN-------------------FAVRPQLGNILFWYNLHRSSDMDFR 487

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H GCPV++G+KW A  W+H
Sbjct: 488 TKHAGCPVLKGSKWIANIWIH 508


>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Ovis aries]
          Length = 487

 Score =  114 bits (284), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 104/201 (51%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 349

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             +  R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 350 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 409

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 410 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 450

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 451 TRHAACPVLVGCKWVSNKWFH 471


>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
           lupus familiaris]
          Length = 533

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 104/201 (51%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             +  R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 396 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 456 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 497 TRHAACPVLVGCKWVSNKWFH 517


>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
          Length = 532

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 104/201 (51%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             +  R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 396 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 456 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 497 TRHAACPVLVGCKWVSNKWFH 517


>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
 gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
          Length = 534

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 67/201 (33%), Positives = 100/201 (49%), Gaps = 20/201 (9%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P    YH+ LS  +   L  +A P M +STV     GQ+K S  R S   +L       
Sbjct: 327 DPFVVTYHDMLSPRKIADLRLMAVPRMHRSTVNPLPGGQNKKSSFRVSKNAWLAYDSHPT 386

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD-EFNTKNGGQRMATLLMY 201
           + G+   ++D T + M   E +QV +Y VG  Y+ H+D+F D +      G RMAT + Y
Sbjct: 387 MGGMLSDLSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRMATAIFY 446

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           LSDVE+GG T FP  N                    +VKP+ G+ L ++++     +D  
Sbjct: 447 LSDVEQGGATAFPFLN-------------------FAVKPQLGNVLFWYNVHRSLDVDYR 487

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H GCPV++G+KW    W+H
Sbjct: 488 TKHAGCPVLKGSKWIGNVWIH 508


>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
 gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
          Length = 517

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 75/235 (31%), Positives = 118/235 (50%), Gaps = 28/235 (11%)

Query: 50  PPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYM 109
           PP  L+ +     +    IA  K     E++   P    Y++ L+ +E E L  ++ P +
Sbjct: 294 PPKGLSCYYEYGADPFLRIAPFK----VELLNRSPYVAAYYDVLNDSEIEELKLMSSPQI 349

Query: 110 VKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFT--FIPMEHGEGIQVL 167
            +S + +      +    RTS+  F++     ++  I +R AD T  ++     E +QV+
Sbjct: 350 RRSLLYNHTLDIDQADVDRTSNSVFMEETGITLLETISQRAADMTDLYVTAISSEDLQVI 409

Query: 168 HYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWN 227
           +Y +G +Y  H DYF DE N +NG  R+AT+L YL+DV++GG TVFP             
Sbjct: 410 NYGLGGQYTPHCDYF-DE-NAENGD-RLATVLFYLTDVQQGGATVFPFLR---------- 456

Query: 228 ELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                    LS  PK+G AL+F ++    + D  S H  CPV+ GNKW ATKW++
Sbjct: 457 ---------LSYFPKKGSALIFRNLDNAMSGDKDSTHSACPVLFGNKWVATKWIY 502


>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
          Length = 487

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 104/201 (51%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 290 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 349

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             +  R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 350 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 409

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 410 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 450

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 451 TRHAACPVLVGCKWVSNKWFH 471


>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
 gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
          Length = 534

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 67/201 (33%), Positives = 100/201 (49%), Gaps = 20/201 (9%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P    YH+ LS  +   L  +A P M +STV     GQ+K S  R S   +L       
Sbjct: 327 DPFVVTYHDMLSPRKIADLRLMAVPRMHRSTVNPLPGGQNKKSSFRVSKNAWLAYDSHPT 386

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD-EFNTKNGGQRMATLLMY 201
           + G+   ++D T + M   E +QV +Y VG  Y+ H+D+F D +      G RMAT + Y
Sbjct: 387 MGGMLSDLSDATGLDMTFCEQLQVANYGVGGHYEPHWDFFRDPDHYPAEEGNRMATAIFY 446

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           LSDVE+GG T FP  N                    +VKP+ G+ L ++++     +D  
Sbjct: 447 LSDVEQGGATAFPFLN-------------------FAVKPQLGNVLFWYNVHRSLDVDYR 487

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H GCPV++G+KW    W+H
Sbjct: 488 TKHAGCPVLKGSKWIGNVWIH 508


>gi|312383453|gb|EFR28539.1| hypothetical protein AND_03427 [Anopheles darlingi]
          Length = 341

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 62/209 (29%), Positives = 112/209 (53%), Gaps = 10/209 (4%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   +YH+ +S  E E +   A+P   ++TV + KTG+ + +  R S   +LK  +  +
Sbjct: 113 QPDIVIYHDVMSDREIELIKHYARPRFRRATVQNYKTGELEFANYRISKSAWLKDTEHEV 172

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATL 198
           IR + +R+ D T + M   E +QV++Y +G  Y+ H+D+   E    F +   G R+AT+
Sbjct: 173 IRTVNQRVEDMTGLTMATAEELQVVNYGIGGHYEPHFDFARREERNAFKSLGTGNRIATV 232

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQ-----GLSVKPKRGDALLFWSMR 253
           L Y+SD+     T    A+  F SV   +++++ G        L+++P++G A  + ++ 
Sbjct: 233 LFYVSDL-CLCHTSHTNADFRFLSVGQMSDVTQGGATVFPSLNLALRPRKGTAAFWHNLH 291

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                D ++ H  CPV+ G KW + KW+H
Sbjct: 292 ASGNGDYATRHAACPVLTGTKWVSNKWIH 320


>gi|291387304|ref|XP_002710243.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 3 [Oryctolagus
           cuniculus]
          Length = 535

 Score =  114 bits (284), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             I +R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 396 ARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRNNERDAFKRLGTGNRVATFL 455

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 456 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFH 519


>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
          Length = 535

 Score =  113 bits (283), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 104/201 (51%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 397

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             +  R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 398 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 457

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 458 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 498

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 499 TRHAACPVLVGCKWVSNKWFH 519


>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
           taurus]
 gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
          Length = 533

 Score =  113 bits (283), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 104/201 (51%), Gaps = 21/201 (10%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMY 201
             +  R+   T + ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y
Sbjct: 396 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           +SDVE GG TVFP                     G ++ PK+G A+ ++++      D  
Sbjct: 456 MSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYR 496

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H  CPV+ G KW + KW H
Sbjct: 497 TRHAACPVLVGCKWVSNKWFH 517


>gi|449267219|gb|EMC78185.1| Prolyl 4-hydroxylase subunit alpha-2 [Columba livia]
          Length = 538

 Score =  113 bits (283), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 64/203 (31%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E +  LAKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 339 PHIVRYYDVMSDEEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 398

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 399 AKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFL 458

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP    +F +  W               PK+G A+ ++++      D
Sbjct: 459 NYMSDVEAGGATVFP----DFGAAIW---------------PKKGTAVFWYNLFRSGEGD 499

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 500 YRTRHAACPVLVGCKWVSNKWFH 522


>gi|268572523|ref|XP_002641343.1| C. briggsae CBR-DPY-18 protein [Caenorhabditis briggsae]
 gi|94442971|emb|CAJ98658.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
          Length = 559

 Score =  113 bits (283), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 64/210 (30%), Positives = 104/210 (49%), Gaps = 23/210 (10%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            EI  + P A ++ + +S  E   + +LAKP + ++TV DS TG+   +  R S   +LK
Sbjct: 321 VEIKRFNPLAVLFKDVISDEEVATIQELAKPKLARATVHDSVTGKLVTATYRISKSAWLK 380

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGG 192
             +  ++  + KRI   T + ME  E +Q+ +Y +G  YD H+D+   E    F +   G
Sbjct: 381 AWEHEVVERVNKRIDLMTNLEMETAEELQIANYGIGGHYDPHFDHAKKEESKSFESLGTG 440

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT+L Y+S    GG TVF    +                   +V P + DAL ++++
Sbjct: 441 NRIATVLFYMSQPSHGGGTVFTEVKS-------------------TVLPTKNDALFWYNL 481

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 +P + H  CPV+ G KW + KW+H
Sbjct: 482 YKQGDGNPDTRHAACPVLVGIKWVSNKWIH 511


>gi|355691582|gb|EHH26767.1| hypothetical protein EGK_16829 [Macaca mulatta]
 gi|355750162|gb|EHH54500.1| hypothetical protein EGM_15360 [Macaca fascicularis]
 gi|384939464|gb|AFI33337.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Macaca
           mulatta]
          Length = 535

 Score =  113 bits (283), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 104/203 (51%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+  ++    F     G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRNDERHTFKHLGTGNRVATFL 455

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 456 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFH 519


>gi|54792285|emb|CAG28668.1| prolyl 4-hydroxylase alpha-2 subunit [Gallus gallus]
          Length = 538

 Score =  113 bits (283), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 64/203 (31%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E +  LAKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 340 PHIVRYYDVMSDEEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 399

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 400 AKVNQRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFL 459

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP    +F +  W               PK+G A+ ++++      D
Sbjct: 460 NYMSDVEAGGATVFP----DFGAAIW---------------PKKGTAVFWYNLFRSGEGD 500

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 501 YRTRHAACPVLVGCKWVSNKWFH 523


>gi|351706369|gb|EHB09288.1| Prolyl 4-hydroxylase subunit alpha-2 [Heterocephalus glaber]
          Length = 535

 Score =  113 bits (283), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y+N +S  E + + +LAKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYNVMSDEEIDRIKELAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 396 ARVNRRMQYITGLTVQTAELLQVANYGMGGQYEPHFDFSRNHERDAFKRLGTGNRVATFL 455

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 456 NYMSDVEAGGATVFP-------------------DLGAALWPKKGTAVFWYNLLRSGEGD 496

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFH 519


>gi|196011912|ref|XP_002115819.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
 gi|190581595|gb|EDV21671.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
          Length = 300

 Score =  113 bits (283), Expect = 8e-23,   Method: Compositional matrix adjust.
 Identities = 63/209 (30%), Positives = 105/209 (50%), Gaps = 23/209 (11%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD--SRVRTSSGTFL 135
           E ++ +P   +YHN  S AE E L  LA   +  + V  + +  +++     R +   F+
Sbjct: 98  EEMSRDPLIILYHNLTSNAEMESLKALAAKQLQPAGVYHTTSADNRNLEGYTRIAKMAFI 157

Query: 136 KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKN--GGQ 193
              +  +   I +R+ D T + M   E +QV++Y +  +Y  HYD F  +   ++     
Sbjct: 158 LDEESAVASAITQRLQDVTGLNMNFSEPLQVINYGIAGQYTPHYDTFPAKSGDRSHPSHD 217

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+AT ++YLSDVE GG TVF   N                   + V P++G+ +++++  
Sbjct: 218 RLATAILYLSDVERGGATVFTNIN-------------------VRVLPRKGNVIIWYNYL 258

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           PD  L P +LH GCPV+ G+KW A KW+ 
Sbjct: 259 PDGNLHPGTLHAGCPVLVGSKWIANKWIQ 287


>gi|219113719|ref|XP_002186443.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209583293|gb|ACI65913.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 230

 Score =  113 bits (283), Expect = 9e-23,   Method: Compositional matrix adjust.
 Identities = 77/230 (33%), Positives = 111/230 (48%), Gaps = 39/230 (16%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTG--------QSKDSRVRT 129
           ++++  PRAF   NFLS+ E E+++ LA    +K +     TG        Q+   R RT
Sbjct: 6   KVLSCAPRAFEIENFLSRQEVEHIVQLASGVDLKLSSTGDITGHKETPKELQTDSRRTRT 65

Query: 130 SSGTFLKRGQDRIIRGIEKRIADFTFIP---MEH----------------GEGIQVLHYE 170
           S  +++ R +  II  I +R AD   I    + H                 E +Q++HY 
Sbjct: 66  SYNSWVPREKSPIIDAIYRRAADVMRIDEALLRHRSDHTEWTNLTSTKPLAEQLQLVHYG 125

Query: 171 VGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELS 230
            GQ+Y AH+D+     + +  G R  TLL+YL++   GGET FP         RW N  +
Sbjct: 126 PGQEYTAHHDFGFSRIDDQFQGARFGTLLLYLNEGMTGGETSFP---------RWSN--A 174

Query: 231 ECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKW 280
           E   + LS+KP+ G A+LF+S  PD  LD  S H   PV  G KW    W
Sbjct: 175 ETFHE-LSIKPEVGKAVLFYSQLPDGNLDDLSHHAAKPVTDGEKWLINLW 223


>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
           garnettii]
          Length = 540

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 102/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 341 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 400

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             +  R+   T + ++  E +QV +Y VG +Y+ H+D+      D F     G R+AT L
Sbjct: 401 ARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRNHERDAFKRLGTGNRVATFL 460

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 461 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 501

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 502 YRTRHAACPVLVGCKWVSNKWFH 524


>gi|157818741|ref|NP_001101745.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Rattus norvegicus]
 gi|149052604|gb|EDM04421.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_a [Rattus norvegicus]
          Length = 535

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRVATFL 455

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 456 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFH 519


>gi|410948134|ref|XP_003980796.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Felis
           catus]
          Length = 535

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFL 455

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 456 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFH 519


>gi|354474413|ref|XP_003499425.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Cricetulus griseus]
          Length = 535

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFL 455

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 456 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFH 519


>gi|226874876|ref|NP_035161.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Mus
           musculus]
 gi|148701601|gb|EDL33548.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_f [Mus
           musculus]
          Length = 537

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 397

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 398 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFL 457

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 458 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 498

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 499 YRTRHAACPVLVGCKWVSNKWFH 521


>gi|195159164|ref|XP_002020452.1| GL13506 [Drosophila persimilis]
 gi|194117221|gb|EDW39264.1| GL13506 [Drosophila persimilis]
          Length = 536

 Score =  113 bits (282), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 75/212 (35%), Positives = 111/212 (52%), Gaps = 28/212 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++ +P   VYHN LS AE   +  +A+P ++KS  V  +   SK S+VRT+ G ++  
Sbjct: 326 EELSLDPYIVVYHNVLSDAEIAKVERVAEP-LLKSIGV-GEMDNSKKSKVRTALGAWIPD 383

Query: 138 GQDRI-----IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-TKNG 191
               I     I+ I +RI D T + ++ G+ +Q++ Y  G  YD H+DY  D    T+  
Sbjct: 384 ENMHISGWPVIQRIVRRIHDMTGLIIKRGQVVQLIKYGYGGHYDTHFDYLNDSLPITQAL 443

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
           G RMAT+L YL+DV+ GG TVFP                      L V  +RG  L++++
Sbjct: 444 GDRMATVLFYLNDVKHGGSTVFPVLQ-------------------LKVPSERGKVLVWYN 484

Query: 252 MRPDA-TLDPSSLHGGCPVIRGNKWSATKWMH 282
           M  +   LD  +LHG CPVI G K   + W+H
Sbjct: 485 MHGETHDLDSRTLHGSCPVIDGAKTVLSCWIH 516


>gi|332140647|ref|YP_004426385.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
           'Deep ecotype']
 gi|327550669|gb|AEA97387.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
           'Deep ecotype']
          Length = 376

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 70/199 (35%), Positives = 100/199 (50%), Gaps = 25/199 (12%)

Query: 88  VYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ-DRIIRGI 146
           VY + LS+ EC YLI      +  S VVD  TG+ K   VRTS    ++    D I R +
Sbjct: 180 VYESILSEYECRYLITKFNALLKPSMVVDPVTGRGKIDSVRTSYVAVIEPAHCDWITRKL 239

Query: 147 EKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT----KNGGQRMATLLMYL 202
           +K I+  T    ++GE + +L Y  GQ+Y  HYD  L+E N     K+G QR+ T L+YL
Sbjct: 240 DKTISQITHTLRQNGEALNLLRYSPGQQYKPHYD-GLNEINDALMFKDGKQRIKTALVYL 298

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           + + EGGET+FP                   K  + + PK G  ++F +   +  L  +S
Sbjct: 299 NTISEGGETLFP-------------------KLDIRIAPKSGTMVVFSNSDENGKLLLNS 339

Query: 263 LHGGCPVIRGNKWSATKWM 281
            H G P +  NKW  TKW+
Sbjct: 340 YHAGAPTVSENKWLVTKWI 358


>gi|2498741|sp|Q60716.1|P4HA2_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|836900|gb|AAC52198.1| prolyl 4-hydroxylase alpha(II)-subunit [Mus musculus]
 gi|18073923|emb|CAC85691.1| Prolyl 4-hydroxylase alpha IIb subunit [Mus musculus]
 gi|1096888|prf||2112362B Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=II
          Length = 537

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 338 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 397

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 398 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDDEDAFKRLGTGNRVATFL 457

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 458 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 498

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 499 YRTRHAACPVLVGCKWVSNKWFH 521


>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
 gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
          Length = 537

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 100/201 (49%), Gaps = 20/201 (9%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P    YH+ LS  +   L  +A P M +STV     GQ+K S  R S   +L       
Sbjct: 330 DPYVVTYHDMLSAQKIRDLRQMAVPRMRRSTVNPLPGGQNKKSAFRVSKNAWLAYESHPT 389

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-TKNGGQRMATLLMY 201
           + G+ + + D T +   + E +QV +Y VG  Y+ H+D+F D  +     G R+AT + Y
Sbjct: 390 MEGMLRDLKDATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFY 449

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           LSDVE+GG T FP  +                    +VKP+ G+ L ++++     +D  
Sbjct: 450 LSDVEQGGATAFPFLD-------------------FAVKPQLGNVLFWYNLHRSLDMDYR 490

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H GCPV++G+KW    W+H
Sbjct: 491 TKHAGCPVLKGSKWIGNVWIH 511


>gi|335283456|ref|XP_003354320.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Sus scrofa]
          Length = 535

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFL 455

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 456 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFH 519


>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
 gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
          Length = 536

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 104/201 (51%), Gaps = 20/201 (9%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P    YH+ LS  +   L ++A P+M +STV     GQ+K S  R S   +L       
Sbjct: 329 DPFVVTYHDMLSPNKIAQLREMAVPHMRRSTVNPLPGGQNKKSSFRVSKNAWLAYETHPT 388

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD-EFNTKNGGQRMATLLMY 201
           +  + + ++D T + M + E +QV +Y VG  Y+ H+D+F + +      G R+AT + Y
Sbjct: 389 MGKMLRDLSDTTGLDMTYCEQLQVANYGVGGHYEPHWDFFRNPDHYPAEEGNRIATAIYY 448

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           LS+VE+GG T FP  N                    +V+P+ G+ L ++++   + +D  
Sbjct: 449 LSEVEQGGATAFPFLN-------------------FAVRPQLGNVLFWYNLHRSSDMDYR 489

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H GCPV++G+KW    W+H
Sbjct: 490 TKHAGCPVLKGSKWIGNVWIH 510


>gi|149052606|gb|EDM04423.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_c [Rattus norvegicus]
          Length = 506

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 366

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 367 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRVATFL 426

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 427 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 467

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 468 YRTRHAACPVLVGCKWVSNKWFH 490


>gi|348557542|ref|XP_003464578.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Cavia porcellus]
          Length = 535

 Score =  112 bits (281), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEEDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 396 ARVNRRMQQITGLTVKTAELLQVANYGMGGQYEPHFDFSRSHERDAFKRLGTGNRVATFL 455

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 456 NYMSDVEAGGATVFP-------------------DLGAALWPKKGTAVFWYNLLRSGEGD 496

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFH 519


>gi|148701597|gb|EDL33544.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_b [Mus
           musculus]
          Length = 506

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 103/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 307 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 366

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 367 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFL 426

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 427 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 467

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 468 YRTRHAACPVLVGCKWVSNKWFH 490


>gi|344264847|ref|XP_003404501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Loxodonta africana]
          Length = 536

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 102/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E +  +AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 337 PHIVRYYDVMSDEEIERIKQIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 396

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 397 AQVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRSHEQDAFKRLGTGNRVATFL 456

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 457 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 497

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 498 YRTRHAACPVLVGCKWVSNKWFH 520


>gi|395814850|ref|XP_003780953.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Otolemur
           garnettii]
          Length = 544

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 67/214 (31%), Positives = 111/214 (51%), Gaps = 28/214 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S +E + + +LA+P++ +S V   +     D R+  S   +LK 
Sbjct: 340 EVIHLEPFVALYHDFVSDSEAQKIRELAEPWLQRSVVASGEKQLQVDYRI--SKSAWLKD 397

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RIA  T + ++  + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 398 TVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 457

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                    SV   +  AL +W++
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYAN-------------------FSVPVVKNAALFWWNL 498

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
             +   D  +LH GCPV+ G+KW A KW  +HEY
Sbjct: 499 HRNGEGDSDTLHAGCPVLVGDKWVANKW--IHEY 530


>gi|195390831|ref|XP_002054071.1| GJ22995 [Drosophila virilis]
 gi|194152157|gb|EDW67591.1| GJ22995 [Drosophila virilis]
          Length = 485

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 68/206 (33%), Positives = 102/206 (49%), Gaps = 35/206 (16%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSR-VRTSSGTFLK 136
           E++  +P    +H+ LS  E   L  LA P + ++TV DS  G     +  RTS G +L 
Sbjct: 298 EVLVVKPFIVAFHDVLSPHEIGELQQLAMPLLKRTTVYDSNAGLHGSVKGTRTSKGIWLS 357

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMA 196
           R  + + + I +RI+D T   +E    +QV++Y +   Y  H DYF    NT        
Sbjct: 358 RSHNNLTKRIGRRISDMTGFHLEGSTSLQVMNYGLSGHYALHTDYF----NTAE------ 407

Query: 197 TLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
                LSDVE+GG+TVFP     F                   KP+RG ALL++++  + 
Sbjct: 408 -----LSDVEQGGDTVFPRIEQAF-------------------KPERGKALLWYNLHRNG 443

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMH 282
           T D  + HG CPV+ G+KW  T+W++
Sbjct: 444 TGDKRTEHGACPVLVGSKWIMTQWIN 469


>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 533

 Score =  112 bits (280), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 61/210 (29%), Positives = 106/210 (50%), Gaps = 20/210 (9%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  +P   +Y+  ++  E +++I  AKP + ++ V D  TG    +  R S  T++  
Sbjct: 327 EVLHHDPYIELYYELITDDEAKHIIKFAKPLLRRAFVHDMVTGDLIYADYRVSKNTWIAE 386

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL---DEFNTKNGGQR 194
             D I   I +R+ D T + M + E +QV +Y +  +Y+ H+D+      +   + GG R
Sbjct: 387 DMDVIAAKIIRRVGDVTGLNMRYAEHLQVANYGIAGQYEPHFDHSTGTRPKHFDRWGGNR 446

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +AT+L+YLSDV+ GG TVF                      G+   P +G  + ++++  
Sbjct: 447 IATMLLYLSDVDWGGRTVFTNT-----------------APGVGTDPIKGAGVFWYNLLR 489

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMHVH 284
           +   +P + H GCPV+ G KW A  W+H H
Sbjct: 490 NGKSNPKTQHAGCPVVLGQKWVANLWIHEH 519


>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
 gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
          Length = 533

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 101/201 (50%), Gaps = 20/201 (9%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P    YH+ LS  +   L  +A P+M +STV     GQ   S  R S   +L      +
Sbjct: 326 DPLVVSYHDMLSPQQIGELRAMAVPHMQRSTVNPLSGGQRMKSAFRVSKNAWLPYSTHPM 385

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG-GQRMATLLMY 201
           +  + + + D T + M + E +QV +Y VG  Y+ H+D+F D  +     G R+AT + Y
Sbjct: 386 MGRMLRDVGDATGLDMTYCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGNRIATAIFY 445

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           LSDVE+GG T FP  N                    +V+P+ G+ L ++++   +  D  
Sbjct: 446 LSDVEQGGATAFPFLN-------------------FAVRPQLGNILFWYNLHRSSDEDYR 486

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H GCPV++G+KW A  W+H
Sbjct: 487 TKHAGCPVLKGSKWIANIWIH 507


>gi|255607134|ref|XP_002538686.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223510975|gb|EEF23697.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 318

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 72/239 (30%), Positives = 117/239 (48%), Gaps = 31/239 (12%)

Query: 50  PPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYM 109
           PP+    F        ++IA    +    +V   PR  ++ + LS AEC+ LI  ++  +
Sbjct: 92  PPSTAGGFLDPVEIAGNAIALSDRDIKVVMVCTAPRIALFDDVLSDAECDALIAASRSRL 151

Query: 110 VKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHY 169
            +S VV ++         RTS G +  +G++ ++  I++RIA+ T  P+ H E +Q+L+Y
Sbjct: 152 QRSKVVANRGSGEFVDDTRTSYGAYFNKGENSLVATIQRRIAELTRWPLTHAEPLQILNY 211

Query: 170 EVGQKYDAHYDYFLDE-----FNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVR 224
            +G +Y  H+DYF  +        ++GGQR+AT++MYL+DVE GG T+FP  N       
Sbjct: 212 GLGGEYLPHFDYFEPQQPGLPSPLESGGQRIATVVMYLNDVEAGGGTIFPHLN------- 264

Query: 225 WWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIR---GNKWSATKW 280
                       L  +P++G A+ F      A     S+   C   R     KW AT+W
Sbjct: 265 ------------LETRPRKGGAIYFSYQLAVA----RSIRSRCMAARRIARRKWIATQW 307


>gi|350014318|dbj|GAA37183.1| prolyl 4-hydroxylase [Clonorchis sinensis]
          Length = 595

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 65/209 (31%), Positives = 109/209 (52%), Gaps = 24/209 (11%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  +PR  ++++ +  +E   + +LA P + ++TV +  TG+ +++  RTS   +L+ 
Sbjct: 384 EVLYPDPRIVMWYDVIHPSEVGRIQELALPRLRRATVKNPVTGKLENAYYRTSKSAWLQD 443

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQ 193
           G D +   + +RI   T + ME  E +QV +Y +G  Y  H+D+      D F  +N G 
Sbjct: 444 GLDEVTHRLNQRIHALTGLAMETAEDLQVGNYGIGGYYAPHFDFGRKREKDAFEVEN-GN 502

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+AT++ YL+DV+ GG TVF                    + G SVKP RG A  ++++ 
Sbjct: 503 RIATIIFYLTDVKAGGATVF-------------------NRFGASVKPVRGAAGFWYNLH 543

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           P    D  + H  CPV+ G+KW    W H
Sbjct: 544 PSGEGDLRTRHVACPVLVGSKWVMNVWFH 572


>gi|126327904|ref|XP_001367838.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Monodelphis
           domestica]
          Length = 559

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 65/210 (30%), Positives = 108/210 (51%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S +E + +   A P++ +S V   +  Q  + R+  S+  +LK 
Sbjct: 355 EVLHLEPYIVLYHDFVSDSEAQKIRGFAAPWLQRSVVASGEKQQQVEYRISKSA--WLKD 412

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RIA  T + ++  + E +QV++Y +G  Y+ H+D+     +     N G
Sbjct: 413 TVDPMLVSLDHRIAALTGLNVQPPYAEHLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSG 472

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                    SV   +  AL +W++
Sbjct: 473 NRVATFMIYLSSVEAGGSTAFIYAN-------------------FSVPVVKNAALFWWNL 513

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 514 HRSGEGDGDTLHAGCPVLVGDKWVANKWIH 543


>gi|405964866|gb|EKC30308.1| KRR1 small subunit processome component-like protein [Crassostrea
           gigas]
          Length = 885

 Score =  112 bits (279), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 71/246 (28%), Positives = 112/246 (45%), Gaps = 45/246 (18%)

Query: 60  RAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKT 119
           R F + + I   K ++  E+V +EPR  ++H+ +S    E+L  +A   + +STV    T
Sbjct: 637 RCFLRDTVIPYYKAKE--EVVNYEPRIAIFHDVISSTSIEHLKSIASKGLTRSTVFLENT 694

Query: 120 GQS--------KDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPME------HGEGIQ 165
           G +        K   +R S   +++  +   +  +E RI   T +  E      H E  Q
Sbjct: 695 GPNGQVTITYGKQDNIRVSQTCWIRTDEYPELLRLENRIQLITGLSAEYKPVRSHSEKFQ 754

Query: 166 VLHYEVGQKYDAHYDYF----------LDEFNTKNGGQRMATLLMYLSDVEEGGETVFPA 215
           V++Y VG  Y AH+DY           +D  +    G RMAT + Y++D + GG TVFP 
Sbjct: 755 VVNYGVGGMYTAHHDYTGYKLGIISNPMDSEDISTSGDRMATWMFYMNDAKAGGATVFPE 814

Query: 216 ANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKW 275
                   +                   G A  ++++RP    DP +LHGGCPV+ G+KW
Sbjct: 815 VRTRIPVAK-------------------GGAAFWFNLRPSGATDPRTLHGGCPVLVGSKW 855

Query: 276 SATKWM 281
              KW+
Sbjct: 856 VTNKWI 861


>gi|301754231|ref|XP_002912939.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Ailuropoda
           melanoleuca]
          Length = 535

 Score =  111 bits (278), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 102/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             +  R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 396 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFL 455

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 456 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFH 519


>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
          Length = 550

 Score =  111 bits (278), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 67/210 (31%), Positives = 107/210 (50%), Gaps = 23/210 (10%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            EI+ + P A ++ + +S  E + +  +A P + ++TV +SKTG+ + +  R S   +LK
Sbjct: 322 VEILRFNPLAVLFVDIISDEEAKMIQQIATPRLKRATVQNSKTGELETAAYRISKSAWLK 381

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGG 192
            G   +I  I +RI   T +  E  E +Q+ +Y VG  YD H+D+   E    F +   G
Sbjct: 382 GGDHELIDRINRRIELMTNLIQETSEELQIANYGVGGHYDPHFDFARKEEPKAFESLGTG 441

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT+L YL++ E GG TV       FT +R             +V P +  AL ++++
Sbjct: 442 NRLATVLFYLTEPEIGGGTV-------FTELR------------TAVMPSKNGALFWYNL 482

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  + H  CPV+ G KW A KW+H
Sbjct: 483 YRSGEGDLRTRHAACPVLVGIKWVANKWIH 512


>gi|240974259|ref|XP_002401836.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215491070|gb|EEC00711.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 490

 Score =  111 bits (278), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 62/209 (29%), Positives = 109/209 (52%), Gaps = 23/209 (11%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  +PR  +YH+ +SK E + +  LA+P + ++TV + K+G+ + +  R S   +L+ 
Sbjct: 285 EVMFPKPRIVIYHDVMSKHEMDVVKLLAQPRLKRATVQNYKSGELEVANYRISKSAWLRN 344

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQ 193
            +  +I  + +RI   T +  +  E +QV++Y +G  Y+ H+D+   E    F +   G 
Sbjct: 345 EEHGVIARVTRRIEHITGLSADTAEELQVVNYGIGGHYEPHFDFARREEKNAFQSLGTGN 404

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+AT L Y+SDV  GG TVFP                      L++ P++G A  ++++ 
Sbjct: 405 RIATWLNYMSDVPAGGATVFPQLR-------------------LTLWPEKGAAAFWYNLH 445

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                D  + H  CPV+ G+KW + KW H
Sbjct: 446 RSGEGDMLTRHAACPVLAGSKWVSNKWFH 474


>gi|449673565|ref|XP_002167120.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 571

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 62/211 (29%), Positives = 107/211 (50%), Gaps = 22/211 (10%)

Query: 75  QWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTF 134
           Q  E V  +P  F+  N +S+ +   + + A P + ++T+ D  TG+ + +  R S   +
Sbjct: 364 QKVERVWVDPEIFILRNIISEKQINLIKEAASPMLRRATIQDPITGKLRHADYRISKSAW 423

Query: 135 LKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL---DEFNTKNG 191
           L   +   ++ +E R    T + + + E +QV +Y +G  Y+ H+D+     D F     
Sbjct: 424 LSTNKYNFLQALEARTQATTGLDLSYAEQLQVANYGLGGHYEPHFDHSRENEDRFTDLGM 483

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
           G R+AT+L YLSDVE GG TVF                   GK   +V P +GDA+ +++
Sbjct: 484 GNRIATVLFYLSDVEAGGATVFT-----------------VGKT--AVFPSKGDAVFWFN 524

Query: 252 MRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           ++ +   +P++ H  CPV+ G KW +  W+H
Sbjct: 525 LKRNGKGNPNTRHAACPVLVGQKWVSNWWIH 555


>gi|351696981|gb|EHA99899.1| Prolyl 4-hydroxylase subunit alpha-3 [Heterocephalus glaber]
          Length = 572

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 65/210 (30%), Positives = 106/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S  E + +  LA+P++ +S V   +     + R+  S   +LK 
Sbjct: 368 EVIHLEPYVALYHDFVSDPEAQKIRKLAEPWLQRSVVASGEKQLQVEYRI--SKSAWLKD 425

Query: 138 GQDRIIRGIEKRIADFTFIPMEH--GEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RIA  T + ++H   E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 426 TADPVLVTLDHRIAALTGLDVQHPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 485

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                    SV   +  AL +W++
Sbjct: 486 NRVATFMIYLSSVEAGGATAFIYAN-------------------FSVPVVKNAALFWWNL 526

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 527 HRSGEGDGDTLHAGCPVLVGDKWVANKWIH 556


>gi|426229219|ref|XP_004008688.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Ovis aries]
          Length = 535

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 102/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             +  R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 396 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFL 455

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 456 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFH 519


>gi|348688210|gb|EGZ28024.1| hypothetical protein PHYSODRAFT_321730 [Phytophthora sojae]
          Length = 487

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 74/221 (33%), Positives = 110/221 (49%), Gaps = 16/221 (7%)

Query: 72  KGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRV-RTS 130
           +G+   E ++  P  F    FL   E + +++L+ P++  S V      +++ +   RTS
Sbjct: 263 RGDLVMETISMTPLVFSVEEFLRDDEIDVVLELSMPHLAPSGVTLQDGHENRPATDWRTS 322

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-------- 182
           +  +L+     +++ I+KR AD   +P+ H E +QVL YE  Q YD H DYF        
Sbjct: 323 TTYWLESSSHPVVQDIDKRTADLVKVPISHQESVQVLRYEHTQHYDQHLDYFSVKRHRNS 382

Query: 183 LDEFNTKNGG--QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVK 240
            D       G   RM T+  Y+SDV +GG T F  A             ++   QGLSV 
Sbjct: 383 ADVLKKIEHGYKNRMITVFWYMSDVAKGGHTNFARAGGLPPP-----PTNKGCTQGLSVV 437

Query: 241 PKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           PK+   ++F+SM P+   DP SLH GCPV  G K S  KW+
Sbjct: 438 PKKRKVVVFYSMLPNGEGDPMSLHAGCPVEEGIKMSGNKWV 478


>gi|226874889|ref|NP_001152881.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Bos
           taurus]
 gi|296485624|tpg|DAA27739.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Bos taurus]
          Length = 535

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 102/203 (50%), Gaps = 23/203 (11%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             +  R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 396 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFL 455

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 456 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 496

Query: 260 PSSLHGGCPVIRGNKWSATKWMH 282
             + H  CPV+ G KW + KW H
Sbjct: 497 YRTRHAACPVLVGCKWVSNKWFH 519


>gi|297301157|ref|XP_001103971.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Macaca
           mulatta]
          Length = 512

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 63/200 (31%), Positives = 99/200 (49%), Gaps = 37/200 (18%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  I  RI D T + +   E +QV +Y VG +Y+ H+D+                    +
Sbjct: 394 VSRINMRIQDLTGLDVSTAEELQVANYGVGGQYEPHFDF------------------ARM 435

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           SDV  GG TVFP                     G SV PK+G A+ ++++      D S+
Sbjct: 436 SDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEGDYST 476

Query: 263 LHGGCPVIRGNKWSATKWMH 282
            H  CPV+ GNKW + KW+H
Sbjct: 477 RHAACPVLVGNKWVSNKWLH 496


>gi|348505573|ref|XP_003440335.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oreochromis
           niloticus]
          Length = 517

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 67/211 (31%), Positives = 109/211 (51%), Gaps = 28/211 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V+ +P   +YH+F++  E E +  LA P + +S V   +   + D R+  S+  +LK 
Sbjct: 313 ELVSLQPYVVLYHDFVTDTEAEDIKSLAHPGLRRSVVAAGEKQATADYRISKSA--WLKG 370

Query: 138 GQDRIIRGIEKRIADFTFIPMEH--GEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNG 191
               I+  +++RI+  T + ++H  GE +QV++Y +G  Y+ H+D+        F  K G
Sbjct: 371 SAQSIVGKLDQRISLLTGLNVKHPYGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKLKTG 430

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
             R+AT ++YLS VE GG T F  AN                    SV      A+ +W+
Sbjct: 431 -NRVATFMIYLSPVEAGGSTAFIYAN-------------------FSVPVVEKAAIFWWN 470

Query: 252 MRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           +  +   D  +LH GCPV+ G+KW A KW+H
Sbjct: 471 LHRNGEGDDDTLHAGCPVLIGDKWVANKWIH 501


>gi|410860761|ref|YP_006975995.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii AltDE1]
 gi|410818023|gb|AFV84640.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii AltDE1]
          Length = 376

 Score =  111 bits (277), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 70/199 (35%), Positives = 100/199 (50%), Gaps = 25/199 (12%)

Query: 88  VYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ-DRIIRGI 146
           VY + LS+ EC YLI      +  S VVD  TG+ K   VRTS    ++    D I R +
Sbjct: 180 VYESILSEYECRYLIAKFSALLKPSMVVDPVTGRGKIDSVRTSYVAVIEPTHCDWITRKL 239

Query: 147 EKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT----KNGGQRMATLLMYL 202
           +K I+  T    ++GE + +L Y  GQ+Y  HYD  L+E N     K+G QR+ T L+YL
Sbjct: 240 DKIISQITHTLRQNGEALNLLRYSPGQQYKPHYD-GLNEINDALMFKDGKQRIKTALVYL 298

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           + + EGGET+FP                   K  + + PK G  ++F +   +  L  +S
Sbjct: 299 NTINEGGETLFP-------------------KLDIRIAPKSGTMVVFSNSDENGKLLLNS 339

Query: 263 LHGGCPVIRGNKWSATKWM 281
            H G P +  NKW  TKW+
Sbjct: 340 YHAGAPTVSENKWLVTKWI 358


>gi|198429625|ref|XP_002128613.1| PREDICTED: similar to procollagen-proline, 2-oxoglutarate
           4-dioxygenase (proline 4-hydroxylase), alpha 1
           polypeptide [Ciona intestinalis]
          Length = 195

 Score =  111 bits (277), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 64/198 (32%), Positives = 102/198 (51%), Gaps = 32/198 (16%)

Query: 93  LSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIAD 152
           +S  E   +  LAKP + ++TV +  TG  + +  R S   +LK     +I+ + +RI+D
Sbjct: 1   MSDKEMAMIKSLAKPRLRRATVQNPVTGVLEFAHYRVSKSAWLKDEDHPVIKRVCQRISD 60

Query: 153 FTFIPMEHGEGIQVLHYEVGQKYDAHYDY--------FLDEFNTKNGGQRMATLLMYLSD 204
            T + ME  E +Q+ +Y VG +Y+ H+DY        F DE      G R+AT L Y+S+
Sbjct: 61  VTGLSMETAEELQIANYGVGGQYEPHFDYSRKSDFGKFDDEV-----GNRIATFLTYMSN 115

Query: 205 VEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLH 264
           VE+GG TVF                      G++V+P +G A+ ++++ P    D  + H
Sbjct: 116 VEQGGSTVFL-------------------HPGIAVRPIKGSAVFWYNLLPSGAGDERTRH 156

Query: 265 GGCPVIRGNKWSATKWMH 282
             CPV+ G KW + KW+H
Sbjct: 157 AACPVLTGVKWVSNKWIH 174


>gi|195505209|ref|XP_002099405.1| GE10885 [Drosophila yakuba]
 gi|194185506|gb|EDW99117.1| GE10885 [Drosophila yakuba]
          Length = 473

 Score =  111 bits (277), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 65/208 (31%), Positives = 110/208 (52%), Gaps = 22/208 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+++ +P   ++H+ +S  +   + +LAK  +V++  V +K G  ++   RT+ GT+L  
Sbjct: 277 ELLSLDPYMVLFHDVVSDKDITSIRNLAKGGLVRAVTV-TKDGSYEEDPARTTKGTWLVE 335

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
              ++I+ + +   D T + +   +  QVL+Y +G  Y  H+D FL +    N   R+AT
Sbjct: 336 NS-KLIQRLSQLAQDMTNLDIRDADPFQVLNYGIGGYYGTHFD-FLADTEMGNFSNRIAT 393

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
            + YLSDV +GG T+FP                   K GLSV PK+G ALL++++     
Sbjct: 394 AVFYLSDVPQGGATIFP-------------------KLGLSVFPKKGSALLWYNLDHKGD 434

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHE 285
            D  + H  CP I G++W  TKW++  E
Sbjct: 435 GDNRTAHSACPTIVGSRWVMTKWINERE 462


>gi|326435474|gb|EGD81044.1| hypothetical protein PTSG_10986 [Salpingoeca sp. ATCC 50818]
          Length = 264

 Score =  110 bits (276), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 69/212 (32%), Positives = 114/212 (53%), Gaps = 33/212 (15%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD-SRVRTSSGTFLKR---G 138
           +P    ++NF+S+   + ++  AKP   +ST     +G  ++ S  RTSS  ++     G
Sbjct: 68  DPPVIQFNNFISQERIDAILHFAKPKFARST-----SGIEREVSNYRTSSTAWMLPDVLG 122

Query: 139 QDRI---IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRM 195
            D +   ++ +E+ IA    +P+E+ E  QVL Y+  Q Y  H DY ++E   +  G R+
Sbjct: 123 NDPMQAHLKDMEEEIARIVRLPVENQEHFQVLQYQKNQYYKVHSDY-IEEQRQQPCGIRV 181

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           AT  +YL+DVEEGG T FP  N                   L+V+P +G+A+L++S  P+
Sbjct: 182 ATFFLYLNDVEEGGGTRFPNLN-------------------LTVQPAKGNAVLWYSAYPN 222

Query: 256 AT-LDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            T +D  + H   PV +G K+ A KW+H+H++
Sbjct: 223 TTRMDSRTDHEAMPVAKGMKYGANKWIHIHDF 254


>gi|301104296|ref|XP_002901233.1| conserved hypothetical protein [Phytophthora infestans T30-4]
 gi|262101167|gb|EEY59219.1| conserved hypothetical protein [Phytophthora infestans T30-4]
          Length = 535

 Score =  110 bits (276), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 81/245 (33%), Positives = 112/245 (45%), Gaps = 43/245 (17%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLI------DLAKPYMVKSTV-VDSKTGQSKDSRVRTS 130
           E ++  PR F  HNF S  E + LI      D     + +STV  +    + K S+ RTS
Sbjct: 182 ESISESPRTFRLHNFFSGEEADKLIKRTLEIDDPSNKLQQSTVGANDNKNKKKKSKHRTS 241

Query: 131 SGTFLKRGQDRIIRGIEKRIADFTFI---PMEHGEGIQVLHYEVGQKYDAHYDYF----L 183
              F    +  +   I KR+ D   +     +  +G+Q+L Y+  Q Y AH DYF     
Sbjct: 242 ENAFDTVSEAAV--DIRKRVFDVLSLGEFQADMADGLQLLRYQQKQAYIAHEDYFPVGAA 299

Query: 184 DEFN---TKNGGQRMATLLMYLSDVEEGGETVFPAANAN--------------------- 219
            +FN    K G  R AT+ +YLSDV  GG+TVFP A                        
Sbjct: 300 KDFNFDPHKGGSNRFATVFLYLSDVPRGGQTVFPLAEMPEGLPTEYQHPPNSAQDYEAIG 359

Query: 220 ---FTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWS 276
              F    W  ++       L+  P +G A+LF+S +P+  LDP SLHGGCPV+ G KW 
Sbjct: 360 AELFEPGSWEMDMVRKCSTKLASYPSKGGAVLFYSQKPNGELDPKSLHGGCPVLEGTKWG 419

Query: 277 ATKWM 281
           A  W+
Sbjct: 420 ANLWV 424


>gi|313229039|emb|CBY18191.1| unnamed protein product [Oikopleura dioica]
          Length = 522

 Score =  110 bits (276), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 63/206 (30%), Positives = 108/206 (52%), Gaps = 20/206 (9%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            E +A +P    + + L+  E   L  L +  + ++TV D  T +  ++  R S   +LK
Sbjct: 316 VEEIAKQPYVVRFFDILNDNEINSLERLGEEKLARATVFDPATHKLVNADYRVSKSAWLK 375

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMA 196
                 +    +RI+  T + +E+ E +Q+ +Y +G +Y+ HYDY   E++  N  +R+A
Sbjct: 376 DEDSDTVEKYNRRISRLTGLDLEYAEQLQMSNYGIGGQYEPHYDYSRREWDIYN-NRRIA 434

Query: 197 TLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           T L YL+ VE+GG TVF              EL      GL ++  +G A+ ++++ P+ 
Sbjct: 435 TWLSYLTTVEQGGGTVF-------------TEL------GLHIRSIKGSAVFWYNLLPNG 475

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMH 282
           + D  + H  CPV+RGNKW + KW+H
Sbjct: 476 SGDERTRHAACPVLRGNKWVSNKWIH 501


>gi|426365135|ref|XP_004049642.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Gorilla gorilla
           gorilla]
          Length = 500

 Score =  110 bits (276), Expect = 5e-22,   Method: Compositional matrix adjust.
 Identities = 73/212 (34%), Positives = 106/212 (50%), Gaps = 40/212 (18%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTF------LK 136
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S  T       LK
Sbjct: 301 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKRTICLLYINLK 360

Query: 137 RGQDRIIRGIEKRIADFT--FIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKN 190
           R   R+  G    + + T  F+P       QV +Y VG +Y+ H+D+      D F    
Sbjct: 361 RYYTRL--GFLFLLYNTTCPFVP-------QVANYGVGGQYEPHFDFARKDEPDAFKELG 411

Query: 191 GGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFW 250
            G R+AT L Y+SDV  GG TVFP                     G SV PK+G A+ ++
Sbjct: 412 TGNRIATWLFYMSDVSAGGATVFPEV-------------------GASVWPKKGTAVFWY 452

Query: 251 SMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           ++      D S+ H  CPV+ GNKW + KW+H
Sbjct: 453 NLFASGEGDYSTRHAACPVLVGNKWVSNKWLH 484


>gi|332211329|ref|XP_003254773.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Nomascus
           leucogenys]
          Length = 544

 Score =  110 bits (276), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 66/210 (31%), Positives = 107/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S +E + + +LA+P++ +S V   +     + R+  S   +LK 
Sbjct: 340 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGEKQLQVEYRI--SKSAWLKD 397

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  +  RIA  T + +   + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 398 TVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 457

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                   LSV   R  AL +W++
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYAN-------------------LSVPVVRNAALFWWNL 498

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 499 HRSGEGDSDTLHAGCPVLVGDKWVANKWIH 528


>gi|403263105|ref|XP_003923900.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3, partial [Saimiri boliviensis boliviensis]
          Length = 534

 Score =  110 bits (276), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 66/210 (31%), Positives = 107/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S +E + + +LA+P++ +S V   +     + R+  S   +LK 
Sbjct: 330 EVLHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGEKQLQVEYRI--SKSAWLKD 387

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  +  RIA  T + +   + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 388 TVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 447

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                   LSV   +  AL +W++
Sbjct: 448 NRVATFMIYLSSVEAGGATAFIYAN-------------------LSVPVVKNAALFWWNL 488

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ GNKW A KW+H
Sbjct: 489 HRSGEGDSDTLHAGCPVLVGNKWVANKWIH 518


>gi|297689698|ref|XP_002822285.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pongo abelii]
          Length = 544

 Score =  110 bits (276), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 66/210 (31%), Positives = 107/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S +E + + +LA+P++ +S V   +     + R+  S   +LK 
Sbjct: 340 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGEKQLQVEYRI--SKSAWLKD 397

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  +  RIA  T + +   + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 398 TVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 457

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                   LSV   R  AL +W++
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYAN-------------------LSVPVVRNAALFWWNL 498

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 499 HRSGEGDSDTLHAGCPVLVGDKWVANKWIH 528


>gi|219124513|ref|XP_002182546.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217405892|gb|EEC45833.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 193

 Score =  110 bits (275), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 69/207 (33%), Positives = 105/207 (50%), Gaps = 22/207 (10%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++  PRAF   NFL+  E ++++ L +    K   +   +     S  RTSS T+L R  
Sbjct: 4   LSCAPRAFQVENFLTDVEADHIVGLVQ----KKNDMQRSSTNGHISETRTSSTTWLARHS 59

Query: 140 DRIIRGIEKRIADFTFI--PMEH---GEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQR 194
           D +I  I +R+AD   +   M H    E +Q++HY VGQ+Y AH+D+   + +      R
Sbjct: 60  DPVIDSIFRRVADTLKMDEAMLHRRINEDLQIVHYGVGQQYTAHHDFGYPKGD-PGSPSR 118

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
                MYL+DV  GG+T FP         RW N  +      L+V PK+G A++F+ + P
Sbjct: 119 SINFCMYLNDVPAGGQTSFP---------RWRNAET---NGALNVVPKKGTAMIFYMVNP 166

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWM 281
           D  LD  + H   PVI G K+ +  W+
Sbjct: 167 DGNLDDLTHHAALPVIEGEKFFSNLWI 193


>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
           musculus]
 gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_d [Rattus norvegicus]
          Length = 189

 Score =  110 bits (275), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 60/192 (31%), Positives = 101/192 (52%), Gaps = 21/192 (10%)

Query: 93  LSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIAD 152
           +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++  + +R+  
Sbjct: 1   MSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQH 60

Query: 153 FTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMYLSDVEEGGE 210
            T + ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y+SDVE GG 
Sbjct: 61  ITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGA 120

Query: 211 TVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVI 270
           TVFP                     G ++ PK+G A+ ++++      D  + H  CPV+
Sbjct: 121 TVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVL 161

Query: 271 RGNKWSATKWMH 282
            G KW + KW H
Sbjct: 162 VGCKWVSNKWFH 173


>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
          Length = 509

 Score =  110 bits (275), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 61/200 (30%), Positives = 103/200 (51%), Gaps = 21/200 (10%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   ++HN LS  E E +  LA+  +V +   +  + Q +    R S   +L+  + + 
Sbjct: 320 DPDILIFHNVLSDCEIETMKQLAQSRLVTAVFENPHSKQLELFPFRISKVAWLEDQEHQH 379

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  + +R+A  T + +   E  QV++Y +G  Y+ H+D+          G R+ T+L YL
Sbjct: 380 LAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDF--QSTVDPAIGSRIETVLFYL 437

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           SDVE+GG TVFP                      +SV P++G A++++++ P    D  +
Sbjct: 438 SDVEQGGATVFPEIQ-------------------VSVWPQKGSAVVWFNLHPSGDGDQRT 478

Query: 263 LHGGCPVIRGNKWSATKWMH 282
            H GCPV+ G+KW ATKW+H
Sbjct: 479 KHAGCPVLIGSKWIATKWIH 498


>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
           putative [Tribolium castaneum]
          Length = 515

 Score =  110 bits (275), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 61/200 (30%), Positives = 103/200 (51%), Gaps = 21/200 (10%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P   ++HN LS  E E +  LA+  +V +   +  + Q +    R S   +L+  + + 
Sbjct: 326 DPDILIFHNVLSDCEIETMKQLAQSRLVTAVFENPHSKQLELFPFRISKVAWLEDQEHQH 385

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  + +R+A  T + +   E  QV++Y +G  Y+ H+D+          G R+ T+L YL
Sbjct: 386 LAVVAQRVAHMTGLTLSTAEEFQVVNYGIGGHYEPHFDF--QSTVDPAIGSRIETVLFYL 443

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           SDVE+GG TVFP                      +SV P++G A++++++ P    D  +
Sbjct: 444 SDVEQGGATVFPEIQ-------------------VSVWPQKGSAVVWFNLHPSGDGDQRT 484

Query: 263 LHGGCPVIRGNKWSATKWMH 282
            H GCPV+ G+KW ATKW+H
Sbjct: 485 KHAGCPVLIGSKWIATKWIH 504


>gi|323454062|gb|EGB09933.1| hypothetical protein AURANDRAFT_14928, partial [Aureococcus
           anophagefferens]
          Length = 182

 Score =  110 bits (275), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 71/203 (34%), Positives = 101/203 (49%), Gaps = 30/203 (14%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P  +   NFL++ EC+ LID AK +M  + VV    G+   SR  TSS  +L R     +
Sbjct: 1   PPIYTVQNFLTEEECDALIDSAKDHMTPAPVVGPGNGEVSVSR--TSSTCYLARED---L 55

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-----TKNGGQRMATL 198
             +  ++   T  P+EH E  QV  Y  G+ Y  HYD F           +NGGQR+AT+
Sbjct: 56  PSVCTKVCALTGKPLEHLELPQVGRYRGGEFYKPHYDAFDTSSADGRRFAQNGGQRVATV 115

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L+YL+DVE GGET F                    K G+ +KP++G+AL+F+    D  L
Sbjct: 116 LVYLNDVERGGETSF-------------------SKLGVRIKPRKGNALIFFPATLDGVL 156

Query: 259 DPSSLHGGCPVIRGNKWSATKWM 281
           D + LH   P +   KW +  W+
Sbjct: 157 DQNYLHAAEPAV-DPKWVSQIWI 178


>gi|449467908|ref|XP_004151664.1| PREDICTED: uncharacterized protein LOC101218099, partial [Cucumis
           sativus]
          Length = 122

 Score =  110 bits (275), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 61/127 (48%), Positives = 81/127 (63%), Gaps = 13/127 (10%)

Query: 3   KLRHSRLQAKKWSTLTLVLSMLFMLTIVLLMLLAMGIFYIPIGDDDSPP----NDLTSFR 58
           K ++ +LQ +KWST  L         +++ ++LA+G F +      SPP    +  +S R
Sbjct: 5   KGKYIKLQGRKWSTFQLS-------KMIMALVLALGFFMLIALRFLSPPETSHHRFSSVR 57

Query: 59  RRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSK 118
             AF   S    ++G+QW E ++WEPRAFVYHNFLSK EC YLI LAKP+M KSTVVDSK
Sbjct: 58  HTAF--LSDGLGKRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSK 115

Query: 119 TGQSKDS 125
           TG+S DS
Sbjct: 116 TGESVDS 122


>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
 gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
          Length = 545

 Score =  110 bits (274), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 67/211 (31%), Positives = 110/211 (52%), Gaps = 27/211 (12%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            E ++ +P   +YH+ + ++E + L  L K  + ++TV  +    S  S  RTS  TF+ 
Sbjct: 323 VEELSHDPLLVLYHDVIYQSEIDTLAKLTKNKIHRATVTGNNA--SVVSNARTSQFTFIP 380

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-DEFNTKN----- 190
           + + +++R I++R+AD T + M   E  Q+ +Y +G  Y  H D+F  + F TK      
Sbjct: 381 KTRHKVLRTIDQRVADMTDLNMVFAEDHQLANYGIGGHYAQHMDWFSPNAFETKQVANSE 440

Query: 191 GGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFW 250
            G R+AT+L YL+DVE+GG T FP                   KQ L  KPK+  A  ++
Sbjct: 441 MGNRIATVLFYLTDVEQGGGTAFPVL-----------------KQLL--KPKKYAAAFWY 481

Query: 251 SMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           ++      D  ++HG CP+I G+KW   +W+
Sbjct: 482 NLHASGAGDVRTMHGACPIIVGSKWVLNRWI 512


>gi|196011900|ref|XP_002115813.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
 gi|190581589|gb|EDV21665.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
          Length = 581

 Score =  110 bits (274), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 65/215 (30%), Positives = 108/215 (50%), Gaps = 32/215 (14%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVV---DSKTGQSKDSRVRTSSGT 133
            E+++ +P   +YHN L+ +E   L  LA P + ++ VV   D + G+  ++  R S   
Sbjct: 345 VEVLSLQPYIVIYHNLLTNSEVVLLKTLASPLLKRAVVVGKPDKEYGE--ETTYRISKTA 402

Query: 134 FLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-------LDEF 186
           +L +     ++ I   I D   +  E  E +Q+ +Y +G  Y+ H D+        L E+
Sbjct: 403 WLDKEDHPAVKRITTLIGDIIGLTSETAEPLQIANYGIGGHYEPHLDFIESEDKEALSEY 462

Query: 187 NTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDA 246
            T   G R+AT+L+YLS+VE GG TVFP A                   G+ V+P++G A
Sbjct: 463 -TSRIGNRIATVLIYLSNVEAGGATVFPKA-------------------GVRVEPRQGSA 502

Query: 247 LLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
             +++M  +   +  S+H  CPV+ G+KW+A  W 
Sbjct: 503 AFWYNMHRNGEGNKLSVHAACPVLIGSKWAANLWF 537


>gi|426369750|ref|XP_004051847.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Gorilla
           gorilla gorilla]
          Length = 517

 Score =  110 bits (274), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 66/210 (31%), Positives = 106/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S +E + + +LA+P++ +S V   +     + R+  S   +LK 
Sbjct: 313 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGEKQLQVEYRI--SKSAWLKD 370

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D  +  +  RIA  T + +   + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 371 TVDPKLVALNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 430

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                   LSV   R  AL +W++
Sbjct: 431 NRVATFMIYLSSVEAGGATAFIYAN-------------------LSVPVVRNAALFWWNL 471

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 472 HRSGEGDSDTLHAGCPVLVGDKWVANKWIH 501


>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
 gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
          Length = 525

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 62/211 (29%), Positives = 106/211 (50%), Gaps = 27/211 (12%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            E ++ +P   +YH+ + ++E + +  L    + ++T+  + T +S  S VRTS  TFL 
Sbjct: 295 AEELSRDPLLILYHDVIYQSEIDTIRKLTTNKLKRATI--TSTNESVVSNVRTSQFTFLP 352

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE------FNTKN 190
             +D+++  I++R+AD T   M + E  Q  +Y +G  Y  H D+F          ++  
Sbjct: 353 VTEDKVLATIDRRVADMTNFNMRYAEDHQFANYGIGGHYGQHMDWFYQPSFDAGLVSSPE 412

Query: 191 GGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFW 250
            G R+AT+L YLSDV +GG T FP                      + +KPK+  A  ++
Sbjct: 413 MGNRIATVLFYLSDVTQGGGTAFPHLR-------------------VLLKPKKYAAAFWY 453

Query: 251 SMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           ++      DP + HG CP+I G+KW   +W+
Sbjct: 454 NLHASGVGDPRTQHGACPIISGSKWVQNRWI 484


>gi|348555277|ref|XP_003463450.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cavia porcellus]
          Length = 584

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 67/211 (31%), Positives = 107/211 (50%), Gaps = 28/211 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S  E + + +LA+P++ +S V  +  G+      R S   +LK 
Sbjct: 380 EVIHLEPYVALYHDFVSDPEAQKIRELAEPWLQRSVV--ASGGKQLQVEYRISKSAWLKD 437

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNG 191
             D ++  +  RIA  T + +   + E +QV++Y +G  Y+ H+D+        F  K+G
Sbjct: 438 TVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLFRMKSG 497

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
             R+AT ++YLS VE GG T F  AN                    SV   +  AL +W+
Sbjct: 498 -NRVATFMIYLSSVEAGGATAFIYAN-------------------FSVPVVKNAALFWWN 537

Query: 252 MRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           +      D  +LH GCPV+ G+KW A KW+H
Sbjct: 538 LHRSGEGDGDTLHAGCPVLVGDKWVANKWIH 568


>gi|195159150|ref|XP_002020445.1| GL13509 [Drosophila persimilis]
 gi|194117214|gb|EDW39257.1| GL13509 [Drosophila persimilis]
          Length = 554

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 71/212 (33%), Positives = 104/212 (49%), Gaps = 26/212 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++ +P   VYHN LS AE   +  + +P + +S V D K  +   S+ RT+ G +L  
Sbjct: 344 EELSLDPYIVVYHNVLSDAEIAEVERVTEPLLKRSVVFDGKENKMSTSKKRTALGAWLPD 403

Query: 138 GQ-----DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNTKNG 191
                    +I+ I +RI + T + M   + +Q++ Y  G  YD H+DYF      TK  
Sbjct: 404 DNMDVSGRAVIQRILRRIHELTGLIMNDRQDMQLIKYGYGGHYDIHFDYFNTSSPITKAR 463

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
           G RMAT+L YL+DV+ GG T F                       L V  +RG  L +++
Sbjct: 464 GDRMATVLFYLNDVKHGGSTAFTDLQ-------------------LKVPSERGKVLFWYN 504

Query: 252 MRPDA-TLDPSSLHGGCPVIRGNKWSATKWMH 282
           MR +   LD  +LHG CPVI G K   + W+H
Sbjct: 505 MRGETHDLDSRTLHGACPVIDGTKSILSCWIH 536


>gi|440899661|gb|ELR50930.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Bos grunniens mutus]
          Length = 478

 Score =  109 bits (273), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 64/210 (30%), Positives = 106/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S AE + +  LA+P++ +S V   +     + R+  S   +LK 
Sbjct: 274 EVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVASGEKQLPVEYRI--SKSAWLKD 331

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RIA  T + ++  + E +QV++Y +G  Y+ H+D+     +     N G
Sbjct: 332 TVDPVLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSG 391

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F   N                    SV   +  AL +W++
Sbjct: 392 NRVATFMIYLSSVEAGGATAFIYGN-------------------FSVPVVKNAALFWWNL 432

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH  CPV+ G+KW A KW+H
Sbjct: 433 HRSGEGDGDTLHAACPVLVGDKWVANKWIH 462


>gi|426245942|ref|XP_004016760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Ovis
           aries]
          Length = 514

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 64/210 (30%), Positives = 106/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S AE + +  LA+P++ +S V   +     + R+  S   +LK 
Sbjct: 310 EVIHLEPYVVLYHDFVSDAEAQKIRGLAEPWLQRSVVASGEKQLPVEYRI--SKSAWLKD 367

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RIA  T + ++  + E +QV++Y +G  Y+ H+D+     +     N G
Sbjct: 368 TVDPVLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSG 427

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F   N                    SV   +  AL +W++
Sbjct: 428 NRVATFMIYLSSVEAGGATAFIYGN-------------------FSVPVVKNAALFWWNL 468

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH  CPV+ G+KW A KW+H
Sbjct: 469 HRSGEGDGDTLHAACPVLVGDKWVANKWIH 498


>gi|410632646|ref|ZP_11343301.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
 gi|410147883|dbj|GAC20168.1| prolyl 4-hydroxylase [Glaciecola arctica BSs20135]
          Length = 480

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 68/197 (34%), Positives = 102/197 (51%), Gaps = 26/197 (13%)

Query: 91  NFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRI 150
           +FL   EC+ LI+L +     ST+    T ++ D + RTSS   L   QD +IR I+ +I
Sbjct: 103 DFLLPQECQALIELIEQAKQPSTI----TSENPDQQFRTSSTCHLGNMQDPVIRKIDLQI 158

Query: 151 ADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD-EFNTKNG--GQRMATLLMYLSDVEE 207
             +  I   + E IQ  HY++GQ++  H DYF   E     G  GQR  T ++YL++VE+
Sbjct: 159 CQYLGIDPSYSEVIQGQHYQLGQQFKPHTDYFEPYELAHYGGIQGQRTYTFMIYLNEVEQ 218

Query: 208 GGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGC 267
           GG+TVFP     F                   K K+G A+++ ++ PD +++  +LH G 
Sbjct: 219 GGDTVFPELAIGF-------------------KAKKGMAVIWNNINPDGSVNYQTLHQGM 259

Query: 268 PVIRGNKWSATKWMHVH 284
           PV +G K   TKW   H
Sbjct: 260 PVQKGEKLIITKWFRQH 276


>gi|296217074|ref|XP_002754870.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Callithrix
           jacchus]
          Length = 544

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 66/210 (31%), Positives = 106/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           EI+  EP   +YH+F+S +E + + + A+P++ +S V   +     + R+  S   +LK 
Sbjct: 340 EILHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVASGEKQLQVEYRI--SKSAWLKD 397

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  +  RIA  T + +   + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 398 TVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 457

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                   LSV   +  AL +W++
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYAN-------------------LSVPVVKNAALFWWNL 498

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ GNKW A KW+H
Sbjct: 499 HRSGEGDSDTLHAGCPVLVGNKWVANKWIH 528


>gi|48675383|ref|NP_001001598.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
 gi|75053350|sp|Q75UG4.1|P4HA3_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|47115494|dbj|BAD18888.1| Collagen prolyl 4-hydroxylase alpha III subunit [Bos taurus]
 gi|296479828|tpg|DAA21943.1| TPA: prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
          Length = 544

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 64/210 (30%), Positives = 107/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S AE + +  LA+P++ +S V   +     + R+  S+  +LK 
Sbjct: 340 EVIHLEPYVVLYHDFVSDAEAQTIRGLAEPWLQRSVVASGEKQLPVEYRISKSA--WLKD 397

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RIA  T + ++  + E +QV++Y +G  Y+ H+D+     +     N G
Sbjct: 398 TVDPVLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSG 457

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F   N                    SV   +  AL +W++
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYGN-------------------FSVPVVKNAALFWWNL 498

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH  CPV+ G+KW A KW+H
Sbjct: 499 HRSGEGDGDTLHAACPVLVGDKWVANKWIH 528


>gi|194213450|ref|XP_001495951.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Equus
           caballus]
          Length = 548

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 64/210 (30%), Positives = 108/210 (51%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S +E + +  LA+P++ +S V   +     + R+  S+  +LK 
Sbjct: 344 EVIHLEPYVVLYHDFVSDSEAQKIRGLAEPWLQRSVVASGEKQLPVEYRISKSA--WLKD 401

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RIA  T + ++  + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 402 TVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPTSPLYRMKSG 461

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                    SV   +  AL +W++
Sbjct: 462 NRVATFMIYLSSVEAGGATAFIYAN-------------------FSVPVVKNAALFWWNL 502

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 503 HRSGEGDSDTLHAGCPVLVGDKWVANKWIH 532


>gi|195172672|ref|XP_002027120.1| GL20071 [Drosophila persimilis]
 gi|194112933|gb|EDW34976.1| GL20071 [Drosophila persimilis]
          Length = 455

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 70/213 (32%), Positives = 109/213 (51%), Gaps = 24/213 (11%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            E ++ +P   +YH+ +  +E E L D A P M +S V       SK++  RTS   F  
Sbjct: 252 VEPLSQDPYIAMYHDVIYDSEIEELKDNAFPDMERSKVYTYSDEDSKNTG-RTSMSAFQT 310

Query: 137 RGQDRIIRGIEKRIADFT-FIPMEHG--EGIQVLHYEVGQKYDAHYDYFLDEFNTK-NGG 192
             Q + +  + +R+   T F  +  G  + + VL+Y    +Y  H DYF   ++     G
Sbjct: 311 DHQYKAVTKVNRRVMHMTGFEVLADGSSDELLVLNYATAAQYLTHSDYFGPAYSEYIQRG 370

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT+L YL+DVE+GG+TVFP                   + G+   P +G A++F++M
Sbjct: 371 DRIATVLFYLNDVEQGGKTVFP-------------------RLGIFRSPMKGSAVVFYNM 411

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
                 DP + HGGCPV+ G KW+ATKW++  E
Sbjct: 412 NSSLQGDPRTEHGGCPVLVGTKWAATKWIYSAE 444


>gi|215697788|dbj|BAG91981.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 225

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 62/190 (32%), Positives = 103/190 (54%), Gaps = 19/190 (10%)

Query: 97  ECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFI 156
           EC++L+ + +  M  S         S  + +           +D ++  IE RI+ ++F+
Sbjct: 2   ECDHLVSMGRGNMESSLAFTDGDRNSSYNNI-----------EDIVVSKIEDRISLWSFL 50

Query: 157 PMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAA 216
           P E+GE IQVL Y V +          +E  + +G  R+AT+LMYLSDV++GGETVFP +
Sbjct: 51  PKENGESIQVLKYGVNRS-----GSIKEEPKSSSGAHRLATILMYLSDVKQGGETVFPRS 105

Query: 217 NANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWS 276
                  +     S+C   G +V+P +G+A+L +++RPD   D  S +  CPV+ G KW 
Sbjct: 106 EMKDAQAKE-GAPSQC--SGYAVRPAKGNAILLFNLRPDGETDKDSQYEECPVLEGEKWL 162

Query: 277 ATKWMHVHEY 286
           A K +++ ++
Sbjct: 163 AIKHINLRKF 172


>gi|33589818|ref|NP_878907.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Homo sapiens]
 gi|114639354|ref|XP_001174896.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan troglodytes]
 gi|397487266|ref|XP_003814725.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan paniscus]
 gi|74738714|sp|Q7Z4N8.1|P4HA3_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|33188232|gb|AAP97874.1| prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
 gi|36962719|gb|AAQ87603.1| collagen prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
 gi|37182165|gb|AAQ88885.1| GPGA711 [Homo sapiens]
 gi|109658570|gb|AAI17334.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
 gi|119595341|gb|EAW74935.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III, isoform CRA_b
           [Homo sapiens]
 gi|410219716|gb|JAA07077.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
 gi|410248278|gb|JAA12106.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
 gi|410336087|gb|JAA36990.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
          Length = 544

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 66/210 (31%), Positives = 106/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S +E + + +LA+P++ +S V   +     + R+  S   +LK 
Sbjct: 340 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGEKQLQVEYRI--SKSAWLKD 397

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D  +  +  RIA  T + +   + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 398 TVDPKLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 457

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                   LSV   R  AL +W++
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYAN-------------------LSVPVVRNAALFWWNL 498

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 499 HRSGEGDSDTLHAGCPVLVGDKWVANKWIH 528


>gi|59809017|gb|AAH89446.1| P4HA3 protein [Homo sapiens]
          Length = 528

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 66/210 (31%), Positives = 106/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S +E + + +LA+P++ +S V   +     + R+  S   +LK 
Sbjct: 324 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGEKQLQVEYRI--SKSAWLKD 381

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D  +  +  RIA  T + +   + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 382 TVDPKLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 441

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                   LSV   R  AL +W++
Sbjct: 442 NRVATFMIYLSSVEAGGATAFIYAN-------------------LSVPVVRNAALFWWNL 482

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 483 HRSGEGDSDTLHAGCPVLVGDKWVANKWIH 512


>gi|339236271|ref|XP_003379690.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
 gi|316977627|gb|EFV60702.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
          Length = 558

 Score =  109 bits (272), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 71/265 (26%), Positives = 118/265 (44%), Gaps = 51/265 (19%)

Query: 43  PIGDDDSPPNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLI 102
           PI D D         R R + K + I         E++ W+P+   +   +S  E   + 
Sbjct: 302 PISDKDRAKLYCYYKRNRPYLKLAPIK-------VEVMHWKPKIVYFRGVISDEEIAVIK 354

Query: 103 DLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGE 162
            LA P + ++TV ++ TGQ + +  R S   +LK  +  +++ I  RI   T + ME  E
Sbjct: 355 QLASPLLKRATVHNADTGQLETASYRISKSAWLKDTEHEVVKRISDRIDMMTDLTMETAE 414

Query: 163 GIQVLHYEVGQKYDAHYD----------------------YFLDE---FNTKNGGQRMAT 197
            +Q+ +Y +G  YD H+D                      ++ ++   F + N G R+AT
Sbjct: 415 LLQIANYGIGGHYDPHFDMSTRGESDPYEEGTGNRIATVLFYTNDPYSFESLNAGNRIAT 474

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           +L Y+S  E GG TVF +                     ++V+P + DA  ++++     
Sbjct: 475 VLFYISQPEAGGGTVFTS-------------------HKITVEPSKYDAAFWFNVLQGGE 515

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMH 282
            D S+ H  CPV+ G KW A KW+H
Sbjct: 516 PDMSTRHAACPVLAGTKWVANKWIH 540


>gi|412986386|emb|CCO14812.1| predicted protein [Bathycoccus prasinos]
          Length = 337

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 77/228 (33%), Positives = 114/228 (50%), Gaps = 33/228 (14%)

Query: 69  AEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVR 128
           A +    W E V+WEPR FVYHNFLS+ E +YL D  K           K  ++ D    
Sbjct: 94  ASDGNTPWIEHVSWEPRVFVYHNFLSEKEAKYLRDAHK-----------KASKAMDDE-- 140

Query: 129 TSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYD---AHYDYFLDE 185
            S  T  KRGQD I+  IE+R++ F  +P  HGE + +   + G        ++D   D+
Sbjct: 141 -SMKTTFKRGQDPIVNVIEQRLSAFVMLPETHGENMFIEKIKKGYPKRLELLNFDDEKDK 199

Query: 186 FNTKNGGQRMATLLMYLSDVEE--GGETVFP--------AANANFTSVRWWNELSEC-GK 234
            + KNGGQR AT  ++L+ + E  GGE VFP         +N ++TS       S C GK
Sbjct: 200 EDLKNGGQRFATTALFLNTISEGKGGELVFPLGTERLYDDSNDSYTSTP-----SACAGK 254

Query: 235 QGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
             L+V+P+ GDA++++S   +   D +S    C  +   +   T + H
Sbjct: 255 YTLAVEPRVGDAVVWFSTHHNGNDDLNSASMRCDAVADGEEMLTAYKH 302


>gi|219113023|ref|XP_002186095.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
 gi|209582945|gb|ACI65565.1| predicted protein [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 508

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 70/218 (32%), Positives = 108/218 (49%), Gaps = 33/218 (15%)

Query: 79  IVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDS--KTGQSKDSRVRTSSGTFLK 136
           +++  PR F   +FLS  E E+L+++A    +K + + +   +  + +   RTS+  ++ 
Sbjct: 283 VLSCVPRVFEVKDFLSDMEVEHLLNIASKRKLKRSTMHAGGSSEATTNDDTRTSTNDWIP 342

Query: 137 RGQDRIIRGIEKRIAD-------------------FTFIPMEHGEGIQVLHYEVGQKYDA 177
           R QD I   I +R AD                   FT   +   E +Q+++Y+VGQ+Y  
Sbjct: 343 RHQDLITDTIYRRAADLLQMDEALLRWRRKSEIPEFTESHISISERLQLVNYQVGQQYTP 402

Query: 178 HYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGL 237
           H+D+ +          R ATLL YL+D  +GGET FP         RW +   E G   L
Sbjct: 403 HHDFTMPGL-VNMQPSRFATLLFYLNDDMDGGETAFP---------RWLHADEEGG--SL 450

Query: 238 SVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKW 275
            VKP++G A+LF+++ PD   D  S H   PV RG KW
Sbjct: 451 KVKPEKGKAILFYNLLPDGNYDERSEHAALPVRRGEKW 488


>gi|443705944|gb|ELU02240.1| hypothetical protein CAPTEDRAFT_227850 [Capitella teleta]
          Length = 475

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 64/212 (30%), Positives = 102/212 (48%), Gaps = 30/212 (14%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
           TE++   P  +++H+F+S +E + L D+A+P    S V+D   G+S     R SS  F+ 
Sbjct: 177 TELLHANPEIYLFHDFISDSEIQRLKDMAEPQFQSSAVLDDTGGESFFDVSRLSSTAFVN 236

Query: 137 RGQDRIIRGIEKRIADFTFIPME------HGEGIQVLHYEVGQKYDAHYDYFLDEFN--- 187
              D ++  + +R++  T +  E        E +QVL Y  G  Y  HYD    E +   
Sbjct: 237 DSND-LVASLNRRVSKLTGLQTEVLDSFSESESLQVLRYGPGGLYTPHYDTLGSEADLPP 295

Query: 188 -TKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDA 246
             ++ G R+AT ++YL     GG TVFP                      +S+  ++G A
Sbjct: 296 YIQHTGDRIATFILYLDIATAGGATVFPLL-------------------PMSIPIQKGAA 336

Query: 247 LLFWSMRPDATLDPSSLHGGCPVIRGNKWSAT 278
             ++++ PD +LD  +LH  CPVIRG KW   
Sbjct: 337 AFWFNLHPDGSLDRRTLHAACPVIRGTKWECV 368


>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
           latipes]
          Length = 517

 Score =  108 bits (271), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 66/215 (30%), Positives = 110/215 (51%), Gaps = 30/215 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+++ +P   +YHNF++  E E +   A+P + +S V   +   + + R+  S   +LK 
Sbjct: 313 EVLSLQPYVVIYHNFITDREAEEIKGFAQPALRRSVVASGENQATVEYRI--SKSAWLKG 370

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNG 191
            +  I+  +++RI+  T + +   + E +QV++Y +G  Y+ H+D+        F  K G
Sbjct: 371 SESCIVGKLDQRISMLTGLNVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPVFKLKTG 430

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
             R+AT ++YLS VE GG T F  AN                    SV   +  A+ +W+
Sbjct: 431 -NRVATFMIYLSSVEAGGSTAFIYAN-------------------FSVPVLKKAAIFWWN 470

Query: 252 MRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           +  +   D  +LH GCPV+ G+KW A KW  VHEY
Sbjct: 471 LHRNGRGDAETLHAGCPVLIGDKWVANKW--VHEY 503


>gi|390989473|ref|ZP_10259770.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
           pv. punicae str. LMG 859]
 gi|372555742|emb|CCF66745.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
           pv. punicae str. LMG 859]
          Length = 152

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 65/160 (40%), Positives = 85/160 (53%), Gaps = 26/160 (16%)

Query: 127 VRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDE 185
            RTS    L+ GQD + + IE RIA     P++HGEG+QVL Y  G +Y  HYDYF  D 
Sbjct: 5   ARTSDSMCLRVGQDALCQRIEARIARLFDWPVDHGEGLQVLRYATGAEYRPHYDYFDPDA 64

Query: 186 FNT----KNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKP 241
             T    + GGQR+A+L+MYL+  E GG T FP A+ +  +V                  
Sbjct: 65  AGTPILLQAGGQRVASLVMYLNTPERGGATRFPDAHLDVAAV------------------ 106

Query: 242 KRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
            +G+A+ F   RP       SLH G PV+ G KW ATKW+
Sbjct: 107 -KGNAVFFSYDRPHPMT--RSLHAGAPVLTGEKWVATKWL 143


>gi|299115443|emb|CBN75608.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 548

 Score =  108 bits (270), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 76/237 (32%), Positives = 115/237 (48%), Gaps = 37/237 (15%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLA-----KPYMVKSTVVDSKTGQSKDSRVRTSSG 132
           E ++  PR F  +NF+   E + +I+ A     + Y +K +   +K G++  S+ RTS  
Sbjct: 208 ETLSHSPRVFSLYNFMDMEEADSIIEDALGMTQEAYRLKRSSTGTK-GKAI-SKTRTSDN 265

Query: 133 TFLKRGQDRIIRGIEKRIADFTFIPMEH---GEGIQVLHYEVGQKYDAHYDYFLD----E 185
            F+        + +++RI     I   H    +G+QVL Y   Q Y AH+DY       +
Sbjct: 266 AFVT--HTNTAQALKRRIFQLLGIEEYHETWADGLQVLRYNESQAYVAHFDYLESAEGHD 323

Query: 186 FNTKN-GGQRMATLLMYLSDVEEGGETVFPAA--------------------NANFTSVR 224
           F ++  G  R AT+++Y +DV EGGETVF  A                    N +     
Sbjct: 324 FKSEGLGTNRFATVVLYFNDVREGGETVFTHAPGIDHHLVPDTKVPVREVLENLDLPRSG 383

Query: 225 WWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           W  +L    ++ + V PKRG A+LF++  PD   D SS HG CPVI G KW+A  W+
Sbjct: 384 WEEKLLLQCRRHMVVAPKRGQAVLFYNQHPDGRKDLSSEHGACPVIDGQKWAANLWV 440


>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
          Length = 187

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 59/188 (31%), Positives = 99/188 (52%), Gaps = 21/188 (11%)

Query: 97  ECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFI 156
           E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++  + +R+   T +
Sbjct: 3   EIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVVARVNRRMQHITGL 62

Query: 157 PMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT--KNGGQRMATLLMYLSDVEEGGETVFP 214
            ++  E +QV +Y +G +Y+ H+D+    F++  K  G R+AT L Y+SDVE GG TVFP
Sbjct: 63  TVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATVFP 122

Query: 215 AANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNK 274
                                G ++ PK+G A+ ++++      D  + H  CPV+ G K
Sbjct: 123 -------------------DLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCK 163

Query: 275 WSATKWMH 282
           W + KW H
Sbjct: 164 WVSNKWFH 171


>gi|390459659|ref|XP_002806656.2| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-2 [Callithrix jacchus]
          Length = 579

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 65/223 (29%), Positives = 105/223 (47%), Gaps = 43/223 (19%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 360 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 419

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDY------------------FL-- 183
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+                  FL  
Sbjct: 420 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 479

Query: 184 ----DEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
               D F     G R+AT L Y+SDVE GG TVFP                     G ++
Sbjct: 480 NDERDAFKHLGTGNRVATFLNYMSDVEAGGATVFP-------------------DLGAAI 520

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            PK+G A+ ++++      D  + H  CPV+ G KW + KW H
Sbjct: 521 WPKKGTAVFWYNLLRSGXGDYRTRHAACPVLVGCKWVSNKWFH 563


>gi|443712762|gb|ELU05926.1| hypothetical protein CAPTEDRAFT_153364 [Capitella teleta]
          Length = 491

 Score =  108 bits (270), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 68/217 (31%), Positives = 106/217 (48%), Gaps = 34/217 (15%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+   +P   ++++ +S AE +++I  A P M +  V +S   QS D R+  S   +L  
Sbjct: 276 EVHNVDPHVAIFYDVISDAEADHIIRHAFPGMFRGLVGNSTLRQSSDQRI--SKVGWLFD 333

Query: 138 GQDRIIRGIEKRIADFT-----FIPMEHG-EGIQVLHYEVGQKYDAHYDYFLDEFNTKN- 190
             D +I+ +  RI D T     + P+    E +QV++Y +G +Y+ H D++ D    KN 
Sbjct: 334 NVDTLIKKLSARIGDVTGLNTVYTPVRSPVEAMQVVNYGIGGQYEPHLDFYEDPEMLKNV 393

Query: 191 ------GGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRG 244
                  G R++T L YLS V  GG TVFP  N                   + V P + 
Sbjct: 394 NPSLQDTGDRISTFLFYLSRVHLGGATVFPKLN-------------------VRVPPVKN 434

Query: 245 DALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
            A  +++ RP+   D  +LH GCPV+ G KW A KW+
Sbjct: 435 GAAFWYNARPNGEHDKRTLHAGCPVVLGEKWVANKWI 471


>gi|167045848|gb|ABZ10515.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Callithrix jacchus]
          Length = 555

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 65/223 (29%), Positives = 105/223 (47%), Gaps = 43/223 (19%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDY------------------FL-- 183
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+                  FL  
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 184 ----DEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
               D F     G R+AT L Y+SDVE GG TVFP                     G ++
Sbjct: 456 NDERDAFKHLGTGNRVATFLNYMSDVEAGGATVFP-------------------DLGAAI 496

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            PK+G A+ ++++      D  + H  CPV+ G KW + KW H
Sbjct: 497 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 539


>gi|325920649|ref|ZP_08182559.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
           [Xanthomonas gardneri ATCC 19865]
 gi|325548839|gb|EGD19783.1| 2OG-Fe(II) oxygenase superfamily enzyme,Sel1 repeat protein
           [Xanthomonas gardneri ATCC 19865]
          Length = 422

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 67/200 (33%), Positives = 95/200 (47%), Gaps = 23/200 (11%)

Query: 92  FLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRG-QDRIIRGIEKRI 150
            LS  EC  L+ LA+P++  S VVD     +  + +RTS G  L    +D   R  + R+
Sbjct: 240 VLSADECRLLMLLARPHLRASQVVDPNDASTHRTPIRTSRGATLDPILEDFAARAAQARV 299

Query: 151 ADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD---EFNTKNGGQRMATLLMYLSDVEE 207
           A    +P+ H E + VL Y  G+ Y AH DY        +    G R+ T  +YL+DV+ 
Sbjct: 300 AACAQLPLTHAEALSVLCYAPGEHYRAHRDYLPPGTIAADRPGAGNRLRTACVYLNDVDA 359

Query: 208 GGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGC 267
           GGET FP A                   G+ V+P+ G  + F +++ D   DP SLH G 
Sbjct: 360 GGETEFPVA-------------------GIRVQPRAGSVVCFDNLQADGCPDPDSLHAGL 400

Query: 268 PVIRGNKWSATKWMHVHEYK 287
           PV  G+KW  T W     Y+
Sbjct: 401 PVTTGSKWLGTLWFRQQRYR 420


>gi|291404186|ref|XP_002718473.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 3
           [Oryctolagus cuniculus]
          Length = 516

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 67/200 (33%), Positives = 98/200 (49%), Gaps = 33/200 (16%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  I  RI D T        G+ V   E  QK +       D F     G R+AT L Y+
Sbjct: 394 VSRINMRIQDLT--------GLDVSTAEELQKDEP------DAFKELGTGNRIATWLFYM 439

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           SDV  GG TVFP                     G SV PK+G A+ ++++      D S+
Sbjct: 440 SDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEGDYST 480

Query: 263 LHGGCPVIRGNKWSATKWMH 282
            H  CPV+ GNKW + KW+H
Sbjct: 481 RHAACPVLVGNKWVSNKWLH 500


>gi|170649696|gb|ACB21278.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Callicebus moloch]
          Length = 555

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 65/223 (29%), Positives = 105/223 (47%), Gaps = 43/223 (19%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDY------------------FL-- 183
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+                  FL  
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 184 ----DEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
               D F     G R+AT L Y+SDVE GG TVFP                     G ++
Sbjct: 456 NDERDAFKHLGTGNRVATFLNYMSDVEAGGATVFP-------------------DLGAAI 496

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            PK+G A+ ++++      D  + H  CPV+ G KW + KW H
Sbjct: 497 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 539


>gi|194905376|ref|XP_001981185.1| GG11927 [Drosophila erecta]
 gi|190655823|gb|EDV53055.1| GG11927 [Drosophila erecta]
          Length = 539

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 65/211 (30%), Positives = 110/211 (52%), Gaps = 30/211 (14%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD----SRVRTSSG 132
           TEI++ +P   + H+ +S  E   +   +K     ST+  S+T  + +    S+ RTS  
Sbjct: 324 TEILSIDPFVVLLHDMVSPKEAALIRSSSK-----STIFPSETVNAANDFVVSKFRTSKS 378

Query: 133 TFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGG 192
            +L R  +     + +R+AD T + ++H E  QV++Y +G  +++H+D  L++ N   GG
Sbjct: 379 VWLDRDANEATVKLTQRLADATGLDVKHSEHFQVINYGIGGVFESHFDTTLEDTNRFVGG 438

Query: 193 --QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFW 250
              R+AT L YL+DV +GG T FP  N                   ++V P+ G AL ++
Sbjct: 439 FIDRIATTLFYLNDVPQGGATHFPGLN-------------------ITVFPRLGAALFWY 479

Query: 251 SMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           ++     L   ++H GCPVI G+KW  +KW+
Sbjct: 480 NLDTQGMLQVRTMHTGCPVIVGSKWVVSKWI 510


>gi|395820528|ref|XP_003783616.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Otolemur
           garnettii]
          Length = 516

 Score =  108 bits (269), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 67/200 (33%), Positives = 98/200 (49%), Gaps = 33/200 (16%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  I  RI D T        G+ V   E  QK +       D F     G R+AT L Y+
Sbjct: 394 VSRINMRIQDLT--------GLDVSTAEELQKDEP------DAFKELGTGNRIATWLFYM 439

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           SDV  GG TVFP                     G SV PK+G A+ ++++      D S+
Sbjct: 440 SDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEGDYST 480

Query: 263 LHGGCPVIRGNKWSATKWMH 282
            H  CPV+ GNKW + KW+H
Sbjct: 481 RHAACPVLVGNKWVSNKWLH 500


>gi|217272851|ref|NP_001136068.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Homo
           sapiens]
 gi|114631189|ref|XP_001140871.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 10 [Pan
           troglodytes]
          Length = 516

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 67/200 (33%), Positives = 98/200 (49%), Gaps = 33/200 (16%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  I  RI D T        G+ V   E  QK +       D F     G R+AT L Y+
Sbjct: 394 VSRINMRIQDLT--------GLDVSTAEELQKDEP------DAFKELGTGNRIATWLFYM 439

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           SDV  GG TVFP                     G SV PK+G A+ ++++      D S+
Sbjct: 440 SDVSAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEGDYST 480

Query: 263 LHGGCPVIRGNKWSATKWMH 282
            H  CPV+ GNKW + KW+H
Sbjct: 481 RHAACPVLVGNKWVSNKWLH 500


>gi|449485593|ref|XP_004175686.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3 [Taeniopygia guttata]
          Length = 567

 Score =  108 bits (269), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 64/212 (30%), Positives = 105/212 (49%), Gaps = 26/212 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V  +P   +YH+F++ AE E +  LA P++ +S V   +  Q  +  +  S  T+LK 
Sbjct: 363 EMVWIQPHVALYHDFITDAEAETIKGLAGPWLQRSVVASGEKQQKAEYWI--SKSTWLKD 420

Query: 138 GQDRIIRGIEKRIADFTFIPM--EHGEGIQVLHYEVGQKYDAHYDYFLD---EFNTKNGG 192
             D ++  +++RI   T + +   + E +QV++Y +G  Y+ H+D+             G
Sbjct: 421 TVDPVVHALDQRIIAVTGLDLWPPYAEYLQVVNYGLGGHYEPHFDHATSTKSPLYRMKSG 480

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R AT+++YLS VE GG T     N                    SV   +  AL +W++
Sbjct: 481 NRNATVMIYLSAVEAGGSTALIYTN-------------------FSVPVVKNAALFWWNL 521

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMHVH 284
           R +   D  +LH GCPV+ G+KW   KW+H H
Sbjct: 522 RRNGNGDGDTLHAGCPVLAGDKWVVNKWIHEH 553


>gi|405965633|gb|EKC30995.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
          Length = 617

 Score =  107 bits (268), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 105/209 (50%), Gaps = 23/209 (11%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E V   P   +YH+ +S  E + +  +A P + ++TV + +TG+ + +  R S   +LK 
Sbjct: 412 EEVYLNPWIVIYHDVVSDKEIDTIKRIATPLLSRATVHNPRTGKLETAEYRVSKSAWLKD 471

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNGGQ 193
           G D +I  +  RI+D T + M   E +Q+ +Y +G +Y+ H+D+   E    F     G 
Sbjct: 472 GDDPVIHNVNNRISDITGLSMATAEELQIANYGLGGQYEPHFDFARREETEAFRDLGSGN 531

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+AT L Y+++V+ GG TV       FT +            G+ + P +G A  ++++ 
Sbjct: 532 RIATWLTYMTNVDAGGATV-------FTHI------------GVKLFPIKGAAAFWYNLY 572

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                   + H  CPV+ G KW + KW+H
Sbjct: 573 RSGDGIFDTRHAACPVLVGQKWVSNKWIH 601


>gi|381200649|ref|ZP_09907785.1| Prolyl 4-hydroxylase alpha subunit [Sphingobium yanoikuyae XLDN2-5]
          Length = 305

 Score =  107 bits (268), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 66/201 (32%), Positives = 102/201 (50%), Gaps = 27/201 (13%)

Query: 82  WEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS-SGTFLKRGQD 140
           W+ R F    FL+  EC ++I   +  +  + V+D ++G+     VRTS  G F    +D
Sbjct: 121 WDVRLF--RQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHPVRTSDGGIFGPARED 178

Query: 141 RIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLM 200
            +I+ I +RIA  +   +  GE + +L Y VGQ+Y  H+D      N     QR  T+L+
Sbjct: 179 LVIQAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCLPHVRN-----QRAWTMLI 233

Query: 201 YLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDP 260
           YL++   GGET+FP                   + GLSVK ++GDALLF +         
Sbjct: 234 YLNEGYAGGETIFP-------------------RLGLSVKGRKGDALLFRNTDAQGQAAE 274

Query: 261 SSLHGGCPVIRGNKWSATKWM 281
           +++H G PV+ G KW  T+W+
Sbjct: 275 AAVHLGAPVMAGQKWLCTRWI 295


>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
 gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
          Length = 537

 Score =  107 bits (268), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 67/202 (33%), Positives = 107/202 (52%), Gaps = 25/202 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P   +YH+ + ++E + L  L +    ++ VV + T  S  S+ RTS   F+   + +++
Sbjct: 335 PLLVLYHDVIYQSEIDVLNKLNRKRYERAGVVINST--STVSKKRTSQHIFIAATRHKVL 392

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF--LDEFNTKNG--GQRMATLL 199
           R I++R+AD T + M++ E  Q+  Y +G  Y  H+D+F   D  N+K    G R+AT+L
Sbjct: 393 RTIDQRVADMTNLNMQYAEDHQLADYGIGGHYSQHFDWFGNSDLANSKCDEMGNRIATVL 452

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            YLSDV +GG T FP                   KQ L  KPK+  A  ++++      D
Sbjct: 453 FYLSDVAQGGGTAFPIL-----------------KQLL--KPKKYAAAFWYNLHASGKGD 493

Query: 260 PSSLHGGCPVIRGNKWSATKWM 281
             +LHGGCP+I G+KW   +W+
Sbjct: 494 WRNLHGGCPIIVGSKWVLNRWI 515


>gi|402894624|ref|XP_003910453.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3 [Papio anubis]
          Length = 535

 Score =  107 bits (268), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 64/210 (30%), Positives = 107/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S +E + + + A+P++ +S V   +     + R+  S+  +LK 
Sbjct: 331 EVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVASGEKQLQVEYRISKSA--WLKD 388

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  +  RIA  T + +   + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 389 TVDPMLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 448

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                   LSV   +  AL +W++
Sbjct: 449 NRVATFMIYLSSVEAGGATAFIYAN-------------------LSVPVVKNAALFWWNL 489

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 490 HRSGEGDSDTLHAGCPVLVGDKWVANKWIH 519


>gi|354504916|ref|XP_003514519.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cricetulus
           griseus]
          Length = 509

 Score =  107 bits (268), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 64/210 (30%), Positives = 107/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++   P   +YH+F+S AE + + +LA+P++ +S V   +     + R+  S   +LK 
Sbjct: 305 EVIHLRPFVALYHDFVSDAEAQKIRELAEPWLQRSVVASGEKQLPVEYRI--SKSAWLKD 362

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RIA  T + ++  + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 363 TVDPMLGTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 422

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                    SV   +  AL +W++
Sbjct: 423 NRVATFMIYLSAVEAGGATAFIYAN-------------------FSVPVVKNAALFWWNL 463

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 464 HRSGEGDGDTLHAGCPVLVGDKWVANKWIH 493


>gi|344274276|ref|XP_003408943.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3
           [Loxodonta africana]
          Length = 516

 Score =  107 bits (268), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 67/200 (33%), Positives = 98/200 (49%), Gaps = 33/200 (16%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIVRFHDIISDAEIEVVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  I  RI D T        G+ V   E  QK +       D F     G R+AT L Y+
Sbjct: 394 VSRINMRIQDLT--------GLDVSTAEELQKDEP------DAFKELGTGNRIATWLFYM 439

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           SDV  GG TVFP                     G SV PK+G A+ ++++      D S+
Sbjct: 440 SDVSAGGATVFPDV-------------------GASVWPKKGTAVFWYNLFASGEGDYST 480

Query: 263 LHGGCPVIRGNKWSATKWMH 282
            H  CPV+ GNKW + KW+H
Sbjct: 481 RHAACPVLVGNKWVSNKWLH 500


>gi|291387302|ref|XP_002710242.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 2 [Oryctolagus
           cuniculus]
 gi|217273039|gb|ACK28132.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Oryctolagus cuniculus]
          Length = 555

 Score =  107 bits (267), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 64/223 (28%), Positives = 107/223 (47%), Gaps = 43/223 (19%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDY------------------FLDE 185
             I +R+   T + ++  E +QV +Y +G +Y+ H+D+                  FL+ 
Sbjct: 396 ARINRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 186 FNTKNG------GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
            N ++       G R+AT L Y+SDVE GG TVFP                     G ++
Sbjct: 456 NNERDAFKRLGTGNRVATFLNYMSDVEAGGATVFP-------------------DLGAAI 496

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            PK+G A+ ++++      D  + H  CPV+ G KW + KW H
Sbjct: 497 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 539


>gi|38454288|ref|NP_942070.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Rattus norvegicus]
 gi|81870816|sp|Q6W3E9.1|P4HA3_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|36962768|gb|AAQ87605.1| collagen prolyl 4-hydroxylase alpha III subunit [Rattus norvegicus]
          Length = 544

 Score =  107 bits (267), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 63/210 (30%), Positives = 105/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++   P   +YH+F+S  E + + +LA+P++ +S V   +     + R+  S   +LK 
Sbjct: 340 EVIHLRPLVALYHDFVSDEEAQKIRELAEPWLQRSVVASGEKQLQVEYRI--SKSAWLKD 397

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDE---FNTKNGG 192
             D ++  +++RIA  T + ++  + E +QV++Y +G  Y+ H+D+             G
Sbjct: 398 TVDPVLVTLDRRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYKMKSG 457

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R ATL++YLS VE GG T F   N                    SV   +  AL +W++
Sbjct: 458 NRAATLMIYLSSVEAGGATAFIYGN-------------------FSVPVVKNAALFWWNL 498

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 499 HRSGEGDDDTLHAGCPVLVGDKWVANKWIH 528


>gi|116496629|gb|AAI26171.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
          Length = 544

 Score =  107 bits (267), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 65/210 (30%), Positives = 106/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S +E + + +LA+P++ +S V   +     + R+  S   +LK 
Sbjct: 340 EVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVASGEKQLQVEYRI--SKSAWLKD 397

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             +  +  +  RIA  T + +   + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 398 TVNPKLVTLNHRIAALTGLDVRPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 457

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                   LSV   R  AL +W++
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYAN-------------------LSVPVVRNAALFWWNL 498

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 499 HRSGEGDSDTLHAGCPVLVGDKWVANKWIH 528


>gi|344296798|ref|XP_003420090.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Loxodonta
           africana]
          Length = 544

 Score =  107 bits (267), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 65/214 (30%), Positives = 109/214 (50%), Gaps = 28/214 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F++  E + +  LA+P++ +S V   +     D R+  S+  +LK 
Sbjct: 340 EVIHLEPYVVLYHDFVNDMEAQKIKGLAEPWLQRSVVASGEKQLQVDYRISKSA--WLKD 397

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RIA  T + ++  + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 398 SVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 457

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN +   V+                     AL +W++
Sbjct: 458 NRVATFMIYLSAVEAGGATAFIYANFSMPVVK-------------------NAALFWWNL 498

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
                 D  +LH GCPV+ G+KW A KW  +HEY
Sbjct: 499 HRSGEGDGDTLHAGCPVLVGDKWVANKW--IHEY 530


>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
 gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
          Length = 537

 Score =  107 bits (267), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 101/201 (50%), Gaps = 20/201 (9%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P    YH+ LS  +   L ++A P M +STV     GQ K S  R S   +L       
Sbjct: 330 DPYVATYHDMLSPRKISQLREMAVPRMRRSTVNPLPGGQHKKSAFRVSKNAWLAYESHPT 389

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-TKNGGQRMATLLMY 201
           + G+ + + + T +   + E +QV +Y VG  Y+ H+D+F D  +  +  G R+AT + Y
Sbjct: 390 MVGMLRDLKEATGLDTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPEEEGNRIATAIFY 449

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           LS+VE+GG T FP  +                   ++VKP+ G+ L ++++      D  
Sbjct: 450 LSEVEQGGATAFPFLD-------------------IAVKPQLGNVLFWYNLHRSLDKDYR 490

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H GCPV++G+KW    W+H
Sbjct: 491 TKHAGCPVLKGSKWIGNVWIH 511


>gi|298712929|emb|CBJ26831.1| conserved unknown protein [Ectocarpus siliculosus]
          Length = 294

 Score =  107 bits (267), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 74/210 (35%), Positives = 104/210 (49%), Gaps = 34/210 (16%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P  +V  +F S  EC+ LI LA  YM+ S VV +  G+  +SR  TSS  FL R +D  +
Sbjct: 101 PPLYVVDDFFSGPECDALIALAGNYMIVSPVVGAGAGEVSESR--TSSSCFLAR-ED--L 155

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNT-------KNGGQRMA 196
             +  ++   T  P+EH E  QV  Y   QKY  H+D F  + NT       +NGGQR+ 
Sbjct: 156 PTVCHKVMALTGKPIEHLELPQVGRYYTSQKYANHWDAF--DLNTEDGRRFAQNGGQRVC 213

Query: 197 TLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           T+L+YL+DV  GG T FP                   + G+ V+P++G A++F+    D 
Sbjct: 214 TVLVYLNDVPSGGCTAFP-------------------QLGMKVQPRKGMAVVFFPATLDG 254

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
            LD   LH   P I   KW +  W+    Y
Sbjct: 255 VLDSRLLHAAEPAID-TKWVSQIWIRQGAY 283


>gi|390178148|ref|XP_001358756.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
 gi|388859341|gb|EAL27899.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
          Length = 498

 Score =  107 bits (267), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 60/207 (28%), Positives = 115/207 (55%), Gaps = 28/207 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGT-FLK 136
           E ++ +P   ++H+ + ++E ++L++  +   +K+++V    GQ + S VRTS    F++
Sbjct: 297 EELSHDPYMVLFHDVVYESEIDFLLNATQ---LKASLV----GQYQYSPVRTSKEQHFVE 349

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYD-YFLDEFNTKNGGQRM 195
                +++ + +R+ D T + M   + + +++Y +G  YD HYD +   E N    G R+
Sbjct: 350 YNDTAVVKTLHRRLNDMTGLDMIESDALTLINYGMGGHYDVHYDSHNYSEANRLILGDRI 409

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           AT+L Y+ +V+ GG T FP  N                   +SV PK+G A+L++++   
Sbjct: 410 ATVLFYVGEVDSGGATTFPYIN-------------------VSVTPKKGSAVLWYNLDNA 450

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMH 282
             ++P ++H GCPVI G+K+  TKW++
Sbjct: 451 GQMNPKAIHAGCPVIVGSKYVLTKWIN 477


>gi|355709028|gb|AES03457.1| prolyl 4-hydroxylase, alpha polypeptide III [Mustela putorius furo]
          Length = 477

 Score =  107 bits (266), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 63/210 (30%), Positives = 106/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S  E + +  LA+P++ +S V   +     + R+  S+  +LK 
Sbjct: 274 EVIHLEPYVVLYHDFVSDMEAQKIRGLAEPWLQRSVVASGEKQLPVEYRISKSA--WLKD 331

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RI   T + ++  + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 332 TVDPLLVNLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPTSPLYRMKSG 391

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                    SV   +  AL +W++
Sbjct: 392 NRVATFMIYLSSVEAGGATAFIYAN-------------------FSVPVVKNAALFWWNL 432

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 433 HRSGEGDGDTLHAGCPVLVGDKWVANKWIH 462


>gi|229084249|ref|ZP_04216532.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
 gi|228699049|gb|EEL51751.1| 2OG-Fe(II) oxygenase [Bacillus cereus Rock3-44]
          Length = 235

 Score =  107 bits (266), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 66/205 (32%), Positives = 98/205 (47%), Gaps = 27/205 (13%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y   +++ EC  LIDLA+  +  S V+ +   + K S VRTS     +     + 
Sbjct: 47  PFIGCYEKVVTQTECHQLIDLARHGLQPSKVIGN--SEQKTSAVRTSDTIGFQHHLTELT 104

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-----LDEFNTKNGGQRMATL 198
             I KRIA    +P+ + E +Q+  Y+VG K++AH+D F     L +      GQR+ T 
Sbjct: 105 LQICKRIASIVELPLNYAEHLQIARYQVGGKFNAHFDTFNPSTELGKMYLSENGQRIITA 164

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L+YL++V  GGET FP  N                   + V P  G  L+F + + ++  
Sbjct: 165 LLYLNNVSAGGETSFPLLN-------------------IQVAPSEGTLLVFENCKKNSNE 205

Query: 259 DPS-SLHGGCPVIRGNKWSATKWMH 282
             + S+H GC V  G KW AT W H
Sbjct: 206 RHALSIHEGCAVHEGEKWIATLWFH 230


>gi|297803562|ref|XP_002869665.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
 gi|297315501|gb|EFH45924.1| ShTK domain-containing protein [Arabidopsis lyrata subsp. lyrata]
          Length = 290

 Score =  107 bits (266), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 65/206 (31%), Positives = 109/206 (52%), Gaps = 26/206 (12%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           ++W+PR F+Y  FLS+ E ++LI L                  KD+   TS     K   
Sbjct: 61  LSWQPRVFLYRGFLSEEESDHLISL-----------------RKDTSEVTSGDADGKTQL 103

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D ++ GIE++I+ +TF+P E+G  I+V  Y   +K     DYF +E ++      +AT++
Sbjct: 104 DPVVAGIEEKISAWTFLPRENGGSIKVRSY-TSEKSGKKLDYFGEEPSSVLRESLLATVV 162

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           +YLS+  +GGE +FP +               C + G  ++P +G+A+LF+S   +A+LD
Sbjct: 163 LYLSNTTQGGELLFPNSEVKPKK--------SCSEDGNILRPVKGNAVLFFSRLLNASLD 214

Query: 260 PSSLHGGCPVIRGNKWSATKWMHVHE 285
            +S H  CPV++G    ATK ++  +
Sbjct: 215 ETSTHLICPVVKGELLVATKLIYAKK 240


>gi|195145084|ref|XP_002013526.1| GL24185 [Drosophila persimilis]
 gi|194102469|gb|EDW24512.1| GL24185 [Drosophila persimilis]
          Length = 229

 Score =  107 bits (266), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 60/207 (28%), Positives = 115/207 (55%), Gaps = 28/207 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGT-FLK 136
           E ++ +P   ++H+ + ++E ++L++  +   +K+++V    GQ + S VRTS    F++
Sbjct: 28  EELSHDPYMVLFHDVVYESEIDFLLNATQ---LKASLV----GQYQYSPVRTSKEQHFVE 80

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYD-YFLDEFNTKNGGQRM 195
                +++ + +R+ D T + M   + + +++Y +G  YD HYD +   E N    G R+
Sbjct: 81  YNDTAVVKTLHRRLNDMTGLDMIESDTLTLINYGMGGHYDVHYDSHNYSEANRLILGDRI 140

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
           AT+L Y+ +V+ GG T FP  N                   +SV PK+G A+L++++   
Sbjct: 141 ATVLFYVGEVDSGGATTFPYIN-------------------VSVTPKKGSAVLWYNLDNS 181

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMH 282
             ++P ++H GCPVI G+K+  TKW++
Sbjct: 182 GQMNPKAIHAGCPVIVGSKYVLTKWIN 208


>gi|195159146|ref|XP_002020443.1| GL13510 [Drosophila persimilis]
 gi|194117212|gb|EDW39255.1| GL13510 [Drosophila persimilis]
          Length = 527

 Score =  107 bits (266), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 69/212 (32%), Positives = 104/212 (49%), Gaps = 26/212 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++ +P   VYHN LS AE   +  + +P + +S V D K  +   S+ RT+ G +L  
Sbjct: 317 EELSLDPYIVVYHNVLSDAEIAEVERVTEPLLKRSVVFDGKENKMSTSKKRTALGAWLPD 376

Query: 138 GQ-----DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-TKNG 191
                    +I+ I +RI + T + +   + +Q++ Y  G  YD H+DYF      TK  
Sbjct: 377 DNMDVSGRAVIQRIFRRIHELTGLIINDRQDMQLIKYGYGGHYDIHFDYFNTSTPITKAR 436

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
           G RMAT+L YL+D++ GG T F                       L V  +RG  L +++
Sbjct: 437 GDRMATVLFYLNDMKHGGSTAFTDLQ-------------------LKVPSERGKVLFWYN 477

Query: 252 MRPDA-TLDPSSLHGGCPVIRGNKWSATKWMH 282
           MR +   LD  +LHG CPVI G K   + W+H
Sbjct: 478 MRGETHDLDSRTLHGACPVINGTKTILSCWIH 509


>gi|197215651|gb|ACH53042.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Otolemur garnettii]
          Length = 555

 Score =  107 bits (266), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 65/223 (29%), Positives = 104/223 (46%), Gaps = 43/223 (19%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDY------------------FL-- 183
             +  R+   T + ++  E +QV +Y VG +Y+ H+D+                  FL  
Sbjct: 396 ARVNHRMQHITGLSVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRVATFLNY 455

Query: 184 ----DEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
               D F     G R+AT L Y+SDVE GG TVFP                     G ++
Sbjct: 456 NHERDAFKRLGTGNRVATFLNYMSDVEAGGATVFP-------------------DLGAAI 496

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            PK+G A+ ++++      D  + H  CPV+ G KW + KW H
Sbjct: 497 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 539


>gi|198459366|ref|XP_002138685.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
 gi|198136669|gb|EDY69243.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
          Length = 448

 Score =  107 bits (266), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 69/213 (32%), Positives = 107/213 (50%), Gaps = 24/213 (11%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            E ++ +P   +YH+ +  +E E L D A P M +S V        KD+  RTS   F  
Sbjct: 245 VEPLSQDPYIAMYHDVIYDSEIEELKDNAFPDMERSKVYTYSDKDGKDTG-RTSMSAFQT 303

Query: 137 RGQDRIIRGIEKRIADFT-FIPMEHG--EGIQVLHYEVGQKYDAHYDYFLDEFNTK-NGG 192
             Q   +  + +R+   T F  +  G  + + VL+Y    +Y  H DYF   ++     G
Sbjct: 304 DHQYTAVTKVNRRVMHMTGFEVLADGSSDELLVLNYATAAQYLTHSDYFGPAYSEYIQRG 363

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT+L YL+DVE+GG+TVFP                   + G+   P +G A++F+++
Sbjct: 364 DRIATVLFYLNDVEQGGKTVFP-------------------RLGIFRSPMKGSAVVFYNL 404

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
                 DP + HGGCPV+ G KW+ATKW++  E
Sbjct: 405 NSSLQGDPRTEHGGCPVLVGTKWAATKWIYSAE 437


>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
 gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score =  107 bits (266), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 66/211 (31%), Positives = 107/211 (50%), Gaps = 29/211 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++ +P   +YH+ + ++E + +  L    M ++ V  + T QS  S VRTS  TF+ +
Sbjct: 322 EELSHDPLLVLYHDVIYQSEIDVIRQLTTNRMARAMV--TLTNQSTVSNVRTSQITFIAK 379

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG------ 191
            +  +++ I++R+AD T + M++ E  Q  +Y +G  Y  H D+F  E    NG      
Sbjct: 380 TEHEVLQTIDRRVADMTNLNMDYAEDHQFANYGIGGHYGQHMDWF-TETTFDNGLVSSTE 438

Query: 192 -GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFW 250
            G R+AT+L YLSDV +GG T FP                   KQ L  +PK+  A  + 
Sbjct: 439 MGNRIATVLFYLSDVAQGGGTAFPYL-----------------KQHL--RPKKYAAAFWH 479

Query: 251 SMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           ++      D  + HG CP+I G+KW   +W+
Sbjct: 480 NLHAAGRGDARTQHGACPIIAGSKWVLNRWI 510


>gi|90023340|ref|YP_529167.1| response regulator receiver domain-containing protein
           [Saccharophagus degradans 2-40]
 gi|89952940|gb|ABD82955.1| 2OG-Fe(II) oxygenase [Saccharophagus degradans 2-40]
          Length = 269

 Score =  107 bits (266), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 67/211 (31%), Positives = 102/211 (48%), Gaps = 31/211 (14%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P   +  +FL++AE   +I  A   M ++ V   K G   +S  RT S  ++    +++ 
Sbjct: 63  PSVTICEDFLTQAEVFQIIKAAGDKMQRARVSSGKEG--IESAGRTGSNCWVAHDHNKVT 120

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTK-------NGGQRMA 196
             + KRI+    I +++ E  QV+HY V Q+Y +H+D +  EFNT+        GGQR+ 
Sbjct: 121 HALAKRISKLVGISLQNAESFQVIHYGVSQEYSSHFDAW--EFNTERGERCMARGGQRLV 178

Query: 197 TLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           T L+YL+DV  GG T FP  +                   L V+ K+G  ++F +  P  
Sbjct: 179 TCLIYLNDVPAGGGTGFPELD-------------------LEVQAKKGRMVIFHNCYPGT 219

Query: 257 TL-DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
               P SLHGG PV  G KW+   W    +Y
Sbjct: 220 NYRHPHSLHGGLPVEEGEKWAVNLWFREADY 250


>gi|426255748|ref|XP_004021510.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Ovis
           aries]
          Length = 516

 Score =  107 bits (266), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 67/200 (33%), Positives = 98/200 (49%), Gaps = 33/200 (16%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +PR   +H+ +S AE E + DLAKP + ++TV D +TG+   ++ R S   +L   ++ +
Sbjct: 334 KPRIIRFHDIISDAEIEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGYENPV 393

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLMYL 202
           +  I  RI D T        G+ V   E  QK +       D F     G R+AT L Y+
Sbjct: 394 VSRINMRIQDLT--------GLDVSTAEELQKDEP------DAFKELGTGNRIATWLFYM 439

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSS 262
           SDV  GG TVFP                     G SV PK+G A+ ++++      D S+
Sbjct: 440 SDVLAGGATVFPEV-------------------GASVWPKKGTAVFWYNLFASGEGDYST 480

Query: 263 LHGGCPVIRGNKWSATKWMH 282
            H  CPV+ GNKW + KW+H
Sbjct: 481 RHAACPVLVGNKWVSNKWLH 500


>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
 gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
          Length = 541

 Score =  107 bits (266), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 63/204 (30%), Positives = 111/204 (54%), Gaps = 27/204 (13%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P   +YH+ + ++E + + +L +  + ++TV+ +K   S+ S+VRTS  TF+ + + +++
Sbjct: 330 PLLVLYHDVIYQSEIDVIRNLTENEISRATVIGAKG--SEVSKVRTSQFTFIPKTRHKVL 387

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNTK-----NGGQRMAT 197
           + I++R+AD + + M++ E  Q  +Y +G  Y  H D+F  D F+ +       G R+AT
Sbjct: 388 QTIDQRVADMSNLNMDYAELHQFANYGIGGHYAQHNDWFGQDAFDNELVSSPEMGNRIAT 447

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
           +L YLSDV +GG T FP                   KQ L  +PK+  A  + ++     
Sbjct: 448 VLFYLSDVAQGGGTAFPHL-----------------KQLL--QPKKYAAAFWHNLHASGV 488

Query: 258 LDPSSLHGGCPVIRGNKWSATKWM 281
            D  +LHG CP+I G+KW   +W+
Sbjct: 489 GDLRTLHGACPIIAGSKWVQNRWI 512


>gi|301759032|ref|XP_002915381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Ailuropoda
           melanoleuca]
          Length = 539

 Score =  107 bits (266), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 63/210 (30%), Positives = 105/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S  E + +  LA+P++ +S V   +     + R+  S   +LK 
Sbjct: 335 EVIHLEPYVVLYHDFVSDGEAQKIRGLAEPWLQRSVVASGEKQLPVEYRI--SKSAWLKD 392

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RI   T + ++  + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 393 TVDPLLVTLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPTSPLYRMKSG 452

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                    SV   +  AL +W++
Sbjct: 453 NRVATFMIYLSSVEAGGATAFIYAN-------------------FSVPVVKNAALFWWNL 493

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 494 HRSGEGDGDTLHAGCPVLVGDKWVANKWIH 523


>gi|229368743|gb|ACQ63024.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Dasypus novemcinctus]
          Length = 556

 Score =  107 bits (266), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 64/223 (28%), Positives = 105/223 (47%), Gaps = 43/223 (19%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 337 PHIVRYYDIMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEENDDPVV 396

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDY------------------FL-- 183
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+                  FL  
Sbjct: 397 AQVNRRMEHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 456

Query: 184 ----DEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
               D F     G R+AT L Y+SDVE GG TVFP                     G ++
Sbjct: 457 NHEQDVFKHLGTGNRVATFLNYMSDVEAGGATVFP-------------------DLGAAI 497

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            PK+G A+ ++++      D  + H  CPV+ G KW + KW H
Sbjct: 498 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 540


>gi|431838427|gb|ELK00359.1| Prolyl 4-hydroxylase subunit alpha-3 [Pteropus alecto]
          Length = 483

 Score =  106 bits (265), Expect = 9e-21,   Method: Compositional matrix adjust.
 Identities = 63/210 (30%), Positives = 105/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F+S  E + +  LA+P++ +S V   +     + R+  S   +LK 
Sbjct: 279 EVIHLEPYVVLYHDFVSDLEAQKIRGLAEPWLQRSVVASGEKQLPVEYRI--SKSAWLKD 336

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RIA  T + ++  + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 337 TADPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 396

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                    SV   +  AL +W++
Sbjct: 397 NRVATFMIYLSSVEAGGATAFIYAN-------------------FSVPVVKNAALFWWNL 437

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH  CPV+ G+KW A KW+H
Sbjct: 438 HRSGEGDSDTLHAACPVLVGDKWVANKWIH 467


>gi|281183175|ref|NP_001162504.1| prolyl 4-hydroxylase subunit alpha-2 [Papio anubis]
 gi|159461520|gb|ABW96795.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase, alpha
           polypeptide II, isoform 1 (predicted) [Papio anubis]
          Length = 578

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 64/223 (28%), Positives = 105/223 (47%), Gaps = 43/223 (19%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 359 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 418

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDY---------------------F 182
             + +R+   T + ++  E +QV +Y VG +Y+ H+D+                     +
Sbjct: 419 ARVNRRMQHITGLTVKTAELLQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 478

Query: 183 LDE---FNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
            DE   F     G R+AT L Y+SDVE GG TVFP                     G ++
Sbjct: 479 NDERHTFKHLGTGNRVATFLNYMSDVEAGGATVFP-------------------DLGAAI 519

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            PK+G A+ ++++      D  + H  CPV+ G KW + KW H
Sbjct: 520 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 562


>gi|381173085|ref|ZP_09882194.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
 gi|380686458|emb|CCG38681.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas citri pv.
           mangiferaeindicae LMG 941]
          Length = 418

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 79/260 (30%), Positives = 110/260 (42%), Gaps = 41/260 (15%)

Query: 37  MGIFYIPIGDDDSP-PNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSK 95
           +G+  +P  D   P P D T+  R AF  R                  PR   Y   LS 
Sbjct: 189 LGMTALPAVDIAPPDPADDTADHRIAFAPRVGPVRRH---------TAPRIEEYAAVLSA 239

Query: 96  AECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTF 155
            EC  L+ LA+P++  S V+D     ++ + +RTS G  L    D II     R A    
Sbjct: 240 DECRLLMLLARPHLRASKVIDPNDASTQRAPIRTSRGATL----DPIIEDFAARAAQARL 295

Query: 156 -----IPMEHGEGIQVLHYEVGQKYDAHYDYFLD---EFNTKNGGQRMATLLMYLSDVEE 207
                +P+ H E + VL Y  G++Y AH DY        +    G R  T+ +YL+DV  
Sbjct: 296 AACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQRTVCVYLNDVGA 355

Query: 208 GGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGC 267
           GG+T FP A                   G+ V+P+ G  + F ++  D   D  SLH G 
Sbjct: 356 GGDTEFPIA-------------------GVRVRPRPGTLVCFDNLHADGRPDADSLHAGL 396

Query: 268 PVIRGNKWSATKWMHVHEYK 287
           PV  G+KW  T W     Y+
Sbjct: 397 PVTAGSKWLGTLWFRQQRYR 416


>gi|344199983|ref|YP_004784309.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
 gi|343775427|gb|AEM47983.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
          Length = 212

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 95/201 (47%), Gaps = 20/201 (9%)

Query: 89  YHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEK 148
           +   LS  EC  LI     +   S V+   +  S ++  R S+       +  II+ + +
Sbjct: 17  FSGLLSPEECTELIAAGGSHAKPSEVIYGVSDVSHETSGRRSTVASPSADKYPIIKAVRR 76

Query: 149 RIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD-EFNTKNGGQRMATLLMYLSDVEE 207
           RI+ F  +  E+ E +QVLHY  G +YD HYD FL+     +NGG RM T+L+YL+DVE+
Sbjct: 77  RISLFIGVAEENQEPLQVLHYTRGGRYDIHYDSFLEGSPQLENGGNRMLTVLLYLNDVEQ 136

Query: 208 GGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGC 267
           GG T FP   AN                   + P  G  +LF +          SLH G 
Sbjct: 137 GGWTQFPHIMAN-------------------IVPNVGTGILFRNTDAQNLQLRESLHAGL 177

Query: 268 PVIRGNKWSATKWMHVHEYKA 288
           PVI G KW A+ W+    Y A
Sbjct: 178 PVIDGEKWIASIWIREKSYTA 198


>gi|224008853|ref|XP_002293385.1| hypothetical protein THAPSDRAFT_264010 [Thalassiosira pseudonana
           CCMP1335]
 gi|220970785|gb|EED89121.1| hypothetical protein THAPSDRAFT_264010 [Thalassiosira pseudonana
           CCMP1335]
          Length = 248

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 69/221 (31%), Positives = 106/221 (47%), Gaps = 37/221 (16%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLA-KPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRG 138
           V+  PR F   +F+S  E ++++ L  + + +  +     +  S     RTS  T++ R 
Sbjct: 40  VSCSPRIFELEHFISDVEADHILMLTNRTHELHRSSTGDSSHHSDHDSTRTSMNTWIYRE 99

Query: 139 QDRIIRGIEKRIADFTFI---------PMEH---------GEGIQVLHYEVGQKYDAHYD 180
           +  II  I +R+AD   I         P EH          E +Q++HY+ G++Y AH+D
Sbjct: 100 ETAIIDTIYRRVADVLRIDEALLRRRQPDEHPRLGTRSSIAEPLQMVHYDPGEEYTAHHD 159

Query: 181 YFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVK 240
           +     +  +   R   +L+YL+DVEEGGET FP         RW          GL VK
Sbjct: 160 FGYTHMSAPHQPSRSINMLLYLNDVEEGGETSFP---------RW---------GGLDVK 201

Query: 241 PKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           P +G A+LF+ +  D   D  S H   PVI+G KW +  W+
Sbjct: 202 PVKGKAVLFYMLTADGNSDDLSQHAALPVIKGEKWMSNLWI 242


>gi|194751829|ref|XP_001958226.1| GF23628 [Drosophila ananassae]
 gi|190625508|gb|EDV41032.1| GF23628 [Drosophila ananassae]
          Length = 484

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 64/208 (30%), Positives = 105/208 (50%), Gaps = 30/208 (14%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E V+ +P   +YHN +S  E E +  L          +D+      +SR   S   +L +
Sbjct: 288 EEVSLDPYIVLYHNVISDREIEEMKGLIDE-------MDNGWTDLNESREIVSRLVWLTK 340

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE---FNTKNGGQR 194
            + R  + +  RI D T   ++   G+Q+ ++ VG ++  HYDYF +     N    G R
Sbjct: 341 -ESRFRKRLNLRIRDITGFNVDEIRGLQIANFGVGGQFKPHYDYFTERILRLNNTILGDR 399

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +A+++ Y+ DV  GG+TVFP                      ++VKP++G +L +++   
Sbjct: 400 IASIIFYVGDVVHGGQTVFPDIQ-------------------IAVKPQKGSSLFWFNTFD 440

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMH 282
           DAT DP SLH  CPV+ G++W+ TKW+H
Sbjct: 441 DATPDPRSLHSVCPVLIGDRWTITKWLH 468


>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
 gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
          Length = 535

 Score =  106 bits (265), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 69/235 (29%), Positives = 107/235 (45%), Gaps = 27/235 (11%)

Query: 51  PNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMV 110
           P+     R R +  R   A  K E+    +  +P     H  +   + E L   A+P + 
Sbjct: 297 PSKQRDLRCRLWRSRLGYAPFKLEE----LHLDPPVVQLHQVIGSKDAESLQRTARPRIK 352

Query: 111 KSTVVD-SKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHY 169
           +STV   +  G S  +  RTS G      ++   + +   + DF+ + ME+ E +QV +Y
Sbjct: 353 RSTVYSLAGNGDSTAAAFRTSQGASFNYSRNAATKLLSHHVGDFSGLNMEYAEDLQVANY 412

Query: 170 EVGQKYDAHYDYFLDEFNTKNG---GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWW 226
            +G  Y+ H+D F D    + G   G R+AT + YLSDVE GG T FP            
Sbjct: 413 GIGGHYEPHWDSFPDNHVYQEGDLHGNRIATAIYYLSDVEAGGGTAFPFL---------- 462

Query: 227 NELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
                     L V P+RG  L ++++ P    D  + H  CPV++G+KW A  W+
Sbjct: 463 ---------PLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKWIANVWI 508


>gi|418521653|ref|ZP_13087695.1| hypothetical protein WS7_11622 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
 gi|410702188|gb|EKQ60697.1| hypothetical protein WS7_11622 [Xanthomonas axonopodis pv.
           malvacearum str. GSPB2388]
          Length = 418

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 79/260 (30%), Positives = 110/260 (42%), Gaps = 41/260 (15%)

Query: 37  MGIFYIPIGDDDSP-PNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSK 95
           +G+  +P  D   P P D T+  R AF  R                  PR   Y   LS 
Sbjct: 189 LGMTALPAVDIAPPDPADDTADHRIAFAPRVGPVRRH---------TAPRIEEYAAVLSA 239

Query: 96  AECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTF 155
            EC  L+ LA+P++  S V+D     ++ + +RTS G  L    D II     R A    
Sbjct: 240 DECRLLMLLARPHLRASKVIDPNDASTQRAPIRTSRGATL----DPIIEDFAARAAQARL 295

Query: 156 -----IPMEHGEGIQVLHYEVGQKYDAHYDYFLD---EFNTKNGGQRMATLLMYLSDVEE 207
                +P+ H E + VL Y  G++Y AH DY        +    G R  T+ +YL+DV  
Sbjct: 296 AACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQRTVCVYLNDVGA 355

Query: 208 GGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGC 267
           GG+T FP A                   G+ V+P+ G  + F ++  D   D  SLH G 
Sbjct: 356 GGDTEFPIA-------------------GVRVRPRPGTLVCFDNLHADGRPDADSLHAGL 396

Query: 268 PVIRGNKWSATKWMHVHEYK 287
           PV  G+KW  T W     Y+
Sbjct: 397 PVTAGSKWLGTLWFRQQRYR 416


>gi|77748579|ref|NP_641686.2| hypothetical protein XAC1351 [Xanthomonas axonopodis pv. citri str.
           306]
          Length = 418

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 79/260 (30%), Positives = 110/260 (42%), Gaps = 41/260 (15%)

Query: 37  MGIFYIPIGDDDSP-PNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSK 95
           +G+  +P  D   P P D T+  R AF  R                  PR   Y   LS 
Sbjct: 189 LGMTALPAVDIAPPDPADDTADHRIAFAPRVGPVRRH---------TAPRIEEYAAVLSA 239

Query: 96  AECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTF 155
            EC  L+ LA+P++  S V+D     ++ + +RTS G  L    D II     R A    
Sbjct: 240 DECRLLMLLARPHLRASKVIDPNDASTQRAPIRTSRGATL----DPIIEDFAARAAQARL 295

Query: 156 -----IPMEHGEGIQVLHYEVGQKYDAHYDYFLD---EFNTKNGGQRMATLLMYLSDVEE 207
                +P+ H E + VL Y  G++Y AH DY        +    G R  T+ +YL+DV  
Sbjct: 296 AACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQRTVCVYLNDVGA 355

Query: 208 GGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGC 267
           GG+T FP A                   G+ V+P+ G  + F ++  D   D  SLH G 
Sbjct: 356 GGDTEFPIA-------------------GVRVRPRPGTLVCFDNLHADGRPDADSLHAGL 396

Query: 268 PVIRGNKWSATKWMHVHEYK 287
           PV  G+KW  T W     Y+
Sbjct: 397 PVTAGSKWLGTLWFRQQRYR 416


>gi|335294484|ref|XP_003357239.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Sus scrofa]
          Length = 545

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 64/210 (30%), Positives = 104/210 (49%), Gaps = 25/210 (11%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F++ AE + +  LA+P++    +V S   Q      R S   +LK 
Sbjct: 340 EVIHLEPYVVLYHDFVTDAEAQKIRGLAEPWVTAEILVASGEKQLP-VEYRISKSAWLKD 398

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RIA  T + ++  + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 399 TVDPMLVTLDHRIAALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 458

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F   N                    SV   +  AL +W++
Sbjct: 459 NRVATFMIYLSSVEAGGATAFIYGN-------------------FSVPVVKNAALFWWNL 499

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 500 HRSGEGDGDTLHAGCPVLVGDKWVANKWIH 529


>gi|417402564|gb|JAA48127.1| Putative prolyl 4-hydroxylase alpha subunit [Desmodus rotundus]
          Length = 544

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 65/214 (30%), Positives = 108/214 (50%), Gaps = 28/214 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V  EP   +YH+F++  E + +   A+P++ +S V   +     + R+  S+  +LK 
Sbjct: 340 EVVHLEPYVVLYHDFVNDLEAQKIRGFAEPWLQRSVVASGEKQLPVEYRISKSA--WLKD 397

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  +++RIA  T +  +  + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 398 TVDPMLVTLDRRIAALTGLDTQPPYAEHLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 457

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                    SV   +  AL +W++
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYAN-------------------FSVPVVKNAALFWWNL 498

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
                 D  +LH GCPV+ G+KW A KW  +HEY
Sbjct: 499 HRSGEGDGDTLHAGCPVLVGDKWVANKW--IHEY 530


>gi|303273602|ref|XP_003056161.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226462245|gb|EEH59537.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 750

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 83/234 (35%), Positives = 111/234 (47%), Gaps = 55/234 (23%)

Query: 88  VYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK--RGQDRIIRG 145
           V+ +FLS  EC+ L+ +A P + +S V D K  +      RTSS TFL   + ++ ++R 
Sbjct: 533 VFDHFLSAVECDDLVAIAAPDLRRSRVTDGKLSEG-----RTSSSTFLTGCKQEEPLVRA 587

Query: 146 IEKR----IADFTFIPME---------HG--------------------EGIQVLHYEVG 172
           IE+R    +   T I  +         HG                    E +QV+ Y  G
Sbjct: 588 IEQRLLRAVQSATLIAAQPNVYDSNERHGQPYRGSTSRFSQRPNLLQGAEPMQVVRYTEG 647

Query: 173 QKYDAHYDYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSEC 232
           Q Y AHYD      N +   +R AT +MYL+DV  GG T FP A     S+R       C
Sbjct: 648 QMYTAHYD------NKQGCLRRTATFMMYLTDVHSGGATHFPRAVP--VSMR-----DGC 694

Query: 233 GKQ-GLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
           G   G+ + PKRG AL+FWS+      D  SLH   PVI G KW ATKW+   E
Sbjct: 695 GDAAGIRIWPKRGRALVFWSVS-GGIEDVRSLHEAEPVIEGEKWIATKWLREDE 747


>gi|449469338|ref|XP_004152378.1| PREDICTED: uncharacterized protein LOC101218968 [Cucumis sativus]
          Length = 311

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 66/204 (32%), Positives = 108/204 (52%), Gaps = 8/204 (3%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQ 139
           V+W PR F+Y  FLS  EC++LI LA       +   + +G +  + +  SSG  L    
Sbjct: 59  VSWRPRVFLYKGFLSDEECDHLISLASNSEDNPSRNSAGSGITVSTELLNSSGVILNTTD 118

Query: 140 DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLL 199
           D + R IE R+A +T +P +H    Q++ Y  G++    Y Y        +    MAT++
Sbjct: 119 DIVAR-IENRLAIWTLLPKDHSMPFQIMQYR-GEEAKHKYFYGNRSAMLPSSEPLMATVV 176

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
           +YLSD   GGE +FP +       ++W   S   K+   ++P +G+A+LF+S+  +A+ D
Sbjct: 177 LYLSDSASGGEILFPESKVK---SKFW---SGRRKKNNFLRPVKGNAILFFSVHLNASPD 230

Query: 260 PSSLHGGCPVIRGNKWSATKWMHV 283
            SS H   P+  G  W ATK++++
Sbjct: 231 KSSYHIRSPIRDGELWVATKFLYL 254


>gi|194764881|ref|XP_001964556.1| GF23245 [Drosophila ananassae]
 gi|190614828|gb|EDV30352.1| GF23245 [Drosophila ananassae]
          Length = 460

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 62/206 (30%), Positives = 103/206 (50%), Gaps = 27/206 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++ +P   VYH+ + + E  +L+D        S    S  G+S+ S +RTS       
Sbjct: 263 EEISTDPYMVVYHDVIYENEINWLLD-------NSDFRTSLVGESQISTLRTSQDMPFGA 315

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL-DEFNTKNGGQRMA 196
               ++R IEKRI D T + M+  E   +++Y +G  Y  HYD+++  E      G+R+ 
Sbjct: 316 NSGEVMRNIEKRIKDMTGLSMDLSEDFMLINYGIGGTYKMHYDFYVYSEPLRFLRGERIV 375

Query: 197 TLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           T+L YL DVE  G TVFP  N                   +S+ PK+G A++++++    
Sbjct: 376 TVLFYLGDVELSGSTVFPFLN-------------------ISITPKKGSAVMWYNLHNSG 416

Query: 257 TLDPSSLHGGCPVIRGNKWSATKWMH 282
            +   + H  CPV+ G+K+  TKW++
Sbjct: 417 DVHQKTQHCACPVVVGSKYVLTKWIN 442


>gi|198449508|ref|XP_002136911.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
 gi|198130638|gb|EDY67469.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
          Length = 516

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 69/212 (32%), Positives = 103/212 (48%), Gaps = 26/212 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++ +P   VYHN L  AE   +  + +P + +S V D K  +   S+ RT+ G +L  
Sbjct: 306 EELSLDPYIVVYHNVLCDAEIAEVERVTEPLLKRSVVFDGKENKMSTSKKRTALGAWLPD 365

Query: 138 GQ-----DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-LDEFNTKNG 191
                    +I+ I +RI + T + +   + +Q++ Y  G  YD H+DYF      TK  
Sbjct: 366 DNMDVSGRAVIQRIFRRIHELTGLIINDRQDMQLIKYGYGGHYDIHFDYFNTSSPITKAR 425

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
           G RMAT+L YL+DV+ GG T F                       L V  +RG  L +++
Sbjct: 426 GDRMATVLFYLNDVKHGGSTAFTDLQ-------------------LKVPSERGKVLFWYN 466

Query: 252 MRPDA-TLDPSSLHGGCPVIRGNKWSATKWMH 282
           MR +   LD  +LHG CPVI G K   + W+H
Sbjct: 467 MRGETHDLDSRTLHGACPVIDGTKTILSCWIH 498


>gi|116008128|ref|NP_001036776.1| CG15539, isoform B [Drosophila melanogaster]
 gi|113194857|gb|ABI31220.1| CG15539, isoform B [Drosophila melanogaster]
          Length = 509

 Score =  106 bits (264), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 63/208 (30%), Positives = 107/208 (51%), Gaps = 22/208 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+++ +P   ++H+ +S  +   + +L K  + ++  V SK G   +   RT+ GT+L  
Sbjct: 313 ELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTV-SKDGNYTEDPDRTTKGTWLVE 371

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
             + +I+ + +   D T   +   +  QVL+Y +G  Y  H+D FL++    N   R+AT
Sbjct: 372 N-NALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFD-FLEDAELDNFSDRIAT 429

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
            + YLSDV +GG T+FP                   K GLSV PK+G ALL++++     
Sbjct: 430 AVFYLSDVPQGGATIFP-------------------KLGLSVFPKKGSALLWYNLDHKGD 470

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHE 285
            D  + H  CP + G++W  TKW++  E
Sbjct: 471 GDNRTAHSACPTVVGSRWVMTKWINERE 498


>gi|116008432|ref|NP_651804.2| CG15539, isoform A [Drosophila melanogaster]
 gi|66772391|gb|AAY55507.1| IP10910p [Drosophila melanogaster]
 gi|66772535|gb|AAY55579.1| IP10810p [Drosophila melanogaster]
 gi|113194858|gb|AAF57060.2| CG15539, isoform A [Drosophila melanogaster]
          Length = 386

 Score =  106 bits (264), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 63/208 (30%), Positives = 107/208 (51%), Gaps = 22/208 (10%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+++ +P   ++H+ +S  +   + +L K  + ++  V SK G   +   RT+ GT+L  
Sbjct: 190 ELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTV-SKDGNYTEDPDRTTKGTWLVE 248

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMAT 197
             + +I+ + +   D T   +   +  QVL+Y +G  Y  H+D FL++    N   R+AT
Sbjct: 249 N-NALIQRLSQLTQDMTNFDIHDADPFQVLNYGIGGFYGIHFD-FLEDAELDNFSDRIAT 306

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
            + YLSDV +GG T+FP                   K GLSV PK+G ALL++++     
Sbjct: 307 AVFYLSDVPQGGATIFP-------------------KLGLSVFPKKGSALLWYNLDHKGD 347

Query: 258 LDPSSLHGGCPVIRGNKWSATKWMHVHE 285
            D  + H  CP + G++W  TKW++  E
Sbjct: 348 GDNRTAHSACPTVVGSRWVMTKWINERE 375


>gi|256083648|ref|XP_002578053.1| prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
 gi|360044447|emb|CCD81995.1| putative prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
          Length = 584

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 65/209 (31%), Positives = 104/209 (49%), Gaps = 24/209 (11%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E +  +PR  ++++ +  +E E + +LA P + ++TV +  TG  + +  RTS   +L  
Sbjct: 373 ETLNPDPRIVMWYDLIFPSEIEKIKELATPRLRRATVKNPVTGILEIAFYRTSKSAWLPH 432

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFL----DEFNTKNGGQ 193
               I   I +RI   T + +E  E +QV +Y +G  Y  H+D+      D F  KN G 
Sbjct: 433 SMSEITDQISQRIRAVTGLSLETAEDLQVGNYGLGGHYAPHFDFGRKREKDAFEVKN-GN 491

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+AT++ YLSDV+ GG TVF                    + G  V PK+G A  ++++ 
Sbjct: 492 RIATIIFYLSDVQAGGATVF-------------------NRIGTRVVPKKGAAGFWFNLL 532

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           P+   D  + H  CPV+ G+KW    W H
Sbjct: 533 PNGEGDLRTRHAACPVLAGSKWVMNLWFH 561


>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
 gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
          Length = 537

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 98/201 (48%), Gaps = 20/201 (9%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P    +H+ LS  +   L ++A P M +STV     GQ K S  R S   +L       
Sbjct: 330 DPYVASFHDMLSPRKISQLREMAVPRMQRSTVNPRPGGQHKKSAFRVSKNAWLAYEAHPT 389

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG-GQRMATLLMY 201
           + G+ + + D T +     E +QV +Y VG  Y+ H+D+F D  +     G R+AT + Y
Sbjct: 390 MAGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPSHYPAAEGNRIATAIFY 449

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           LS+VE+GG T FP  +                    +VKP+ G+ L ++++      D  
Sbjct: 450 LSEVEQGGATAFPFLD-------------------FAVKPQLGNVLFWYNLHRSLDKDYR 490

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H GCPV++G+KW    W+H
Sbjct: 491 TKHAGCPVLKGSKWIGNVWIH 511


>gi|397615311|gb|EJK63351.1| hypothetical protein THAOC_15991 [Thalassiosira oceanica]
          Length = 463

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 75/209 (35%), Positives = 103/209 (49%), Gaps = 39/209 (18%)

Query: 106 KPYMVKSTVVDSKTGQSKDSR---------VRTSSGTFLKRGQDRIIRGIEKRIADFTFI 156
           +PYM  +  V S +  + D+R          RTS  T++ R +D II  I +R AD   I
Sbjct: 267 EPYMNMTMKVLSWSTTAGDNREEARHDIRETRTSLNTWVYREKDLIIDAIYRRAADLLRI 326

Query: 157 ------PMEHGE------------GIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATL 198
                 P   GE             +Q++HYEVGQ+Y AH+D+    F+ K+   R ATL
Sbjct: 327 DEALLRPRSAGEVPEMKNTRGLAEALQLVHYEVGQEYTAHHDFGYAPFDRKDQPARFATL 386

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L+YL++   GGET FP         RW N  +   + GL V+PK G A+LF+S  PD  +
Sbjct: 387 LLYLNEGMVGGETQFP---------RWANAET---RAGLDVEPKIGKAVLFYSQLPDGNM 434

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEYK 287
           D  S H   PV  G KW    W+   EY+
Sbjct: 435 DDLSQHAARPVKIGEKWLMNLWVWDPEYQ 463


>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
 gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
 gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
          Length = 537

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 99/201 (49%), Gaps = 20/201 (9%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P    +H+ LS  +   L ++A P M +STV     GQ K S  R S   +L       
Sbjct: 330 DPYVATFHDILSPGKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLAYESHPT 389

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-TKNGGQRMATLLMY 201
           + G+ + + D T +     E +QV +Y VG  Y+ H+D+F D  +     G R+AT + Y
Sbjct: 390 MVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFY 449

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           LS+VE+GG T FP  +                   ++VKP+ G+ L ++++      D  
Sbjct: 450 LSEVEQGGATAFPFLD-------------------IAVKPQLGNVLFWYNLHRSLDKDYR 490

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H GCPV++G+KW    W+H
Sbjct: 491 TKHAGCPVLKGSKWIGNVWIH 511


>gi|427410797|ref|ZP_18900999.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
           51230]
 gi|425710785|gb|EKU73805.1| hypothetical protein HMPREF9718_03473 [Sphingobium yanoikuyae ATCC
           51230]
          Length = 322

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 64/201 (31%), Positives = 102/201 (50%), Gaps = 27/201 (13%)

Query: 82  WEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS-SGTFLKRGQD 140
           W+ R F    FL+  EC ++I   +  +  + V+D ++G+     +RTS  G F    +D
Sbjct: 138 WDVRLF--RQFLTGDECHHVISEGQALLEPAMVIDPRSGRPMPHPIRTSDGGIFGPARED 195

Query: 141 RIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATLLM 200
            +I+ I +RIA  +   +  GE + +L Y VGQ+Y  H+D      N     QR  T+L+
Sbjct: 196 LVIQAINRRIAAASGTMLSGGEPLTLLRYAVGQQYRQHHDCLPHVRN-----QRAWTMLI 250

Query: 201 YLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDP 260
           YL++   GGET+FP                   + GLSVK ++G+ALLF +         
Sbjct: 251 YLNEGYAGGETIFP-------------------RLGLSVKGRKGNALLFRNTDAQGQAAE 291

Query: 261 SSLHGGCPVIRGNKWSATKWM 281
           +++H G PV+ G KW  T+W+
Sbjct: 292 AAVHLGAPVMAGQKWLCTRWI 312


>gi|281362877|ref|NP_733393.3| CG31016, isoform B [Drosophila melanogaster]
 gi|442621939|ref|NP_001263119.1| CG31016, isoform C [Drosophila melanogaster]
 gi|272477249|gb|AAF57071.5| CG31016, isoform B [Drosophila melanogaster]
 gi|440218076|gb|AGB96498.1| CG31016, isoform C [Drosophila melanogaster]
          Length = 536

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 67/213 (31%), Positives = 102/213 (47%), Gaps = 28/213 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++ +P    YHN LS AE E L  + KP++ ++ V   + G  +    R++ G +L  
Sbjct: 317 EELSLDPYVIFYHNVLSDAEIEKLKPMGKPFLERAKVFRVEKGSDEIDPSRSADGAWLPH 376

Query: 138 GQD-----RIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE-FNTKNG 191
                    ++  I +RI D T +    G  +Q L Y  G  +  HYDYF  + F+ +  
Sbjct: 377 QNIDPDDLEVLNRIGRRIEDMTGLNTRSGSKMQFLKYGFGGHFVPHYDYFNSKTFSLETV 436

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
           G R+AT+L YL++V+ GG TVFP  N                   L+V  ++G A LFW 
Sbjct: 437 GDRIATVLFYLNNVDHGGATVFPKLN-------------------LAVPTQKGSA-LFWH 476

Query: 252 M--RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
              R     D  + HG CP+I G K   T+W++
Sbjct: 477 NIDRKSYDYDTRTFHGACPLISGTKLVMTRWIY 509


>gi|21358233|ref|NP_651814.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
 gi|20269810|gb|AAM18060.1|AF495538_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE3
           [Drosophila melanogaster]
 gi|15291443|gb|AAK92990.1| GH21465p [Drosophila melanogaster]
 gi|23172714|gb|AAN14251.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
 gi|220945610|gb|ACL85348.1| PH4alphaNE3-PA [synthetic construct]
 gi|220955396|gb|ACL90241.1| PH4alphaNE3-PA [synthetic construct]
          Length = 481

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 61/210 (29%), Positives = 101/210 (48%), Gaps = 29/210 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++ EP   VYH+ L   + + LI LA+P +  + + D    +++ S  RT  G  L  
Sbjct: 299 EEISLEPHIVVYHDILPDKDIQQLITLAEPLLKPTEMFDDNKNEARSS-YRTPLGGPL-- 355

Query: 138 GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG-GQRMA 196
                +  + +R+ D T + +  G  I ++ Y  G  Y  +YD+F    +   G G RMA
Sbjct: 356 -----LDSLTQRMRDITGLQIRQGNPINIIKYGFGAPYTNYYDFFKKRNSESKGFGDRMA 410

Query: 197 TLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDA 256
           T + YL+D   GG TVFP  N                   + V  +RG  L ++++  D 
Sbjct: 411 TFMFYLNDAPYGGATVFPRLN-------------------VKVPAERGKVLFWYNLNGDT 451

Query: 257 -TLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
             ++P+++H  CPV  G+KW  T W+H ++
Sbjct: 452 HDMEPTTMHAACPVFHGSKWVMTAWIHEYD 481


>gi|289662828|ref|ZP_06484409.1| hypothetical protein XcampvN_06993, partial [Xanthomonas campestris
           pv. vasculorum NCPPB 702]
          Length = 301

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 65/203 (32%), Positives = 95/203 (46%), Gaps = 23/203 (11%)

Query: 89  YHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRG-QDRIIRGIE 147
           Y   LS  EC  L+ LA+P++  S V+D     ++ + VRTS G  L    +D   R  +
Sbjct: 116 YAGVLSADECRLLMLLARPHLRDSQVIDPNDASTQRAPVRTSRGATLDPIIEDFAARVAQ 175

Query: 148 KRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD---EFNTKNGGQRMATLLMYLSD 204
            R+A    + + H E + VL Y  G++Y AH DY        +  N G R  T+ +YL+ 
Sbjct: 176 ARLAACAQLTLTHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADHPNAGNRQRTVCVYLNV 235

Query: 205 VEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLH 264
           V+ GGET FP A                   G+ V+P+ G  + F ++  D   +  SLH
Sbjct: 236 VDAGGETEFPLA-------------------GVRVQPRPGALVCFDNLHADGRPNADSLH 276

Query: 265 GGCPVIRGNKWSATKWMHVHEYK 287
            G PV  G+KW  T W     Y+
Sbjct: 277 AGLPVTAGSKWLGTLWFRQQRYR 299


>gi|312092237|ref|XP_003147267.1| hypothetical protein LOAG_11701 [Loa loa]
          Length = 553

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 62/208 (29%), Positives = 101/208 (48%), Gaps = 21/208 (10%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            EIV   P A ++H+ +S  E   +  LA P + ++TV + +TG  + +  R S   +L+
Sbjct: 320 VEIVYQNPLAVLFHDIMSDEESRIIEMLAVPKLDRATVHNVETGNLETASYRISKSAWLR 379

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE--FNTKNGGQR 194
             +  ++  I +R+   T + +   E +QV +Y +G  Y+ H D   DE  F     G R
Sbjct: 380 STEHEVVNRINRRLDLATNLEIATAEELQVQNYGIGGHYEPHLDCSRDEDAFERTGTGNR 439

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
           +AT+L+Y+++ E GG TVF    A                   SV   +  AL ++++  
Sbjct: 440 IATILIYMTEPEIGGRTVFINLKA-------------------SVPCTKNAALFWYNLMR 480

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWMH 282
              +D  S H  CPV+ G KW+A KW H
Sbjct: 481 SGAVDMRSYHAACPVLTGTKWTANKWFH 508


>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
          Length = 467

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 99/201 (49%), Gaps = 20/201 (9%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P    +H+ LS  +   L ++A P M +STV     GQ K S  R S   +L       
Sbjct: 260 DPYVATFHDILSPGKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLAYESHPT 319

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-TKNGGQRMATLLMY 201
           + G+ + + D T +     E +QV +Y VG  Y+ H+D+F D  +     G R+AT + Y
Sbjct: 320 MVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFY 379

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           LS+VE+GG T FP  +                   ++VKP+ G+ L ++++      D  
Sbjct: 380 LSEVEQGGATAFPFLD-------------------IAVKPQLGNVLFWYNLHRSLDKDYR 420

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H GCPV++G+KW    W+H
Sbjct: 421 TKHAGCPVLKGSKWIGNVWIH 441


>gi|159884097|gb|ABX00727.1| IP12176p [Drosophila melanogaster]
          Length = 538

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 67/213 (31%), Positives = 102/213 (47%), Gaps = 28/213 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++ +P    YHN LS AE E L  + KP++ ++ V   + G  +    R++ G +L  
Sbjct: 319 EELSLDPYVIFYHNVLSDAEIEKLKPMGKPFLERAKVFRVEKGSDEIDPSRSADGAWLPH 378

Query: 138 GQD-----RIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDE-FNTKNG 191
                    ++  I +RI D T +    G  +Q L Y  G  +  HYDYF  + F+ +  
Sbjct: 379 QNIDPDDLEVLNRIGRRIEDMTGLNTRSGSKMQFLKYGFGGHFVPHYDYFNSKTFSLETV 438

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
           G R+AT+L YL++V+ GG TVFP  N                   L+V  ++G A LFW 
Sbjct: 439 GDRIATVLFYLNNVDHGGATVFPKLN-------------------LAVPTQKGSA-LFWH 478

Query: 252 M--RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
              R     D  + HG CP+I G K   T+W++
Sbjct: 479 NIDRKSYDYDTRTFHGACPLISGTKLVMTRWIY 511


>gi|292621357|ref|XP_691737.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Danio rerio]
          Length = 538

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 61/213 (28%), Positives = 108/213 (50%), Gaps = 26/213 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           EI++ +P   ++H F+++AE + +   A P + +S V       + + R+  S   +LK 
Sbjct: 334 EIISLQPYVVLFHGFVTQAEAKNIRKYAMPGLRRSVVASGMNQATAEYRI--SKSAWLKE 391

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDE---FNTKNGG 192
               ++  +++RI   T + ++  + E +QV++Y +G  Y+ H+D+   +         G
Sbjct: 392 SAHEVVGKLDQRITLVTGLNVQPPYAEYLQVVNYGIGGHYEPHFDHATSDSSPLYRLKTG 451

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT+++YLS V+ GG T F  AN                    SV   +  AL +W++
Sbjct: 452 NRVATIMIYLSPVQAGGSTAFIYAN-------------------FSVPVVQNAALFWWNL 492

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
             +   +  +LH GCPVI GNKW A KW+H +E
Sbjct: 493 HKNGQGNVDTLHAGCPVIVGNKWVANKWVHEYE 525


>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 285

 Score =  105 bits (263), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 66/215 (30%), Positives = 109/215 (50%), Gaps = 30/215 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++ +P   +YH+F+S  E E +   A+  + +S V       + + R+  S   +LK 
Sbjct: 81  ETLSLQPYVVLYHDFISDTEAEEIKHHAQLGLRRSVVATRDKQVTAEYRI--SKSAWLKG 138

Query: 138 GQDRIIRGIEKRIADFTFIPMEH--GEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNG 191
                +  +++RI+  T + ++H  GE +QV++Y +G  Y+ H+D+        F  K G
Sbjct: 139 SAQSAVSRLDQRISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKLKTG 198

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
             R+AT+++YLS VE GG T F  AN                    SV   +  A+ +W+
Sbjct: 199 -NRVATVMIYLSSVEAGGSTAFIYAN-------------------FSVPVMKNAAIFWWN 238

Query: 252 MRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           +  +   DP +LH GCPV+ G+KW A KW  +HEY
Sbjct: 239 LHRNGRGDPDTLHAGCPVLIGDKWVANKW--IHEY 271


>gi|73988166|ref|XP_851718.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Canis lupus
           familiaris]
          Length = 544

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 62/210 (29%), Positives = 106/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F++  E + +  LA+P++ +S V   +     + R+  S+  +LK 
Sbjct: 340 EVIHLEPYVVLYHDFVNDVEAQKIRGLAEPWLQRSVVASGEKQLPVEYRISKSA--WLKD 397

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RI   T + ++  + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 398 TVDPLLVTLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPTSPLYRMKSG 457

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                    SV   +  AL +W++
Sbjct: 458 NRVATFMIYLSSVEAGGATAFIYAN-------------------FSVPVVKNAALFWWNL 498

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 499 HRSGEGDGDTLHAGCPVLVGDKWVANKWIH 528


>gi|47218149|emb|CAG10069.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 595

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 66/225 (29%), Positives = 104/225 (46%), Gaps = 45/225 (20%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVV----------------------DSKTGQ 121
           P    Y + +S  E E +  LAKP + ++T+                       D +TG+
Sbjct: 374 PYIVRYLDIISDKEIELVKQLAKPRLRRATISNPITGVLETASYRISKRRATVHDPQTGK 433

Query: 122 SKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDY 181
              ++ R S   +L   +  +I  I +RI D T + ++  E +QV +Y VG +Y+ H+D+
Sbjct: 434 LTTAQYRVSKSAWLTGYEHPVIETINQRIEDLTGLEVDTAEELQVANYGVGGQYEPHFDF 493

Query: 182 FL----DEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGL 237
                 D F     G R+AT L Y+SDV  GG TVFP                     G 
Sbjct: 494 GRKDEPDAFKELGTGNRIATWLFYMSDVAAGGATVFPDV-------------------GA 534

Query: 238 SVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           +V P++G A+ ++++      D S+ H  CPV+ GNKW + KW+H
Sbjct: 535 AVWPQKGSAVFWYNLFTSGEGDYSTRHAACPVLVGNKWVSNKWIH 579


>gi|393718270|ref|ZP_10338197.1| putative oxygenase [Sphingomonas echinoides ATCC 14820]
          Length = 226

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 69/203 (33%), Positives = 101/203 (49%), Gaps = 29/203 (14%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           E + F + +FL  A C+ L+ L      +STV+  ++ Q      RTS    + R    +
Sbjct: 39  EAQIFYHPDFLDAATCDRLVALIDANRRRSTVLAEESVQD----FRTSDSCDMDRWSPDV 94

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-----TKNGGQRMAT 197
            R  ++ IAD   I   HGE +Q   Y VGQ + AH+DYF +         + GGQR  T
Sbjct: 95  -RPTDEAIADLLGIDPVHGETMQGQRYAVGQHFRAHFDYFNEAQAYWPKMVETGGQRTWT 153

Query: 198 LLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDAT 257
            ++YL+DVEEGG T FP                     G+ V PK+G  L + +M+PD  
Sbjct: 154 AMIYLNDVEEGGATWFPTI-------------------GIRVAPKKGLLLTWNNMKPDGD 194

Query: 258 LDPSSLHGGCPVIRGNKWSATKW 280
            + ++LH G PV++G K+  TKW
Sbjct: 195 RNTATLHEGMPVVQGTKYIVTKW 217


>gi|198449504|ref|XP_002136909.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
 gi|198130636|gb|EDY67467.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
          Length = 527

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 68/212 (32%), Positives = 104/212 (49%), Gaps = 26/212 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++ +P   VYHN LS AE   +  + +P + +S V D K  +   S+ RT+ G +L  
Sbjct: 317 EELSLDPYIVVYHNVLSDAEIAEVERVTEPLLKRSVVFDGKGNKMSTSKRRTALGAWLPD 376

Query: 138 GQ-----DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-TKNG 191
                    +I+ I +RI + T + +   + +Q++ Y  G  YD H+DYF      TK  
Sbjct: 377 DNMDVSGRAVIQRIFRRIHELTGLIINDRQDMQLIKYGYGGHYDIHFDYFNTSTPITKAR 436

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
           G RMAT+L YL+D++ GG T F                       L V  +RG  L +++
Sbjct: 437 GDRMATVLFYLNDMKHGGSTAFTDLQ-------------------LKVPSERGKVLFWYN 477

Query: 252 MRPDA-TLDPSSLHGGCPVIRGNKWSATKWMH 282
           MR +   +D  +LHG CPVI G K   + W+H
Sbjct: 478 MRGETHDVDSRTLHGACPVINGTKTILSCWIH 509


>gi|195145080|ref|XP_002013524.1| GL24183 [Drosophila persimilis]
 gi|194102467|gb|EDW24510.1| GL24183 [Drosophila persimilis]
          Length = 296

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 67/210 (31%), Positives = 104/210 (49%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS-SGTFLK 136
           EI + +P   +YH+ L  AE + LID  +  M +S +V  +  Q + S  RTS    F +
Sbjct: 76  EIFSHDPYVVIYHDVLYDAEMQGLIDSTRRRMSRS-MVQYEIRQIEISEQRTSKEAPFTE 134

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF-----LDEFNTKNG 191
           +   ++++ I  R+ D T   M   E + +L Y+ G  +D H DY        E+     
Sbjct: 135 KNDPQLLKRIYDRLKDMTGCDMLRSEHLSILLYDQGGHHDPHVDYHDLYWHPQEYEYHPF 194

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
           G R A+++ YL+DVE+GGETVFP                   K  L + P +G AL++ +
Sbjct: 195 GDRQASVVFYLNDVEDGGETVFP-------------------KLQLVIPPTKGSALMWHN 235

Query: 252 MRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
           +RP    DP + H  CPV+ G K  A +W+
Sbjct: 236 LRPWGEGDPRTQHASCPVLSGYKQVAIQWI 265


>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
           [Drosophila melanogaster]
          Length = 286

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 99/201 (49%), Gaps = 20/201 (9%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P    +H+ LS  +   L ++A P M +STV     GQ K S  R S   +L       
Sbjct: 79  DPYVATFHDILSPGKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLAYESHPT 138

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-TKNGGQRMATLLMY 201
           + G+ + + D T +     E +QV +Y VG  Y+ H+D+F D  +     G R+AT + Y
Sbjct: 139 MVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFY 198

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           LS+VE+GG T FP  +                   ++VKP+ G+ L ++++      D  
Sbjct: 199 LSEVEQGGATAFPFLD-------------------IAVKPQLGNVLFWYNLHRSLDKDYR 239

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H GCPV++G+KW    W+H
Sbjct: 240 TKHAGCPVLKGSKWIGNVWIH 260


>gi|195064500|ref|XP_001996577.1| GH12091 [Drosophila grimshawi]
 gi|193895397|gb|EDV94263.1| GH12091 [Drosophila grimshawi]
          Length = 521

 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 63/215 (29%), Positives = 107/215 (49%), Gaps = 38/215 (17%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVR------TSS 131
           E +  +P   +YHN +  +E E +  L+ P M    +   K  Q+K + +         +
Sbjct: 320 EEINHDPYVVMYHNVIYDSEIEEMKRLS-PQMQNGYIHGYKANQTKVTDIAARVNWLVEN 378

Query: 132 GTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG 191
             FL+R        + +RI D T   ++    +QV ++ +G  ++AHYDY   +   K  
Sbjct: 379 TPFLER--------MNQRITDMTGFDLKEFPSVQVANFGIGNNFEAHYDYIFGKRVRKED 430

Query: 192 ----GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDAL 247
               G R+A+++ Y SDV  GG TVFP                      ++V+P++G++L
Sbjct: 431 VGDLGDRLASIIFYSSDVPLGGATVFPDIQ-------------------VAVQPQKGNSL 471

Query: 248 LFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           L++++  D T DP SLH  CPV+ G++W+ TKW+H
Sbjct: 472 LWYNLFDDGTPDPRSLHSVCPVVVGSRWTLTKWLH 506


>gi|21107513|gb|AAM36222.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
           str. 306]
          Length = 273

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 79/260 (30%), Positives = 110/260 (42%), Gaps = 41/260 (15%)

Query: 37  MGIFYIPIGDDDSP-PNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSK 95
           +G+  +P  D   P P D T+  R AF  R                  PR   Y   LS 
Sbjct: 44  LGMTALPAVDIAPPDPADDTADHRIAFAPRVGPVRRH---------TAPRIEEYAAVLSA 94

Query: 96  AECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTF 155
            EC  L+ LA+P++  S V+D     ++ + +RTS G  L    D II     R A    
Sbjct: 95  DECRLLMLLARPHLRASKVIDPNDASTQRAPIRTSRGATL----DPIIEDFAARAAQARL 150

Query: 156 -----IPMEHGEGIQVLHYEVGQKYDAHYDYFLD---EFNTKNGGQRMATLLMYLSDVEE 207
                +P+ H E + VL Y  G++Y AH DY        +    G R  T+ +YL+DV  
Sbjct: 151 AACAQLPLAHAEPLSVLCYAPGEQYRAHRDYLPPGTIAADRPTAGNRQRTVCVYLNDVGA 210

Query: 208 GGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGC 267
           GG+T FP A                   G+ V+P+ G  + F ++  D   D  SLH G 
Sbjct: 211 GGDTEFPIA-------------------GVRVRPRPGTLVCFDNLHADGRPDADSLHAGL 251

Query: 268 PVIRGNKWSATKWMHVHEYK 287
           PV  G+KW  T W     Y+
Sbjct: 252 PVTAGSKWLGTLWFRQQRYR 271


>gi|224007761|ref|XP_002292840.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220971702|gb|EED90036.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 490

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 77/231 (33%), Positives = 110/231 (47%), Gaps = 33/231 (14%)

Query: 71  EKGEQWTE--IVAWEPRAFVYHNFLSKAECEYLIDLA--KPYMVKSTVVDSKTGQSKDSR 126
           EKG + TE   ++  P    + NFL+  EC  +I L     Y     V + +   S DS 
Sbjct: 273 EKGRELTEENDMSQPPWIITFDNFLTDEECNQMIQLGYKAKYERSKDVGEMQIDGSYDSV 332

Query: 127 V---RTSSGTFL----KRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHY 179
           V   RTS   +     K       + I  RI+  T IP  H E  Q+L YE GQ Y +H+
Sbjct: 333 VSKGRTSENAWCSFRDKCRNTTTAQLIHDRISTVTGIPANHSEDFQILKYEKGQFYRSHH 392

Query: 180 DYFLDEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
           DY ++    +  G R+ T  +YLSDVEEGG+T FP                   K  ++V
Sbjct: 393 DY-IEHQEKRRCGPRVLTFFLYLSDVEEGGDTNFP-------------------KLSIAV 432

Query: 240 KPKRGDALLFWSM--RPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEYKA 288
           KPK+G A+L+ S+     +  DP + H    V+ G K+ A  W+H+H+Y A
Sbjct: 433 KPKKGSAVLWPSVLDSNPSMKDPRTDHEAQEVVNGTKFGANAWLHLHDYMA 483


>gi|407699315|ref|YP_006824102.1| prolyl 4-hydroxylase subunit alpha [Alteromonas macleodii str.
           'Black Sea 11']
 gi|407248462|gb|AFT77647.1| Prolyl 4-hydroxylase alpha subunit [Alteromonas macleodii str.
           'Black Sea 11']
          Length = 354

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 77/227 (33%), Positives = 107/227 (47%), Gaps = 24/227 (10%)

Query: 59  RRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSK 118
           R A E+ SS A  K    TE++       +Y + LS+ EC YLI      +  S VVD  
Sbjct: 130 RLAVEEISS-ALGKIYAPTEVLDQTLPVELYVDVLSEYECAYLITKFSSLLQPSMVVDPL 188

Query: 119 TGQSKDSRVRTSSGTFLKRGQ-DRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDA 177
           TG  K   VRTS    +     D I R ++K I+  T  P  +GE + +L Y  GQ+Y  
Sbjct: 189 TGNGKVDNVRTSYVAIIAPSYCDWITRKLDKVISQVTHTPRCNGEALNLLRYTPGQQYKP 248

Query: 178 HYDYFLDEFNT---KNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGK 234
           HYD   ++ +    K+G QR+ T L+YL+ V +GGET FP                   K
Sbjct: 249 HYDALNEDHDGSMYKDGKQRIKTALVYLNTVRQGGETRFP-------------------K 289

Query: 235 QGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWM 281
             +SV P  G+ ++F +      L  +S H G P    NKW  TKW+
Sbjct: 290 LDISVSPTLGNMVVFSNSDESGKLLLNSYHLGAPTFSENKWLVTKWI 336


>gi|198452400|ref|XP_002137470.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
 gi|198131917|gb|EDY68028.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
          Length = 348

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 67/207 (32%), Positives = 104/207 (50%), Gaps = 23/207 (11%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTS-SGTFLK 136
           EI + +P   +YH+ L  AE + LID  +  M +S +V  +  Q + S  RTS    F +
Sbjct: 144 EIFSHDPYVVIYHDVLYDAEMQGLIDSTRRRMSRS-MVQYEIRQIEISEQRTSKEAPFTE 202

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDY--FLDEFNTKNGGQR 194
           +   ++++ I  R+ D T   M   E + +L Y+ G  +D H DY     E+     G R
Sbjct: 203 KNDPQLLKRIYDRLKDMTGCDMLRSEHLSILLYDQGGHHDPHVDYHDLYWEYEYHPFGDR 262

Query: 195 MATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRP 254
            A+++ YL+DVE+GGETVFP                   K  L + P +G AL++ ++RP
Sbjct: 263 QASVVFYLNDVEDGGETVFP-------------------KLQLVIPPTKGSALMWHNLRP 303

Query: 255 DATLDPSSLHGGCPVIRGNKWSATKWM 281
               DP + H  CPV+ G K  A +W+
Sbjct: 304 WGEGDPRTQHASCPVLSGYKQVAIQWI 330


>gi|410972729|ref|XP_003992809.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Felis catus]
          Length = 533

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/210 (29%), Positives = 105/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E++  EP   +YH+F++  E + +  LA+P++ +S V   +     + R+  S   +LK 
Sbjct: 329 EVIHLEPYVVLYHDFVNDLEAQKIRGLAEPWLQRSVVASGEKQLPVEYRI--SKSAWLKD 386

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RI   T + ++  + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 387 TVDPLLVTLDHRIGALTGLDVQPPYAEYLQVVNYGIGGHYEPHFDHATSPTSPLYRMKSG 446

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F  AN                    SV   +  AL +W++
Sbjct: 447 NRVATFMIYLSSVEAGGATAFIYAN-------------------FSVPVVKNAALFWWNL 487

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 488 HRSGEGDGDTLHAGCPVLVGDKWVANKWIH 517


>gi|289526401|gb|ADD01323.1| FI13021p [Drosophila melanogaster]
 gi|373432715|gb|AEY70761.1| FI17809p1 [Drosophila melanogaster]
          Length = 193

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 61/183 (33%), Positives = 93/183 (50%), Gaps = 32/183 (17%)

Query: 110 VKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHY 169
           +K+T +  +    K +R RT+ G +LK+  + + + I +RI D T   +   EG QV++Y
Sbjct: 10  MKNTKIHKERAVPKKNRGRTAKGFWLKKESNELTKRITRRIMDMTGFDLADSEGFQVINY 69

Query: 170 EVGQKYDAHYDYFLDEFNTKNG-----------GQRMATLLMYLSDVEEGGETVFPAANA 218
            +G  Y  H DYF  +F + N            G R+AT+L YL+DVE+GG TVF     
Sbjct: 70  GIGGHYFLHMDYF--DFASSNHTDTRSRYSIDLGDRIATVLFYLTDVEQGGATVF----- 122

Query: 219 NFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSAT 278
                         G  G  V P+ G A+ ++++  D   DP + H  CPVI G+KW  T
Sbjct: 123 --------------GDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGSKWVMT 168

Query: 279 KWM 281
           +W+
Sbjct: 169 EWI 171


>gi|81870817|sp|Q6W3F0.1|P4HA3_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|36962749|gb|AAQ87604.1| collagen prolyl 4-hydroxylase alpha III subunit [Mus musculus]
          Length = 542

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 63/210 (30%), Positives = 105/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V   P   +YH+F+S  E + + +LA+P++ +S V   +     + R+  S   +LK 
Sbjct: 338 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVASGEKQLQVEYRI--SKSAWLKD 395

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RIA  T + ++  + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 396 TVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 455

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F   N                    SV   +  AL +W++
Sbjct: 456 NRVATFMIYLSSVEAGGATAFIYGN-------------------FSVPVVKNAALFWWNL 496

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 497 HRSGEGDGDTLHAGCPVLVGDKWVANKWIH 526


>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
 gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
          Length = 537

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/201 (30%), Positives = 99/201 (49%), Gaps = 20/201 (9%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P    +H+ LS  +   L ++A P M +STV     GQ K S  R S   +L       
Sbjct: 330 DPYVATFHDMLSPRKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLAYESHPT 389

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-TKNGGQRMATLLMY 201
           + G+ + + D T +     E +QV +Y VG  Y+ H+D+F D  +     G R+AT + Y
Sbjct: 390 MVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFY 449

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           LS+VE+GG T FP  +                   ++VKP+ G+ L ++++      D  
Sbjct: 450 LSEVEQGGATAFPFLD-------------------IAVKPQLGNVLFWYNLHRSLDKDYR 490

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H GCPV++G+KW    W+H
Sbjct: 491 TKHAGCPVLKGSKWIGNVWIH 511


>gi|52139015|gb|AAH82538.1| P4ha3 protein [Mus musculus]
          Length = 404

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 63/210 (30%), Positives = 106/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V   P   +YH+F+S  E + + +LA+P++ +S V   +     + R+  S+  +LK 
Sbjct: 200 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVASGEKQLQVEYRISKSA--WLKD 257

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RIA  T + ++  + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 258 TVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 317

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F   N                    SV   +  AL +W++
Sbjct: 318 NRVATFMIYLSSVEAGGATAFIYGN-------------------FSVPVVKNAALFWWNL 358

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 359 HRSGEGDGDTLHAGCPVLVGDKWVANKWIH 388


>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
 gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
          Length = 549

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 62/207 (29%), Positives = 106/207 (51%), Gaps = 22/207 (10%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            E ++ +P    +H+ LS  E E +I+  K  + +S +   +TG S  S +RTS  T+L 
Sbjct: 339 VEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEI--GQTGNSTVSEIRTSQNTWLW 396

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD-EFNTKNGGQRM 195
              +  +  I++R+ D T +  +  E +Q+++Y +G +Y+ H+D+  D E N    G R+
Sbjct: 397 YENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFDFMDDAEKNFGWKGNRL 456

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
            T L YL+DV  GG T FP  +                   L+V P +G  L+++++   
Sbjct: 457 LTALFYLNDVPLGGATAFPFLH-------------------LAVPPVKGSLLVWYNLHRS 497

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMH 282
              D  + H GCPV++G+KW   +W H
Sbjct: 498 LHKDFRTKHAGCPVLKGSKWICNEWFH 524


>gi|30686940|ref|NP_194290.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
 gi|26451153|dbj|BAC42680.1| unknown protein [Arabidopsis thaliana]
 gi|29893542|gb|AAP06823.1| unknown protein [Arabidopsis thaliana]
 gi|332659681|gb|AEE85081.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
          Length = 291

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 65/207 (31%), Positives = 109/207 (52%), Gaps = 28/207 (13%)

Query: 80  VAWEPRAFVYHNFLSKAECEYLIDLAKPYM-VKSTVVDSKTGQSKDSRVRTSSGTFLKRG 138
           ++W PR F+Y  FLS+ EC++LI L K    V S   D KT                   
Sbjct: 62  LSWLPRVFLYRGFLSEEECDHLISLRKETTEVYSVDADGKTQL----------------- 104

Query: 139 QDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQRMATL 198
            D ++ GIE++++ +TF+P E+G  I+V  Y   +K     DYF +E ++      +AT+
Sbjct: 105 -DPVVAGIEEKVSAWTFLPGENGGSIKVRSY-TSEKSGKKLDYFGEEPSSVLHESLLATV 162

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           ++YLS+  +GGE +FP +             + C + G  ++P +G+A+LF++   +A+L
Sbjct: 163 VLYLSNTTQGGELLFPNSE--------MKPKNSCLEGGNILRPVKGNAILFFTRLLNASL 214

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHE 285
           D  S H  CPV++G    ATK ++  +
Sbjct: 215 DGKSTHLRCPVVKGELLVATKLIYAKK 241


>gi|227908832|ref|NP_796135.3| prolyl 4-hydroxylase subunit alpha-3 precursor [Mus musculus]
          Length = 542

 Score =  105 bits (261), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 63/210 (30%), Positives = 105/210 (50%), Gaps = 26/210 (12%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+V   P   +YH+F+S  E + + +LA+P++ +S V   +     + R+  S   +LK 
Sbjct: 338 EVVHLRPLIALYHDFVSDEEAQKIRELAEPWLQRSVVASGEKQLQVEYRI--SKSAWLKD 395

Query: 138 GQDRIIRGIEKRIADFTFIPME--HGEGIQVLHYEVGQKYDAHYDYFLDEFNT---KNGG 192
             D ++  ++ RIA  T + ++  + E +QV++Y +G  Y+ H+D+     +       G
Sbjct: 396 TVDPMLVTLDHRIAALTGLDIQPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSG 455

Query: 193 QRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSM 252
            R+AT ++YLS VE GG T F   N                    SV   +  AL +W++
Sbjct: 456 NRVATFMIYLSSVEAGGATAFIYGN-------------------FSVPVVKNAALFWWNL 496

Query: 253 RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                 D  +LH GCPV+ G+KW A KW+H
Sbjct: 497 HRSGEGDGDTLHAGCPVLVGDKWVANKWIH 526


>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
 gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 62/207 (29%), Positives = 106/207 (51%), Gaps = 22/207 (10%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLK 136
            E ++ +P    +H+ LS  E E +I+  K  + +S +   +TG S  S +RTS  T+L 
Sbjct: 339 VEQLSGDPYVAYFHDVLSDKESEQIIEHGKGQVTRSEI--GQTGNSTVSDIRTSQNTWLW 396

Query: 137 RGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD-EFNTKNGGQRM 195
              +  +  I++R+ D T +  +  E +Q+++Y +G +Y+ H+D+  D E N    G R+
Sbjct: 397 YENNPWLADIKQRLEDITGLSTDTAEPLQLVNYGIGGQYEPHFDFMDDAEKNFGWKGNRL 456

Query: 196 ATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPD 255
            T L YL+DV  GG T FP  +                   L+V P +G  L+++++   
Sbjct: 457 LTALFYLNDVPLGGATAFPFLH-------------------LAVPPVKGSLLVWYNLHRS 497

Query: 256 ATLDPSSLHGGCPVIRGNKWSATKWMH 282
              D  + H GCPV++G+KW   +W H
Sbjct: 498 LHKDFRTKHAGCPVLKGSKWICNQWFH 524


>gi|195441323|ref|XP_002068462.1| GK20483 [Drosophila willistoni]
 gi|194164547|gb|EDW79448.1| GK20483 [Drosophila willistoni]
          Length = 550

 Score =  104 bits (260), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 68/214 (31%), Positives = 102/214 (47%), Gaps = 32/214 (14%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++ +P    YH+ LS  E E L    K   +K T+++  T     +     S T + R
Sbjct: 342 EEISLDPYIVQYHDVLSDNEIEDL----KREGIKGTMINGWTSLKSSNATENESRTIVAR 397

Query: 138 -----GQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNT 188
                    I++ I +RI D T   +E  + IQ+  + VG  +  HYDY     LD    
Sbjct: 398 VAIMSPSLEIVQRINRRIIDMTGFNIEESKTIQLAAFSVGGFFMPHYDYLYDRLLDTDVL 457

Query: 189 KNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALL 248
           K  G R+A+++ Y  DV EGG T FP                   +  L V+PK+G AL 
Sbjct: 458 KKLGDRVASVIFYAGDVTEGGATNFP-------------------RNQLVVQPKKGSALF 498

Query: 249 FWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           +++   D + DP SLH  CPV+ G++W+ TKW+H
Sbjct: 499 WYNKFDDGSPDPRSLHSICPVVVGSRWTITKWIH 532


>gi|195505214|ref|XP_002099407.1| GE23379 [Drosophila yakuba]
 gi|194185508|gb|EDW99119.1| GE23379 [Drosophila yakuba]
          Length = 547

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 63/208 (30%), Positives = 105/208 (50%), Gaps = 22/208 (10%)

Query: 77  TEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKD--SRVRTSSGTF 134
           TEI++ +P   ++H+ +S+ E   +   +K +M+ S   D     S+D  +  RTS   +
Sbjct: 330 TEILSIDPFVLLFHDMISQKESTLIRSSSKEHMLPSATTDVDASGSEDHVATFRTSKSVW 389

Query: 135 LKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ- 193
                +   + I +R+ D T + M   E  QV++Y +G  ++ H D  L + +  NG + 
Sbjct: 390 YSSTSNDTTKRITERLGDATGLDMNFTEYFQVINYGLGGFFETHLDMLLSDRSRFNGTRD 449

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+AT L YL++V +GG T FP  N                   L+V P+ G AL ++++ 
Sbjct: 450 RLATTLFYLNEVRQGGGTHFPRLN-------------------LTVFPQPGSALFWYNLD 490

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWM 281
                  S+LH GCPVI G+KW  +KW+
Sbjct: 491 TRGNDHTSTLHTGCPVIVGSKWVMSKWV 518


>gi|432109537|gb|ELK33711.1| Prolyl 4-hydroxylase subunit alpha-2 [Myotis davidii]
          Length = 555

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 63/223 (28%), Positives = 105/223 (47%), Gaps = 43/223 (19%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E + + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 336 PHIVRYYDVMSDEEIQRIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDY------------------FL-- 183
             + +R+   T + ++  E +QV +Y +G +Y+ H+D+                  FL  
Sbjct: 396 ARVNRRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNY 455

Query: 184 ----DEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
               D F     G R+AT L Y+SDVE GG TVFP                     G ++
Sbjct: 456 NDEQDVFKHLGTGNRVATFLNYMSDVEAGGATVFP-------------------DLGAAI 496

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            PK+G A+ ++++      D  + H  CPV+ G KW + KW H
Sbjct: 497 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 539


>gi|184185444|gb|ACC68850.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Rhinolophus ferrumequinum]
          Length = 555

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 64/223 (28%), Positives = 105/223 (47%), Gaps = 43/223 (19%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+  +D ++
Sbjct: 336 PHIVRYYDVMSDEEIEKIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEETEDPVV 395

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDY------------------FL-- 183
             +  R+   T + ++  E +QV +Y +G +Y+ H+D+                  FL  
Sbjct: 396 ARLNLRMQHITGLSVKTAELLQVANYGMGGQYEPHFDFSRRPFDNGLKTEGNRLATFLNY 455

Query: 184 ----DEFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSV 239
               D F     G R+AT L Y+SDVE GG TVFP                     G ++
Sbjct: 456 NDEHDVFKHLGTGNRVATFLNYMSDVEAGGATVFP-------------------DLGAAI 496

Query: 240 KPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
            PK+G A+ ++++      D  + H  CPV+ G KW + KW H
Sbjct: 497 WPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWVSNKWFH 539


>gi|156333122|ref|XP_001619372.1| hypothetical protein NEMVEDRAFT_v1g151555 [Nematostella vectensis]
 gi|156202442|gb|EDO27272.1| predicted protein [Nematostella vectensis]
          Length = 144

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 51/149 (34%), Positives = 83/149 (55%), Gaps = 19/149 (12%)

Query: 134 FLKRGQDRIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
           +L+  +D +++ I  R+  ++ + M   E +QV++Y +G  Y+ HYD+  D+F +   G 
Sbjct: 5   WLRDEEDELVKRISYRVQAYSGLNMTTSEDLQVVNYGIGGHYEPHYDFARDKFTSLGTGN 64

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMR 253
           R+AT L YLSDVE GG TVF                    + G +V P++GDA  +++++
Sbjct: 65  RIATFLSYLSDVEAGGGTVFT-------------------RVGATVWPQKGDAAFWYNLK 105

Query: 254 PDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                D S+ H  CPV+ G+KW A KW+H
Sbjct: 106 RSGDGDSSTRHAACPVLVGSKWVANKWIH 134


>gi|281348666|gb|EFB24250.1| hypothetical protein PANDA_000722 [Ailuropoda melanoleuca]
          Length = 505

 Score =  104 bits (259), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 61/202 (30%), Positives = 101/202 (50%), Gaps = 25/202 (12%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRII 143
           P    Y++ +S  E E + ++AKP + ++TV D KTG    +  R S  ++L+   D ++
Sbjct: 325 PHIVRYYDVMSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEDDDPVV 384

Query: 144 RGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF----LDEFNTKNGGQRMATLL 199
             +  R+   T + ++  E +QV +Y +G +Y+ H+D+      D F     G R+AT L
Sbjct: 385 ARVNLRMQHITGLTVKTAELLQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFL 444

Query: 200 MYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD 259
            Y+SDVE GG TVFP                     G ++ PK+G A+ ++++      D
Sbjct: 445 NYMSDVEAGGATVFP-------------------DLGAAIWPKKGTAVFWYNLLRSGEGD 485

Query: 260 PSSLHGGCPVIRGNKWSATKWM 281
             + H  CPV+ G KW   KW+
Sbjct: 486 YRTRHAACPVLVGCKWG--KWL 505


>gi|359401514|ref|ZP_09194482.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
 gi|357597189|gb|EHJ58939.1| 2OG-Fe(II) oxygenase [Novosphingobium pentaromativorans US6-1]
          Length = 232

 Score =  103 bits (258), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 68/206 (33%), Positives = 104/206 (50%), Gaps = 29/206 (14%)

Query: 87  FVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGI 146
           +  ++FL++A+C +LI L       S +++    +  D    + SG       DRI+R +
Sbjct: 51  YTSNSFLNEADCAHLIALIDTCACPSRLLEE---EGWDGYRTSYSGDIDT--HDRIVRDL 105

Query: 147 EKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-----TKNGGQRMATLLMY 201
           E R++DFT I    GE  Q   YE GQ ++ H D+F  E        + GGQR  T ++Y
Sbjct: 106 ELRLSDFTGIAPSCGESAQGQRYECGQYFNEHCDWFDTEAGYWRQERRCGGQRSWTAMIY 165

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           L+ VEEGG T       +FT +            GLS+ P+ G  LL+ +  PD T +P 
Sbjct: 166 LNAVEEGGRT-------DFTHI------------GLSIPPEPGCLLLWNNALPDGTPNPL 206

Query: 262 SLHGGCPVIRGNKWSATKWMHVHEYK 287
           ++H   PV+RG K+  TKW  V  ++
Sbjct: 207 TMHAARPVVRGVKYVVTKWFRVRNWQ 232


>gi|321458081|gb|EFX69155.1| hypothetical protein DAPPUDRAFT_228756 [Daphnia pulex]
          Length = 570

 Score =  103 bits (258), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 72/216 (33%), Positives = 106/216 (49%), Gaps = 34/216 (15%)

Query: 84  PRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSK-TGQSK-DSRVRTSSGTFLKRGQDR 141
           P   V+H+F+S AE E    LA    ++ +   SK  GQ    S  RTS  ++++ G   
Sbjct: 347 PYIAVFHDFMSDAETEIFKSLAMAERLERSAHGSKRPGQGGVTSDKRTSKQSWVEDGSHH 406

Query: 142 IIRGIEKRIADFTFI---PMEHG-EGIQVLHYEVGQKYDAHYDYFL--------DEFNTK 189
           ++  I KRI+D   +   P   G E  QV +Y +G +Y  H D+ +         EF+  
Sbjct: 407 VVDQISKRISDSVGLNSQPSNVGSEHYQVANYGIGGRYTPHTDHGVLSKSMGGPSEFDLF 466

Query: 190 NGGQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLF 249
            G  R+ T + YL DVE GG TVF  A                   G+ V+PK+G A+ +
Sbjct: 467 RG-DRILTFMTYLDDVEAGGATVFTHA-------------------GVVVRPKKGMAVFW 506

Query: 250 WSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
           W+++ D+  D  + HGGCPV+ G+KW   KW H  E
Sbjct: 507 WNLKSDSNGDTLTRHGGCPVLHGSKWICNKWFHSGE 542


>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
 gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
          Length = 537

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 61/201 (30%), Positives = 99/201 (49%), Gaps = 20/201 (9%)

Query: 83  EPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRI 142
           +P    +H+ L+  +   L ++A P M +STV     GQ K S  R S   +L       
Sbjct: 330 DPYVATFHDMLNPRKISQLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWLAYESHPT 389

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFN-TKNGGQRMATLLMY 201
           + G+ + + D T +     E +QV +Y VG  Y+ H+D+F D  +     G R+AT + Y
Sbjct: 390 MVGMLRDLKDATGLDTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGNRIATAIFY 449

Query: 202 LSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPS 261
           LS+VE+GG T FP  +                   ++VKP+ G+ L ++++      D  
Sbjct: 450 LSEVEQGGATAFPFLD-------------------IAVKPQLGNVLFWYNLHRSLDKDYR 490

Query: 262 SLHGGCPVIRGNKWSATKWMH 282
           + H GCPV++G+KW    W+H
Sbjct: 491 TKHAGCPVLKGSKWIGNVWIH 511


>gi|195341584|ref|XP_002037386.1| GM12898 [Drosophila sechellia]
 gi|194131502|gb|EDW53545.1| GM12898 [Drosophila sechellia]
          Length = 536

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 67/217 (30%), Positives = 105/217 (48%), Gaps = 36/217 (16%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E ++ +P   +YHN LS  E E L  ++KP++ ++ V   + G  + +  R++ G +L  
Sbjct: 317 EELSLDPYVVLYHNVLSDPEIEKLKPMSKPFLERAKVFRVEKGSDEIAPSRSADGAWLPH 376

Query: 138 GQD------RIIRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG 191
            QD       ++R I +RI D T +    G  +Q L Y  G  +  HYDY    FN+K  
Sbjct: 377 -QDTDPDDLEVLRRIGRRIKDLTGLNTRSGSQMQFLKYGFGGHFVPHYDY----FNSKTS 431

Query: 192 -----GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDA 246
                G R+AT+L YL++V+ GG T FP  N                   L V  ++G A
Sbjct: 432 YLERVGDRIATVLFYLNNVDHGGATAFPKLN-------------------LVVPTQKGSA 472

Query: 247 LLFWSM-RPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
           L + ++ R     D  + HG CP+I G K   T+W++
Sbjct: 473 LFWHNLDRKSYDYDTCTFHGACPLISGTKLVMTRWIY 509


>gi|412986224|emb|CCO17424.1| predicted protein [Bathycoccus prasinos]
          Length = 557

 Score =  103 bits (258), Expect = 7e-20,   Method: Compositional matrix adjust.
 Identities = 74/229 (32%), Positives = 113/229 (49%), Gaps = 37/229 (16%)

Query: 79  IVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFL--K 136
            V+  P  FV+ NFL ++ECE+L  LA   + +S V D K      S  RTSS  FL   
Sbjct: 318 CVSLSPLLFVFENFLHESECEFLRTLADKDLKRSRVTDGKL-----SNGRTSSSCFLIGA 372

Query: 137 RGQDRIIRGIEKRIAD------------FTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLD 184
           +G++ +++ IE+R+ D            F  + ++  E +Q++ Y   +KY +H+D    
Sbjct: 373 KGKEDVVKTIERRMLDAIRSTPVLTTRRFDTLKLKGSEPMQIVRYGKNEKYTSHFD---- 428

Query: 185 EFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANF--TSVRWWNELSECGKQ------- 235
             N     +R+AT + YLSD  EGG T FP A   F   S          G++       
Sbjct: 429 --NKAGSFRRVATFMCYLSDQCEGGCTNFPKAEPLFLEPSFDEHGAFKPFGRKKKTVASE 486

Query: 236 --GLSVKPKRGDALLFWSMRPDA-TLDPSSLHGGCPVIRGNKWSATKWM 281
             G+ + PK G A+LF+S+  +    +P SLH G  V +G K+  TKW+
Sbjct: 487 QHGVKIHPKLGRAILFFSISEEPFRENPLSLHEGQTVRKGEKFICTKWL 535


>gi|390989336|ref|ZP_10259634.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
           pv. punicae str. LMG 859]
 gi|372555840|emb|CCF66609.1| 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas axonopodis
           pv. punicae str. LMG 859]
          Length = 228

 Score =  103 bits (257), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 75/245 (30%), Positives = 104/245 (42%), Gaps = 40/245 (16%)

Query: 51  PNDLTSFRRRAFEKRSSIAEEKGEQWTEIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMV 110
           P D T+  R AF  R                  PR   Y   LS  EC  L+ LA+P++ 
Sbjct: 14  PADDTADHRIAFAPRVGPVRRHTA---------PRIEEYAAVLSADECRLLMLLARPHLR 64

Query: 111 KSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIEKRIADFTF-----IPMEHGEGIQ 165
            S V+D     ++ + +RTS G  L    D II     R A         +P+ H E + 
Sbjct: 65  ASKVIDPNDASTQRAPIRTSRGATL----DPIIEDFAARAAQARLAACAQLPLAHAEPLS 120

Query: 166 VLHYEVGQKYDAHYDYFLD---EFNTKNGGQRMATLLMYLSDVEEGGETVFPAANANFTS 222
           VL Y  G++Y AH DY        + +  G R  T+ +YL+DV  GG+T FP A      
Sbjct: 121 VLCYAPGEQYRAHRDYLPPGTIAADRRTAGNRQRTVCVYLNDVGAGGDTEFPIA------ 174

Query: 223 VRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLDPSSLHGGCPVIRGNKWSATKWMH 282
                        G+ V+P+ G  + F ++  D   D  SLH G PV  G+KW  T W  
Sbjct: 175 -------------GVRVRPRPGTLVCFDNLHADGRPDADSLHAGLPVTAGSKWLGTLWFR 221

Query: 283 VHEYK 287
              Y+
Sbjct: 222 QQRYR 226


>gi|37912909|gb|AAR05245.1| conserved hypothetical protein [uncultured marine proteobacterium
           ANT32C12]
          Length = 186

 Score =  103 bits (257), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 68/201 (33%), Positives = 99/201 (49%), Gaps = 29/201 (14%)

Query: 88  VYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKRGQDRIIRGIE 147
           +Y  F        L+ L +  + ++TV+     Q  DSR  T+S  +++     II  + 
Sbjct: 2   LYQIFYPLMSARPLLRLDQARVERATVITDSEHQFHDSR--TNSYAWIQHDASEIIHEVS 59

Query: 148 KRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYF--LDEFNTKN---GGQRMATLLMYL 202
           KR +    +P+ + E  Q++HY  G +Y  H+D F    E    N   GGQRM T L YL
Sbjct: 60  KRFSILVKMPINNAEQFQLVHYGPGTEYKPHFDAFDKSTEEGRNNWFPGGQRMVTALAYL 119

Query: 203 SDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATLD--P 260
           +DVE+GG T FP  +                   +SVKP +GD ++F + + D T D  P
Sbjct: 120 NDVEDGGATDFPDIH-------------------VSVKPNKGDVVVFHNCK-DGTSDINP 159

Query: 261 SSLHGGCPVIRGNKWSATKWM 281
           +SLHGG PVI G KW+   W 
Sbjct: 160 NSLHGGSPVISGEKWAVNLWF 180


>gi|145344669|ref|XP_001416850.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144577076|gb|ABO95143.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 225

 Score =  103 bits (257), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 71/216 (32%), Positives = 102/216 (47%), Gaps = 40/216 (18%)

Query: 87  FVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFL--KRGQDRIIR 144
           FV  +FLS+ E + LI++A+P M +S V D K  +      RTS+ TFL   R  D ++ 
Sbjct: 31  FVLEDFLSEEEGDQLIEIARPSMQRSRVTDGKLSEG-----RTSTSTFLTGARAHDDLVL 85

Query: 145 GIEKRIADFTFIP-----------MEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNGGQ 193
            IE+RI     +P           M   E +Q++ Y   ++Y AHYD      N     +
Sbjct: 86  EIERRIQAAIRLPLIVERRKNVKVMYQHEPMQIVQYGPTERYTAHYD------NRAGSLK 139

Query: 194 RMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQ----GLSVKPKRGDALLF 249
           R  T + YL + EEGG T FP              +  CG      G+ V PKRG A+LF
Sbjct: 140 RSMTFMCYLQEPEEGGATFFPKC------------VPLCGCDSTTLGIRVFPKRGRAILF 187

Query: 250 WSMRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHE 285
           W++  +      SLH   PV+ G K   T+W+ + E
Sbjct: 188 WNVGENGQEAMRSLHEAQPVVSGKKAIFTQWLSISE 223


>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
           rubripes]
          Length = 540

 Score =  103 bits (257), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 64/215 (29%), Positives = 109/215 (50%), Gaps = 30/215 (13%)

Query: 78  EIVAWEPRAFVYHNFLSKAECEYLIDLAKPYMVKSTVVDSKTGQSKDSRVRTSSGTFLKR 137
           E+++  P   +YH+F+S +E E +   A+  + +S V       + + R+  S+  +LK 
Sbjct: 336 EVLSLRPYVVLYHDFISDSESEEIKQHAQLGLRRSVVATGDKQATAEYRISKSA--WLKG 393

Query: 138 GQDRIIRGIEKRIADFTFIPMEH--GEGIQVLHYEVGQKYDAHYDYFLDE----FNTKNG 191
                +  ++++I+  T + ++H  GE +QV++Y +G  Y+ H+D+        F  K G
Sbjct: 394 SAHSTVSRLDQKISMLTGLNVQHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKLKTG 453

Query: 192 GQRMATLLMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWS 251
             R+AT ++YLS VE GG T F  AN                    SV   +  A+ +W+
Sbjct: 454 -NRVATFMIYLSSVEAGGSTAFIYAN-------------------FSVPVMKNAAIFWWN 493

Query: 252 MRPDATLDPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           +  +   D  +LH GCPV+ G+KW A KW  +HEY
Sbjct: 494 LHRNGEGDADTLHAGCPVLIGDKWVANKW--IHEY 526


>gi|156398644|ref|XP_001638298.1| predicted protein [Nematostella vectensis]
 gi|156225417|gb|EDO46235.1| predicted protein [Nematostella vectensis]
          Length = 495

 Score =  103 bits (257), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 67/208 (32%), Positives = 103/208 (49%), Gaps = 30/208 (14%)

Query: 88  VYHNFLSKAECEYLIDLAK----PYMVKSTVVDSKTGQSKDSRVRTSSGTFLK-RGQDRI 142
           V+ NF    E +    L +    P + ++TV +  TG  + +  R S   +L  R    +
Sbjct: 295 VFENFNWHVERDIYKRLCRGEKLPTLNRATVHNPITGHLETAHYRISKNCWLSGREHGEV 354

Query: 143 IRGIEKRIADFTFIPMEHGEGIQVLHYEVGQKYDAHYDYFLDEFNTKNG----GQRMATL 198
           I  +E+RIA  T + +E  EG QV +Y +  +YD H+D+  D  N+  G    G R+AT+
Sbjct: 355 IDRVERRIAAMTRLNLETAEGFQVQNYGLAGQYDPHFDFSRDLANSSLGSLGTGNRIATV 414

Query: 199 LMYLSDVEEGGETVFPAANANFTSVRWWNELSECGKQGLSVKPKRGDALLFWSMRPDATL 258
           L+++S VE GG TVFP                     G  + P++GDA+ + ++      
Sbjct: 415 LVWMSQVESGGATVFPYV-------------------GARILPQKGDAVFWHNLLRSGDG 455

Query: 259 DPSSLHGGCPVIRGNKWSATKWMHVHEY 286
           D  + H GCPV+ G KW A KW  +HEY
Sbjct: 456 DFRTRHAGCPVLSGIKWVANKW--IHEY 481


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.321    0.135    0.418 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,761,437,270
Number of Sequences: 23463169
Number of extensions: 197626890
Number of successful extensions: 462467
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1549
Number of HSP's successfully gapped in prelim test: 455
Number of HSP's that attempted gapping in prelim test: 457010
Number of HSP's gapped (non-prelim): 2156
length of query: 288
length of database: 8,064,228,071
effective HSP length: 141
effective length of query: 147
effective length of database: 9,050,888,538
effective search space: 1330480615086
effective search space used: 1330480615086
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)