BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy6259
         (227 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|347972274|ref|XP_001237637.3| AGAP004611-PA [Anopheles gambiae str. PEST]
 gi|333469330|gb|EAU76664.3| AGAP004611-PA [Anopheles gambiae str. PEST]
          Length = 514

 Score =  179 bits (453), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 95/227 (41%), Positives = 131/227 (57%), Gaps = 16/227 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           YP  C+G+   P    + L+C YE     FL+I PLK++E+  DP +V  HD I + EI+
Sbjct: 272 YPSLCRGDDQRPAKELAKLRCRYEHNRTPFLRISPLKLQEVNHDPMIVMYHDVISNKEID 331

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            II +SK  + R  V +  +     TR S   +L   +   HP +  +  R +DMTNL +
Sbjct: 332 AIISISKPLMHRSMVGDDHEKAVSKTRTSSNAWLDDVM---HPVVRTLSQRTEDMTNLAM 388

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---------RLASFMFYLTDVELGG 173
              ER    LQ+ NYG+GGHY  H D    +EG           R+A+ M+YL+DV +GG
Sbjct: 389 TAAER----LQVGNYGIGGHYLPHYDYAVAEEGKEVYPSIGKGNRIATVMYYLSDVAIGG 444

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           AT+FP L L VFP+KGSA+FWYN HAN  +D+R  H  CPV +G+KW
Sbjct: 445 ATVFPQLGLGVFPQKGSAIFWYNLHANGTVDHRTLHGACPVFVGSKW 491


>gi|307190793|gb|EFN74662.1| Prolyl 4-hydroxylase subunit alpha-2 [Camponotus floridanus]
          Length = 476

 Score =  176 bits (445), Expect = 8e-42,   Method: Compositional matrix adjust.
 Identities = 89/230 (38%), Positives = 135/230 (58%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y + C+G +S+P +++ NLKC Y      FLKI PLK EE YLDPR+V  H+ IYD E
Sbjct: 223 ERYEMLCRGEVSIPREVEKNLKCRYVDRGIPFLKIAPLKEEEAYLDPRIVVYHNVIYDEE 282

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  I  +++ + +R  V NY  G     + R+SK  +L      +H  +  +  R++ MT
Sbjct: 283 IETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEH---EHKHVAAVSKRVEHMT 339

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
           ++ I   E     LQ+ NYG+GGHY+ H D   ++E           R+A+ ++Y++DVE
Sbjct: 340 SMSIETAEE----LQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVE 395

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T+F ++N++++P KGSA FWYN   N   D++  H+ CPV  G+KW
Sbjct: 396 QGGGTVFTAINISLWPRKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKW 445


>gi|347964867|ref|XP_309164.4| AGAP000971-PA [Anopheles gambiae str. PEST]
 gi|333466515|gb|EAA04901.5| AGAP000971-PA [Anopheles gambiae str. PEST]
          Length = 553

 Score =  174 bits (442), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 91/230 (39%), Positives = 133/230 (57%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+G    P +++S L C Y + ++ FL+IGPLK+EE YL P +V  HD + D E
Sbjct: 303 KLYEQLCRGEQQPPIELRSQLVCRYTTNSSPFLRIGPLKLEEAYLRPYIVIYHDVMSDRE 362

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I RI   ++ +  R  V NY  G+  + + R+SK  +L      +   +  I  R++DMT
Sbjct: 363 IERIKHYARPRFRRATVQNYKTGELEFANYRISKSAWLKD---AEDEMIRTISQRVEDMT 419

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GGHY+ H D   R+E           R+A+ +FY++DV 
Sbjct: 420 GLTMETAEE----LQVVNYGIGGHYEPHFDFARREERNAFKSLGTGNRIATVLFYMSDVT 475

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FPSLNL ++P KG+A FW+N HA+   DY   H+ CPV  G KW
Sbjct: 476 QGGATVFPSLNLALWPRKGTAAFWFNLHASGRGDYATRHAACPVLTGTKW 525


>gi|345481336|ref|XP_001600680.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Nasonia
           vitripennis]
          Length = 556

 Score =  173 bits (439), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 89/230 (38%), Positives = 132/230 (57%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y + C+G + +P  I+  L+C Y      FLKI P K EE YLDPR+V  HD IYD E
Sbjct: 303 ERYEMLCRGEIKMPLSIQKELRCRYVDRGIPFLKIAPFKEEEAYLDPRIVIYHDVIYDDE 362

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  I  +++ + +R  V NY  G+    + R+SK  +L      +H  +  +  R++ MT
Sbjct: 363 IETIKRMAQPRFKRATVQNYKTGELEIANYRISKSAWLQEH---EHKHVRAVSQRVEHMT 419

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
           ++ I   E     LQ+ NYG+GGHY+ H D   R+E           R+A+ ++Y++DVE
Sbjct: 420 SMSIETAE----ELQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRIATVLYYMSDVE 475

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T+F  +N++++P+KGSA FWYN   N   DY+  H+ CPV  G+KW
Sbjct: 476 QGGGTVFTKINISLWPKKGSAAFWYNLKPNGEGDYKTRHAACPVLTGSKW 525


>gi|91091610|ref|XP_969386.1| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
           putative [Tribolium castaneum]
 gi|270001037|gb|EEZ97484.1| hypothetical protein TcasGA2_TC011321 [Tribolium castaneum]
          Length = 536

 Score =  173 bits (438), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 92/230 (40%), Positives = 130/230 (56%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y   C+G +S+P +  S LKCFY S N  FLKI P KVEE +  P +    D + DSE
Sbjct: 286 EFYEQLCRGEISLPVEKASKLKCFYLSRNQPFLKIAPFKVEEAHHRPDIFIFRDVLADSE 345

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  I  +++ + +R  V N   G+      R+SK  +L  E   +H  +  +  R+ DMT
Sbjct: 346 IATIKRMAQPRFKRATVQNTDTGELEIAQYRISKSAWLKEE---EHKHIADVSQRVSDMT 402

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GGHY+ H D   RDE           R+A+ +FY++DVE
Sbjct: 403 GLTMSTAEE----LQVVNYGIGGHYEPHFDFARRDERNAFKSLGTGNRIATVLFYMSDVE 458

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FPS+ ++++P+KGSA FWYN H +   D    H+ CPV  G+KW
Sbjct: 459 QGGATVFPSIQVSLWPQKGSAAFWYNLHPSGDGDKMTRHAACPVLTGSKW 508


>gi|328790718|ref|XP_392392.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Apis mellifera]
          Length = 415

 Score =  172 bits (437), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 134/230 (58%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y + C+G +++P +++ NLKC Y      FLKI P K EE YLDPR+V  H+ IYD E
Sbjct: 162 ERYEMLCRGEVTIPPEVQKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDDE 221

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  I  +++ + +R  V NY  G     + R+SK  +L      +H  +  +  R++ MT
Sbjct: 222 IETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEH---EHKHVAAVSRRVEHMT 278

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
           ++ +   E     LQ+ NYG+GGHY+ H D   ++E           R+A+ ++Y++DVE
Sbjct: 279 SMTVDTAEE----LQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVE 334

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T+F ++N+ ++P+KGSA FWYN   N   D++  H+ CPV  G+KW
Sbjct: 335 QGGGTVFTAINIALWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKW 384


>gi|380025232|ref|XP_003696381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Apis florea]
          Length = 537

 Score =  172 bits (437), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 134/230 (58%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y + C+G +++P +++ NLKC Y      FLKI P K EE YLDPR+V  H+ IYD E
Sbjct: 284 ERYEMLCRGEVTIPPEVQKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDDE 343

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  I  +++ + +R  V NY  G     + R+SK  +L      +H  +  +  R++ MT
Sbjct: 344 IETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEH---EHKHVAAVSRRVEHMT 400

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
           ++ +   E     LQ+ NYG+GGHY+ H D   ++E           R+A+ ++Y++DVE
Sbjct: 401 SMTVDTAEE----LQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVE 456

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T+F ++N+ ++P+KGSA FWYN   N   D++  H+ CPV  G+KW
Sbjct: 457 QGGGTVFTAINIALWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKW 506


>gi|383864775|ref|XP_003707853.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Megachile
           rotundata]
          Length = 550

 Score =  172 bits (437), Expect = 8e-41,   Method: Compositional matrix adjust.
 Identities = 89/230 (38%), Positives = 133/230 (57%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y + C+G +S+P +I+ NLKC Y      FLKI P K EE YLDPR+V  H+ IYD E
Sbjct: 297 ERYEMLCRGEVSIPPEIQKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVIYHNVIYDEE 356

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  I  +++ + +R  V NY  G     + R+SK  +L      +H  +  +  R++ MT
Sbjct: 357 IETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEH---EHKHVAAVSKRVEHMT 413

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
           +L +   E     LQ+ NYG+GGHY+ H D   ++E           R+A+ ++Y++DVE
Sbjct: 414 SLNVETAEE----LQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVE 469

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T+F ++N++++P KGSA FW+N   N   D R  H+ CPV  G+KW
Sbjct: 470 QGGGTVFTAINISLWPRKGSAAFWFNLKPNGEGDLRTRHAACPVLTGSKW 519


>gi|350416719|ref|XP_003491070.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
           impatiens]
          Length = 557

 Score =  172 bits (436), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 87/230 (37%), Positives = 134/230 (58%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y + C+G +S+P +I+ NL C Y      FLKI P K EE YLDPR+V  H+ IYD E
Sbjct: 304 ERYEMLCRGEVSIPPEIQKNLVCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDEE 363

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  I  +++ + +R  V NY  G     + R+SK  +L      +H  +  +  R++ MT
Sbjct: 364 IETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEH---EHEHVAAVSRRVEHMT 420

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
           ++ +   E     LQ+ NYG+GGHY+ H D   ++E           R+A+ ++Y++DVE
Sbjct: 421 SMTVDTAEE----LQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVE 476

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T+F ++N++++P+KGSA FWYN   N   D++  H+ CPV  G+KW
Sbjct: 477 QGGGTVFTAINISLWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKW 526


>gi|340722330|ref|XP_003399560.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Bombus
           terrestris]
          Length = 557

 Score =  172 bits (435), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 87/230 (37%), Positives = 134/230 (58%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y + C+G +S+P +I+ NL C Y      FLKI P K EE YLDPR+V  H+ IYD E
Sbjct: 304 ERYEMLCRGEVSIPPEIQKNLVCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDEE 363

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  I  +++ + +R  V NY  G     + R+SK  +L      +H  +  +  R++ MT
Sbjct: 364 IETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEH---EHEHVAAVSRRVEHMT 420

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
           ++ +   E     LQ+ NYG+GGHY+ H D   ++E           R+A+ ++Y++DVE
Sbjct: 421 SMTVDTAEE----LQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVE 476

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T+F ++N++++P+KGSA FWYN   N   D++  H+ CPV  G+KW
Sbjct: 477 QGGGTVFTAINISLWPKKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKW 526


>gi|307211752|gb|EFN87747.1| Prolyl 4-hydroxylase subunit alpha-1 [Harpegnathos saltator]
          Length = 415

 Score =  172 bits (435), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 87/230 (37%), Positives = 134/230 (58%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y + C+G +S+P +++ NLKC Y      FLKI P K EE YLDPR+V  H+ IYD E
Sbjct: 162 ERYEMLCRGEVSIPLEVEKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVFYHNVIYDEE 221

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  I  +++ + +R  V NY  G     + R+SK  +L      +H  +  +  R++ MT
Sbjct: 222 IETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEH---EHKHVAAVSKRVEHMT 278

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
           ++ +   E     LQ+ NYG+GGHY+ H D   ++E           R+A+ ++Y++DVE
Sbjct: 279 SMSVETAEE----LQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVE 334

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T+F ++N++++P KGSA FWYN   N   D++  H+ CPV  G+KW
Sbjct: 335 QGGGTVFTAINISLWPRKGSAAFWYNLKPNGEGDFKTRHAACPVLTGSKW 384


>gi|332026992|gb|EGI67088.1| Prolyl 4-hydroxylase subunit alpha-1 [Acromyrmex echinatior]
          Length = 415

 Score =  171 bits (434), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 134/230 (58%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y + C+G +S+P +++ NLKC Y      FLKI P K EE YLDPR+V  H+ IYD E
Sbjct: 162 ERYEMLCRGEVSIPPEVEKNLKCRYVDRGIPFLKIAPFKEEEAYLDPRIVVYHNVIYDEE 221

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  I  +++ + +R  V NY  G     + R+SK  +L      +H  +  +  R++ MT
Sbjct: 222 IETIKRMAQPRFKRATVQNYKTGALEIANYRISKSAWLQEH---EHKHVAAVSKRVEHMT 278

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
           ++ +   E     LQ+ NYG+GGHY+ H D   ++E           R+A+ ++Y++DVE
Sbjct: 279 SMSVETAEE----LQVVNYGIGGHYEPHFDFARKEETNAFKSLGTGNRIATVLYYMSDVE 334

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T+F ++N++++P KGSA FW+N   N   D++  H+ CPV  G+KW
Sbjct: 335 QGGGTVFTAINISLWPRKGSAAFWHNLKPNGEGDFKTRHAACPVLTGSKW 384


>gi|157114985|ref|XP_001658091.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
 gi|108877086|gb|EAT41311.1| AAEL007038-PA [Aedes aegypti]
          Length = 545

 Score =  167 bits (424), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 90/230 (39%), Positives = 130/230 (56%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+G         S LKC Y +  + FLKI PLK+EE  L P +V  HD I ++E
Sbjct: 296 KLYEQLCRGEAERSVAETSKLKCRYVTNKSPFLKIAPLKLEEANLKPYIVIYHDVISEAE 355

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           +  +  L+K +  R  V NY  G+    + R+SK  +L      +HP++  I  R++DMT
Sbjct: 356 MELVKRLAKPRFRRATVQNYKTGELEVANYRISKSAWLKDH---EHPYIKAIGERVEDMT 412

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GGHY+ H D   R+E           R+A+ +FY++DV 
Sbjct: 413 GLTMSTAEE----LQVVNYGIGGHYEPHFDFARREETNAFKSLGTGNRIATVLFYMSDVT 468

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FPSL L ++P+KG+A FW+N HA+   DY   H+ CPV  G KW
Sbjct: 469 QGGATVFPSLRLALWPKKGAAAFWFNLHASGQGDYSTRHAACPVLTGTKW 518


>gi|170064953|ref|XP_001867740.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
 gi|167882143|gb|EDS45526.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
          Length = 509

 Score =  167 bits (422), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 93/225 (41%), Positives = 133/225 (59%), Gaps = 14/225 (6%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           + L C+G+   P    S+L C Y +  ++FL++ PLK E L LDP +   HD   D EI+
Sbjct: 265 HELLCRGDYQRPASETSHLYCRYHTGTSSFLRLAPLKEEVLNLDPFITVYHDVASDREIS 324

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
           ++IEL+K ++ R  + + G+    + R S+  +L     GD   +  +  R+ DMT  + 
Sbjct: 325 KLIELAKSRISRATIRDDGEPQVSNARTSQNAWLDA---GDDRVVTTLDRRVGDMTGGL- 380

Query: 123 GREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLW---RLASFMFYLTDVELGGAT 175
            R++ Y+  LQ+NNYG+GGHY  H D    A P   GL    R+A+ MFYL+DVE+GGAT
Sbjct: 381 -RQQSYE-MLQVNNYGVGGHYVAHHDWAMEAVPY-AGLRVGNRIATVMFYLSDVEIGGAT 437

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +FP L L VFP KGSA+ WYN + N   D R  H+ CPV  G+KW
Sbjct: 438 VFPQLGLAVFPRKGSAILWYNLYRNGKGDRRTLHAACPVLSGSKW 482


>gi|170064951|ref|XP_001867739.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
 gi|167882142|gb|EDS45525.1| prolyl 4-hydroxylase subunit alpha-2 [Culex quinquefasciatus]
          Length = 516

 Score =  166 bits (421), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 91/226 (40%), Positives = 130/226 (57%), Gaps = 14/226 (6%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+G+   P    SNL C Y    + FL++ PLK E + LDP V   HDA  D+EI
Sbjct: 275 LYEPLCRGDHQRPPSETSNLYCRYHMSTSPFLRLAPLKQEVVNLDPFVAVYHDAASDAEI 334

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
           N++IEL + ++ R  V +        +R S+  +L      DHP +  +  R +DM    
Sbjct: 335 NKVIELGRPQINRSMVGDAAKKEVSKSRTSQNSWL---TDYDHPVVAALSRRTKDM---A 388

Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVELGGA 174
           +G +E     LQ+NNYG+GGHY  H D + R+E  +       R+A+ MFYL+DVE GGA
Sbjct: 389 LGLDETAYESLQVNNYGIGGHYLPHYDWS-REENPYPELNTGNRIATLMFYLSDVEEGGA 447

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T+FP L + VFP+KG+A+FWYN  A+   D +  H  CPV +G+KW
Sbjct: 448 TVFPHLGVGVFPKKGTAIFWYNLRASGKGDEKTLHGACPVLIGSKW 493


>gi|195505207|ref|XP_002099404.1| GE23380 [Drosophila yakuba]
 gi|194185505|gb|EDW99116.1| GE23380 [Drosophila yakuba]
          Length = 540

 Score =  166 bits (420), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 90/227 (39%), Positives = 134/227 (59%), Gaps = 15/227 (6%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+G L      + NL+C+       + ++ P K+E+L LDP V  +H+ ++DSE
Sbjct: 290 KLYTQVCRGELHQTPREQRNLRCWLTHQGVPYYRLAPFKIEQLNLDPYVAYVHEVLWDSE 349

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I+ I+E  KG ++R  V   G++   + R S+  +L+   +  +P+L KI+ R++D+T L
Sbjct: 350 IDMIMEHGKGNMKRSMVGQSGNSTTTEIRTSQNTWLW---YDANPWLAKIKQRLEDVTGL 406

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL----W---RLASFMFYLTDVELGG 173
                E    PLQ+ NYG+GG Y+ H D    D+G     W   RLA+ +FYL DV LGG
Sbjct: 407 STESAE----PLQLVNYGIGGQYEPHFDFM-EDDGQKVFGWKGNRLATALFYLNDVALGG 461

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           AT FP L L V P KGS + WYN H++T  D+R  H+GCPV  G+KW
Sbjct: 462 ATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 508


>gi|195341548|ref|XP_002037368.1| GM12149 [Drosophila sechellia]
 gi|194131484|gb|EDW53527.1| GM12149 [Drosophila sechellia]
          Length = 537

 Score =  166 bits (419), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 89/226 (39%), Positives = 131/226 (57%), Gaps = 13/226 (5%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+G L      + NL+C+       +  + P K+E+L +DP V  +H+ ++DSE
Sbjct: 287 KLYTQVCRGELHQSPREQRNLRCWLSHQGVLYYHLSPFKIEQLNIDPYVAYVHEVLWDSE 346

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I+ IIE  KG +ER KV    ++   + R+S+  +L+   +  +P+L KI+ R++D+T L
Sbjct: 347 IDTIIEHGKGNMERSKVGQIENSTTTEVRISRNTWLW---YDANPWLSKIKQRLEDVTGL 403

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL---W---RLASFMFYLTDVELGGA 174
                E    PLQ+ NYG+GG Y+ H D    D      W   RL + +FYL DV LGGA
Sbjct: 404 STESAE----PLQLVNYGIGGQYEPHFDFVEDDGKTVFSWKGNRLLTALFYLNDVALGGA 459

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T FP L L V P KGS + WYN H++T  D+R  H+GCPV  G+KW
Sbjct: 460 TAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 505


>gi|170064960|ref|XP_001867743.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
 gi|167882146|gb|EDS45529.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
          Length = 545

 Score =  165 bits (417), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 88/229 (38%), Positives = 131/229 (57%), Gaps = 17/229 (7%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+G     E   + L+C Y + ++ FLKI PLK+EE +L+P +V  H+ + D+EI
Sbjct: 297 LYEQLCRGEAHRAEADLAKLRCRYVTNSSPFLKIAPLKLEEAHLEPYIVIYHEVMSDAEI 356

Query: 62  NRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
             I  L+K +  R  V NY  G+    + R+SK  +L  E   +H  +  +  R++DMT 
Sbjct: 357 EVIKRLAKPRFRRATVQNYKTGELEVANYRISKSAWLKDE---EHSVVRTVGQRVEDMTG 413

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVEL 171
           L +   E     LQ+ NYG+GGHY+ H D   R+E           R+A+ +FY++DV  
Sbjct: 414 LTMTTAEE----LQVVNYGIGGHYEPHFDFARREEKNAFKSLGTGNRIATVLFYMSDVSQ 469

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GGAT+FPS+ + + P+KG+A FWYN HA+   DY   H+ CPV  G KW
Sbjct: 470 GGATVFPSIRVALRPKKGTAAFWYNLHASGHGDYATRHAACPVLTGTKW 518


>gi|270001038|gb|EEZ97485.1| hypothetical protein TcasGA2_TC011322 [Tribolium castaneum]
          Length = 509

 Score =  164 bits (416), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 85/224 (37%), Positives = 136/224 (60%), Gaps = 11/224 (4%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E+Y   C+  +S+PE   S LKCFY++ N+ FL+I P KVE+ +LDP ++  H+ + D E
Sbjct: 275 EVYKKLCRAEISLPEAKSSKLKCFYQNSNHPFLRIAPFKVEQAHLDPDILIFHNVLSDCE 334

Query: 61  INRIIELSKGKVERGKVVN-YGDTIYV-DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L++ ++      N +   + +   R+SKV +L  +   +H  L  +  R+  MT
Sbjct: 335 IETMKQLAQSRLVTAVFENPHSKQLELFPFRISKVAWLEDQ---EHQHLAVVAQRVAHMT 391

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGL-WRLASFMFYLTDVELGGATI 176
            L +   E +    Q+ NYG+GGHY+ H D  +  D  +  R+ + +FYL+DVE GGAT+
Sbjct: 392 GLTLSTAEEF----QVVNYGIGGHYEPHFDFQSTVDPAIGSRIETVLFYLSDVEQGGATV 447

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP + ++V+P+KGSAV W+N H +   D R  H+GCPV +G+KW
Sbjct: 448 FPEIQVSVWPQKGSAVVWFNLHPSGDGDQRTKHAGCPVLIGSKW 491


>gi|189241578|ref|XP_969458.2| PREDICTED: similar to prolyl 4-hydroxylase alpha subunit 1,
           putative [Tribolium castaneum]
          Length = 515

 Score =  164 bits (416), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 85/224 (37%), Positives = 133/224 (59%), Gaps = 11/224 (4%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E+Y   C+  +S+PE   S LKCFY++ N+ FL+I P KVE+ +LDP ++  H+ + D E
Sbjct: 281 EVYKKLCRAEISLPEAKSSKLKCFYQNSNHPFLRIAPFKVEQAHLDPDILIFHNVLSDCE 340

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L++ ++      N           R+SKV +L  +   +H  L  +  R+  MT
Sbjct: 341 IETMKQLAQSRLVTAVFENPHSKQLELFPFRISKVAWLEDQ---EHQHLAVVAQRVAHMT 397

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGL-WRLASFMFYLTDVELGGATI 176
            L +   E +    Q+ NYG+GGHY+ H D  +  D  +  R+ + +FYL+DVE GGAT+
Sbjct: 398 GLTLSTAEEF----QVVNYGIGGHYEPHFDFQSTVDPAIGSRIETVLFYLSDVEQGGATV 453

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP + ++V+P+KGSAV W+N H +   D R  H+GCPV +G+KW
Sbjct: 454 FPEIQVSVWPQKGSAVVWFNLHPSGDGDQRTKHAGCPVLIGSKW 497


>gi|24651424|ref|NP_733376.1| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
 gi|23172697|gb|AAF57059.2| prolyl-4-hydroxylase-alpha SG1 [Drosophila melanogaster]
 gi|66772443|gb|AAY55533.1| IP03659p [Drosophila melanogaster]
 gi|220951214|gb|ACL88150.1| PH4alphaSG1-PA [synthetic construct]
 gi|220959938|gb|ACL92512.1| PH4alphaSG1-PA [synthetic construct]
          Length = 540

 Score =  164 bits (415), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 89/227 (39%), Positives = 134/227 (59%), Gaps = 15/227 (6%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+G L      + NL+C+       + ++ P K+E+L +DP V  +H+ ++DSE
Sbjct: 290 KLYTQVCRGELHQSPREQRNLRCWLYHQGVPYYRLSPFKIEQLNVDPYVAYVHEVLWDSE 349

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I+ I+E  KG +ER KV    ++   + R+S+  +L+   +  +P+L KI+ R++D+T L
Sbjct: 350 IDTIMEHGKGNMERSKVGQSENSTTSEVRISRNTWLW---YDANPWLSKIKQRLEDVTGL 406

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL----W---RLASFMFYLTDVELGG 173
                E    PLQ+ NYG+GG Y+ H D    D+G     W   RL + +FYL DV LGG
Sbjct: 407 STESAE----PLQLVNYGIGGQYEPHFDFV-EDDGQSVFSWKGNRLLTALFYLNDVALGG 461

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           AT FP L L V P KGS + WYN H++T  D+R  H+GCPV  G+KW
Sbjct: 462 ATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 508


>gi|66772331|gb|AAY55477.1| IP03959p [Drosophila melanogaster]
 gi|66772361|gb|AAY55492.1| IP03859p [Drosophila melanogaster]
          Length = 541

 Score =  164 bits (415), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 89/227 (39%), Positives = 134/227 (59%), Gaps = 15/227 (6%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+G L      + NL+C+       + ++ P K+E+L +DP V  +H+ ++DSE
Sbjct: 291 KLYTQVCRGELHQSPREQRNLRCWLYHQGVPYYRLSPFKIEQLNVDPYVAYVHEVLWDSE 350

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I+ I+E  KG +ER KV    ++   + R+S+  +L+   +  +P+L KI+ R++D+T L
Sbjct: 351 IDTIMEHGKGNMERSKVGQSENSTTSEVRISRNTWLW---YDANPWLSKIKQRLEDVTGL 407

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL----W---RLASFMFYLTDVELGG 173
                E    PLQ+ NYG+GG Y+ H D    D+G     W   RL + +FYL DV LGG
Sbjct: 408 STESAE----PLQLVNYGIGGQYEPHFDFV-EDDGQSVFSWKGNRLLTALFYLNDVALGG 462

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           AT FP L L V P KGS + WYN H++T  D+R  H+GCPV  G+KW
Sbjct: 463 ATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 509


>gi|20269816|gb|AAM18063.1|AF495541_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]SG1
           [Drosophila melanogaster]
          Length = 540

 Score =  164 bits (414), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 89/227 (39%), Positives = 134/227 (59%), Gaps = 15/227 (6%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+G L      + NL+C+       + ++ P K+E+L +DP V  +H+ ++DSE
Sbjct: 290 KLYTEVCRGELHQSPREQRNLRCWLSHQGVPYYRLFPFKIEQLNIDPYVAYVHEVLWDSE 349

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I+ I+E  KG +ER KV    ++   + R+S+  +L+   +  +P+L KI+ R++D+T L
Sbjct: 350 IDTIMEHGKGNMERSKVGQSENSTTSEVRISRNTWLW---YDANPWLSKIKQRLEDVTGL 406

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL----W---RLASFMFYLTDVELGG 173
                E    PLQ+ NYG+GG Y+ H D    D+G     W   RL + +FYL DV LGG
Sbjct: 407 STESAE----PLQLVNYGIGGQYEPHFDFV-EDDGQSVFSWKGNRLLTALFYLNDVALGG 461

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           AT FP L L V P KGS + WYN H++T  D+R  H+GCPV  G+KW
Sbjct: 462 ATAFPFLRLAVPPVKGSLLIWYNLHSSTHKDFRTKHAGCPVLQGSKW 508


>gi|242018356|ref|XP_002429643.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
           humanus corporis]
 gi|212514628|gb|EEB16905.1| Prolyl 4-hydroxylase alpha-1 subunit precursor, putative [Pediculus
           humanus corporis]
          Length = 534

 Score =  161 bits (408), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 84/229 (36%), Positives = 129/229 (56%), Gaps = 17/229 (7%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+  + + E +K+ LKC Y  +   FL +  +K EE +LDPR+V  HD + D EI
Sbjct: 289 MYEKLCRNEVGLSEKMKAKLKCRYVDFGRPFLMLAKVKEEEAFLDPRIVLYHDVLSDREI 348

Query: 62  NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
             I +L+  + +R  V N   G       R+SK  +L      DHP++ K+  R++D+T 
Sbjct: 349 KTIQQLAVPRFKRATVQNSETGKLEVAHYRISKSAWLED---VDHPYVAKVSQRVEDITG 405

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVEL 171
           L +   E     LQ+ NYG+GGHY+ H D   ++E           R+A+ +FY++DV  
Sbjct: 406 LNMATAE----SLQVVNYGIGGHYEPHFDFARKEEKNAFQSLGTGNRIATILFYMSDVSQ 461

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GGAT+FP + ++++P+KG+A FWYN   N   DY   H+ CPV  G+KW
Sbjct: 462 GGATVFPGIKVSLWPKKGTAAFWYNLRKNGEGDYLTRHAACPVLTGSKW 510


>gi|194765174|ref|XP_001964702.1| GF23328 [Drosophila ananassae]
 gi|190614974|gb|EDV30498.1| GF23328 [Drosophila ananassae]
          Length = 542

 Score =  161 bits (407), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 85/225 (37%), Positives = 126/225 (56%), Gaps = 12/225 (5%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+G L      +  L+C +   N  F ++ P KVE+L LDP V   H+AI  SE
Sbjct: 293 QLYKRVCRGELRQSPRQQRKLRCLFSHQNVAFYRLAPFKVEQLNLDPYVAYFHEAINSSE 352

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           + +IIE   G +ER +V    +    + R S   +L+   + ++P+L KI+ R++D+T L
Sbjct: 353 MEQIIEKGLGSMERSRVGQSQNATTSEIRTSANTWLW---YNENPWLSKIKQRLEDITGL 409

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGAT 175
                E    PLQ+ NYG+GG Y+ H D     + ++     R+ + +FY+ DV LGGAT
Sbjct: 410 STESAE----PLQLVNYGIGGQYEPHFDFVEEPQKVFGWKGNRMLTALFYINDVALGGAT 465

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            FP L L V P KGS + WYN H +   D+R  H+GCPV  G+KW
Sbjct: 466 AFPFLQLAVPPVKGSLLVWYNLHRSLHKDFRTKHAGCPVIKGSKW 510


>gi|195110931|ref|XP_002000033.1| GI24862 [Drosophila mojavensis]
 gi|193916627|gb|EDW15494.1| GI24862 [Drosophila mojavensis]
          Length = 549

 Score =  160 bits (405), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 88/231 (38%), Positives = 128/231 (55%), Gaps = 19/231 (8%)

Query: 1   EIYPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           E+Y   C G++   P +++  L+C Y +  + FL + PLKVEEL  DP +V  HD IY S
Sbjct: 288 ELYRHTCNGHIRPTPSELR-QLRCGYMTETHPFLLLAPLKVEELSHDPLLVLFHDVIYQS 346

Query: 60  EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           EI+ ++ L+K K+ R  V  +  ++  + R S+  FL P+    H  L  I  R+ DMT+
Sbjct: 347 EIDTLMRLAKNKIHRATVTGHNSSVVSNARTSQFTFL-PKT--RHKVLRTIDQRVADMTD 403

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD----------ATPRDEGLWRLASFMFYLTDV 169
           L +     Y    Q+ NYG+GGHY  H D               E   R+ + +FYL+DV
Sbjct: 404 LHL----EYAEDHQLANYGIGGHYAQHMDWFYPITFETKQVSNPEMGNRIGTVLFYLSDV 459

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           E GGAT FP+L   + P+K +A FWYN HA+ + D R  H  CP+ +G+KW
Sbjct: 460 EQGGATAFPALKQLLRPKKHAAAFWYNLHASGVGDARTMHGACPIIVGSKW 510


>gi|194905372|ref|XP_001981184.1| GG11758 [Drosophila erecta]
 gi|190655822|gb|EDV53054.1| GG11758 [Drosophila erecta]
          Length = 550

 Score =  159 bits (402), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 87/230 (37%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           + Y L C G+  +    +S+L+C Y +  + FL I PLK EEL+ DP +V  HD IY SE
Sbjct: 284 QAYSLTCSGHWRLTPKEQSHLRCGYVTETHPFLWIAPLKAEELFQDPLLVLYHDVIYQSE 343

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I+ I +L++ ++ R  V  + +++  + R S+  F+       H  L  I  R+ DMTNL
Sbjct: 344 IDVIRKLTENRLMRATVTGHNESLVSNVRTSQFTFIPASA---HKVLSTIDQRVADMTNL 400

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW-------RLASFMFYLTDVE 170
            +    +Y    Q  NYG+GGHY  H D    T  D GL        R+A+ +FYL+DV 
Sbjct: 401 NM----KYAEDHQFANYGIGGHYGQHMDWFYQTTFDAGLVSSPEMGNRIATVLFYLSDVS 456

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T FP L   + P+K +A FW+N HA+ + D R  H  CP+  G+KW
Sbjct: 457 QGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 506


>gi|195390835|ref|XP_002054073.1| GJ22993 [Drosophila virilis]
 gi|194152159|gb|EDW67593.1| GJ22993 [Drosophila virilis]
          Length = 525

 Score =  159 bits (401), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 94/227 (41%), Positives = 124/227 (54%), Gaps = 20/227 (8%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G        KS L C Y S N+ FL++ PLK E L LDP +V  HD I  SEI 
Sbjct: 289 YERGCRGQFPT----KSKLHCVYNSTNSPFLRLAPLKTELLALDPYMVLYHDVITPSEIR 344

Query: 63  RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            +  L+   ++R  V N   G    V TR SKV +L   +   +P   ++  RI DMT  
Sbjct: 345 ELQYLAVPTLKRATVFNQKMGRNTVVKTRTSKVTWLTDSL---NPLTVRLNRRISDMTGF 401

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLW---RLASFMFYLTDVELGG 173
            +   E     LQ+ NYGLGGHYDLH D       +D       R+A+ +FYLTDVE GG
Sbjct: 402 DLYGSEM----LQVMNYGLGGHYDLHFDYFNATIAKDLTKLNGDRIATVLFYLTDVEQGG 457

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           AT+FP++   +FP+KG+AV WYN   N   D +  H+ CPV +G+KW
Sbjct: 458 ATVFPNIKQAIFPKKGTAVMWYNLRHNNDGDPQTLHAACPVIVGSKW 504


>gi|170064956|ref|XP_001867741.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
 gi|167882144|gb|EDS45527.1| prolyl 4-hydroxylase alpha subunit 1 [Culex quinquefasciatus]
          Length = 520

 Score =  159 bits (401), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 89/228 (39%), Positives = 134/228 (58%), Gaps = 16/228 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+G+   P ++ S L C YE+    FL++ PLK+E + L+P +V  H+A+ D E
Sbjct: 278 KLYEKLCRGDYERPGEVTSQLFCRYETSATPFLRLAPLKLEVVNLEPLIVVYHEAVSDRE 337

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDH--PFLYKIQTRIQDMT 118
           I ++IEL++  ++R  V   GDT     ++SK+       F +   P +  +  R +DM 
Sbjct: 338 IAKLIELARPLIKRSAV---GDT--RSEQISKIRISQNAWFENEHDPIVETLNQRARDMA 392

Query: 119 NLVIGREERYKGPLQINNYGLGG----HYDLHCDATP-RDEGLW-RLASFMFYLTDVELG 172
               G  E     LQ+NNYGLGG    HYD    A P  ++G+  R+A+ MFYL+DV+ G
Sbjct: 393 G---GLNEPSYELLQVNNYGLGGFYSIHYDWSTSANPFPNKGMGNRIATLMFYLSDVQEG 449

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           G+T+FP LNL V P KG+A+FWYN H N   + +  H+ CPV +G+KW
Sbjct: 450 GSTVFPRLNLAVRPRKGTAIFWYNLHRNGKGNKKTLHAACPVLIGSKW 497


>gi|170029530|ref|XP_001842645.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
 gi|167863229|gb|EDS26612.1| prolyl 4-hydroxylase subunit alpha-1 [Culex quinquefasciatus]
          Length = 522

 Score =  159 bits (401), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 89/225 (39%), Positives = 130/225 (57%), Gaps = 16/225 (7%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+G +    D  S L+C  ++    FL++ PLKVEE+ L+P +   H  I D EI
Sbjct: 273 LYEPLCRGEVHRFADELSKLRCRLDTKTTPFLRLAPLKVEEVSLEPPIYLYHKVISDEEI 332

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
           +++IEL K ++ R  V      +    R+S+  +L  E+    P L  +Q R  DM+   
Sbjct: 333 DKLIELGKARLNRATV----GQMVSQVRISQNVWLSEEV---DPLLGVLQRRTYDMSR-- 383

Query: 122 IGREERYKGPLQINNYGLGGHYDLH--CDAT----PRDEGLWRLASFMFYLTDVELGGAT 175
            G   +    +Q+NNYG+GGH   H  CD+     P+     RLA+ M+YL+DVE+GG T
Sbjct: 384 -GLSMQGFDMVQVNNYGIGGHNIPHYDCDSEYPPFPQFNMGNRLATLMYYLSDVEVGGGT 442

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +FP L+L VFP KGSA+FW+N H N  +D RM H+GCP  +G+KW
Sbjct: 443 VFPRLSLGVFPIKGSAIFWHNVHHNGNVDERMLHAGCPTLIGSKW 487


>gi|195505218|ref|XP_002099409.1| GE10887 [Drosophila yakuba]
 gi|194185510|gb|EDW99121.1| GE10887 [Drosophila yakuba]
          Length = 521

 Score =  159 bits (401), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           + Y L C G+  +    + +L+C Y +  + FL I PLK EEL+ DP +V  HD IY SE
Sbjct: 256 QAYSLTCSGHWRLTPKEQRHLRCGYVTETHPFLWIAPLKAEELFQDPLLVLYHDVIYQSE 315

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I+ I +L++ +++R  V  + +++  + R S+  F+       H  L  I  R+ DMTNL
Sbjct: 316 IDVIRKLTENRLKRATVTGHNESVVSNVRTSQFTFI---PVSAHKVLSTIDQRVADMTNL 372

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW-------RLASFMFYLTDVE 170
            +    +Y    Q  NYG+GGHY  H D    T  D GL        R+A+ +FYL+DV 
Sbjct: 373 NM----KYAEDHQFANYGIGGHYGQHMDWFYQTTIDAGLISSPEMGNRIATVLFYLSDVS 428

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T FP L   + P+K +A FW+N HA+ + D R  H  CP+  G+KW
Sbjct: 429 QGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 478


>gi|194905436|ref|XP_001981196.1| GG11753 [Drosophila erecta]
 gi|190655834|gb|EDV53066.1| GG11753 [Drosophila erecta]
          Length = 550

 Score =  158 bits (400), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 92/230 (40%), Positives = 132/230 (57%), Gaps = 21/230 (9%)

Query: 3   YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           Y + C+G L   P D++S L+C Y +    FL++GPLK+EE + DP +V  HDA+YD EI
Sbjct: 302 YEMLCRGELKPSPSDLRS-LRCRYVTNGVPFLRLGPLKLEEAHADPYIVIFHDAMYDGEI 360

Query: 62  NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFL-YPEIFGDHPFLYKIQTRIQDMT 118
           + I  +++ +  R  V N   G     + R+SK  +L  PE    H  +  +  R  DMT
Sbjct: 361 DLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTPE----HRVIETVVQRTADMT 416

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GGHY+ H D   ++E     GL    R+A+ +FY++DVE
Sbjct: 417 GLDMDSAEE----LQVVNYGIGGHYEPHFDFARKEEQRAFEGLNLGNRIATVLFYMSDVE 472

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+F SL+  +FP+KG+A FW N H +   D R  H+ CPV  G KW
Sbjct: 473 QGGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKW 522


>gi|194905397|ref|XP_001981189.1| GG11929 [Drosophila erecta]
 gi|190655827|gb|EDV53059.1| GG11929 [Drosophila erecta]
          Length = 538

 Score =  158 bits (400), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 83/225 (36%), Positives = 129/225 (57%), Gaps = 12/225 (5%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+G L      + NL+C+       + ++ P K E+L LDP V  +H  ++DSE
Sbjct: 289 QLYTQLCRGELHQSPREQRNLRCWLSHQGVPYYRLSPFKFEQLNLDPYVALVHHVLWDSE 348

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           +  I++  +G +ER KV    ++   D R S+  +L+ ++   +P+L +I+ R++D+T L
Sbjct: 349 MEMIMQHGRGSMERSKVGQSENSKIADRRTSQNTWLWYDV---NPWLSRIKQRLEDVTGL 405

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGAT 175
                E    PLQ+ NYG+GG Y+ H D     E ++     RL + +FY+ DV LGGAT
Sbjct: 406 STESAE----PLQLLNYGIGGQYEPHFDFVEDAEKIFGWQDDRLMTAIFYINDVALGGAT 461

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            FP L L V PEKGS + W N H++   DYR  H+GCP+  G+KW
Sbjct: 462 AFPFLRLAVPPEKGSLLMWNNLHSSLHKDYRSKHAGCPILQGSKW 506


>gi|195575089|ref|XP_002105512.1| GD21521 [Drosophila simulans]
 gi|194201439|gb|EDX15015.1| GD21521 [Drosophila simulans]
          Length = 550

 Score =  158 bits (399), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 133/229 (58%), Gaps = 19/229 (8%)

Query: 3   YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           Y + C+G L   P D++S L+C Y +    FL++GPLK+EE++ DP +V  HDA+YDSEI
Sbjct: 302 YEMLCRGELKPSPSDLRS-LRCRYVTNRVPFLRLGPLKLEEVHADPYIVIYHDAMYDSEI 360

Query: 62  NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           + I  +++ +  R  V N   G     + R+SK  +L  +   +   +  +  R  DMT 
Sbjct: 361 DLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTQ---EDRVIETVVQRTADMTG 417

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVEL 171
           L +   E     LQ+ NYG+GGHY+ H D   ++E     GL    R+A+ +FY++DVE 
Sbjct: 418 LDMDSAEE----LQVVNYGIGGHYEPHFDFARKEEERAFEGLNLGNRIATVLFYMSDVEQ 473

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GGAT+F SL+  +FP+KG+A FW N H +   D R  H+ CPV  G KW
Sbjct: 474 GGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKW 522


>gi|195452742|ref|XP_002073480.1| GK13123 [Drosophila willistoni]
 gi|194169565|gb|EDW84466.1| GK13123 [Drosophila willistoni]
          Length = 540

 Score =  158 bits (399), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 84/225 (37%), Positives = 126/225 (56%), Gaps = 12/225 (5%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+G L      +  L+CFY      F ++GP KVE+L LDP V   H+ I D E
Sbjct: 286 DLYQRVCRGELRQSPRQQRKLRCFYSDRGVAFYRLGPFKVEQLNLDPYVAYFHNVISDDE 345

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            + +IE   G+V+R +V   G++   + R S+  +L+   +   P+L  ++ R++D+T L
Sbjct: 346 TDDLIEHGMGQVKRSRVGTVGNSTVSEVRTSQNTWLW---YEQQPWLKNLKLRLEDITGL 402

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGAT 175
            +   E    PLQ+ NYG+GGHY+ H D        +     RL + + YL +V +GGAT
Sbjct: 403 GMESAE----PLQLVNYGIGGHYEPHYDFVEDKVTTFGWKGNRLLTALLYLNEVPMGGAT 458

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            FP L L V P KGS + WYN H +   D+R  H+GCPV +G+KW
Sbjct: 459 AFPYLKLAVPPVKGSLLVWYNLHRSLDPDFRTKHAGCPVLMGSKW 503


>gi|24651407|ref|NP_733371.1| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
 gi|20269806|gb|AAM18058.1|AF495536_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]EFB
           [Drosophila melanogaster]
 gi|15292529|gb|AAK93533.1| SD05564p [Drosophila melanogaster]
 gi|23172692|gb|AAF57053.2| prolyl-4-hydroxylase-alpha EFB [Drosophila melanogaster]
 gi|220946562|gb|ACL85824.1| PH4alphaEFB-PA [synthetic construct]
          Length = 550

 Score =  158 bits (399), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 133/229 (58%), Gaps = 19/229 (8%)

Query: 3   YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           Y + C+G L   P D++S L+C Y +    FL++GPLK+EE++ DP +V  HDA+YDSEI
Sbjct: 302 YEMLCRGELKPSPSDLRS-LRCRYVTNRVPFLRLGPLKLEEVHADPYIVIYHDAMYDSEI 360

Query: 62  NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           + I  +++ +  R  V N   G     + R+SK  +L  +   +   +  +  R  DMT 
Sbjct: 361 DLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTQ---EDRVIETVVQRTADMTG 417

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVEL 171
           L +   E     LQ+ NYG+GGHY+ H D   ++E     GL    R+A+ +FY++DVE 
Sbjct: 418 LDMDSAEE----LQVVNYGIGGHYEPHFDFARKEEQRAFEGLNLGNRIATVLFYMSDVEQ 473

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GGAT+F SL+  +FP+KG+A FW N H +   D R  H+ CPV  G KW
Sbjct: 474 GGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKW 522


>gi|195341536|ref|XP_002037362.1| GM12882 [Drosophila sechellia]
 gi|194131478|gb|EDW53521.1| GM12882 [Drosophila sechellia]
          Length = 550

 Score =  158 bits (399), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 88/229 (38%), Positives = 131/229 (57%), Gaps = 19/229 (8%)

Query: 3   YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           Y + C+G L   P D++S L+C Y +    FL++GPLK+EE++ DP +V  HDA+YDSEI
Sbjct: 302 YEMLCRGELKPSPSDLRS-LRCRYVTNRVPFLRLGPLKLEEVHADPYIVIYHDAMYDSEI 360

Query: 62  NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           + I  +++ +  R  V N   G     + R+SK  +L  +   +   +  +  R  DMT 
Sbjct: 361 DLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTQ---EDRVIETVVQRTADMTG 417

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVEL 171
           L +   E     LQ+ NYG+GGHY+ H D   ++E           R+A+ +FY++DVE 
Sbjct: 418 LDMDSAEE----LQVVNYGIGGHYEPHFDFARKEEERAFEGINLGNRIATVLFYMSDVEQ 473

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GGAT+F SL+  +FP+KG+A FW N H +   D R  H+ CPV  G KW
Sbjct: 474 GGATVFTSLHTALFPKKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKW 522


>gi|195391754|ref|XP_002054525.1| GJ24502 [Drosophila virilis]
 gi|194152611|gb|EDW68045.1| GJ24502 [Drosophila virilis]
          Length = 487

 Score =  158 bits (399), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 90/231 (38%), Positives = 133/231 (57%), Gaps = 19/231 (8%)

Query: 1   EIYPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           + Y + C+G L   P +++  L+C Y + N  FL++ PLK+EE Y+DP +V  HDA+YDS
Sbjct: 237 KAYEMLCRGELKPSPSELRP-LRCRYVNNNVAFLRLAPLKLEEAYMDPYIVIYHDAMYDS 295

Query: 60  EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
           EI  I  +++ +  R  V N   G     + R+SK  +L      +H  +  +  R  DM
Sbjct: 296 EIEIIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKT---AEHRVIGTVVQRTADM 352

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDV 169
           T L +   E     LQ+ NYG+GGHY+ H D   R+E     GL    R+A+ +FY++DV
Sbjct: 353 TGLDMDSAEE----LQVVNYGIGGHYEPHFDFARREEKRAFEGLNLGNRIATMLFYMSDV 408

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           E GGAT+F SL+  ++P+KG+A FW N H +   D R  H+ CPV  G+KW
Sbjct: 409 EQGGATVFTSLHAALWPKKGTAAFWMNLHRSGEGDVRTRHAACPVLTGSKW 459


>gi|195110919|ref|XP_002000027.1| GI24860 [Drosophila mojavensis]
 gi|193916621|gb|EDW15488.1| GI24860 [Drosophila mojavensis]
          Length = 487

 Score =  157 bits (398), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 88/230 (38%), Positives = 130/230 (56%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           + Y + C+G L +   +   L+C Y S N  FL++ PLK+EE +LDP +V  HDA++DSE
Sbjct: 237 KAYEMLCRGELKLSPSVLRPLRCRYVSNNVPFLRLAPLKLEEAFLDPYIVIYHDAMFDSE 296

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  +  +++ +  R  V N   G     + R+SK  +L      +H  +  +  R  DMT
Sbjct: 297 IEVLKRMARPRFRRATVQNAVTGALETANYRISKSAWLKT---AEHRVIGTVVQRTADMT 353

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GGHY+ H D   R+E     GL    R+A+ +FY++DVE
Sbjct: 354 GLDMDSAEE----LQVVNYGIGGHYEPHFDFARREEIRAFEGLNLGNRIATVLFYMSDVE 409

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+F SL+  + P+KG+A FW N H +   D R  H+ CPV  G+KW
Sbjct: 410 QGGATVFTSLHAVLKPKKGTAAFWMNLHRSGEGDVRTRHAACPVLTGSKW 459


>gi|157111033|ref|XP_001651361.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
 gi|108878552|gb|EAT42777.1| AAEL005714-PA, partial [Aedes aegypti]
          Length = 522

 Score =  157 bits (397), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 86/218 (39%), Positives = 126/218 (57%), Gaps = 11/218 (5%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G +       S+LKC Y S  + F KIGP K+EE++L P++V  HD + D+EI  +  
Sbjct: 289 CRGEIQRNVSETSHLKCRYVSNLSAFSKIGPFKLEEMHLKPKIVIFHDVLSDTEIELLKR 348

Query: 67  LSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
           L+K  +ER  + N   G       R+SK  + +P+ +  H  +  I  R+ DMT L +  
Sbjct: 349 LAKPILERATIANQQTGKAERSKDRVSKSSW-FPDEY--HSTIRTITKRVADMTGLSMDT 405

Query: 125 EERYKGPLQINNYGLGGHYDLHCD--ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL 182
            E     LQ+ NYGLGG YD H D     + + + R+A+ +FY++DV +GGAT+FP L +
Sbjct: 406 AEE----LQVVNYGLGGQYDPHFDFFHWGKLKEVNRIATVLFYMSDVSIGGATVFPKLGV 461

Query: 183 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T+   KG+A FWYN H++  LDY   H  CPV +G KW
Sbjct: 462 TLEARKGTAAFWYNLHSSGELDYSTLHGACPVLIGEKW 499


>gi|195452726|ref|XP_002073473.1| GK14136 [Drosophila willistoni]
 gi|194169558|gb|EDW84459.1| GK14136 [Drosophila willistoni]
          Length = 550

 Score =  157 bits (397), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 90/229 (39%), Positives = 133/229 (58%), Gaps = 19/229 (8%)

Query: 3   YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           Y + C+G L   P D++  L+C Y + N  FL++GPLK+EE ++DP +V  HDA+YDSE+
Sbjct: 302 YEMLCRGELKPSPADLRP-LRCRYVTNNVPFLRLGPLKLEEAHMDPYIVIYHDAMYDSEM 360

Query: 62  NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           + I  +++ +  R  V N   G     + R+SK  +L  E   +   +  +  R  DMT 
Sbjct: 361 DLIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTE---EDQVIGTVVQRTADMTG 417

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVEL 171
           L +   E     LQ+ NYG+GGHY+ H D   R+E     GL    R+A+ +FY++DVE 
Sbjct: 418 LDMDSAEE----LQVVNYGIGGHYEPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQ 473

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GGAT+F SL+  ++P+KG+A FW N H +   D R  H+ CPV  G KW
Sbjct: 474 GGATVFTSLHAALWPKKGTAAFWMNLHRDGEGDVRTRHAACPVLTGTKW 522


>gi|195055779|ref|XP_001994790.1| GH14110 [Drosophila grimshawi]
 gi|193892553|gb|EDV91419.1| GH14110 [Drosophila grimshawi]
          Length = 487

 Score =  157 bits (396), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 89/230 (38%), Positives = 131/230 (56%), Gaps = 21/230 (9%)

Query: 3   YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           Y + C+G L   P +I+  L+C Y + N  FL++ PLK+EE ++DP +V  HDA+YDSEI
Sbjct: 239 YEMLCRGELKPSPAEIRP-LRCRYVNNNVDFLRLAPLKLEEAFMDPYIVIYHDAMYDSEI 297

Query: 62  NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFL-YPEIFGDHPFLYKIQTRIQDMT 118
             +  +++ +  R  V N   G     + R+SK  +L  PE    H  +  +  R  DMT
Sbjct: 298 EVLKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTPE----HEIIGTVVQRTADMT 353

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GGHY+ H D   R+E L         R+A+ +FY++DV+
Sbjct: 354 GLDMDSAEE----LQVVNYGIGGHYEPHFDFARREEKLAFEGLNLGNRIATMLFYMSDVQ 409

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+F SL   ++P+KG+A FW N H +   D R  H+ CPV  G+KW
Sbjct: 410 QGGATVFTSLRTALWPKKGTAAFWMNLHRSGEGDARTRHAACPVLTGSKW 459


>gi|297515507|gb|ADI44133.1| RT08151p [Drosophila melanogaster]
          Length = 546

 Score =  157 bits (396), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           + Y L C G+  +    + +L+C Y +  + FL I PLK EEL+ DP +V  HD IY SE
Sbjct: 287 QAYSLTCSGHWRLTPKEQRHLRCGYVTETHPFLWIAPLKAEELFQDPLLVLYHDVIYQSE 346

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I+ I +L++ ++ R  + ++ +++  + R S+  F+       H  L  I  R+ DMTNL
Sbjct: 347 IDVIRKLTENRLMRATITSHNESVVSNVRTSQFTFI---PVTAHKVLSTIDQRVADMTNL 403

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW-------RLASFMFYLTDVE 170
            +    +Y    Q  NYG+GGHY  H D    T  D GL        R+A+ +FYL+DV 
Sbjct: 404 NM----KYAEDHQFANYGIGGHYGQHMDWFYQTTFDAGLVSSPEMGNRIAAVLFYLSDVA 459

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T FP L   + P+K +A FW+N HA+ + D R  H  CP+  G+KW
Sbjct: 460 QGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 509


>gi|194765194|ref|XP_001964712.1| GF22904 [Drosophila ananassae]
 gi|190614984|gb|EDV30508.1| GF22904 [Drosophila ananassae]
          Length = 547

 Score =  156 bits (395), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 93/229 (40%), Positives = 132/229 (57%), Gaps = 19/229 (8%)

Query: 3   YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           Y + C+G L   P D++  L+C Y + N  FL++GPLK+EE + +P +V  HDA+YDSEI
Sbjct: 299 YEMLCRGELKPSPADLRP-LRCRYVTNNVPFLRLGPLKLEEAHQEPYIVIYHDAMYDSEI 357

Query: 62  NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
             I  +++ +  R  V N   G     + R+SK  +L  E   DH     +Q R  DMT 
Sbjct: 358 ELIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTE--EDHVIGTVVQ-RTADMTG 414

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVEL 171
           L +   E     LQ+ NYG+GGHY+ H D   ++E     GL    R+A+ +FY++DVE 
Sbjct: 415 LDMDSAEE----LQVVNYGIGGHYEPHFDFARKEEKRAFEGLNLGNRIATVLFYMSDVEQ 470

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GGAT+F SL+  +FP+KG+A FW N H +   D R  H+ CPV  G KW
Sbjct: 471 GGATVFTSLHTALFPKKGTAAFWMNLHRDGEGDVRTRHAACPVLTGTKW 519


>gi|116008434|ref|NP_651806.2| CG9698 [Drosophila melanogaster]
 gi|113194862|gb|AAF57062.2| CG9698 [Drosophila melanogaster]
          Length = 547

 Score =  156 bits (395), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           + Y L C G+  +    + +L+C Y +  + FL I PLK EEL+ DP +V  HD IY SE
Sbjct: 287 QAYSLTCSGHWRLTPKEQRHLRCGYVTETHPFLWIAPLKAEELFQDPLLVLYHDVIYQSE 346

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I+ I +L++ ++ R  + ++ +++  + R S+  F+       H  L  I  R+ DMTNL
Sbjct: 347 IDVIRKLTENRLMRATITSHNESVVSNVRTSQFTFI---PVTAHKVLSTIDQRVADMTNL 403

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW-------RLASFMFYLTDVE 170
            +    +Y    Q  NYG+GGHY  H D    T  D GL        R+A+ +FYL+DV 
Sbjct: 404 NM----KYAEDHQFANYGIGGHYGQHMDWFYQTTFDAGLVSSPEMGNRIATVLFYLSDVA 459

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T FP L   + P+K +A FW+N HA+ + D R  H  CP+  G+KW
Sbjct: 460 QGGGTAFPQLRTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 509


>gi|198449643|ref|XP_001357664.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
 gi|198130698|gb|EAL26798.2| GA15938 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score =  156 bits (395), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 86/225 (38%), Positives = 126/225 (56%), Gaps = 12/225 (5%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E+Y   C+G L      +  L+C+    +  + ++ P KVE+L  DP V   HD + D E
Sbjct: 300 ELYQRVCRGELRQSPKEQRYLRCWLSHQDVPYQRLSPFKVEQLSGDPYVAYFHDVLSDKE 359

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
             +IIE  KG+V R ++   G++   D R S+  +L+   + ++P+L  I+ R++D+T L
Sbjct: 360 SEQIIEHGKGQVTRSEIGQTGNSTVSDIRTSQNTWLW---YENNPWLADIKQRLEDITGL 416

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGAT 175
                E    PLQ+ NYG+GG Y+ H D     E  +     RL + +FYL DV LGGAT
Sbjct: 417 STDTAE----PLQLVNYGIGGQYEPHFDFMDDAEKNFGWKGNRLLTALFYLNDVPLGGAT 472

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            FP L+L V P KGS + WYN H +   D+R  H+GCPV  G+KW
Sbjct: 473 AFPFLHLAVPPVKGSLLVWYNLHRSLHKDFRTKHAGCPVLKGSKW 517


>gi|125772807|ref|XP_001357662.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
 gi|54637394|gb|EAL26796.1| GA15946 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score =  155 bits (392), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 87/228 (38%), Positives = 128/228 (56%), Gaps = 17/228 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y + C+G L        +L+C Y + N  FL++GPLK+EE + DP +V  HDA+YDSE++
Sbjct: 301 YEMLCRGELKPSPTYMRSLRCRYVTNNVPFLRLGPLKLEEAHKDPYIVIYHDAMYDSEMD 360

Query: 63  RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            I  +++ +  R  V N   G     + R+SK  +L  E   +   + K+  R  DMT L
Sbjct: 361 LIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTE---EDSVIAKVVQRTADMTGL 417

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVELG 172
            +   E     LQ+ NYG+GGHY  H D   R+E     GL    R+A+ +FY++DVE G
Sbjct: 418 DMESAEE----LQVVNYGIGGHYAPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQG 473

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GAT+F +L   ++P++G+A FW N H +   D R  H+ CPV  G KW
Sbjct: 474 GATVFTTLRTALWPKRGTAAFWMNLHRDGEGDKRTQHAACPVLTGTKW 521


>gi|195159323|ref|XP_002020531.1| GL13463 [Drosophila persimilis]
 gi|194117300|gb|EDW39343.1| GL13463 [Drosophila persimilis]
          Length = 487

 Score =  155 bits (391), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 87/228 (38%), Positives = 128/228 (56%), Gaps = 17/228 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y + C+G L        +L+C Y + N  FL++GPLK+EE + DP +V  HDA+YDSE++
Sbjct: 239 YEMLCRGELKPSPTYMRSLRCRYVTNNVPFLRLGPLKLEEAHKDPYIVIYHDAMYDSEMD 298

Query: 63  RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            I  +++ +  R  V N   G     + R+SK  +L  E   +   + K+  R  DMT L
Sbjct: 299 LIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTE---EDSVIAKVVQRTADMTGL 355

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVELG 172
            +   E     LQ+ NYG+GGHY  H D   R+E     GL    R+A+ +FY++DVE G
Sbjct: 356 DMESAEE----LQVVNYGIGGHYAPHFDFARREEKRAFEGLNLGNRIATVLFYMSDVEQG 411

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GAT+F +L   ++P++G+A FW N H +   D R  H+ CPV  G KW
Sbjct: 412 GATVFTTLRTALWPKRGTAAFWMNLHRDGEGDKRTQHAACPVLTGTKW 459


>gi|112984520|ref|NP_001037195.1| prolyl 4-hydroxylase alpha subunit precursor [Bombyx mori]
 gi|37543673|gb|AAM21932.1| prolyl 4-hydroxylase alpha subunit [Bombyx mori]
          Length = 550

 Score =  155 bits (391), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 83/228 (36%), Positives = 129/228 (56%), Gaps = 15/228 (6%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+G + +P +I   LKC+Y +  + FLK+ P+KVE++Y+ P +   H+ + D E
Sbjct: 291 KVYESLCRGEMEIPHEITKRLKCWYVTDTHPFLKLAPIKVEQMYVKPDIFMFHEVMTDDE 350

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  I + +K + +R  V +   G+      R+SK  +L  E   + P + +I  R+ DMT
Sbjct: 351 IEFIKKRAKPRFKRAVVHDPKTGELTPAHYRISKSSWLRDE---ESPVIARITQRVTDMT 407

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE------GLWRLASFMFYLTDVELG 172
            L +   E     LQ+ NYG+GGHY+ H D   + E      G  R+A+ +FY++DV  G
Sbjct: 408 GLSMLHAEE----LQVVNYGIGGHYEPHFDFARKRENPFTKFGGNRIATVLFYMSDVAQG 463

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GAT+F  L L++FP K +A FW N HA+   D    H+ CPV  G+KW
Sbjct: 464 GATVFTELGLSLFPIKRAAAFWLNLHASGEGDLATRHAACPVLRGSKW 511


>gi|195159313|ref|XP_002020526.1| GL14040 [Drosophila persimilis]
 gi|194117295|gb|EDW39338.1| GL14040 [Drosophila persimilis]
          Length = 549

 Score =  154 bits (390), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 85/225 (37%), Positives = 126/225 (56%), Gaps = 12/225 (5%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E+Y   C+G L      +  L+C+    +  + ++ P KVE+L  DP V   HD + D E
Sbjct: 300 ELYQRVCRGELRQSPKEQRYLRCWLSHQDVPYQRLSPFKVEQLSGDPYVAYFHDVLSDKE 359

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
             +IIE  KG+V R ++   G++   + R S+  +L+   + ++P+L  I+ R++D+T L
Sbjct: 360 SEQIIEHGKGQVTRSEIGQTGNSTVSEIRTSQNTWLW---YENNPWLADIKQRLEDITGL 416

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGAT 175
                E    PLQ+ NYG+GG Y+ H D     E  +     RL + +FYL DV LGGAT
Sbjct: 417 STDTAE----PLQLVNYGIGGQYEPHFDFMDDAEKNFGWKGNRLLTALFYLNDVPLGGAT 472

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            FP L+L V P KGS + WYN H +   D+R  H+GCPV  G+KW
Sbjct: 473 AFPFLHLAVPPVKGSLLVWYNLHRSLHKDFRTKHAGCPVLKGSKW 517


>gi|321474875|gb|EFX85839.1| hypothetical protein DAPPUDRAFT_309105 [Daphnia pulex]
          Length = 545

 Score =  154 bits (389), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 85/228 (37%), Positives = 131/228 (57%), Gaps = 15/228 (6%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+G   +   ++  L+C Y + N  +  I P+K+EE  L PR+V  HD I D E
Sbjct: 300 DVYEQLCRGEKLMDPKLEGRLRCRYVTNNVPYFYIQPIKMEEALLKPRIVVYHDIISDEE 359

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  I  L++ + ER  V     G+  +   R++K  +L  E   +H ++  I  R+ D+T
Sbjct: 360 IETIKRLAQPRFERATVQKKESGEREFSRYRIAKSAWLKHE---EHDYVSDINFRVGDIT 416

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL----W--RLASFMFYLTDVELG 172
            L +   E     LQ+ NYG+GGHY+ H D   + E      W  R+A+++FY++DVE G
Sbjct: 417 GLDMATSE----DLQVCNYGIGGHYEPHYDYARKGEVQQDFGWGGRIATWLFYMSDVEAG 472

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GAT+FP LNL+++P+KGSA FW+N + N   +    H+GCPV  G+KW
Sbjct: 473 GATVFPKLNLSLWPQKGSAAFWFNLYPNGEGNEMTQHAGCPVLTGSKW 520


>gi|312383453|gb|EFR28539.1| hypothetical protein AND_03427 [Anopheles darlingi]
          Length = 341

 Score =  153 bits (387), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 87/253 (34%), Positives = 129/253 (50%), Gaps = 40/253 (15%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+G    P +++S L C Y +  + FL++ PLK+EE Y  P +V  HD + D E
Sbjct: 68  KLYEQLCRGEQEPPIELRSQLVCRYATNRSPFLRLAPLKLEEAYRQPDIVIYHDVMSDRE 127

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  I   ++ +  R  V NY  G+  + + R+SK  +L      +H  +  +  R++DMT
Sbjct: 128 IELIKHYARPRFRRATVQNYKTGELEFANYRISKSAWLKD---TEHEVIRTVNQRVEDMT 184

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFY----- 165
            L +   E     LQ+ NYG+GGHY+ H D   R+E           R+A+ +FY     
Sbjct: 185 GLTMATAEE----LQVVNYGIGGHYEPHFDFARREERNAFKSLGTGNRIATVLFYVSDLC 240

Query: 166 ------------------LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 207
                             ++DV  GGAT+FPSLNL + P KG+A FW+N HA+   DY  
Sbjct: 241 LCHTSHTNADFRFLSVGQMSDVTQGGATVFPSLNLALRPRKGTAAFWHNLHASGNGDYAT 300

Query: 208 YHSGCPVALGNKW 220
            H+ CPV  G KW
Sbjct: 301 RHAACPVLTGTKW 313


>gi|195061074|ref|XP_001995919.1| GH14105 [Drosophila grimshawi]
 gi|193891711|gb|EDV90577.1| GH14105 [Drosophila grimshawi]
          Length = 513

 Score =  153 bits (387), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 91/227 (40%), Positives = 126/227 (55%), Gaps = 21/227 (9%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G  +      S L C Y S N+ FL++ PLK+E L LDP +V  HDAI   EI 
Sbjct: 279 YEKGCRGQYAPA--TSSRLHCVYNSTNSAFLRLAPLKMELLQLDPYMVLYHDAISPREIE 336

Query: 63  RIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGD--HPFLYKIQTRIQDMT 118
            +  L+  +++R KVV+      + V  R SKV +L     GD  + F  ++  RI+DM+
Sbjct: 337 DLQFLAMPRLKRAKVVDQVTHRNMMVKERTSKVTWL-----GDATNAFTMRLNKRIEDMS 391

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGG 173
              +   E     LQ+ NYGLGGHY  H D     +  R  G  R+A+ MFYL+DVE GG
Sbjct: 392 GFTMYGSEM----LQVMNYGLGGHYASHYDFLNATSKTRLNGD-RIATVMFYLSDVEQGG 446

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           AT+FP +   VFP++G+A+ WYN   N   D    H+ CPV +G+KW
Sbjct: 447 ATVFPKIQKAVFPQRGTAIIWYNLKENGDFDTNTIHAACPVIVGSKW 493


>gi|195391766|ref|XP_002054531.1| GJ24504 [Drosophila virilis]
 gi|194152617|gb|EDW68051.1| GJ24504 [Drosophila virilis]
          Length = 545

 Score =  152 bits (385), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 87/231 (37%), Positives = 126/231 (54%), Gaps = 19/231 (8%)

Query: 1   EIYPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           ++Y   C G++   P +++  L+C Y +  + FL + PLKVEEL  DP +V  HD IY S
Sbjct: 284 DLYRYTCNGHIKPTPAELR-QLRCGYMTETHPFLLLAPLKVEELSHDPLLVLYHDVIYQS 342

Query: 60  EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           EI+ + +L+K K+ R  V     ++  + R S+  F+ P+    H  L  I  R+ DMT+
Sbjct: 343 EIDTLAKLTKNKIHRATVTGNNASVVSNARTSQFTFI-PKT--RHKVLRTIDQRVADMTD 399

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD----------ATPRDEGLWRLASFMFYLTDV 169
           L +   E +    Q+ NYG+GGHY  H D               E   R+A+ +FYLTDV
Sbjct: 400 LNMVFAEDH----QLANYGIGGHYAQHMDWFSPNAFETKQVANSEMGNRIATVLFYLTDV 455

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           E GG T FP L   + P+K +A FWYN HA+   D R  H  CP+ +G+KW
Sbjct: 456 EQGGGTAFPVLKQLLKPKKYAAAFWYNLHASGAGDVRTMHGACPIIVGSKW 506


>gi|195505190|ref|XP_002099397.1| GE10881 [Drosophila yakuba]
 gi|194185498|gb|EDW99109.1| GE10881 [Drosophila yakuba]
          Length = 487

 Score =  151 bits (381), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 88/229 (38%), Positives = 129/229 (56%), Gaps = 19/229 (8%)

Query: 3   YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           Y + C+G L   P +++  L+C Y +    FL++GPLK+EE + DP +V  HDA+YDSEI
Sbjct: 239 YEMLCRGELKPSPSELRP-LRCRYVTNGVPFLRLGPLKLEEAHADPYIVIYHDAMYDSEI 297

Query: 62  NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           + I  +++ +  R  V N   G     + R+SK  +L      +   +  +  R  DMT 
Sbjct: 298 DVIKRMARPRFRRATVQNSVTGALETANYRISKSAWLKTH---EDRVIGTVVQRTADMTG 354

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-----GLW---RLASFMFYLTDVEL 171
           L +   E     LQ+ NYG+GGHY+ H D   ++E     GL    R+A+ +FY++DVE 
Sbjct: 355 LDMESAEE----LQVVNYGIGGHYEPHFDFARKEEERAFEGLNLGNRIATVLFYMSDVEQ 410

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GGAT+F SL+  +FP KG+A FW N H +   D R  H+ CPV  G KW
Sbjct: 411 GGATVFTSLHTALFPRKGTAAFWMNLHRDGQGDVRTRHAACPVLTGTKW 459


>gi|194765138|ref|XP_001964684.1| GF23317 [Drosophila ananassae]
 gi|190614956|gb|EDV30480.1| GF23317 [Drosophila ananassae]
          Length = 520

 Score =  150 bits (380), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 85/226 (37%), Positives = 129/226 (57%), Gaps = 16/226 (7%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y + C+G    P    S L C Y S    FL + PLK+E + L+P +V  HD +  +EI
Sbjct: 284 LYEMGCRG--MYPASTDSKLVCRYNSTTTPFLTLAPLKMEIVGLNPYMVIYHDVLSSAEI 341

Query: 62  NRIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           + + E++   ++R  V   + G    V TR SKV + +P+ +  +    ++  RI DMT 
Sbjct: 342 DEMKEMATPSLKRATVYKASLGKNEVVKTRTSKVAW-FPDSY--NSLTLRLNARIHDMTG 398

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW--RLASFMFYLTDVELGGA 174
             +   E     LQ+ NYGLGGHYD H D   AT +   L   R+A+ +FY++DVE GGA
Sbjct: 399 FDLSGSEM----LQLMNYGLGGHYDKHYDFFNATEKSSSLTGDRIATVLFYMSDVEQGGA 454

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T+FP++  TV+P++G+AV WYN   +   D +  H+ CPV +G+KW
Sbjct: 455 TVFPNIYKTVYPQRGTAVMWYNLKDDGQPDEQTLHAACPVLVGSKW 500


>gi|321474898|gb|EFX85862.1| hypothetical protein DAPPUDRAFT_309117 [Daphnia pulex]
          Length = 541

 Score =  150 bits (380), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 84/226 (37%), Positives = 126/226 (55%), Gaps = 15/226 (6%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G   +   I++ L+C Y + N  +  I P+K+E   L PR+V  H+ + D EI 
Sbjct: 298 YERLCRGEKLMDPKIEARLRCRYVTNNVPYFFIQPIKMELASLKPRLVIYHNVVTDEEIE 357

Query: 63  RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
              +L++ ++ R  V N   G +     R++K  FL      +H  + K+  RI D+T L
Sbjct: 358 TAKKLAQSRLRRSTVQNSLTGASEPTKYRIAKAAFLQN---SEHDHIVKMTRRIGDVTGL 414

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL----W--RLASFMFYLTDVELGGA 174
            +   E     LQ+ NYG+GGHY+ H D   + E      W  R+A++MFY++DVE GGA
Sbjct: 415 DMTTAEE----LQVCNYGIGGHYEPHYDHARKGEVQKDFGWGNRIATWMFYMSDVEAGGA 470

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T+FP +NL ++P+KGSA FW+N H N   D    H+ CPV  G+KW
Sbjct: 471 TVFPQINLALWPQKGSAAFWFNLHPNGEGDDLTQHAACPVLTGSKW 516


>gi|321474953|gb|EFX85917.1| hypothetical protein DAPPUDRAFT_309108 [Daphnia pulex]
          Length = 549

 Score =  150 bits (379), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 84/229 (36%), Positives = 130/229 (56%), Gaps = 18/229 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G   +   I+ +L+C Y + N  +  I PLK+EE +L P +V  HD I+D EI 
Sbjct: 303 YEKLCRGEKLMDPKIEGHLRCRYVTNNEPYFFIQPLKMEEAFLKPLLVIYHDVIFDEEIE 362

Query: 63  RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            + +L+  + +R  V+N   G       R+SK  FL  +   +H  + K+  R+  +T L
Sbjct: 363 TVKKLAHPRFKRTTVMNSATGKLETAKYRISKAAFLKNK---EHHHVLKMSRRVGAITGL 419

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL-------WR--LASFMFYLTDVEL 171
            +   E     LQ+ NYG+GGHY+ H D   ++E +       WR  +A+++FY++DVE 
Sbjct: 420 DMSTAE----DLQVCNYGIGGHYEPHFDYARKNETIGFNKDSGWRNRIATWLFYMSDVEA 475

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GGAT+FP+LN+ ++P+KGSA FWYN   N   +    H+ CPV  G+KW
Sbjct: 476 GGATVFPALNVALWPQKGSAAFWYNLFPNGEGNELTRHAACPVLTGSKW 524


>gi|195055767|ref|XP_001994784.1| GH14132 [Drosophila grimshawi]
 gi|193892547|gb|EDV91413.1| GH14132 [Drosophila grimshawi]
          Length = 537

 Score =  150 bits (379), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 88/228 (38%), Positives = 123/228 (53%), Gaps = 15/228 (6%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           EIY   C G +      + NL+C Y S  + FL + PLKVEEL  +P +V  HD IY SE
Sbjct: 289 EIYRYTCNGYIKKTPPEERNLRCGYMSETHPFLLLAPLKVEELNRNPLLVLYHDVIYQSE 348

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I+ + +L++ + ER  VV    +     R S+  F+       H  L  I  R+ DMTNL
Sbjct: 349 IDVLNKLNRKRYERAGVVINSTSTVSKKRTSQHIFIAA---TRHKVLRTIDQRVADMTNL 405

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD--------ATPRDEGLWRLASFMFYLTDVELG 172
            +    +Y    Q+ +YG+GGHY  H D         +  DE   R+A+ +FYL+DV  G
Sbjct: 406 NM----QYAEDHQLADYGIGGHYSQHFDWFGNSDLANSKCDEMGNRIATVLFYLSDVAQG 461

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           G T FP L   + P+K +A FWYN HA+   D+R  H GCP+ +G+KW
Sbjct: 462 GGTAFPILKQLLKPKKYAAAFWYNLHASGKGDWRNLHGGCPIIVGSKW 509


>gi|334311009|ref|XP_001371555.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Monodelphis
           domestica]
          Length = 534

 Score =  150 bits (378), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 87/230 (37%), Positives = 130/230 (56%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           E+Y   C+G  + +    +  L C Y   N T  L I P K E+ +  P +V+ +D + D
Sbjct: 289 EVYEALCRGEGIKLTPQRRKRLFCRYHDSNKTPQLLIAPFKEEDEWDSPHIVRYYDVLSD 348

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI +I E+SK K+ R  V +   G  I V  R+SK  +L  +   D P + ++  R+Q 
Sbjct: 349 EEIEKIKEISKPKLSRATVRDPKTGHLIVVSYRISKSSWLKED---DDPIIAQVNRRMQY 405

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ++NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 406 ITGLSVKTAEL----LQVSNYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP     ++P+KG++VFWYN   +   DYR  H+ CPV +G+KW
Sbjct: 462 AGGATVFPDFGAAIWPKKGTSVFWYNLFRSGECDYRTRHAACPVLVGSKW 511


>gi|194765168|ref|XP_001964699.1| GF22909 [Drosophila ananassae]
 gi|190614971|gb|EDV30495.1| GF22909 [Drosophila ananassae]
          Length = 525

 Score =  150 bits (378), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 85/228 (37%), Positives = 120/228 (52%), Gaps = 17/228 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y L C G+       + +L+C Y    + FL I PLK EEL  DP ++  HD IY SEI+
Sbjct: 258 YMLTCSGHFRPTPREQRDLRCGYMDETHPFLWIAPLKAEELSRDPLLILYHDVIYQSEID 317

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            I +L+  K++R  + +  +++  + R S+  FL      +   L  I  R+ DMTN  +
Sbjct: 318 TIRKLTTNKLKRATITSTNESVVSNVRTSQFTFL---PVTEDKVLATIDRRVADMTNFNM 374

Query: 123 GREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW-------RLASFMFYLTDVELG 172
               RY    Q  NYG+GGHY  H D       D GL        R+A+ +FYL+DV  G
Sbjct: 375 ----RYAEDHQFANYGIGGHYGQHMDWFYQPSFDAGLVSSPEMGNRIATVLFYLSDVTQG 430

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           G T FP L + + P+K +A FWYN HA+ + D R  H  CP+  G+KW
Sbjct: 431 GGTAFPHLRVLLKPKKYAAAFWYNLHASGVGDPRTQHGACPIISGSKW 478


>gi|321474877|gb|EFX85841.1| hypothetical protein DAPPUDRAFT_208740 [Daphnia pulex]
          Length = 545

 Score =  150 bits (378), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 132/230 (57%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y   C+G   +   I+  L+C Y + N  +L I P+K+EE +  P +V  H+ I D E
Sbjct: 298 ENYEKLCRGEKLMDPKIEGRLRCRYVTNNVPYLYIQPVKMEEAFHKPLIVIYHNVINDDE 357

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + ++++ + +R  V N   G+    + R+SK  +L  E   +H  ++K+  R+ D+T
Sbjct: 358 IETVKKMAQPRFKRATVQNSVTGNLEPANYRISKSAWLKSE---EHDHVFKVTRRVGDVT 414

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL------W--RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GGHY+ H D   ++E        W  R+A+++FY+++VE
Sbjct: 415 GLDMATAE----DLQVVNYGIGGHYEPHFDYARKEEVNAFKDLGWGNRVATWLFYMSEVE 470

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP LNL ++P+KGSA FWYN H N   +    H+ CPV  G+KW
Sbjct: 471 AGGATVFPKLNLALWPQKGSAAFWYNLHPNGEGNELTRHAACPVLTGSKW 520


>gi|198449635|ref|XP_001357660.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
 gi|198130694|gb|EAL26794.2| GA21971 [Drosophila pseudoobscura pseudoobscura]
          Length = 549

 Score =  149 bits (377), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 120/230 (52%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y L C G+  +    + +L+C Y +  + FL + PLK EEL  DP +V  HD IY SE
Sbjct: 282 EAYRLTCSGHSRLTAREERHLRCGYMTETHPFLLLAPLKAEELSHDPLLVLYHDVIYQSE 341

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I+ I +L+  ++ R  V     +   + R S++ F+      +H  L  I  R+ DMTNL
Sbjct: 342 IDVIRQLTTNRMARAMVTLTNQSTVSNVRTSQITFIAK---TEHEVLQTIDRRVADMTNL 398

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW-------RLASFMFYLTDVE 170
            +   E +    Q  NYG+GGHY  H D    T  D GL        R+A+ +FYL+DV 
Sbjct: 399 NMDYAEDH----QFANYGIGGHYGQHMDWFTETTFDNGLVSSTEMGNRIATVLFYLSDVA 454

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T FP L   + P+K +A FW+N HA    D R  H  CP+  G+KW
Sbjct: 455 QGGGTAFPYLKQHLRPKKYAAAFWHNLHAAGRGDARTQHGACPIIAGSKW 504


>gi|21711777|gb|AAM75079.1| RE70601p [Drosophila melanogaster]
          Length = 316

 Score =  149 bits (376), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 86/226 (38%), Positives = 122/226 (53%), Gaps = 17/226 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y + C+G    P    S L C Y    + FL + PLK+E + LDP +V  HD +   EI 
Sbjct: 78  YQIGCRGQF--PPSADSKLYCLYNRTTSPFLILAPLKMELVGLDPYMVLYHDVLSPKEIK 135

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            +  ++   ++R  V   + G    V TR SKV + +P+  G +P   ++  RI DMT  
Sbjct: 136 ELQGMATPSLKRATVYQASSGRNEVVKTRTSKVAW-FPD--GYNPLTVRLNARISDMTGF 192

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGA 174
            +   E     LQ+ NYGLGGHYD H D   +           R+A+ +FYLTDVE GGA
Sbjct: 193 NLYGSEM----LQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLTDVEQGGA 248

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T+FP++   VFP++GS V WYN   N  +D +  H+ CPV +G+KW
Sbjct: 249 TVFPNIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKW 294


>gi|195061068|ref|XP_001995918.1| GH14106 [Drosophila grimshawi]
 gi|193891710|gb|EDV90576.1| GH14106 [Drosophila grimshawi]
          Length = 511

 Score =  149 bits (375), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 89/223 (39%), Positives = 125/223 (56%), Gaps = 16/223 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y L C+G   VP+   SNL C Y+   + FL++ PLK+E + L+P +V  HDA+   EI+
Sbjct: 277 YSLGCRGQF-VPQ---SNLHCEYKMKTSPFLRLAPLKMEIVLLNPFIVVFHDALSPQEID 332

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            +  L++  ++R  V   G  +    R SK  +L  ++   +    +I+ R+ DMT L +
Sbjct: 333 YLQNLARPLLKRTTVHVNGKYVSRRVRTSKGAWLERDL---NNLTRRIERRVVDMTELSM 389

Query: 123 GREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGGATIF 177
              E Y     I NYGLGGHY  H D          E   R+A+ +FYL+DVE GGAT+F
Sbjct: 390 QGSEAY----NIMNYGLGGHYAAHYDFFNTTKQQTSETGDRIATVLFYLSDVEQGGATVF 445

Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           P+L L V PE+G A+FWYN   N   D R  H GCPV +G+KW
Sbjct: 446 PNLKLAVSPERGMALFWYNLLDNGTGDTRTLHGGCPVLVGSKW 488


>gi|195575145|ref|XP_002105540.1| GD16902 [Drosophila simulans]
 gi|194201467|gb|EDX15043.1| GD16902 [Drosophila simulans]
          Length = 525

 Score =  149 bits (375), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 86/226 (38%), Positives = 122/226 (53%), Gaps = 17/226 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y + C+G    P    S L C Y    + FL + PLK+E + LDP +V  HD +   EI 
Sbjct: 287 YQMGCRGQF--PPSADSKLYCLYNRTTSPFLILAPLKMELVGLDPYMVLYHDVLSPKEIT 344

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            +  ++   ++R  V   + G    V TR SKV + +P+  G +P   ++  RI DMT  
Sbjct: 345 ELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAW-FPD--GYNPLTVRLNARISDMTGF 401

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGA 174
            +   E     LQ+ NYGLGGHYD H D   +           R+A+ +FYLTDVE GGA
Sbjct: 402 NLYGSEM----LQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLTDVEQGGA 457

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T+FP++   VFP++GS V WYN   N  +D +  H+ CPV +G+KW
Sbjct: 458 TVFPNIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKW 503


>gi|291230950|ref|XP_002735430.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saccoglossus
           kowalevskii]
          Length = 533

 Score =  148 bits (374), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 82/231 (35%), Positives = 126/231 (54%), Gaps = 18/231 (7%)

Query: 1   EIYPLACQG-NLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           E Y   C+G  + +    +  LKC    YN  FL + P K E ++  P+++  HDAI  +
Sbjct: 286 EAYEALCRGEQVKMSPQRQKKLKCRLRDYNRPFLILQPAKEEVVFDKPKLIIFHDAILTN 345

Query: 60  EINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
           EI ++  L+  ++ R  + N   G+  + + R+SK  +L  +   D   ++++  RI+  
Sbjct: 346 EIRKVKALASPRLRRATIQNSVTGNLEFAEYRISKSAWLSED---DGDVVHRLNHRIEQY 402

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE--------GLWRLASFMFYLTDV 169
           T L +   E     LQ+ NYGLGGHY+ H D   ++E           R+A+F+FY++DV
Sbjct: 403 TGLTMDTAEE----LQVANYGLGGHYEPHFDFARKEEINAFKSLNTGNRIATFLFYMSDV 458

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           E GGAT+FP +   + PEKGSA FWYN   N   DY   H+ CPV +G+KW
Sbjct: 459 EAGGATVFPQVGARLIPEKGSAAFWYNLLKNGEGDYSTRHAACPVLVGSKW 509


>gi|344274274|ref|XP_003408942.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
           [Loxodonta africana]
          Length = 534

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+V+ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   GD   V  R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEVVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|24651477|ref|NP_733395.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
 gi|20269812|gb|AAM18061.1|AF495539_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]PV [Drosophila
           melanogaster]
 gi|23172718|gb|AAN14252.1| prolyl-4-hydroxylase-alpha PV [Drosophila melanogaster]
          Length = 525

 Score =  148 bits (373), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 86/226 (38%), Positives = 122/226 (53%), Gaps = 17/226 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y + C+G    P    S L C Y    + FL + PLK+E + LDP +V  HD +   EI 
Sbjct: 287 YQIGCRGQF--PPSADSKLYCLYNRTTSPFLILAPLKMELVGLDPYMVLYHDVLSPKEIK 344

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            +  ++   ++R  V   + G    V TR SKV + +P+  G +P   ++  RI DMT  
Sbjct: 345 ELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAW-FPD--GYNPLTVRLNARISDMTGF 401

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGA 174
            +   E     LQ+ NYGLGGHYD H D   +           R+A+ +FYLTDVE GGA
Sbjct: 402 NLYGSEM----LQLMNYGLGGHYDQHYDFFNKTNSNMTAMSGDRIATVLFYLTDVEQGGA 457

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T+FP++   VFP++GS V WYN   N  +D +  H+ CPV +G+KW
Sbjct: 458 TVFPNIRKAVFPQRGSVVMWYNLKDNGQIDTQTLHAACPVIVGSKW 503


>gi|195341560|ref|XP_002037374.1| GM12888 [Drosophila sechellia]
 gi|194131490|gb|EDW53533.1| GM12888 [Drosophila sechellia]
          Length = 501

 Score =  147 bits (372), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 79/220 (35%), Positives = 118/220 (53%), Gaps = 18/220 (8%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           + Y L C G+  +    + +L+C Y +  + FL I PLK EEL+ DP +V  HD IY SE
Sbjct: 256 QAYSLTCSGHWRLTPKEQRHLRCGYVTETHPFLWIAPLKAEELFQDPLLVLYHDVIYQSE 315

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I+ I +L+K ++ R  + ++ +++  + R S++ F+       H  L  I  R+ DMTNL
Sbjct: 316 IDVIRKLTKNRLMRATITSHNESVVSNVRTSQITFI---PVTAHKVLSTIDQRVADMTNL 372

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSL 180
            +    +Y    Q  NYG+GGHY  H D        W    +   L+DV  GG T FP L
Sbjct: 373 NM----KYAEDHQFANYGIGGHYGQHMD--------W---FYQTTLSDVAQGGGTAFPQL 417

Query: 181 NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
              + P+K +A FW+N HA+ + D R  H  CP+  G+KW
Sbjct: 418 RTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 457


>gi|380813206|gb|AFE78477.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
 gi|384947328|gb|AFI37269.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
          Length = 534

 Score =  147 bits (372), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   GD   V  R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|190788|gb|AAA36535.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
          Length = 534

 Score =  147 bits (372), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   GD   V  R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|281350467|gb|EFB26051.1| hypothetical protein PANDA_009188 [Ailuropoda melanoleuca]
          Length = 511

 Score =  147 bits (372), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   GD   V  R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|355562502|gb|EHH19096.1| hypothetical protein EGK_19739 [Macaca mulatta]
 gi|355782842|gb|EHH64763.1| hypothetical protein EGM_18071 [Macaca fascicularis]
 gi|383418719|gb|AFH32573.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Macaca
           mulatta]
          Length = 534

 Score =  147 bits (371), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   GD   V  R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|291404184|ref|XP_002718472.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 2
           [Oryctolagus cuniculus]
          Length = 534

 Score =  147 bits (371), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   GD   V  R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|63252888|ref|NP_001017962.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
           sapiens]
 gi|197099666|ref|NP_001125733.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Pongo abelii]
 gi|217272849|ref|NP_001136067.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Homo
           sapiens]
 gi|114631177|ref|XP_001140234.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Pan
           troglodytes]
 gi|114631181|ref|XP_001140652.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 7 [Pan
           troglodytes]
 gi|2507090|sp|P13674.2|P4HA1_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|75061858|sp|Q5RAG8.1|P4HA1_PONAB RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|602675|gb|AAA59068.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
 gi|23271226|gb|AAH34998.1| Prolyl 4-hydroxylase, alpha polypeptide I [Homo sapiens]
 gi|55729010|emb|CAH91242.1| hypothetical protein [Pongo abelii]
 gi|56403853|emb|CAI29712.1| hypothetical protein [Pongo abelii]
 gi|119574854|gb|EAW54469.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_c [Homo
           sapiens]
 gi|119574855|gb|EAW54470.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_d [Homo
           sapiens]
 gi|123981532|gb|ABM82595.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [synthetic
           construct]
 gi|123996359|gb|ABM85781.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [synthetic
           construct]
 gi|261861532|dbj|BAI47288.1| prolyl 4-hydroxylase, alpha polypeptide I [synthetic construct]
 gi|410295852|gb|JAA26526.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410349611|gb|JAA41409.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score =  147 bits (371), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   GD   V  R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|395820526|ref|XP_003783615.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Otolemur
           garnettii]
          Length = 534

 Score =  147 bits (371), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   GD   V  R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|148233143|ref|NP_001090904.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Sus scrofa]
 gi|83778522|gb|ABC47142.1| procollagen-proline 2-oxoglutarate-4-dioxygenase [Sus scrofa]
          Length = 534

 Score =  147 bits (371), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 130/230 (56%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I+ + +L+K ++ R  + N   GD   V  R+SK  +L      ++P + ++  RIQD+T
Sbjct: 349 IDIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRLNMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|410251926|gb|JAA13930.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 566

 Score =  147 bits (370), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   GD   V  R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|157114983|ref|XP_001658090.1| prolyl 4-hydroxylase alpha subunit 1, putative [Aedes aegypti]
 gi|108877085|gb|EAT41310.1| AAEL007032-PA, partial [Aedes aegypti]
          Length = 448

 Score =  147 bits (370), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 84/220 (38%), Positives = 121/220 (55%), Gaps = 24/220 (10%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+G         +NL+C YES N++FLKI P K+EE  LDP +V  H+AI D EI
Sbjct: 244 LYEPLCRGEYQRTPAQVANLRCRYESKNSSFLKIAPFKLEEASLDPLIVIYHNAISDKEI 303

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
           ++II++SK  ++R  V   G++   +    +  +       D   +  +  R +DMT   
Sbjct: 304 DQIIQVSKPMLKRSMV---GESFSKEVSNERTNY-------DFELVKVLSLRTEDMT--- 350

Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN 181
            G + +    LQ+NNYG+GG Y  H D    +E +          +DVE GGAT+FP + 
Sbjct: 351 -GLDRKSYESLQVNNYGIGGFYLPHFDWVRTNEPI----------SDVEQGGATVFPQIG 399

Query: 182 LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           + VFP+KGSA+FWYN   +   D R  H  CPV LG+KWG
Sbjct: 400 VGVFPKKGSAIFWYNLLPDGTGDERTLHGACPVLLGSKWG 439


>gi|321474876|gb|EFX85840.1| hypothetical protein DAPPUDRAFT_309107 [Daphnia pulex]
          Length = 528

 Score =  147 bits (370), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 84/229 (36%), Positives = 129/229 (56%), Gaps = 19/229 (8%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G   +   ++  L+C Y + N  F  I P+K+EE  L P +V  H  I+D+EI+
Sbjct: 285 YEKLCRGEKLLDPKVEGRLRCRYVTNNVPFFFIQPVKMEEALLKPLLVIYHGVIFDAEID 344

Query: 63  RIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            + +L++ + +R  V +   G ++ V  R++K  FL      +H  + K+  R+ D+T L
Sbjct: 345 VVKKLAQPRFKRTGVTDRDTGRSMPVQYRIAKAAFLKD---SEHNLIVKMSRRVGDITGL 401

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDAT-------PRDEGLW--RLASFMFYLTDVEL 171
            +   E     LQ+ NYG+GGHY  H D         PRD   W  R+A+++FY++DVE 
Sbjct: 402 DMAASE----DLQVCNYGIGGHYVPHFDYARQGEIHGPRDLD-WGNRIATWLFYMSDVEA 456

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GGAT+FP++   ++P+KGSA FWYN   N   D    H+GCPV  G+KW
Sbjct: 457 GGATVFPAVGAALWPQKGSAAFWYNLRPNGNGDEDTLHAGCPVLTGSKW 505


>gi|115495019|ref|NP_001069238.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
 gi|122144801|sp|Q1RMU3.1|P4HA1_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|92097479|gb|AAI14709.1| Prolyl 4-hydroxylase, alpha polypeptide I [Bos taurus]
 gi|296472132|tpg|DAA14247.1| TPA: prolyl 4-hydroxylase subunit alpha-1 precursor [Bos taurus]
 gi|440892721|gb|ELR45796.1| Prolyl 4-hydroxylase subunit alpha-1 [Bos grunniens mutus]
          Length = 534

 Score =  146 bits (369), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   GD   V  R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEVVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVL 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|195341590|ref|XP_002037389.1| GM12139 [Drosophila sechellia]
 gi|194131505|gb|EDW53548.1| GM12139 [Drosophila sechellia]
          Length = 525

 Score =  146 bits (369), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 87/230 (37%), Positives = 123/230 (53%), Gaps = 25/230 (10%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y + C+G    P    S L C Y    + FL + PLK+E + L+P +V  HD +   EI 
Sbjct: 287 YQVGCRGQF--PPSADSKLYCLYNRTTSPFLILAPLKMELVGLEPYMVLYHDVLSPKEIT 344

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            +  ++   ++R  V   + G    V TR SKV + +P+  G +P   ++  RI DMT  
Sbjct: 345 ELQGMATPGLKRATVYQASSGRNEVVKTRTSKVAW-FPD--GYNPLTVRLNARISDMTGF 401

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----------ATPRDEGLWRLASFMFYLTDVE 170
            +   E     LQ+ NYGLGGHYD H D          A   D    R+A+ +FYLTDVE
Sbjct: 402 NLYGSEM----LQLMNYGLGGHYDQHYDFFNNTNSNMTAMSGD----RIATVLFYLTDVE 453

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP++   VFP++GS V WYN   N  +D +  H+ CPV +G+KW
Sbjct: 454 QGGATVFPNIRKAVFPQRGSVVMWYNLRDNGQIDTQTLHAACPVIVGSKW 503


>gi|195390833|ref|XP_002054072.1| GJ22994 [Drosophila virilis]
 gi|194152158|gb|EDW67592.1| GJ22994 [Drosophila virilis]
          Length = 496

 Score =  146 bits (368), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 87/209 (41%), Positives = 119/209 (56%), Gaps = 16/209 (7%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           SNL C Y+   + FL + P+K+E   L+P ++  HD +   EI+ + +L++  +ER  VV
Sbjct: 275 SNLYCVYKFGTSPFLLLAPIKMEIRLLNPFIIVFHDVLSPREIDELQKLARPLLERTTVV 334

Query: 79  NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYK-IQTRIQDMTNLVIGREERYKGPLQINNY 137
            +        R SK  +    I  DH  L K I+ RI DM  L +    RY  P Q+ NY
Sbjct: 335 KFKKYEKDSRRTSKGTW----IERDHNNLTKRIERRITDMVELDL----RYSEPFQVMNY 386

Query: 138 GLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSA 191
           GLGGHY  H D      A  ++E   R+A+ +FYLTDVE GGAT+F  LN  V P++G+A
Sbjct: 387 GLGGHYAAHEDFLGDTWADKKEEDD-RIATVLFYLTDVEQGGATVFTILNQAVSPKRGTA 445

Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +FWYN H N   D R  H GCPV +G+KW
Sbjct: 446 LFWYNLHRNGTGDTRTLHGGCPVLVGSKW 474


>gi|195505255|ref|XP_002099425.1| GE23368 [Drosophila yakuba]
 gi|194185526|gb|EDW99137.1| GE23368 [Drosophila yakuba]
          Length = 528

 Score =  146 bits (368), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 122/230 (53%), Gaps = 25/230 (10%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y + C+G  +   D  S L C Y    + FL + PLK+E + LDP +V  HD +   EI 
Sbjct: 290 YQMGCRGQFAPSAD--SKLHCLYNRTTSPFLMLAPLKMELVGLDPYMVLYHDVLSAKEIK 347

Query: 63  RIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            +  ++   ++R  V     G    V TR SKV + +P+  G  P   ++  RI DMT  
Sbjct: 348 ELQGMATPGLKRATVFQAASGRNEVVRTRTSKVAW-FPD--GYSPLTVRLNARITDMTGF 404

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----------ATPRDEGLWRLASFMFYLTDVE 170
            +   E     LQ+ NYGLGGHYD H D          A   D    R+A+ +FYLTDVE
Sbjct: 405 NLHGSEM----LQLMNYGLGGHYDQHYDYFNTINSNLTAMSGD----RIATVLFYLTDVE 456

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP++   VFP++GS + WYN   +  +D +  H+ CPV +G+KW
Sbjct: 457 QGGATVFPNIRKAVFPQRGSVIMWYNLKDDGQIDTQTLHAACPVIVGSKW 506


>gi|426255744|ref|XP_004021508.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Ovis
           aries]
          Length = 534

 Score =  146 bits (368), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 129/230 (56%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   GD   V  R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVL 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|56118630|ref|NP_001007975.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
           (Silurana) tropicalis]
 gi|51513259|gb|AAH80485.1| p4ha2 protein [Xenopus (Silurana) tropicalis]
          Length = 527

 Score =  146 bits (368), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 86/232 (37%), Positives = 133/232 (57%), Gaps = 17/232 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y + N + +L + P+KVE+ +  PR+V+  +A+ D
Sbjct: 290 DVYEALCRGEGVKMNPRRQRRLFCRYHNGNRSPYLILSPVKVEDEWDSPRIVRYLNALSD 349

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI +I EL+K K+ R  V +   G     + R+SK  +L      D P + ++  R+Q 
Sbjct: 350 EEIAKIKELAKPKLARATVRDPKTGVLSVANYRVSKSAWLEE---NDDPVIARVNLRMQA 406

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D  L     RLA+F+ Y++DVE
Sbjct: 407 ITGLTVDTAEL----LQVANYGMGGQYEPHFDFSRRPFDSNLKTDGNRLATFLNYMSDVE 462

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGK 222
            GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV +G+KWGK
Sbjct: 463 AGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKWGK 514


>gi|195575115|ref|XP_002105525.1| GD21527 [Drosophila simulans]
 gi|194201452|gb|EDX15028.1| GD21527 [Drosophila simulans]
          Length = 495

 Score =  146 bits (368), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 79/220 (35%), Positives = 117/220 (53%), Gaps = 18/220 (8%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           + Y L C G+  +    + +L+C Y +  + FL I PLK EEL+ DP +V  HD IY SE
Sbjct: 250 QAYSLTCSGHWQLTPKEQRHLRCGYVTETHPFLWIAPLKAEELFQDPLLVLYHDVIYQSE 309

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I+ I +L+K ++ R  + ++ +++  + R S+  F+       H  L  I  R+ DMTNL
Sbjct: 310 IDVIRKLTKNRLMRATITSHNESVVSNVRTSQFTFI---PVTAHKVLSTIDQRVADMTNL 366

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSL 180
            +    +Y    Q  NYG+GGHY  H D        W    +   L+DV  GG T FP L
Sbjct: 367 NM----KYAEDHQFANYGIGGHYGQHMD--------W---FYQTTLSDVAQGGGTAFPQL 411

Query: 181 NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
              + P+K +A FW+N HA+ + D R  H  CP+  G+KW
Sbjct: 412 RTLLKPKKYAAAFWHNLHASGVGDVRTQHGACPIIAGSKW 451


>gi|194905290|ref|XP_001981166.1| GG11918 [Drosophila erecta]
 gi|190655804|gb|EDV53036.1| GG11918 [Drosophila erecta]
          Length = 525

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 120/230 (52%), Gaps = 25/230 (10%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y + C+G    P      L C Y    + FL + PLK+E + LDP +V  HD +   EI 
Sbjct: 287 YQMGCRGQF--PPSADGKLYCLYNRTTSAFLMLAPLKMELVGLDPYMVLYHDVLSAKEIK 344

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            +  ++   + R  V   + G    V TR SKV + +P+ +  +P   ++  RI DMT  
Sbjct: 345 ELQGMATPGLTRATVFQASSGRNEVVKTRTSKVAW-FPDSY--NPLTVRLNARIADMTGF 401

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----------ATPRDEGLWRLASFMFYLTDVE 170
            +   E     LQ+ NYGLGGHYD H D          A   D    R+A+ +FYLTDVE
Sbjct: 402 NLYGSEM----LQLMNYGLGGHYDQHYDFFNTINSNLTAMSGD----RIATVLFYLTDVE 453

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP++   VFP++GS + WYN   N   D +  H+ CPV +G+KW
Sbjct: 454 QGGATVFPNIRKAVFPQRGSVIMWYNLQDNGQTDNKTLHAACPVIVGSKW 503


>gi|239915958|ref|NP_001070123.2| prolyl 4-hydroxylase alpha II-like precursor [Danio rerio]
          Length = 490

 Score =  145 bits (366), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 84/220 (38%), Positives = 125/220 (56%), Gaps = 16/220 (7%)

Query: 3   YPLACQGNLSVPEDIKSN-LKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y   C+G +      +   L C Y +   N  L   P+K EEL+ +P++++ HD I D+E
Sbjct: 262 YEALCRGEVDERTSKRQRALSCRYSTGGGNPRLMYAPVKEEELWDEPKIIRYHDVISDTE 321

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I  + ++++ ++ R +    G  +  D R S+  FL  E  G    + +I  RI D+T L
Sbjct: 322 IETLKDIARPELTRSQT---GWGVISDIRTSQSVFL--EEVGT---VARISQRIADITGL 373

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSL 180
            +   E+    L + NYG+GG Y  H D    DE   R A+F+ Y++DVE+GGAT+F ++
Sbjct: 374 SVESAEK----LHVQNYGIGGRYTPHFDTG--DEVNERTATFLIYMSDVEVGGATVFTNV 427

Query: 181 NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            + V PEKGSAVFWYN H N  LD +  H+GCPV +GNKW
Sbjct: 428 GVAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGNKW 467


>gi|410914996|ref|XP_003970973.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Takifugu
           rubripes]
          Length = 538

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 86/232 (37%), Positives = 128/232 (55%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           E Y   C+G  L + E  +S L C Y+  N N  L + P+K E+ +  P +V+  D + +
Sbjct: 291 EAYEALCRGEGLQMNEARRSRLFCRYQDGNRNPHLLLKPIKEEDEWDSPNIVRYLDFLSN 350

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI +I EL+K K+ R  V +   G       R+SK  +L  E   + P + ++  RI+D
Sbjct: 351 EEIEKIKELAKPKLARATVRDPKSGVLTTASYRVSKSAWLEGE---EDPIIARVNQRIED 407

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D + +DE           R+A+F+ Y++D
Sbjct: 408 LTGLTVKTAEL----LQVANYGVGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSD 463

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP     ++P KG+AVFWYN   +   DYR  H+ CPV +GNKW
Sbjct: 464 VEAGGATVFPDFGAAIWPRKGTAVFWYNLFKSGEGDYRTRHAACPVLVGNKW 515


>gi|354483225|ref|XP_003503795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Cricetulus griseus]
          Length = 534

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 128/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   G+   V  R+SK  +L      + P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPITGNLETVHYRISKSAWLSGY---EDPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|260825355|ref|XP_002607632.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
 gi|229292980|gb|EEN63642.1| hypothetical protein BRAFLDRAFT_84679 [Branchiostoma floridae]
          Length = 519

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 81/234 (34%), Positives = 131/234 (55%), Gaps = 23/234 (9%)

Query: 2   IYPLACQGNLS-----VPEDIKSNLKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDA 55
           +Y L CQGN        P  +K +LKC Y + NN   L + P+++E+++  P++  +H+ 
Sbjct: 271 VYELLCQGNQPEIFNITPSRVK-HLKCRYFTNNNHPRLLLAPIRLEQVFDKPKLWVLHNI 329

Query: 56  IYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR 113
           + D E+  I +L++ ++      N   G  +    R+SK  +LY   + +H  + +++ R
Sbjct: 330 LSDPEMEVIKKLAQPRLRPAATQNPTTGGAVLSSYRISKNAWLY---YWEHRLINRVKQR 386

Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYL 166
           ++D T L +   E    PLQ+ NYG+GGHY+ H D   +DE          R+A+ +FY+
Sbjct: 387 VEDATGLTMETAE----PLQVINYGIGGHYEPHFDCATKDEEFALDPNEGDRIATMLFYM 442

Query: 167 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +DVE GGAT+FP +   V PEKG+  FWYN   +   D    H+GCPV +G+KW
Sbjct: 443 SDVEAGGATVFPQVGARVVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGSKW 496


>gi|268536692|ref|XP_002633481.1| C. briggsae CBR-PHY-2 protein [Caenorhabditis briggsae]
 gi|94442973|emb|CAJ98659.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
          Length = 539

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 89/231 (38%), Positives = 123/231 (53%), Gaps = 19/231 (8%)

Query: 1   EIYPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           + Y   C+G +  V E  KS L+C+ +  +  FLKI P+KVE L  DP  V   + I DS
Sbjct: 279 DAYEALCRGEIPPVEEKWKSKLRCYLKR-DKPFLKIAPIKVEILRFDPLAVLFKNVISDS 337

Query: 60  EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
           EI  I EL+  K++R  V N   G+  +   R+SK  +L  ++    P + ++  RI+D 
Sbjct: 338 EIEVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLKGDL---DPVIDRVNRRIEDF 394

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDV 169
           T L     E     LQ+ NYGLGGHYD H D   ++E           R+A+ +FY++  
Sbjct: 395 TGLNQATSEE----LQVANYGLGGHYDPHFDFARKEEKNAFKTLNTGNRIATVLFYMSQP 450

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           E GGAT+F  L   VFP K  A+FWYN   +   D R  H+ CPV LG KW
Sbjct: 451 ERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKW 501


>gi|92096574|gb|AAI15350.1| LOC557059 protein [Danio rerio]
          Length = 508

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 84/220 (38%), Positives = 125/220 (56%), Gaps = 16/220 (7%)

Query: 3   YPLACQGNLSVPEDIKSN-LKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y   C+G +      +   L C Y +   N  L   P+K EEL+ +P++++ HD I D+E
Sbjct: 280 YEALCRGEVDERTSKRQRALSCRYSTGGGNPRLMYAPVKEEELWDEPKIIRYHDVISDTE 339

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I  + ++++ ++ R +    G  +  D R S+  FL  E  G    + +I  RI D+T L
Sbjct: 340 IETLKDIARPELTRSQT---GWGVISDIRTSQSVFL--EEVGT---VARISQRIADITGL 391

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSL 180
            +   E+    L + NYG+GG Y  H D    DE   R A+F+ Y++DVE+GGAT+F ++
Sbjct: 392 SVESAEK----LHVQNYGIGGRYTPHFDTG--DEVNERTATFLIYMSDVEVGGATVFTNV 445

Query: 181 NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            + V PEKGSAVFWYN H N  LD +  H+GCPV +GNKW
Sbjct: 446 GVAVKPEKGSAVFWYNLHKNGELDLKTKHAGCPVLVGNKW 485


>gi|195159142|ref|XP_002020441.1| GL13994 [Drosophila persimilis]
 gi|194117210|gb|EDW39253.1| GL13994 [Drosophila persimilis]
          Length = 493

 Score =  145 bits (365), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 87/228 (38%), Positives = 126/228 (55%), Gaps = 19/228 (8%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G    P      L C Y S N+ FL++ PLK+E + LDP +V  HD I   EI+
Sbjct: 252 YERGCRGLFPSPSK-DGRLHCVYNSTNSAFLRLAPLKMELVGLDPYMVLYHDVISALEIS 310

Query: 63  RIIELSKGKVERGKVVNYG--DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           ++ +++   ++R  V       +  V TR SKV + +P+ F +     ++  RI DMTN 
Sbjct: 311 QLQDMATPGLKRATVYKASGRRSEVVKTRTSKVAW-FPDTFNE--LTERLNRRIADMTNF 367

Query: 121 -VIGREERYKGPLQINNYGLGGHYDLHCD-------ATPRDEGLWRLASFMFYLTDVELG 172
            ++G E      LQ  NYGLGGHYD H D       A        R+A+ +FYLTDVE G
Sbjct: 368 DLLGSEM-----LQAMNYGLGGHYDKHYDFFNASTAANLTQMNGDRIATVLFYLTDVEQG 422

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GAT+FP++   VFP++GSA+ WYN   +   + +  H+ CPV +G+KW
Sbjct: 423 GATVFPNIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGSKW 470


>gi|344254200|gb|EGW10304.1| Prolyl 4-hydroxylase subunit alpha-1 [Cricetulus griseus]
          Length = 507

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 128/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 262 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 321

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   G+   V  R+SK  +L      + P + +I  RIQD+T
Sbjct: 322 IEIVKDLAKPRLRRATISNPITGNLETVHYRISKSAWLSGY---EDPVVSRINMRIQDLT 378

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 379 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATWLFYMSDVS 434

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 435 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 484


>gi|198449500|ref|XP_001357604.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
 gi|198130634|gb|EAL26738.2| GA15939 [Drosophila pseudoobscura pseudoobscura]
          Length = 528

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 87/228 (38%), Positives = 128/228 (56%), Gaps = 19/228 (8%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G    P      L C Y S N+ FL++ PLK+E + LDP +V  HD I   EI+
Sbjct: 287 YERGCRGLFPSPSK-DGRLHCVYNSTNSAFLRLAPLKMELVGLDPYMVLYHDVISAPEIS 345

Query: 63  RIIELSKGKVERGKVVNYG--DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           ++ +++   ++R  V       +  V TR SKV + +P+ F +     ++  RI DMTN 
Sbjct: 346 QLQDMATPGLKRATVYKASGRRSEVVKTRTSKVAW-FPDTFNE--LTERLNRRIADMTNF 402

Query: 121 -VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLW---RLASFMFYLTDVELG 172
            ++G E      LQ  NYGLGGHYD H D    +T  +       R+A+ +FYLTDVE G
Sbjct: 403 DLLGSEM-----LQAMNYGLGGHYDKHYDFFNASTATNLTQMNGDRIATVLFYLTDVEQG 457

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GAT+FP++   VFP++GSA+ WYN   +   + +  H+ CPV +G+KW
Sbjct: 458 GATVFPNIRKAVFPQRGSAIIWYNLKDDGDPNPQTLHAACPVLVGSKW 505


>gi|348557544|ref|XP_003464579.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Cavia porcellus]
          Length = 533

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 87/230 (37%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           E+Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 EVYESLCRGEGIKLTPQRRKRLFCRYHHGNRAPELLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  E   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEE---DDPVVARVNRRMQQ 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|195113239|ref|XP_002001175.1| GI10638 [Drosophila mojavensis]
 gi|193917769|gb|EDW16636.1| GI10638 [Drosophila mojavensis]
          Length = 511

 Score =  144 bits (364), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 81/222 (36%), Positives = 123/222 (55%), Gaps = 18/222 (8%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           + + C+G        +S L C Y+S +  FL++ P+K+E L LDP VV  HD +   EI+
Sbjct: 279 FEIGCRGQYVQ----QSGLMCTYKSKSPAFLRLAPIKMEVLVLDPLVVIFHDVLSSREID 334

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            + E+++  +ER  VV Y   +    R+S   ++  +    +   ++I+ RI DM +L +
Sbjct: 335 GLQEIARPHLERSMVVKYRANVQGKHRISAGTWVERKY---NNLTWRIERRIADMVDLNL 391

Query: 123 GREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATIFP 178
              E    P  + NYG+GG Y  H D     T  D    RLA+ +FY+ DVE GGAT+FP
Sbjct: 392 EGSE----PFYVINYGIGGQYKAHWDFFGADTVEDN---RLATVLFYMNDVEQGGATVFP 444

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            L  TV  ++G+A+FWYN   N  +D R  H GCP+ +G+KW
Sbjct: 445 RLGQTVRAKRGNALFWYNMQHNGTVDDRTLHGGCPILVGSKW 486


>gi|312032360|ref|NP_001185667.1| prolyl 4-hydroxylase subunit alpha-1 isoform 4 precursor [Gallus
           gallus]
          Length = 536

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  L +    +  L C +Y+   N    +GP+K E+ +  PR+V+  D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + EL+K ++ R  + N   G       R+SK  +L      + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLRRATISNPITGALETAHYRISKSAWLSGY---ESPVVSRINTRIQDLT 407

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 408 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 463

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 464 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513


>gi|149038788|gb|EDL93077.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_b
           [Rattus norvegicus]
          Length = 534

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   G    V  R+SK  +L      + P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGY---EDPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|74224984|dbj|BAE38205.1| unnamed protein product [Mus musculus]
          Length = 534

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   G    V  R+SK  +L      + P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGY---EDPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|74225936|dbj|BAE28745.1| unnamed protein product [Mus musculus]
          Length = 561

 Score =  144 bits (363), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   G    V  R+SK  +L      + P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGY---EDPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|33859596|ref|NP_035160.1| prolyl 4-hydroxylase subunit alpha-1 precursor [Mus musculus]
 gi|20455506|sp|Q60715.2|P4HA1_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|16307134|gb|AAH09654.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide [Mus musculus]
 gi|74144306|dbj|BAE36020.1| unnamed protein product [Mus musculus]
 gi|74146660|dbj|BAE41331.1| unnamed protein product [Mus musculus]
 gi|148700260|gb|EDL32207.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_a [Mus
           musculus]
          Length = 534

 Score =  144 bits (362), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   G    V  R+SK  +L      + P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATISNPVTGALETVHYRISKSAWLSGY---EDPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|344274272|ref|XP_003408941.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
           [Loxodonta africana]
          Length = 534

 Score =  144 bits (362), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+V+ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEVVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|312032356|ref|NP_001185665.1| prolyl 4-hydroxylase subunit alpha-1 isoform 2 precursor [Gallus
           gallus]
          Length = 536

 Score =  143 bits (361), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  L +    +  L C +Y+   N    +GP+K E+ +  PR+V+  D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + EL+K ++ R  V +   G       R+SK  +L      + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGY---ESPVVSRINTRIQDLT 407

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 408 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 463

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 464 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513


>gi|326923461|ref|XP_003207954.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Meleagris gallopavo]
          Length = 536

 Score =  143 bits (361), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  L +    +  L C +Y+   N    +GP+K E+ +  PR+V+  D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + EL+K ++ R  + N   G       R+SK  +L      + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLRRATISNPITGALETAHYRISKSAWLSGY---ESPVVSRINTRIQDLT 407

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 408 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 463

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 464 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513


>gi|195452746|ref|XP_002073482.1| GK14141 [Drosophila willistoni]
 gi|194169567|gb|EDW84468.1| GK14141 [Drosophila willistoni]
          Length = 541

 Score =  143 bits (361), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 122/230 (53%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           +IY   C G++      + +L+C Y +  + FL + PLKVEEL  +P +V  HD IY SE
Sbjct: 284 DIYRFTCSGHIKKTAREERHLRCGYLTETHPFLNLAPLKVEELNHNPLLVLYHDVIYQSE 343

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I+ I  L++ ++ R  V+    +     R S+  F+ P+    H  L  I  R+ DM+NL
Sbjct: 344 IDVIRNLTENEISRATVIGAKGSEVSKVRTSQFTFI-PKT--RHKVLQTIDQRVADMSNL 400

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRD----------EGLWRLASFMFYLTDVE 170
            +   E +    Q  NYG+GGHY  H D   +D          E   R+A+ +FYL+DV 
Sbjct: 401 NMDYAELH----QFANYGIGGHYAQHNDWFGQDAFDNELVSSPEMGNRIATVLFYLSDVA 456

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T FP L   + P+K +A FW+N HA+ + D R  H  CP+  G+KW
Sbjct: 457 QGGGTAFPHLKQLLQPKKYAAAFWHNLHASGVGDLRTLHGACPIIAGSKW 506


>gi|281348666|gb|EFB24250.1| hypothetical protein PANDA_000722 [Ailuropoda melanoleuca]
          Length = 505

 Score =  143 bits (361), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 86/236 (36%), Positives = 129/236 (54%), Gaps = 19/236 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           +IY   C+G  + +    +  L C Y   N T  L I P K E+ +  P +V+ +D + D
Sbjct: 277 DIYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 336

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 337 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 393

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D + ++E           R+A+F+ Y++D
Sbjct: 394 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSD 449

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 224
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KWGK L
Sbjct: 450 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWGKWL 505


>gi|291190128|ref|NP_001167431.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
 gi|223649060|gb|ACN11288.1| Prolyl 4-hydroxylase subunit alpha-2 precursor [Salmo salar]
          Length = 538

 Score =  143 bits (361), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 86/232 (37%), Positives = 130/232 (56%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           EIY   C+G  + +  + +S L C Y   N N  L + P+K E+ +  P +V+  +A+ D
Sbjct: 291 EIYEGLCRGEGVKMTSERRSRLYCRYHDGNRNPRLLLQPMKEEDEWDSPHIVRYLNALSD 350

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
           SEI +I EL+K ++ R  V +   G     + R+SK  +L  E   + P + ++  RI+D
Sbjct: 351 SEIEKIKELAKPRLARATVRDPKTGVLTTANYRVSKSAWLEGE---EDPVIERVNQRIED 407

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L     E     LQI NYG+GG Y+ H D + +DE           R+A+F+ Y++D
Sbjct: 408 ITGLTTQTAEL----LQIANYGVGGQYEPHFDFSRKDEPDAFKTLGTGNRVATFLNYMSD 463

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 464 VEAGGATVFPDFGAAIYPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 515


>gi|312032358|ref|NP_001185666.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Gallus
           gallus]
          Length = 536

 Score =  143 bits (361), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  L +    +  L C +Y+   N    +GP+K E+ +  PR+V+  D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + EL+K ++ R  + N   G       R+SK  +L      + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLRRATISNPITGALETAHYRISKSAWLSGY---ESPVVSRINTRIQDLT 407

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 408 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 463

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 464 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513


>gi|395509387|ref|XP_003758979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Sarcophilus harrisii]
          Length = 534

 Score =  143 bits (361), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 128/230 (55%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N T  L I P K E+ +  P +V+ +D + D
Sbjct: 289 DVYEALCRGEGIKLTPRRQKRLFCRYHDGNRTPQLLIAPFKEEDEWDSPHIVRYYDVLSD 348

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI EL+K K+ R  V +   G     + R+SK  +L     GD P + ++  R+  
Sbjct: 349 EEIERIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEE---GDDPVIAQLNRRMHY 405

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 406 ITGLSVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP    T++P+KG++VFWYN   +   DYR  H+ CPV +G+KW
Sbjct: 462 AGGATVFPDFGATIWPKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKW 511


>gi|17541712|ref|NP_502317.1| Protein PHY-2 [Caenorhabditis elegans]
 gi|32171589|sp|Q20065.1|P4HA2_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|3876769|emb|CAA93469.1| Protein PHY-2 [Caenorhabditis elegans]
          Length = 539

 Score =  143 bits (360), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 87/231 (37%), Positives = 124/231 (53%), Gaps = 19/231 (8%)

Query: 1   EIYPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           + Y   C+G +  V    K+ L+C+ +  +  FLK+ P+KVE L  DP  V   + I+DS
Sbjct: 279 DAYEALCRGEIPPVEPKWKNKLRCYLKR-DKPFLKLAPIKVEILRFDPLAVLFKNVIHDS 337

Query: 60  EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
           EI  I EL+  K++R  V N   G+  +   R+SK  +L  ++    P + ++  RI+D 
Sbjct: 338 EIEVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLKGDL---DPVIDRVNRRIEDF 394

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDV 169
           TNL     E     LQ+ NYGLGGHYD H D   ++E           R+A+ +FY++  
Sbjct: 395 TNLNQATSEE----LQVANYGLGGHYDPHFDFARKEEKNAFKTLNTGNRIATVLFYMSQP 450

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           E GGAT+F  L   VFP K  A+FWYN   +   D R  H+ CPV LG KW
Sbjct: 451 ERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKW 501


>gi|410948132|ref|XP_003980795.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Felis
           catus]
 gi|410948136|ref|XP_003980797.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Felis
           catus]
          Length = 533

 Score =  143 bits (360), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 87/230 (37%), Positives = 127/230 (55%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           +IY   C+G  + +    +  L C Y   N T  L I P K E+ +  P +V+ +D + D
Sbjct: 288 DIYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|198418585|ref|XP_002122034.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1 (4-PH
           alpha-1)
           (Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1) [Ciona intestinalis]
          Length = 525

 Score =  143 bits (360), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 89/230 (38%), Positives = 124/230 (53%), Gaps = 18/230 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCF-YESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           Y   CQG   +P  +  NL+C+ Y + N+  L+I P+KVEEL   P +V+ +D I + +I
Sbjct: 276 YNQICQGKFKLPHKVSKNLRCYLYTNKNDPRLRIKPVKVEELCNSPHIVQFYDVINNDDI 335

Query: 62  NRIIELSKGKVERGKVVNYGDT-IYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
             I ++SK  + R  V    +T I  D R SKV +       D   + K+ TRI +MT L
Sbjct: 336 ETIKKMSKKHLSRALVTGPNNTGIVEDIRTSKVAWFKK---NDFTAVKKLYTRISEMTGL 392

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDAT------PRDEGLW---RLASFMFYLTDVEL 171
               EE ++  LQ+ NYGL G Y  H D T       R++G     R+A+ + YL DV+ 
Sbjct: 393 ---SEETFED-LQVANYGLAGEYQPHFDYTEDPSIYKREDGAEVGNRIATMLLYLNDVKE 448

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           GG T F    +   P KGSAVFWYN + + L D R  H+ CPV +GNKW 
Sbjct: 449 GGRTAFIEPKIVAKPIKGSAVFWYNLYPSGLGDPRTRHASCPVVIGNKWA 498


>gi|321474952|gb|EFX85916.1| hypothetical protein DAPPUDRAFT_45616 [Daphnia pulex]
          Length = 537

 Score =  143 bits (360), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 83/228 (36%), Positives = 125/228 (54%), Gaps = 17/228 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G   +   I+ +L+C Y + N  F  I P+K+EE  L P +V  HD + D EI 
Sbjct: 292 YEKLCRGEKLMDPKIEGHLRCRYITNNVPFFFIQPIKMEEALLKPMIVVYHDVMSDDEIE 351

Query: 63  RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            + +++K + +R  + N   G+    + R+SK  +L  E   +H  + K+  R+ D+T L
Sbjct: 352 TVKKMAKPRFKRATIRNSKTGELEPANYRISKSAWLKSE---EHDHILKVTRRVGDITGL 408

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD--ATPRDEGL----W--RLASFMFYLTDVELG 172
            +   E     LQ+ NYG+GGHY+ H D   T   E      W  R+A+++FY++DVE G
Sbjct: 409 DMSTAE----DLQVVNYGIGGHYEPHFDYARTETTEAFKELGWGNRIATWLFYMSDVEAG 464

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GAT+FP     V+P KGSA FWYN + N   +    H+ CPV  G+KW
Sbjct: 465 GATVFPPTGAAVWPRKGSAAFWYNLYPNGKGNELTRHAACPVLSGSKW 512


>gi|395820524|ref|XP_003783614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Otolemur
           garnettii]
          Length = 534

 Score =  143 bits (360), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|402593814|gb|EJW87741.1| hypothetical protein WUBG_01349 [Wuchereria bancrofti]
          Length = 541

 Score =  143 bits (360), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 117/230 (50%), Gaps = 18/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           +IY   C+  + V   + S L C+Y+  +  FL++ P KVE L  +P  V   D I D E
Sbjct: 287 DIYEALCRNEIPVSIKVTSKLYCYYK-MDRPFLRLAPFKVEILRFNPLAVLFRDVITDEE 345

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  I  L+  ++ R  V N   G+      R SK  +L  E   +H  +++I  RI  MT
Sbjct: 346 ITMIQMLATPRLRRATVQNSITGELETASYRTSKSAWLKDE---EHEVVHRINKRIDLMT 402

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
           NL    E+     LQ+ NYG+GGHYD H D   R+E           RLA+ +FY+T  E
Sbjct: 403 NL----EQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPE 458

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+F  +  TV P K  A+FWYN   +   D R  H+ CPV  G KW
Sbjct: 459 SGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPVLTGTKW 508


>gi|397490069|ref|XP_003816032.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Pan paniscus]
          Length = 488

 Score =  143 bits (360), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 243 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 302

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 303 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 359

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 360 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 415

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 416 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 465


>gi|432106758|gb|ELK32410.1| Prolyl 4-hydroxylase subunit alpha-1 [Myotis davidii]
          Length = 534

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|349604936|gb|AEQ00344.1| Prolyl 4-hydroxylase subunit alpha-1-like protein, partial [Equus
           caballus]
          Length = 302

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 57  YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 116

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 117 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 173

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 174 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 229

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 230 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 279


>gi|301770069|ref|XP_002920453.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Ailuropoda
           melanoleuca]
          Length = 534

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|296220402|ref|XP_002756291.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Callithrix
           jacchus]
          Length = 534

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|190786|gb|AAA36534.1| prolyl 4-hydroxylase alpha subunit (EC 1.14.11.2) [Homo sapiens]
          Length = 534

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|380813208|gb|AFE78478.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
 gi|384947330|gb|AFI37270.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
          Length = 534

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|332244067|ref|XP_003271193.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-1 [Nomascus leucogenys]
          Length = 502

 Score =  143 bits (360), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 257 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 316

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 317 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 373

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 374 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 429

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 430 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 479


>gi|348576112|ref|XP_003473831.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cavia
           porcellus]
          Length = 534

 Score =  142 bits (359), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|291404182|ref|XP_002718471.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 1
           [Oryctolagus cuniculus]
          Length = 534

 Score =  142 bits (359), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|63252886|ref|NP_000908.2| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Homo
           sapiens]
 gi|114631173|ref|XP_508168.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 13 [Pan
           troglodytes]
 gi|602676|gb|AAA59069.1| alpha-subunit of prolyl 4-hydroxylase [Homo sapiens]
 gi|62897481|dbj|BAD96680.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I variant [Homo
           sapiens]
 gi|119574852|gb|EAW54467.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_a [Homo
           sapiens]
 gi|119574853|gb|EAW54468.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I, isoform CRA_b [Homo
           sapiens]
 gi|410349609|gb|JAA41408.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410349613|gb|JAA41410.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score =  142 bits (359), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|410295850|gb|JAA26525.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
 gi|410295854|gb|JAA26527.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 534

 Score =  142 bits (359), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLRRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|73952886|ref|XP_850682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Canis
           lupus familiaris]
          Length = 534

 Score =  142 bits (359), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|383418721|gb|AFH32574.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Macaca
           mulatta]
          Length = 534

 Score =  142 bits (359), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|312032354|ref|NP_001185664.1| prolyl 4-hydroxylase subunit alpha-1 isoform 1 precursor [Gallus
           gallus]
          Length = 536

 Score =  142 bits (359), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  L +    +  L C +Y+   N    +GP+K E+ +  PR+V+  D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + EL+K ++ R  V +   G       R+SK  +L      + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGY---ESPVVSRINTRIQDLT 407

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 408 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 463

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 464 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513


>gi|402880501|ref|XP_003903839.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
           [Papio anubis]
          Length = 379

 Score =  142 bits (359), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 134 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 193

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 194 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 250

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 251 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 306

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 307 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 356


>gi|410251924|gb|JAA13929.1| prolyl 4-hydroxylase, alpha polypeptide I [Pan troglodytes]
          Length = 566

 Score =  142 bits (359), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|326923463|ref|XP_003207955.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Meleagris gallopavo]
          Length = 536

 Score =  142 bits (359), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  L +    +  L C +Y+   N    +GP+K E+ +  PR+V+  D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + EL+K ++ R  V +   G       R+SK  +L      + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGY---ESPVVSRINTRIQDLT 407

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 408 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 463

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 464 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513


>gi|129365|sp|P16924.1|P4HA1_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1
          Length = 516

 Score =  142 bits (359), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  L +    +  L C +Y+   N    +GP+K E+ +  PR+V+  D I D E
Sbjct: 271 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 330

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + EL+K ++ R  V +   G       R+SK  +L      + P + +I TRIQD+T
Sbjct: 331 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGY---ESPVVSRINTRIQDLT 387

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 388 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 443

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 444 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 493


>gi|474940|emb|CAA55546.1| gamma-butyrobetaine,2-oxoglutarate dioxygenase [Rattus norvegicus]
          Length = 534

 Score =  142 bits (359), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      + P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|170591592|ref|XP_001900554.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|16415740|emb|CAC82616.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|21425621|emb|CAD19314.1| prolyl 4-hydroxylase [Brugia malayi]
 gi|158592166|gb|EDP30768.1| prolyl 4-hydroxylase, putative [Brugia malayi]
          Length = 541

 Score =  142 bits (359), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 117/230 (50%), Gaps = 18/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           +IY   C+  + V   + S L C+Y+  +  FL++ P KVE L  +P  V   D I D E
Sbjct: 287 DIYEALCRNEIPVSIKVTSKLYCYYK-MDRPFLRLAPFKVEILRFNPLAVLFRDVITDEE 345

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           +  I  L+  ++ R  V N   G+      R SK  +L  E   +H  +++I  RI  MT
Sbjct: 346 VTMIQMLATPRLRRATVQNSITGELETASYRTSKSAWLKDE---EHEVVHRINKRIDLMT 402

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
           NL    E+     LQ+ NYG+GGHYD H D   R+E           RLA+ +FY+T  E
Sbjct: 403 NL----EQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPE 458

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+F  +  TV P K  A+FWYN   +   D R  H+ CPV  G KW
Sbjct: 459 SGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPVLTGTKW 508


>gi|212530|gb|AAA49002.1| prolyl 4-hydroxylase, alpha subunit (EC 1.14.11.2), partial [Gallus
           gallus]
          Length = 489

 Score =  142 bits (359), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  L +    +  L C +Y+   N    +GP+K E+ +  PR+V+  D I D E
Sbjct: 244 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 303

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + EL+K ++ R  V +   G       R+SK  +L      + P + +I TRIQD+T
Sbjct: 304 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGY---ESPVVSRINTRIQDLT 360

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 361 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 416

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 417 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 466


>gi|26336999|dbj|BAC32183.1| unnamed protein product [Mus musculus]
 gi|148700261|gb|EDL32208.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_b [Mus
           musculus]
          Length = 534

 Score =  142 bits (359), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      + P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|51036657|ref|NP_742059.2| prolyl 4-hydroxylase subunit alpha-1 precursor [Rattus norvegicus]
 gi|90111077|sp|P54001.2|P4HA1_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; Flags: Precursor
 gi|50927553|gb|AAH78703.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [Rattus norvegicus]
 gi|149038787|gb|EDL93076.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide, isoform CRA_a
           [Rattus norvegicus]
          Length = 534

 Score =  142 bits (358), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      + P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|354483223|ref|XP_003503794.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Cricetulus griseus]
          Length = 534

 Score =  142 bits (358), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      + P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFQELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|73970649|ref|XP_850109.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Canis
           lupus familiaris]
          Length = 533

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 127/230 (55%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N T  L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|355709025|gb|AES03456.1| prolyl 4-hydroxylase, alpha polypeptide II [Mustela putorius furo]
          Length = 532

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 127/230 (55%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N T  L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|395501518|ref|XP_003755140.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Sarcophilus
           harrisii]
          Length = 385

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 126/230 (54%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  L +    +  L C Y   N N    + P K E+ +  PR+V+ H+ I D+E
Sbjct: 140 YEMLCRGEGLKMTPQRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHEIISDAE 199

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      + P + +I  RIQD+T
Sbjct: 200 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 256

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 257 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 312

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 313 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 362


>gi|449280261|gb|EMC87600.1| Prolyl 4-hydroxylase subunit alpha-1 [Columba livia]
          Length = 536

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 126/230 (54%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  L +    +  L C +Y+   N    +GP+K E+ +  PR+V+  D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + EL+K ++ R  V +   G       R+SK  +L      + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGY---ESPVVSRINTRIQDLT 407

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 408 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 463

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 464 AGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513


>gi|224052167|ref|XP_002191912.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Taeniopygia
           guttata]
          Length = 536

 Score =  142 bits (358), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 126/230 (54%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  L +    +  L C +Y+   N    +GP+K E+ +  PR+V+  D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + EL+K ++ R  V +   G       R+SK  +L      + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGY---ESPVVSRINTRIQDLT 407

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 408 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 463

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 464 AGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513


>gi|151556370|gb|AAI47868.1| P4HA1 protein [Bos taurus]
          Length = 534

 Score =  142 bits (357), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEVVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVL 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|326928728|ref|XP_003210527.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Meleagris
           gallopavo]
          Length = 535

 Score =  142 bits (357), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 85/231 (36%), Positives = 127/231 (54%), Gaps = 19/231 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           +IY   C+G  + +    +  L C Y + N N  L I P K E+ +  P +V+ +D + D
Sbjct: 290 DIYEALCRGEGVKMTPRRQKRLFCRYHNGNRNPHLVIAPFKEEDEWDSPHIVRYYDVMSD 349

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI +I +L+K K+ R  V +   G       R+SK  +L  +   D P + K+  R+Q 
Sbjct: 350 EEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVAKVNQRMQQ 406

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPRDEGLWRLASFMFYLTDV 169
           +T L +   E     LQ+ NYG+GG Y+ H D       +T + EG  RLA+F+ Y++DV
Sbjct: 407 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSTLKSEGN-RLATFLNYMSDV 461

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           E GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 462 EAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 512


>gi|432949777|ref|XP_004084253.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Oryzias
           latipes]
          Length = 532

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 130/230 (56%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           E Y   C+G  L + E  +S L C +++   +  L + P+K E+ + +P +V+  + + D
Sbjct: 287 ETYEALCRGEGLQLTEARRSRLFCRYHDGKRSPRLLLKPIKEEDEWDNPHIVRYLNILSD 346

Query: 59  SEINRIIELSKGKVERGKVVNYGDTIYVDT--RLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI +I EL+K ++ R  V +    +      R+SK  +L  E   D P + ++  RIQD
Sbjct: 347 QEIEKIKELAKPRLARATVRDPKTGVLTTAPYRVSKSAWLEGE---DDPVIDRVNQRIQD 403

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D  L     RLA+F+ Y++DVE
Sbjct: 404 ITGLTVETAEL----LQVANYGVGGQYEPHFDFSRRPFDSNLKVDGNRLATFLNYMSDVE 459

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP    +++P KG+AVFWYN   +   DYR  H+ CPV +G+KW
Sbjct: 460 AGGATVFPDFGASIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKW 509


>gi|291387300|ref|XP_002710241.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 1 [Oryctolagus
           cuniculus]
          Length = 533

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHDGNGAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + +I  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARINRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|426255746|ref|XP_004021509.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Ovis
           aries]
          Length = 534

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVL 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 462 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 511


>gi|195113237|ref|XP_002001174.1| GI10637 [Drosophila mojavensis]
 gi|193917768|gb|EDW16635.1| GI10637 [Drosophila mojavensis]
          Length = 529

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 89/230 (38%), Positives = 125/230 (54%), Gaps = 22/230 (9%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           + Y   C+G    P+++K  L C Y S  + FL++ PLK+E + LDP +V  HD I  SE
Sbjct: 288 QAYERGCRGQY--PQNLK--LYCVYNSTTSAFLRLAPLKMELISLDPYMVIYHDVISPSE 343

Query: 61  INRIIELSKGKVERGKVVNYGDTI--YVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I+ +  L+   ++R  V N        V TR SKV +L   +   +    ++  RI DMT
Sbjct: 344 ISELQSLAVPGLKRATVFNQQSMRNHVVKTRTSKVTWLLDTL---NQLTIRLNRRITDMT 400

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD--------ATPRDEGLWRLASFMFYLTDVE 170
              +   E     LQ+ NYGLGGHYD H D           R  G  R+A+ +FYLTDVE
Sbjct: 401 GFDMYGSEM----LQVMNYGLGGHYDKHYDYFNSSVAADLTRLNGD-RIATVLFYLTDVE 455

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP++   VFP+ G+AV WYN   +   D +  H+ CPV +G+KW
Sbjct: 456 QGGATVFPNIEKAVFPKSGTAVVWYNLRHDGNGDPQTLHAACPVIVGSKW 505


>gi|195055775|ref|XP_001994788.1| GH17428 [Drosophila grimshawi]
 gi|193892551|gb|EDV91417.1| GH17428 [Drosophila grimshawi]
          Length = 540

 Score =  141 bits (356), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 77/230 (33%), Positives = 129/230 (56%), Gaps = 19/230 (8%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+  L      +  L+C     N       P ++EEL+LDP V+++HD I   E 
Sbjct: 286 MYQQVCREELKPEPATQRKLRCRLHRGNGLRSSYQPYRLEELHLDPYVIQVHDIISAEET 345

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDT--RLSK-VYFLYPEIFGDHPFLYKIQTRIQDMT 118
             + +L++ +++R  V +  ++ ++ T  R+S+  +F Y E    HP + ++   +++++
Sbjct: 346 IVLQQLARPELQRSMVYSLSNSEHISTNFRISQGTFFEYHE----HPIMQRMSQHLENIS 401

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE--------GLWRLASFMFYLTDVE 170
            L +   E+    LQ+ NYG+GGHY+ H D+   +            R+A+ ++YL++VE
Sbjct: 402 GLDMRSAEQ----LQVANYGIGGHYEPHMDSFSENHNYGINTYMSTNRVATGIYYLSNVE 457

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T FP L L V PE+GS +FWYN H +  LDYR  H+GCPV +G+KW
Sbjct: 458 AGGGTAFPFLPLLVEPERGSLLFWYNLHRSGDLDYRTKHAGCPVLMGSKW 507


>gi|334314085|ref|XP_001363658.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1
           [Monodelphis domestica]
          Length = 537

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 126/230 (54%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  L +    +  L C Y   N N    + P K E+ +  PR+V+ H+ I D+E
Sbjct: 292 YEMLCRGEGLKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHEIISDAE 351

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      + P + +I  RIQD+T
Sbjct: 352 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 408

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 409 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 464

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 465 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 514


>gi|57525020|ref|NP_001006155.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Gallus gallus]
 gi|82082587|sp|Q5ZLK5.1|P4HA2_CHICK RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|53129464|emb|CAG31388.1| hypothetical protein RCJMB04_5l17 [Gallus gallus]
          Length = 534

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 85/231 (36%), Positives = 126/231 (54%), Gaps = 19/231 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           +IY   C+G  + +    +  L C Y   N N  L I P K E+ +  P +V+ +D + D
Sbjct: 289 DIYEALCRGEGVKMTPRRQKRLFCRYHDGNRNPHLLIAPFKEEDEWDSPHIVRYYDVMSD 348

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI +I +L+K K+ R  V +   G       R+SK  +L  +   D P + K+  R+Q 
Sbjct: 349 EEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVAKVNQRMQQ 405

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPRDEGLWRLASFMFYLTDV 169
           +T L +   E     LQ+ NYG+GG Y+ H D       +T + EG  RLA+F+ Y++DV
Sbjct: 406 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSTLKSEGN-RLATFLNYMSDV 460

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           E GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 EAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 511


>gi|90085216|dbj|BAE91349.1| unnamed protein product [Macaca fascicularis]
          Length = 244

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 81/228 (35%), Positives = 126/228 (55%), Gaps = 19/228 (8%)

Query: 5   LACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+EI 
Sbjct: 1   MLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIE 60

Query: 63  RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T L
Sbjct: 61  IVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLTGL 117

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELG 172
            +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV  G
Sbjct: 118 DVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAG 173

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 174 GATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 221


>gi|54792285|emb|CAG28668.1| prolyl 4-hydroxylase alpha-2 subunit [Gallus gallus]
          Length = 538

 Score =  141 bits (355), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 83/232 (35%), Positives = 125/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           +IY   C+G  + +    +  L C Y   N N  L I P K E+ +  P +V+ +D + D
Sbjct: 292 DIYEALCRGEGVKMTPQRQKRLFCRYHDGNRNPHLLIAPFKEEDEWDSPHIVRYYDVMSD 351

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI +I +L+K K+ R  V +   G       R+SK  +L  +   D P + K+  R+Q 
Sbjct: 352 EEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVAKVNQRMQQ 408

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D + +DE           R+A+F+ Y++D
Sbjct: 409 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSD 464

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 465 VEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 516


>gi|148701600|gb|EDL33547.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_e [Mus
           musculus]
          Length = 593

 Score =  141 bits (355), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 348 DVYESLCRGEGVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 407

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 408 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 464

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 465 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 520

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 521 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 570


>gi|391342914|ref|XP_003745760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Metaseiulus
           occidentalis]
          Length = 525

 Score =  140 bits (354), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 126/230 (54%), Gaps = 19/230 (8%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+G       ++ NL C Y   N+ ++ + P K+E ++  P +   HD + D EI
Sbjct: 281 MYERLCRGEPVEKPFLRKNLHCTYFHNNHPYMILQPSKLEVIHERPYLALFHDIMSDDEI 340

Query: 62  NRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
             +IELS  +++R  V N   G+    + R+SK  +L      DH  + ++  R + +T 
Sbjct: 341 QTVIELSAPRLKRATVQNAKSGELEVANYRISKSAWLKNH---DHEVVERLSFRFEYLTG 397

Query: 120 LV-IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
           L  +  EE     LQ+ NYG+GGHY+ H D   RDE           R+A+++ Y++DV+
Sbjct: 398 LTHLTAEE-----LQVVNYGIGGHYEAHFDFARRDEKDAFKQLGTGNRIATWINYMSDVK 452

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L LTV+PEKGSA FW+N H +   D    H+ CPV  G+KW
Sbjct: 453 AGGATVFPRLGLTVWPEKGSAAFWWNLHRSGEGDILTRHAACPVLAGSKW 502


>gi|57997558|emb|CAI46066.1| hypothetical protein [Homo sapiens]
          Length = 533

 Score =  140 bits (354), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLPIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|209862961|ref|NP_001129548.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Mus
           musculus]
 gi|17390970|gb|AAH18411.1| P4ha2 protein [Mus musculus]
 gi|18073922|emb|CAC85690.1| Prolyl 4-hydroxylase alpha IIa subunit [Mus musculus]
 gi|74211515|dbj|BAE26490.1| unnamed protein product [Mus musculus]
          Length = 535

 Score =  140 bits (354), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 290 DVYESLCRGEGVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 349

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 350 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 406

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 407 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 462

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 463 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|114601548|ref|XP_001162501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 9 [Pan
           troglodytes]
 gi|114601562|ref|XP_001162805.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 16 [Pan
           troglodytes]
 gi|114601564|ref|XP_517917.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 19 [Pan
           troglodytes]
 gi|397518354|ref|XP_003829356.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Pan
           paniscus]
 gi|397518356|ref|XP_003829357.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
           paniscus]
 gi|397518360|ref|XP_003829359.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Pan
           paniscus]
 gi|410215942|gb|JAA05190.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410255606|gb|JAA15770.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331277|gb|JAA34585.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331281|gb|JAA34587.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
          Length = 533

 Score =  140 bits (354), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 86/230 (37%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           +IY   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DIYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|334314087|ref|XP_003339988.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2
           [Monodelphis domestica]
          Length = 537

 Score =  140 bits (354), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 126/230 (54%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  L +    +  L C Y   N N    + P K E+ +  PR+V+ H+ I D+E
Sbjct: 292 YEMLCRGEGLKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHEIISDAE 351

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   G       R+SK  +L      + P + +I  RIQD+T
Sbjct: 352 IEIVKDLAKPRLRRATISNPITGVLETAHYRISKSAWLSGY---EDPVVSRINMRIQDLT 408

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 409 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 464

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 465 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 514


>gi|344264849|ref|XP_003404502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Loxodonta africana]
          Length = 534

 Score =  140 bits (354), Expect = 3e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 127/230 (55%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N T  L I P K E+ +  P +V+ +D + D
Sbjct: 289 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 348

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI +++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 349 EEIERIKQIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVAQVNRRMQH 405

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 406 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 461

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 462 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 511


>gi|345305838|ref|XP_001508476.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Ornithorhynchus
           anatinus]
          Length = 493

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+V+ H+ I D+E
Sbjct: 248 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRYHEIISDAE 307

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      + P + +I  RIQD+T
Sbjct: 308 IETVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 364

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 365 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 420

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 421 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 470


>gi|348501574|ref|XP_003438344.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
           niloticus]
          Length = 615

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 83/232 (35%), Positives = 129/232 (55%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           E Y + C+G  + +    +S L C +Y++  N  L + P+K ++ +  P +V+  D I D
Sbjct: 368 EKYEMLCRGEGIKMTPRRQSRLFCRYYDNNRNPSLLLAPVKQQDEWDRPYIVRYLDIISD 427

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
           +EI R+ +L+K ++ R  + N   G       R+SK  +L      D P + KI  RI+ 
Sbjct: 428 AEIERVKQLAKPRLRRATISNPITGVLETASYRISKSAWLTEY---DDPMIEKINDRIEG 484

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++D
Sbjct: 485 VTGLEMDTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSD 540

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           V  GGAT+FP +   V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 541 VSAGGATVFPDVGAAVWPQKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 592


>gi|836898|gb|AAC52197.1| prolyl 4-hydroxylase alpha(I)-subunit, partial [Mus musculus]
 gi|1096887|prf||2112362A Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=I
          Length = 526

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 125/230 (54%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 281 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 340

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  +  L+K ++ R  V +   G       R+SK  +L      + P + +I  RIQD+T
Sbjct: 341 IEIVKYLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 397

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 398 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 453

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 454 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 503


>gi|386368303|gb|AFJ06910.1| procollagen-proline dioxygenase [Mytilus galloprovincialis]
          Length = 535

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 82/229 (35%), Positives = 122/229 (53%), Gaps = 18/229 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G    P +  S + C Y   NN  L + P+K EE+Y D  +V  HD   D E+ 
Sbjct: 291 YKRLCKGLDVKPREKMSQVVCRYRHNNNPRLLLSPIKEEEVYRDANMVLFHDIASDKEMK 350

Query: 63  RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            I  L+  K+ R  V +   G  I+   R++K  +L      DH  + ++Q RI+ +T L
Sbjct: 351 IIKSLAIPKLFRATVHDPTTGKLIHAKYRITKTAWLDDR---DHLVVDRVQNRIKAVTGL 407

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---------RLASFMFYLTDVEL 171
            +   +     LQ+ NYG+GGHYD H D + RD+            R+A+F+ Y+TDV+ 
Sbjct: 408 DLDSAD----ALQVANYGIGGHYDPHYDFSTRDDDDTSETEKRDGNRIATFLLYMTDVDA 463

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GGAT+FP +++ V P+KG+AVFWYN   +        H+ CPV +G KW
Sbjct: 464 GGATVFPIIDVRVLPKKGTAVFWYNLRRSGKGIMETRHAACPVLVGTKW 512


>gi|63252891|ref|NP_001017973.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|63252893|ref|NP_001017974.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|217272861|ref|NP_001136070.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Homo
           sapiens]
 gi|18073925|emb|CAC85688.1| Prolyl 4-hydroxylase alpha IIa subunit [Homo sapiens]
 gi|23274221|gb|AAH35813.1| Prolyl 4-hydroxylase, alpha polypeptide II [Homo sapiens]
 gi|37183058|gb|AAQ89329.1| P4HA2 [Homo sapiens]
 gi|119582745|gb|EAW62341.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_a
           [Homo sapiens]
 gi|119582750|gb|EAW62346.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_a
           [Homo sapiens]
 gi|123983232|gb|ABM83357.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II [synthetic
           construct]
 gi|157928048|gb|ABW03320.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II [synthetic
           construct]
          Length = 533

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|292619367|ref|XP_001922562.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Danio rerio]
          Length = 541

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y   C+G  L +    + +L C Y + N + F  IGP+K E+ +  PR+++ H+ I + E
Sbjct: 296 YEKLCRGEGLRMTPRRQKHLFCRYFNGNRHPFYTIGPVKQEDEWDRPRIIRYHEIITEQE 355

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I +I ELSK ++ R  + N   G       R+SK  +L      +HP + +I  RI+D+T
Sbjct: 356 IEKIKELSKPRLRRATISNPITGVLETAHYRISKSAWLAAY---EHPVVDRINQRIEDIT 412

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 413 GLNVKTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVA 468

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +   V P KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 469 AGGATVFPEVGAAVKPLKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 518


>gi|332221662|ref|XP_003259982.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Nomascus
           leucogenys]
          Length = 556

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 311 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 370

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 371 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 427

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 428 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 483

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 484 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 533


>gi|297675927|ref|XP_002815905.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pongo
           abelii]
 gi|395736137|ref|XP_003776704.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 533

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|332221656|ref|XP_003259979.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Nomascus
           leucogenys]
 gi|332221658|ref|XP_003259980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Nomascus
           leucogenys]
          Length = 535

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 290 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 349

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 350 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 406

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 407 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 462

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 463 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|403255937|ref|XP_003920661.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Saimiri
           boliviensis boliviensis]
 gi|403255939|ref|XP_003920662.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Saimiri
           boliviensis boliviensis]
 gi|403255943|ref|XP_003920664.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 4 [Saimiri
           boliviensis boliviensis]
          Length = 533

 Score =  140 bits (353), Expect = 4e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|449267219|gb|EMC78185.1| Prolyl 4-hydroxylase subunit alpha-2 [Columba livia]
          Length = 538

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 83/232 (35%), Positives = 125/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           +IY   C+G  + +    +  L C Y   N N  L I P K E+ +  P +V+ +D + D
Sbjct: 291 DIYEALCRGEGVKMTPRRQKRLFCRYHDGNRNPHLLIAPFKEEDEWDSPHIVRYYDVMSD 350

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI +I +L+K K+ R  V +   G       R+SK  +L  +   D P + K+  R+Q 
Sbjct: 351 EEIEKIKQLAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVAKVNQRMQQ 407

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D + +DE           R+A+F+ Y++D
Sbjct: 408 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSD 463

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 464 VEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 515


>gi|194905376|ref|XP_001981185.1| GG11927 [Drosophila erecta]
 gi|190655823|gb|EDV53055.1| GG11927 [Drosophila erecta]
          Length = 539

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 82/224 (36%), Positives = 114/224 (50%), Gaps = 15/224 (6%)

Query: 4   PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
           P  C G    P  +K  L C Y      FL++ P+K E L +DP VV +HD +   E   
Sbjct: 289 PPCCSGRCEGPRKLK-RLYCVYNCATAAFLRLAPIKTEILSIDPFVVLLHDMVSPKEAAL 347

Query: 64  IIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
           I   SK  +   + VN  +   V   R SK  +L  +    +    K+  R+ D T L +
Sbjct: 348 IRSSSKSTIFPSETVNAANDFVVSKFRTSKSVWLDRDA---NEATVKLTQRLADATGLDV 404

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATI 176
              E +    Q+ NYG+GG ++ H D T  D   +      R+A+ +FYL DV  GGAT 
Sbjct: 405 KHSEHF----QVINYGIGGVFESHFDTTLEDTNRFVGGFIDRIATTLFYLNDVPQGGATH 460

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP LN+TVFP  G+A+FWYN     +L  R  H+GCPV +G+KW
Sbjct: 461 FPGLNITVFPRLGAALFWYNLDTQGMLQVRTMHTGCPVIVGSKW 504


>gi|426349879|ref|XP_004042513.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Gorilla gorilla
           gorilla]
          Length = 565

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 320 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 379

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 380 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 436

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 437 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 492

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 493 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 542


>gi|386780652|ref|NP_001247763.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Macaca mulatta]
 gi|383422579|gb|AFH34503.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
           mulatta]
 gi|384939466|gb|AFI33338.1| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Macaca
           mulatta]
          Length = 533

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|195452734|ref|XP_002073476.1| GK13124 [Drosophila willistoni]
 gi|194169561|gb|EDW84462.1| GK13124 [Drosophila willistoni]
          Length = 536

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 82/226 (36%), Positives = 125/226 (55%), Gaps = 16/226 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G +      +  L+C Y   N+ + ++ PLK+EE  LDP VV  HD +  ++I 
Sbjct: 286 YEKVCRGEVEPSPAQQRPLRCRYSQGNHPYRQLAPLKMEEHSLDPFVVTYHDMLSPNKIA 345

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           ++ E++   + R  V  +  G       R+SK  +L    +  HP + K+   + D T L
Sbjct: 346 QLREMAVPHMRRSTVNPLPGGQNKKSSFRVSKNAWL---AYETHPTMGKMLRDLSDTTGL 402

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
            +     Y   LQ+ NYG+GGHY+ H D        P +EG  R+A+ ++YL++VE GGA
Sbjct: 403 DMT----YCEQLQVANYGVGGHYEPHWDFFRNPDHYPAEEGN-RIATAIYYLSEVEQGGA 457

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T FP LN  V P+ G+ +FWYN H ++ +DYR  H+GCPV  G+KW
Sbjct: 458 TAFPFLNFAVRPQLGNVLFWYNLHRSSDMDYRTKHAGCPVLKGSKW 503


>gi|119582748|gb|EAW62344.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_c
           [Homo sapiens]
          Length = 565

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 320 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 379

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 380 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 436

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 437 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 492

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 493 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 542


>gi|74353841|gb|AAI03334.1| Prolyl 4-hydroxylase, alpha polypeptide II [Bos taurus]
          Length = 487

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 242 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 301

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 302 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 358

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 359 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 414

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 415 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 464


>gi|440912197|gb|ELR61789.1| Prolyl 4-hydroxylase subunit alpha-2, partial [Bos grunniens mutus]
          Length = 535

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 290 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 349

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 350 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 406

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 407 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 462

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 463 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|395736139|ref|XP_003776705.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 575

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 330 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 389

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 390 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 446

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 447 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 502

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 503 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 552


>gi|226874885|ref|NP_001029465.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Bos
           taurus]
 gi|296485623|tpg|DAA27738.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Bos taurus]
          Length = 533

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|116283554|gb|AAH17062.1| P4HA2 protein [Homo sapiens]
          Length = 504

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 259 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 318

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 319 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 375

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 376 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 431

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 432 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 481


>gi|426229221|ref|XP_004008689.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Ovis aries]
          Length = 487

 Score =  140 bits (352), Expect = 5e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 242 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 301

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 302 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 358

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 359 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 414

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 415 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 464


>gi|119582749|gb|EAW62345.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_d
           [Homo sapiens]
          Length = 488

 Score =  140 bits (352), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 243 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 302

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 303 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 359

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 360 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 415

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 416 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 465


>gi|393909803|gb|EFO21561.2| prolyl 4-hydroxylase 2 [Loa loa]
          Length = 542

 Score =  139 bits (351), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 117/230 (50%), Gaps = 18/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+  + V     S L C+Y+  +  FL++ P KVE L   P  V   D I D E
Sbjct: 288 DMYEALCRNEVPVSVKATSKLYCYYK-MDRPFLRLAPFKVEILRFSPLAVFFRDVITDEE 346

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           +  I  L+  ++ R  V N   G+      R SK  +L  E   +H  +++I  RI  MT
Sbjct: 347 VTIIQMLATPRLRRATVQNSITGELETASYRTSKSAWLKDE---EHEIVHRINRRIDLMT 403

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
           NL    E+     LQ+ NYG+GGHYD H D   R+E           RLA+ +FY+T  E
Sbjct: 404 NL----EQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPE 459

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+F  +  TV P K  A+FWYN   +   D R  H+ CPV +G+KW
Sbjct: 460 SGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPVLIGSKW 509


>gi|335283456|ref|XP_003354320.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Sus scrofa]
          Length = 535

 Score =  139 bits (351), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 83/232 (35%), Positives = 126/232 (54%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N T  L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D + +DE           R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSD 460

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|281361323|ref|NP_652183.2| CG15864 [Drosophila melanogaster]
 gi|272476864|gb|AAF54202.3| CG15864 [Drosophila melanogaster]
          Length = 490

 Score =  139 bits (351), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 86/221 (38%), Positives = 119/221 (53%), Gaps = 22/221 (9%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           L+ + N +V     S L C Y +    F +I PLK+EEL LDP +V  HD IYD+EI+ +
Sbjct: 258 LSTKQNCAVVVQKPSRLHCRYNTTTTPFTRIAPLKMEELGLDPYMVVFHDVIYDTEIDGM 317

Query: 65  IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
           +  S          N+G ++    + S+V         D   L     R+ DMT    G 
Sbjct: 318 LNSS----------NFGLSLTDSGQKSEVRTSKDSYIVDAKTL---NERVTDMT----GF 360

Query: 125 EERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGGATIFPS 179
                 P  + NYGLGGHY LH D      T R +   R+A+ +FYL +V+ GGATIFP 
Sbjct: 361 SMEMSDPFSLINYGLGGHYMLHYDFHEYTNTTRPKQGDRIATVLFYLGEVDSGGATIFPM 420

Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +N+TV P+KGSAVFWYN H +  ++ +  HS CPV  G+K+
Sbjct: 421 INITVTPKKGSAVFWYNLHNSGAMNLKSLHSACPVISGSKY 461


>gi|224068121|ref|XP_002191580.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Taeniopygia
           guttata]
          Length = 539

 Score =  139 bits (351), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 84/231 (36%), Positives = 126/231 (54%), Gaps = 19/231 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           +IY   C+G  + +    +  L C Y   N N  L I P K E+ +  P +V+ +D + D
Sbjct: 294 DIYEALCRGEGVKMTPRRQKRLFCRYHDGNRNPHLLIAPFKEEDEWDSPHIVRYYDVMSD 353

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI +I +L+K ++ R  V +   G       R+SK  +L  +   D P + K+  R+Q 
Sbjct: 354 EEIEKIKQLAKPRLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVAKVNQRMQH 410

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPRDEGLWRLASFMFYLTDV 169
           +T L +   E     LQ+ NYG+GG Y+ H D       +T + EG  RLA+F+ Y++DV
Sbjct: 411 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSTLKSEGN-RLATFLNYMSDV 465

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           E GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 466 EAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 516


>gi|312080225|ref|XP_003142509.1| prolyl 4-hydroxylase 2 [Loa loa]
          Length = 541

 Score =  139 bits (351), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 117/230 (50%), Gaps = 18/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+  + V     S L C+Y+  +  FL++ P KVE L   P  V   D I D E
Sbjct: 287 DMYEALCRNEVPVSVKATSKLYCYYK-MDRPFLRLAPFKVEILRFSPLAVFFRDVITDEE 345

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           +  I  L+  ++ R  V N   G+      R SK  +L  E   +H  +++I  RI  MT
Sbjct: 346 VTIIQMLATPRLRRATVQNSITGELETASYRTSKSAWLKDE---EHEIVHRINRRIDLMT 402

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
           NL    E+     LQ+ NYG+GGHYD H D   R+E           RLA+ +FY+T  E
Sbjct: 403 NL----EQETSEELQVGNYGIGGHYDPHFDFARREEVNAFQSLNTGNRLATLLFYMTQPE 458

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+F  +  TV P K  A+FWYN   +   D R  H+ CPV +G+KW
Sbjct: 459 SGGATVFTEVKTTVMPSKNDALFWYNLLRSGEGDLRTRHAACPVLIGSKW 508


>gi|240974259|ref|XP_002401836.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215491070|gb|EEC00711.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 490

 Score =  139 bits (351), Expect = 6e-31,   Method: Compositional matrix adjust.
 Identities = 83/231 (35%), Positives = 127/231 (54%), Gaps = 19/231 (8%)

Query: 2   IYPLACQG-NLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           IY   C+G    VP   K  +L C Y +  + FL + P K E ++  PR+V  HD +   
Sbjct: 244 IYERLCRGEKFPVPPLYKDKDLTCQYRTNGSPFLLLQPAKEEVMFPKPRIVIYHDVMSKH 303

Query: 60  EINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
           E++ +  L++ +++R  V NY  G+    + R+SK  +L  E   +H  + ++  RI+ +
Sbjct: 304 EMDVVKLLAQPRLKRATVQNYKSGELEVANYRISKSAWLRNE---EHGVIARVTRRIEHI 360

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDV 169
           T L     E     LQ+ NYG+GGHY+ H D   R+E           R+A+++ Y++DV
Sbjct: 361 TGLSADTAEE----LQVVNYGIGGHYEPHFDFARREEKNAFQSLGTGNRIATWLNYMSDV 416

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             GGAT+FP L LT++PEKG+A FWYN H +   D    H+ CPV  G+KW
Sbjct: 417 PAGGATVFPQLRLTLWPEKGAAAFWYNLHRSGEGDMLTRHAACPVLAGSKW 467


>gi|156352054|ref|XP_001622587.1| predicted protein [Nematostella vectensis]
 gi|156209158|gb|EDO30487.1| predicted protein [Nematostella vectensis]
          Length = 531

 Score =  139 bits (351), Expect = 7e-31,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 118/230 (51%), Gaps = 12/230 (5%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y   C+G      +  + L+C+Y+   +   +I PLKVEEL+ DP +  + D +YDSE
Sbjct: 281 EAYERLCRGISYRSNEEAAKLRCYYDFTRHPMFRIRPLKVEELHSDPPIWMLRDVMYDSE 340

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLY-PEIFGDHPFLYKIQTRIQDM 117
           I  I   +  K+ R  V N   G+  + D R+SK  +L  P    +   L ++  R   +
Sbjct: 341 IEYIKRTATPKLRRATVTNLKTGELEFADYRISKSGWLEDPRDDNEEKILNRVNRRTSII 400

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVE 170
           T L      R    LQI NYG  GHY+ H D AT     +       R+A+ ++Y++DVE
Sbjct: 401 TGL--DTTPRSAEALQIVNYGAAGHYEPHFDHATEAVSSILKLGIGNRIATVLYYMSDVE 458

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+F      V P KG A FWYN H N   D R  H+ CP+ +G+KW
Sbjct: 459 AGGATVFVDAEAIVKPSKGDAAFWYNLHKNGKGDERTRHAACPIIVGSKW 508


>gi|74148153|dbj|BAE36242.1| unnamed protein product [Mus musculus]
          Length = 454

 Score =  139 bits (351), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 81/230 (35%), Positives = 125/230 (54%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 209 YEMLCRGEGIKMTPRRQKRLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 268

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
              + +L+K ++ R  V +   G       R+SK  +L      + P + +I  RIQD+T
Sbjct: 269 NEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---EDPVVSRINMRIQDLT 325

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 326 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFRELGTGNRIATWLFYMSDVS 381

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 382 AGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 431


>gi|195452776|ref|XP_002073495.1| GK13117 [Drosophila willistoni]
 gi|194169580|gb|EDW84481.1| GK13117 [Drosophila willistoni]
          Length = 487

 Score =  139 bits (350), Expect = 8e-31,   Method: Compositional matrix adjust.
 Identities = 85/223 (38%), Positives = 123/223 (55%), Gaps = 18/223 (8%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G      D K  L C Y + ++ FL++ PLK+E + LDP +V  HD I  +EI  + E
Sbjct: 250 CRGEFPALTDAK--LYCIYNTTSSPFLRLAPLKMELIGLDPYMVLYHDVISPNEIAELQE 307

Query: 67  LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD--HPFLYKIQTRIQDMTNLVIGR 124
           ++K +++R +V  Y  T   D +LSK        F D  +    ++  RI DMTN V+  
Sbjct: 308 MAKPQLKRARV--YNSTKNTD-QLSKTRTAKLAWFLDTFNQLTERLNQRIMDMTNFVLNG 364

Query: 125 EERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVELGGATIF 177
            E     LQ+ NYGLGG+Y  H D     +G         R+A+ +FYL DVE GGAT+F
Sbjct: 365 SEM----LQVMNYGLGGYYVKHFDYFNTTKGPHITQINGDRIATVLFYLNDVEQGGATVF 420

Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           P +   VFP++GSA+ WYN   +   +    H+GCPV +G+KW
Sbjct: 421 PEIKKAVFPKRGSAIMWYNLKDDGEGNRDTLHAGCPVIVGSKW 463


>gi|395817618|ref|XP_003782262.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Otolemur
           garnettii]
          Length = 538

 Score =  139 bits (350), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 85/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           E+Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 293 EVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 352

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 353 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNHRMQH 409

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     R+A+F+ Y++DVE
Sbjct: 410 ITGLSVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRVATFLNYMSDVE 465

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 466 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 515


>gi|195572621|ref|XP_002104294.1| GD18523 [Drosophila simulans]
 gi|194200221|gb|EDX13797.1| GD18523 [Drosophila simulans]
          Length = 490

 Score =  139 bits (350), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 120/227 (52%), Gaps = 34/227 (14%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           LA + N +      S L C Y +    F +I PLK+EEL LDP +V  HD +YD+EI+ +
Sbjct: 258 LATKQNCTAVVQKPSRLHCRYNTSTTPFTRIAPLKMEELSLDPYMVVFHDVVYDTEIDGM 317

Query: 65  IELSKGKVE------RGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           +  S   +       + +V    D+  VD++                    +  R+ DMT
Sbjct: 318 LNSSNFGISESVSGLKSEVRTSKDSHIVDSK-------------------TLNERVTDMT 358

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGG 173
            L +   +    P  + NYGLGGH+ LH D      T R +   R+A+ +FYL +V+ GG
Sbjct: 359 GLSMEMSD----PFSLINYGLGGHFILHHDFHEYTNTTRLKQGDRIATVLFYLGEVDSGG 414

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           ATIFP LN+TV P+KGSAVFWYN H +  ++ +  HS CPV  G+K+
Sbjct: 415 ATIFPMLNITVTPKKGSAVFWYNLHNSGAVNSKTLHSACPVISGSKY 461


>gi|308476969|ref|XP_003100699.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
 gi|308264511|gb|EFP08464.1| hypothetical protein CRE_15564 [Caenorhabditis remanei]
          Length = 573

 Score =  139 bits (350), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 91/249 (36%), Positives = 126/249 (50%), Gaps = 37/249 (14%)

Query: 1   EIYPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           + Y   C+G +  V +  K+ L+C+ +  +  FLKI P+KVE L  DP  V   + I DS
Sbjct: 295 DAYEALCRGEIPPVEKKWKNKLRCYLKR-DKPFLKIAPIKVEILRFDPLAVLFKNVISDS 353

Query: 60  EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
           EI  I EL+  K++R  V N   G+  +   R+SK  +L  ++   HP + ++  RI+D 
Sbjct: 354 EIKVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLKGDL---HPVIERVNRRIEDF 410

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------------------- 157
           T L  G  E     LQ+ NYGLGGHYD H D A   + GL                    
Sbjct: 411 TGLYQGTSEE----LQVANYGLGGHYDPHFDFARIANYGLGGHYEPHYDMSLKEEKNAFK 466

Query: 158 ------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 211
                 R+A+ +FY++  E GGAT+F  L   VFP K  A+FWYN   +   D R  H+ 
Sbjct: 467 TLNTGNRIATVLFYMSQPERGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAA 526

Query: 212 CPVALGNKW 220
           CPV LG KW
Sbjct: 527 CPVLLGVKW 535


>gi|405965633|gb|EKC30995.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
          Length = 617

 Score =  139 bits (350), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 84/232 (36%), Positives = 124/232 (53%), Gaps = 21/232 (9%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           + Y   C+G  +    +K  LKC Y   NN  L + P K EE+YL+P +V  HD + D E
Sbjct: 372 QTYESLCRGEDTHDYKLKHKLKCRYVHKNNPRLLLKPAKEEEVYLNPWIVIYHDVVSDKE 431

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I+ I  ++   + R  V N   G     + R+SK  +L     GD P ++ +  RI D+T
Sbjct: 432 IDTIKRIATPLLSRATVHNPRTGKLETAEYRVSKSAWLKD---GDDPVIHNVNNRISDIT 488

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQI NYGLGG Y+ H D   R+E           R+A+++ Y+T+V+
Sbjct: 489 GLSMATAEE----LQIANYGLGGQYEPHFDFARREETEAFRDLGSGNRIATWLTYMTNVD 544

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAH--ANTLLDYRMYHSGCPVALGNKW 220
            GGAT+F  + + +FP KG+A FWYN +   + + D R  H+ CPV +G KW
Sbjct: 545 AGGATVFTHIGVKLFPIKGAAAFWYNLYRSGDGIFDTR--HAACPVLVGQKW 594


>gi|354474415|ref|XP_003499426.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Cricetulus griseus]
          Length = 533

 Score =  139 bits (350), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 84/224 (37%), Positives = 123/224 (54%), Gaps = 17/224 (7%)

Query: 7   CQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D EI RI
Sbjct: 294 CRGEGVKLTPQRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERI 353

Query: 65  IELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q +T L +
Sbjct: 354 KEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQHITGLTV 410

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVELGGATI 176
              E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE GGAT+
Sbjct: 411 KTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVEAGGATV 466

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 467 FPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|195452778|ref|XP_002073496.1| GK13116 [Drosophila willistoni]
 gi|194169581|gb|EDW84482.1| GK13116 [Drosophila willistoni]
          Length = 521

 Score =  139 bits (350), Expect = 9e-31,   Method: Compositional matrix adjust.
 Identities = 82/223 (36%), Positives = 121/223 (54%), Gaps = 18/223 (8%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G      D K  L C Y + ++ FL++ PLK+E + LDP +V  HD I  +EI  + E
Sbjct: 287 CRGEFPALTDAK--LYCIYNTTSSPFLRLAPLKMELIGLDPYMVLYHDVISPNEIAELQE 344

Query: 67  LSKGKVERGKVVNYGDTI--YVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
           ++K +++R  V N       +V TR +KV +        +    ++  RI DMTN V+  
Sbjct: 345 MAKPELKRATVYNSTKNTNQFVKTRTAKVAWFLDTF---NQLTERLNQRIMDMTNFVLNG 401

Query: 125 EERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLW--RLASFMFYLTDVELGGATIF 177
            E     LQ+ NYGLGG+Y  H D       P    +   R+A+ +FYL DVE GGAT+F
Sbjct: 402 SEM----LQVMNYGLGGYYVKHFDYFNTTTNPHISQINGDRIATVLFYLNDVEQGGATVF 457

Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           P +   VFP++GSA+ WYN   +   +    H+ CPV +G+KW
Sbjct: 458 PEIKKAVFPKRGSAIMWYNLKDDGEGNRDTLHAACPVIVGSKW 500


>gi|195330780|ref|XP_002032081.1| GM23710 [Drosophila sechellia]
 gi|194121024|gb|EDW43067.1| GM23710 [Drosophila sechellia]
          Length = 490

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 120/227 (52%), Gaps = 34/227 (14%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           LA + N +      S L C Y +    F +I PLK+EEL LDP +V  HD +YD+EI+ +
Sbjct: 258 LATKQNCTAVIQKPSRLHCRYNTSTTPFTRIAPLKMEELSLDPYMVVFHDVVYDTEIDGM 317

Query: 65  IELSKGKVE------RGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           +  S   +       + +V    D+  VD++                    +  R+ DMT
Sbjct: 318 LNSSNFGISESVSGLKSEVRTSKDSHIVDSK-------------------TLNERVTDMT 358

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGG 173
            L +   +    P  + NYGLGGH+ LH D      T R +   R+A+ +FYL +V+ GG
Sbjct: 359 GLSMEMSD----PFSLINYGLGGHFILHHDFHEYTNTTRLKRGDRIATVLFYLGEVDSGG 414

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           ATIFP LN+TV P+KGSAVFWYN H +  ++ +  HS CPV  G+K+
Sbjct: 415 ATIFPMLNITVTPKKGSAVFWYNLHNSGAVNSKTLHSACPVISGSKY 461


>gi|345326417|ref|XP_001510155.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
           [Ornithorhynchus anatinus]
          Length = 888

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 83/232 (35%), Positives = 126/232 (54%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N T  L I P K E+ +  P +V+ +D + D
Sbjct: 641 DVYEGLCRGEGVKLTPRRQKRLFCRYHDGNRTPQLLIAPFKEEDEWDSPHIVRYYDVLSD 700

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI +I EL+K K+ R  V +   G     + R+SK  +L  E   D P + ++  R+Q 
Sbjct: 701 EEIEKIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEE---DDPVVAQVNRRMQY 757

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D + +DE           R+A+F+ Y++D
Sbjct: 758 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSD 813

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 814 VEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 865


>gi|260825357|ref|XP_002607633.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
 gi|229292981|gb|EEN63643.1| hypothetical protein BRAFLDRAFT_59428 [Branchiostoma floridae]
          Length = 520

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 79/233 (33%), Positives = 131/233 (56%), Gaps = 21/233 (9%)

Query: 2   IYPLACQGNLSVPEDIKSN----LKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDAI 56
           +Y L CQ +     +I S+    LKC Y + NN   L + P+++E+++  P++  +H+ +
Sbjct: 272 VYELLCQADQPEIFNITSSRVKHLKCRYFTNNNHPRLLLAPIRLEQVFDKPKLWVLHNIL 331

Query: 57  YDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRI 114
            D E+  I +L++ ++ R +V +   G+      R+SK  +LY     +H  + ++  R+
Sbjct: 332 TDPEMEVIKKLAQPRLRRARVESPTTGEGELASYRISKSAWLYD---WEHRVIRRVNQRV 388

Query: 115 QDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLT 167
           +D+T L +   E     LQ+ NYG+GGHY+ H D   +DE          R+A+ +FY++
Sbjct: 389 EDVTGLTMETAEL----LQVVNYGIGGHYEPHFDCATKDEEFALDPNEGDRIATMLFYMS 444

Query: 168 DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           DVE GGAT+FP +   V PEKG+  FWYN   +   D    H+GCPV +G+KW
Sbjct: 445 DVEAGGATVFPQVGARVVPEKGAGAFWYNLLKSGEGDMLTEHAGCPVLVGSKW 497


>gi|410948134|ref|XP_003980796.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Felis
           catus]
          Length = 535

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 83/232 (35%), Positives = 126/232 (54%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           +IY   C+G  + +    +  L C Y   N T  L I P K E+ +  P +V+ +D + D
Sbjct: 288 DIYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D + ++E           R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSD 460

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|324511726|gb|ADY44875.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
          Length = 550

 Score =  139 bits (349), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 85/229 (37%), Positives = 119/229 (51%), Gaps = 18/229 (7%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           IY   C+  + V     S L C+Y+  +  FL++ P KVE L  +P  V   D I D E 
Sbjct: 285 IYEALCRNEVPVSIKAISQLYCYYK-MDRPFLRLAPFKVEILRFNPLAVLFVDIISDEEA 343

Query: 62  NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
             I +++  +++R  V N   G+      R+SK  +L     GDH  + +I  RI+ MTN
Sbjct: 344 KMIQQIATPRLKRATVQNSKTGELETAAYRISKSAWLKG---GDHELIDRINRRIELMTN 400

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVEL 171
           L+    E     LQI NYG+GGHYD H D   ++E           RLA+ +FYLT+ E+
Sbjct: 401 LIQETSEE----LQIANYGVGGHYDPHFDFARKEEPKAFESLGTGNRLATVLFYLTEPEI 456

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GG T+F  L   V P K  A+FWYN + +   D R  H+ CPV +G KW
Sbjct: 457 GGGTVFTELRTAVMPSKNGALFWYNLYRSGEGDLRTRHAACPVLVGIKW 505


>gi|432926124|ref|XP_004080841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
           latipes]
          Length = 523

 Score =  138 bits (348), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 79/224 (35%), Positives = 124/224 (55%), Gaps = 19/224 (8%)

Query: 8   QGNLSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           QG L  P  + S L C ++ ++ +    IGP+K E+ +  P +V+ HD   + E+  + E
Sbjct: 285 QGALMTPRRL-SRLFCRYFNNHGHPNYLIGPVKQEDEWDSPYIVRYHDVASEKEMETVKE 343

Query: 67  LSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
           L+K ++ R  V +   G       R+SK  +L      +HP + +I  RI+D+T L +  
Sbjct: 344 LAKPRLRRATVHDPQTGKLTTAQYRVSKSAWLGSH---EHPIVDRINQRIEDITGLDVST 400

Query: 125 EERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELGGATI 176
            E     LQ+ NYG+GG Y+ H D   +DE           R+A+++ Y++DV+ GG T+
Sbjct: 401 AE----DLQVANYGVGGQYEPHFDFGRKDEADAFEELGTGNRIATWLLYMSDVQAGGNTV 456

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           F  +   V+P+KG+AVFWYN H +   DYR  H+ CPV +GNKW
Sbjct: 457 FTDIGAVVWPKKGTAVFWYNLHRSGEGDYRTRHAACPVLVGNKW 500


>gi|190402274|gb|ACE77683.1| prolyl 4-hydroxylase subunit alpha-2 precursor (predicted) [Sorex
           araneus]
          Length = 533

 Score =  138 bits (348), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 83/230 (36%), Positives = 126/230 (54%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C ++  +    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGHGAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L      D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTTASYRVSKSSWLEET---DDPVVARVNLRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DVE
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDVE 460

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 AGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 510


>gi|195330778|ref|XP_002032080.1| GM23711 [Drosophila sechellia]
 gi|194121023|gb|EDW43066.1| GM23711 [Drosophila sechellia]
          Length = 490

 Score =  138 bits (348), Expect = 1e-30,   Method: Compositional matrix adjust.
 Identities = 82/227 (36%), Positives = 117/227 (51%), Gaps = 34/227 (14%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           LA + N +      S L C Y +    F +I PLK+EEL LDP +V  HD +YD+EI+ +
Sbjct: 258 LATKQNCTAVVQKPSRLHCRYNTSTTPFTRIAPLKMEELSLDPYMVVFHDVVYDTEIDGM 317

Query: 65  IELSKGKV------ERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           +  S   +      ++ +V    D+  VD +                    +  R+ DMT
Sbjct: 318 LNSSNFVLSLTDSGQKSEVRTSKDSYIVDAK-------------------SLNERVTDMT 358

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGG 173
               G       P  + NYGLGGHY LH D      T R +   R+A+ +FYL +V+ GG
Sbjct: 359 ----GFSMEMSDPFSLINYGLGGHYMLHYDFHEYTNTTRPKQGDRIATVLFYLGEVDSGG 414

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           ATIFP +N+ V P+KGSAVFWYN H +  ++ +  HS CPV  G+K+
Sbjct: 415 ATIFPKINIAVTPKKGSAVFWYNLHNSGAMNLKSLHSACPVISGSKY 461


>gi|195572619|ref|XP_002104293.1| GD18524 [Drosophila simulans]
 gi|194200220|gb|EDX13796.1| GD18524 [Drosophila simulans]
          Length = 472

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 84/221 (38%), Positives = 117/221 (52%), Gaps = 22/221 (9%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           LA + N +      S L C Y +    F +I PLK+EEL LDP +V  HD +YD+EI+ +
Sbjct: 240 LATKQNCTAVVQKPSRLHCRYNTSTTPFTRIAPLKMEELSLDPYMVVFHDVVYDTEIDGM 299

Query: 65  IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
           +  S          N+G ++    + S+V         D   L     R+ DMT    G 
Sbjct: 300 LNSS----------NFGLSLTDSGQKSEVRTSKDSYIVDSESL---NERVTDMT----GF 342

Query: 125 EERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGGATIFPS 179
                 P  + NYGLGGHY LH D      T R +   R+A+ +FYL +V+ GGATIFP 
Sbjct: 343 SMEMSDPFSLINYGLGGHYMLHYDFHEYTNTTRPKQGDRIATVLFYLGEVDSGGATIFPK 402

Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +N+ V P+KGSAVFWYN H +  ++ +  HS CPV  G+K+
Sbjct: 403 INIAVTPKKGSAVFWYNLHNSGAMNLKSLHSACPVISGSKY 443


>gi|148226320|ref|NP_001087703.1| prolyl 4-hydroxylase, alpha polypeptide 2 precursor [Xenopus
           laevis]
 gi|51703693|gb|AAH81114.1| MGC83530 protein [Xenopus laevis]
          Length = 533

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 84/230 (36%), Positives = 130/230 (56%), Gaps = 17/230 (7%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N N  L +GP+K+E+ +  PR+V+  D + D
Sbjct: 288 DVYEALCRGEGVKMNPRRQKRLFCRYHDGNRNPRLILGPIKMEDEWDSPRIVRYLDVLSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI +I EL+K ++ R  V +   G     + R+SK  +L  E + D P + ++ +R+Q 
Sbjct: 348 EEIEKIKELAKPRLARATVRDPKTGVLTVANYRVSKSAWL--EEY-DDPVIGRVNSRMQA 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR--DEGLW----RLASFMFYLTDVE 170
           +T L     E     LQ+ NYG+GG Y+ H D + R  D  L     RLA+++ Y++DVE
Sbjct: 405 ITGLTKDTAEL----LQVANYGMGGQYEPHFDFSRRPFDSNLKTEGNRLATYLNYMSDVE 460

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP     ++P KG+AVFWYN   +   DYR  H+ CPV +G+KW
Sbjct: 461 AGGATVFPDFGAAIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKW 510


>gi|301754231|ref|XP_002912939.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Ailuropoda
           melanoleuca]
          Length = 535

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 83/232 (35%), Positives = 126/232 (54%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           +IY   C+G  + +    +  L C Y   N T  L I P K E+ +  P +V+ +D + D
Sbjct: 288 DIYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D + ++E           R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKNEQDAFKRLGTGNRVATFLNYMSD 460

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|195341558|ref|XP_002037373.1| GM12146 [Drosophila sechellia]
 gi|194131489|gb|EDW53532.1| GM12146 [Drosophila sechellia]
          Length = 485

 Score =  138 bits (348), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 82/225 (36%), Positives = 116/225 (51%), Gaps = 15/225 (6%)

Query: 4   PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
           P  C G    P  +K  L C Y      FL++ P+K E L +DP V+  HD +  +E   
Sbjct: 269 PPCCSGRCEGPRKLK-RLYCVYNCVTAPFLRLAPIKTEILSIDPFVILFHDMVSPTEGAL 327

Query: 64  IIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
           I   SK ++   + VN  +   V   R SK  +   +    +    K+  R+ + T L +
Sbjct: 328 IRSSSKNQILPSETVNAANEFEVAKFRTSKSVWFDSDA---NEATLKLTQRLGEATGLDM 384

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATI 176
              E    P Q+ NYG+GG ++ H D +  DE  +      RLA+ +FYL DV  GGAT 
Sbjct: 385 KHSE----PFQVINYGIGGVFESHFDTSLADEDRFVNGYIDRLATTLFYLNDVPQGGATH 440

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           FP LN+TVFP+ G+ + WYN H   LL  R  H+GCPV +G+KWG
Sbjct: 441 FPGLNITVFPKFGTVLMWYNLHTEGLLHVRTMHTGCPVIVGSKWG 485


>gi|410927705|ref|XP_003977281.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
           rubripes]
          Length = 531

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 128/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y   C+G  L +    +S L C +Y++  +    IGP+K E+ +  P +V+ HD + + E
Sbjct: 286 YEQLCRGEGLKMTARRQSQLFCRYYDNGRHPKYVIGPVKQEDEWDRPHIVRYHDILSNRE 345

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           +  + EL+K ++ R  V +   G       R+SK  +L      +HP + +I  RI+D+T
Sbjct: 346 METVKELAKPRLRRATVHDPQTGQLTTAPYRVSKSAWLGA---FEHPVVDRINQRIEDIT 402

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++ Y+++V+
Sbjct: 403 GLDVSTAE----DLQVANYGVGGQYEPHYDFGRKDEPDAFKELGTGNRIATWLLYMSEVQ 458

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+F  +  +V P+KGSAVFWYN H +   DYR  H+ CPV LGNKW
Sbjct: 459 AGGATVFTDIGASVSPKKGSAVFWYNLHPSGDGDYRTRHAACPVLLGNKW 508


>gi|195591302|ref|XP_002085381.1| GD14757 [Drosophila simulans]
 gi|194197390|gb|EDX10966.1| GD14757 [Drosophila simulans]
          Length = 525

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 85/225 (37%), Positives = 122/225 (54%), Gaps = 21/225 (9%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           + L C+G        K+NL C Y+S  NTFL++ PLK+EE+ LDP +   H+ +YDSEI+
Sbjct: 291 FELGCRGLYRQ----KTNLVCRYKSTANTFLRLAPLKLEEISLDPFIAMYHEVLYDSEIH 346

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            +   S   V  G       T   DT     ++    +  +     +I  RI DMT    
Sbjct: 347 ELKGQSMNMV-NGYASERNGTEIRDTVARYDWWSNTSLVRE-----RINQRIIDMTEFNF 400

Query: 123 GREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGGAT 175
            ++E+    LQI NYG+G ++  H D       TP    L  RLAS +FY ++V  GGAT
Sbjct: 401 SKDEK----LQITNYGVGTYFQPHFDYSSDGFETPNITTLGDRLASILFYASEVPQGGAT 456

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +FP +N+TVFP+KGS ++W+N H +   D R  HS CPV  G++W
Sbjct: 457 VFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSKHSVCPVINGDRW 501


>gi|348557542|ref|XP_003464578.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Cavia porcellus]
          Length = 535

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 83/232 (35%), Positives = 123/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           E+Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 EVYESLCRGEGIKLTPQRRKRLFCRYHHGNRAPELLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  E   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEEE---DDPVVARVNRRMQQ 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +   E           R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRSHERDAFKRLGTGNRVATFLNYMSD 460

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|327267604|ref|XP_003218589.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Anolis
           carolinensis]
          Length = 542

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  L +    +  L C +Y+   N    + P+K E+ +  PR+V+  + I D E
Sbjct: 297 YEMLCRGEGLKMTPRRQKKLFCRYYDGNRNPKYILRPVKQEDEWDRPRIVRFVEIISDEE 356

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + EL+K ++ R  V +   G       R+SK  +L      ++P + +I TRIQD+T
Sbjct: 357 IETVKELAKPRLSRATVHDPQTGKLTTAHYRVSKSAWLSGY---ENPIVARINTRIQDLT 413

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 414 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 469

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 470 AGGATVFPEVGASVWPRKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 519


>gi|226874889|ref|NP_001152881.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Bos
           taurus]
 gi|296485624|tpg|DAA27739.1| TPA: prolyl 4-hydroxylase subunit alpha-2 isoform 1 [Bos taurus]
          Length = 535

 Score =  138 bits (347), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 125/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D + +DE           R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSD 460

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|395509389|ref|XP_003758980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2
           [Sarcophilus harrisii]
          Length = 536

 Score =  137 bits (346), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 126/232 (54%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N T  L I P K E+ +  P +V+ +D + D
Sbjct: 289 DVYEALCRGEGIKLTPRRQKRLFCRYHDGNRTPQLLIAPFKEEDEWDSPHIVRYYDVLSD 348

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI EL+K K+ R  V +   G     + R+SK  +L     GD P + ++  R+  
Sbjct: 349 EEIERIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEE---GDDPVIAQLNRRMHY 405

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D + + E           R+A+F+ Y++D
Sbjct: 406 ITGLSVKTAEL----LQVANYGMGGQYEPHFDFSRKGEQDAFKHLGTGNRVATFLNYMSD 461

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP    T++P+KG++VFWYN   +   DYR  H+ CPV +G+KW
Sbjct: 462 VEAGGATVFPDFGATIWPKKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKW 513


>gi|193688213|ref|XP_001943683.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Acyrthosiphon pisum]
          Length = 552

 Score =  137 bits (346), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 82/231 (35%), Positives = 124/231 (53%), Gaps = 18/231 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           E Y + C+    +   I S L+C Y + N N  L I PLK EE +  PR++   D +YD+
Sbjct: 300 ERYHMLCRNENLMSIQISSQLRCRYTNNNRNPLLLIAPLKEEEAFFSPRIILYRDVLYDN 359

Query: 60  EINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
           EI  I  +++ +++R  V NY  G+  + D R+SK  +L      +   +  +  R++ M
Sbjct: 360 EIEVIKRMAQPRLKRATVQNYKTGELEFADYRISKSAWLKEH---EDVVVANVAKRVEVM 416

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-------GLW-RLASFMFYLTDV 169
           T L     E     LQ+ NYG+GGHYD H D    +E       G   R+A+ +FY++DV
Sbjct: 417 TGLTTETAEE----LQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDV 472

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             GGAT+FP L + + P KG+A  W+N + +   D R  H+ CPV  G+KW
Sbjct: 473 AQGGATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQGSKW 523


>gi|426229219|ref|XP_004008688.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Ovis aries]
          Length = 535

 Score =  137 bits (346), Expect = 2e-30,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 125/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNLRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D + +DE           R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKDEQDAFKRLGTGNRVATFLNYMSD 460

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|328696638|ref|XP_003240086.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Acyrthosiphon pisum]
          Length = 534

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 82/231 (35%), Positives = 124/231 (53%), Gaps = 18/231 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           E Y + C+    +   I S L+C Y + N N  L I PLK EE +  PR++   D +YD+
Sbjct: 282 ERYHMLCRNENLMSIQISSQLRCRYTNNNRNPLLLIAPLKEEEAFFSPRIILYRDVLYDN 341

Query: 60  EINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
           EI  I  +++ +++R  V NY  G+  + D R+SK  +L      +   +  +  R++ M
Sbjct: 342 EIEVIKRMAQPRLKRATVQNYKTGELEFADYRISKSAWLKEH---EDVVVANVAKRVEVM 398

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-------GLW-RLASFMFYLTDV 169
           T L     E     LQ+ NYG+GGHYD H D    +E       G   R+A+ +FY++DV
Sbjct: 399 TGLTTETAEE----LQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDV 454

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             GGAT+FP L + + P KG+A  W+N + +   D R  H+ CPV  G+KW
Sbjct: 455 AQGGATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQGSKW 505


>gi|194765178|ref|XP_001964704.1| GF23330 [Drosophila ananassae]
 gi|190614976|gb|EDV30500.1| GF23330 [Drosophila ananassae]
          Length = 537

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 82/226 (36%), Positives = 123/226 (54%), Gaps = 16/226 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G ++     + NL+C     N+ F  + PLK+EE  LDP VV  HD +   +I 
Sbjct: 287 YEKVCRGEVNPTPRQERNLRCRLSQGNHPFRLLAPLKLEEHNLDPYVVTYHDMLSAQKIR 346

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            + +++  ++ R  V  +  G       R+SK  +L    +  HP +  +   ++D T L
Sbjct: 347 DLRQMAVPRMRRSTVNPLPGGQNKKSAFRVSKNAWL---AYESHPTMEGMLRDLKDATGL 403

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
               +  Y   LQ+ NYG+GGHY+ H D        P +EG  R+A+ +FYL+DVE GGA
Sbjct: 404 ----DTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGN-RIATAIFYLSDVEQGGA 458

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T FP L+  V P+ G+ +FWYN H +  +DYR  H+GCPV  G+KW
Sbjct: 459 TAFPFLDFAVKPQLGNVLFWYNLHRSLDMDYRTKHAGCPVLKGSKW 504


>gi|195575113|ref|XP_002105524.1| GD16980 [Drosophila simulans]
 gi|194201451|gb|EDX15027.1| GD16980 [Drosophila simulans]
          Length = 518

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 81/224 (36%), Positives = 117/224 (52%), Gaps = 15/224 (6%)

Query: 4   PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
           P  C G    P+ +K  L C Y      FL++ P+K E L +DP V+ +HD +  +E   
Sbjct: 268 PHCCSGRCERPQKLK-RLYCVYNCITAPFLRLAPIKTEILSVDPFVILLHDMVSPTEGAL 326

Query: 64  IIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
           I   SK ++   + VN  +   V   R SK  +   +    +    K+  R+ + T L +
Sbjct: 327 IRSSSKNQILPSETVNAANEFEVAKFRTSKSVWFDSDA---NEATLKLTQRLGEATGLDM 383

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATI 176
              E    P Q+ NYG+GG ++ H D +  DE  +      RLA+ +FYL DV  GGAT 
Sbjct: 384 KHSE----PFQVINYGIGGVFESHFDTSLADEDRFVNGYIDRLATTLFYLNDVPQGGATH 439

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP LN+TVFP+ G+ + WYN H   LL  R  H+GCPV +G+KW
Sbjct: 440 FPGLNITVFPKFGTVLMWYNLHTEGLLHVRTMHTGCPVIVGSKW 483


>gi|195110925|ref|XP_002000030.1| GI22756 [Drosophila mojavensis]
 gi|193916624|gb|EDW15491.1| GI22756 [Drosophila mojavensis]
          Length = 533

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 81/226 (35%), Positives = 121/226 (53%), Gaps = 16/226 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G +      +  L+C Y    + +  + PLK+EE  LDP VV  HD +   +I 
Sbjct: 283 YEKVCRGEVGPSAAQQRRLRCRYARGRHAYRLLAPLKLEEHSLDPLVVSYHDMLSPQQIG 342

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            +  ++   ++R  V  ++ G  +    R+SK  +L    +  HP + ++   + D T L
Sbjct: 343 ELRAMAVPHMQRSTVNPLSGGQRMKSAFRVSKNAWL---PYSTHPMMGRMLRDVGDATGL 399

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
               +  Y   LQ+ NYG+GGHY+ H D        P  EG  R+A+ +FYL+DVE GGA
Sbjct: 400 ----DMTYCEQLQVANYGVGGHYEPHWDFFRDSRHYPAAEGN-RIATAIFYLSDVEQGGA 454

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T FP LN  V P+ G+ +FWYN H ++  DYR  H+GCPV  G+KW
Sbjct: 455 TAFPFLNFAVRPQLGNILFWYNLHRSSDEDYRTKHAGCPVLKGSKW 500


>gi|116008432|ref|NP_651804.2| CG15539, isoform A [Drosophila melanogaster]
 gi|66772391|gb|AAY55507.1| IP10910p [Drosophila melanogaster]
 gi|66772535|gb|AAY55579.1| IP10810p [Drosophila melanogaster]
 gi|113194858|gb|AAF57060.2| CG15539, isoform A [Drosophila melanogaster]
          Length = 386

 Score =  137 bits (346), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 80/206 (38%), Positives = 115/206 (55%), Gaps = 12/206 (5%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           + L C Y++ ++ FL++ PLK+E L LDP +V  HD + D +I  I  L+KGK+ R   V
Sbjct: 168 AKLYCLYKTTSSYFLRLAPLKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTV 227

Query: 79  NYGDTIYVD-TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
           +       D  R +K  +L      ++  + ++    QDMTN  I   +    P Q+ NY
Sbjct: 228 SKDGNYTEDPDRTTKGTWLVE----NNALIQRLSQLTQDMTNFDIHDAD----PFQVLNY 279

Query: 138 GLGGHYDLHCD---ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
           G+GG Y +H D       D    R+A+ +FYL+DV  GGATIFP L L+VFP+KGSA+ W
Sbjct: 280 GIGGFYGIHFDFLEDAELDNFSDRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLW 339

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           YN       D R  HS CP  +G++W
Sbjct: 340 YNLDHKGDGDNRTAHSACPTVVGSRW 365


>gi|195505209|ref|XP_002099405.1| GE10885 [Drosophila yakuba]
 gi|194185506|gb|EDW99117.1| GE10885 [Drosophila yakuba]
          Length = 473

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 82/206 (39%), Positives = 114/206 (55%), Gaps = 12/206 (5%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           + L C Y +  + FL++ PLK+E L LDP +V  HD + D +I  I  L+KG + R   V
Sbjct: 255 AKLHCLYNTTASYFLRLAPLKMELLSLDPYMVLFHDVVSDKDITSIRNLAKGGLVRAVTV 314

Query: 79  NYGDTIYVD-TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
               +   D  R +K  +L      +   + ++    QDMTNL I    R   P Q+ NY
Sbjct: 315 TKDGSYEEDPARTTKGTWL----VENSKLIQRLSQLAQDMTNLDI----RDADPFQVLNY 366

Query: 138 GLGGHYDLHCDATPRDE-GLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
           G+GG+Y  H D     E G +  R+A+ +FYL+DV  GGATIFP L L+VFP+KGSA+ W
Sbjct: 367 GIGGYYGTHFDFLADTEMGNFSNRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLW 426

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           YN       D R  HS CP  +G++W
Sbjct: 427 YNLDHKGDGDNRTAHSACPTIVGSRW 452


>gi|66770649|gb|AAY54636.1| IP12415p [Drosophila melanogaster]
 gi|66772017|gb|AAY55320.1| IP12615p [Drosophila melanogaster]
          Length = 512

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 83/210 (39%), Positives = 117/210 (55%), Gaps = 17/210 (8%)

Query: 18  KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
           K+NL C Y+S  NTFL++ PLK+EE+ LDP +   H+ +YDSEI  +   S   V  G  
Sbjct: 294 KTNLVCRYKSTANTFLRLAPLKLEEISLDPFMAMYHEVLYDSEIRELKGQSMNMV-NGYA 352

Query: 78  VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
                T   DT +   ++    +  +     +I  RI DMT     ++E+    LQI NY
Sbjct: 353 SQRNGTEIRDTVVRYDWWSNTSLVRE-----RINQRIIDMTGFNFLKDEK----LQIANY 403

Query: 138 GLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
           GLG ++  H D       TP    L  RLAS +FY ++V  GGAT+FP +N+TVFP+KGS
Sbjct: 404 GLGTYFQPHFDYSSDGFETPNITTLGDRLASILFYASEVPQGGATVFPEINVTVFPQKGS 463

Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            ++W+N H +   D R  HS CPV  G++W
Sbjct: 464 MLYWFNLHDDGKPDIRSLHSVCPVLNGDRW 493


>gi|195499025|ref|XP_002096772.1| GE25857 [Drosophila yakuba]
 gi|194182873|gb|EDW96484.1| GE25857 [Drosophila yakuba]
          Length = 490

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 88/228 (38%), Positives = 116/228 (50%), Gaps = 36/228 (15%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI--- 61
           LA   N +V     S L C Y S    F +I PLK+EEL LDP +V  HD IYD EI   
Sbjct: 258 LATVQNCTVVVQKPSRLHCRYNSTTTPFTRIAPLKMEELSLDPYMVVFHDVIYDREIELM 317

Query: 62  ----NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
               N I+ L+    E  +V    D+  V+++                    +  R+ DM
Sbjct: 318 LNSSNFILSLTDSGQE-SEVRASKDSYIVESK-------------------TLNDRVTDM 357

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELG 172
           T L +        P  + NYG+GGHY LH D      T R +   R+A+ +FYL +V+ G
Sbjct: 358 TGLSM----ELSDPFSLINYGIGGHYMLHYDYHKYTNTTRAKYGDRIATLLFYLGEVDSG 413

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GATIFP +N+TV P+KGSAVFWYN H +  L     HS CPV  G+K+
Sbjct: 414 GATIFPRINITVTPKKGSAVFWYNLHNSGALHLETLHSACPVISGSKY 461


>gi|194871348|ref|XP_001972831.1| GG13664 [Drosophila erecta]
 gi|190654614|gb|EDV51857.1| GG13664 [Drosophila erecta]
          Length = 520

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 83/210 (39%), Positives = 116/210 (55%), Gaps = 17/210 (8%)

Query: 18  KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
           ++NL C Y+S  NTFL++ PLK EE+ LDP +   H+ +YDSEI+ +    KGK   G +
Sbjct: 302 RTNLVCRYKSTANTFLRLAPLKFEEISLDPFIAVYHEVLYDSEIHAL----KGK--SGNM 355

Query: 78  VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
           VN        T +      Y           +I  RI DMT     ++E+    LQI NY
Sbjct: 356 VNGYARQRNGTEIRDTVARYDWWSDTSLTRERINQRIIDMTGFNFTKDEK----LQIANY 411

Query: 138 GLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
           G+G +++ H D       TP    L  RLAS +FY  +V  GGAT+FP +N+TVFP+KGS
Sbjct: 412 GVGTYFEPHFDYSSDGFETPEVTTLGDRLASIIFYAGEVLQGGATVFPEINVTVFPQKGS 471

Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            ++W+N H +   D R  HS CPV  G++W
Sbjct: 472 MLYWFNLHDDGRPDIRSQHSACPVVNGDRW 501


>gi|66771935|gb|AAY55279.1| IP12715p [Drosophila melanogaster]
          Length = 451

 Score =  137 bits (345), Expect = 3e-30,   Method: Compositional matrix adjust.
 Identities = 83/210 (39%), Positives = 117/210 (55%), Gaps = 17/210 (8%)

Query: 18  KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
           K+NL C Y+S  NTFL++ PLK+EE+ LDP +   H+ +YDSEI  +   S   V  G  
Sbjct: 233 KTNLVCRYKSTANTFLRLAPLKLEEISLDPFMAMYHEVLYDSEIRELKGQSMNMV-NGYA 291

Query: 78  VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
                T   DT +   ++    +  +     +I  RI DMT     ++E+    LQI NY
Sbjct: 292 SQRNGTEIRDTVVRYDWWSNTSLVRE-----RINQRIIDMTGFNFLKDEK----LQIANY 342

Query: 138 GLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
           GLG ++  H D       TP    L  RLAS +FY ++V  GGAT+FP +N+TVFP+KGS
Sbjct: 343 GLGTYFQPHFDYSSDGFETPNITTLGDRLASILFYASEVPQGGATVFPEINVTVFPQKGS 402

Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            ++W+N H +   D R  HS CPV  G++W
Sbjct: 403 MLYWFNLHDDGKPDIRSLHSVCPVLNGDRW 432


>gi|221512818|ref|NP_730346.2| CG32201 [Drosophila melanogaster]
 gi|220902638|gb|AAN11679.2| CG32201 [Drosophila melanogaster]
          Length = 520

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 83/210 (39%), Positives = 117/210 (55%), Gaps = 17/210 (8%)

Query: 18  KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
           K+NL C Y+S  NTFL++ PLK+EE+ LDP +   H+ +YDSEI  +   S   V  G  
Sbjct: 302 KTNLVCRYKSTANTFLRLAPLKLEEISLDPFMAMYHEVLYDSEIRELKGQSMNMV-NGYA 360

Query: 78  VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
                T   DT +   ++    +  +     +I  RI DMT     ++E+    LQI NY
Sbjct: 361 SQRNGTEIRDTVVRYDWWSNTSLVRE-----RINQRIIDMTGFNFLKDEK----LQIANY 411

Query: 138 GLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
           GLG ++  H D       TP    L  RLAS +FY ++V  GGAT+FP +N+TVFP+KGS
Sbjct: 412 GLGTYFQPHFDYSSDGFETPNITTLGDRLASILFYASEVPQGGATVFPEINVTVFPQKGS 471

Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            ++W+N H +   D R  HS CPV  G++W
Sbjct: 472 MLYWFNLHDDGKPDIRSLHSVCPVLNGDRW 501


>gi|115313004|gb|AAI24075.1| Zgc:152670 [Danio rerio]
          Length = 235

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 76/201 (37%), Positives = 116/201 (57%), Gaps = 15/201 (7%)

Query: 21  LKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           L C Y +   N  L   P+K EEL+ +P++++ HD I D+EI  + ++++ ++ R +   
Sbjct: 26  LSCRYSTGGGNPRLMYAPVKEEELWDEPKIIRYHDVISDTEIETLKDIARPELTRSQT-- 83

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
            G  +  + R S+  FL      +   + +I  RI D+T L +   E+    L + NYG+
Sbjct: 84  -GWGVISEIRTSQSVFL-----DEVGTVARISQRIADITGLSVESAEK----LHVQNYGI 133

Query: 140 GGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA 199
           GG Y  H DA        R A+F+ Y++DVE+GGAT+F ++ + V PEKGSAVFW N H 
Sbjct: 134 GGRYTPHFDAGGDVNE--RTATFLIYMSDVEVGGATVFTNVGVAVKPEKGSAVFWNNLHK 191

Query: 200 NTLLDYRMYHSGCPVALGNKW 220
           N  LD +  H+GCPV +GNKW
Sbjct: 192 NGELDLKTKHAGCPVLVGNKW 212


>gi|157818741|ref|NP_001101745.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Rattus norvegicus]
 gi|149052604|gb|EDM04421.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_a [Rattus norvegicus]
          Length = 535

 Score =  137 bits (345), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGIKMTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +  DE           R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRVATFLNYMSD 460

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|324507368|gb|ADY43128.1| Prolyl 4-hydroxylase subunit alpha-2 [Ascaris suum]
          Length = 534

 Score =  137 bits (344), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 89/230 (38%), Positives = 118/230 (51%), Gaps = 18/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+G   V    +S + C Y   +  FLK+ P+KVE L   P VV     I D E
Sbjct: 280 DVYEALCRGEQKVNVTAQSEVYC-YLKMDRPFLKLAPIKVEILRFSPLVVLFKQVISDYE 338

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  I +L+  K++R  V N   GD  Y + R+SK  +L      DHP + +I  RI  MT
Sbjct: 339 IEVIEKLAIPKLKRATVQNARTGDLEYANYRISKSAWLKG---TDHPAIDRINKRIDLMT 395

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW-------RLASFMFYLTDVE 170
           NL     +     LQ  NYG+GGHYD H D A   D   +       R+A+ + Y++DVE
Sbjct: 396 NL----NQETAEELQAQNYGIGGHYDPHFDFARKEDINAFKTLNTGNRIATILIYMSDVE 451

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+F  L   VFP K  A+FWYN   +   D R  H+ CPV  G KW
Sbjct: 452 SGGATVFNHLGNAVFPSKYDALFWYNLRRDGEGDLRTRHAACPVLTGIKW 501


>gi|116008128|ref|NP_001036776.1| CG15539, isoform B [Drosophila melanogaster]
 gi|113194857|gb|ABI31220.1| CG15539, isoform B [Drosophila melanogaster]
          Length = 509

 Score =  137 bits (344), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 80/206 (38%), Positives = 115/206 (55%), Gaps = 12/206 (5%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           + L C Y++ ++ FL++ PLK+E L LDP +V  HD + D +I  I  L+KGK+ R   V
Sbjct: 291 AKLYCLYKTTSSYFLRLAPLKMELLSLDPYMVLFHDVVSDKDIVSIRNLTKGKLARTVTV 350

Query: 79  NYGDTIYVD-TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
           +       D  R +K  +L      ++  + ++    QDMTN  I   +    P Q+ NY
Sbjct: 351 SKDGNYTEDPDRTTKGTWL----VENNALIQRLSQLTQDMTNFDIHDAD----PFQVLNY 402

Query: 138 GLGGHYDLHCD---ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
           G+GG Y +H D       D    R+A+ +FYL+DV  GGATIFP L L+VFP+KGSA+ W
Sbjct: 403 GIGGFYGIHFDFLEDAELDNFSDRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLW 462

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           YN       D R  HS CP  +G++W
Sbjct: 463 YNLDHKGDGDNRTAHSACPTVVGSRW 488


>gi|226874876|ref|NP_035161.2| prolyl 4-hydroxylase subunit alpha-2 isoform 2 precursor [Mus
           musculus]
 gi|148701601|gb|EDL33548.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_f [Mus
           musculus]
          Length = 537

 Score =  137 bits (344), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 290 DVYESLCRGEGVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 349

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 350 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 406

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +  DE           R+A+F+ Y++D
Sbjct: 407 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSD 462

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 463 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514


>gi|149052606|gb|EDM04423.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_c [Rattus norvegicus]
          Length = 506

 Score =  137 bits (344), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 259 DVYESLCRGEGIKMTPRRQKRLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 318

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 319 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 375

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +  DE           R+A+F+ Y++D
Sbjct: 376 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRSDERDAFKRLGTGNRVATFLNYMSD 431

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 432 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 483


>gi|387016440|gb|AFJ50339.1| Prolyl 4-hydroxylase subunit alpha-1-like [Crotalus adamanteus]
          Length = 543

 Score =  137 bits (344), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y   C+G  L +    +  L C +Y    N    +GP++ E+ +  PR+V+  D I + E
Sbjct: 298 YEKLCRGEGLKMTPRREKKLFCRYYNGNGNPNYILGPVRQEDEWDRPRIVRFLDIISNEE 357

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I ++ ELSK ++ R  + N   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 358 IEKVKELSKPRLRRATISNPITGVLETAHYRISKSAWLSGY---ENPVVARINQRIQDLT 414

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 415 GLDVSTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVA 470

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 471 AGGATVFPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 520


>gi|195145314|ref|XP_002013641.1| GL24244 [Drosophila persimilis]
 gi|194102584|gb|EDW24627.1| GL24244 [Drosophila persimilis]
          Length = 496

 Score =  137 bits (344), Expect = 4e-30,   Method: Compositional matrix adjust.
 Identities = 76/229 (33%), Positives = 125/229 (54%), Gaps = 28/229 (12%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           ++   CQG   +P  ++S+L+C Y +  + FL++ PL++E L  DP V   H+ +  +E 
Sbjct: 266 VHQRNCQGRSRLP--VQSSLRCHYSAEGSAFLRLAPLRMELLSRDPLVALYHEVVSAAEQ 323

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDH-PFLYKIQTRIQDMTNL 120
             ++ LS+ +++R +   Y D I          F    +  +  P + ++  R++D+T L
Sbjct: 324 RHLMLLSESQLQRQRGHQY-DKIRT--------FASASVAANATPTVEQLHRRLEDITGL 374

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDAT---------PRDEGLWRLASFMFYLTDVEL 171
            +   E    PL+I NYG+GG Y +H D           P++   +RLA+ + YL+DV L
Sbjct: 375 DLAESE----PLRILNYGIGGQYYIHVDCEQPQTHVEPYPKE---YRLATVLLYLSDVRL 427

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GG T FP+L L + P +GSA+ W+NA+     DYR  H+ CPV LG +W
Sbjct: 428 GGFTSFPALGLGIRPNRGSALVWHNANNAGNCDYRALHAACPVLLGTRW 476


>gi|148701597|gb|EDL33544.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_b [Mus
           musculus]
          Length = 506

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 259 DVYESLCRGEGVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 318

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 319 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 375

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +  DE           R+A+F+ Y++D
Sbjct: 376 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSD 431

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 432 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 483


>gi|114601566|ref|XP_001162222.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Pan
           troglodytes]
 gi|114601568|ref|XP_001162843.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 17 [Pan
           troglodytes]
 gi|397518358|ref|XP_003829358.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pan
           paniscus]
 gi|397518362|ref|XP_003829360.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Pan
           paniscus]
 gi|410215944|gb|JAA05191.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410255608|gb|JAA15771.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
 gi|410331279|gb|JAA34586.1| prolyl 4-hydroxylase, alpha polypeptide II [Pan troglodytes]
          Length = 535

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 83/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           +IY   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DIYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +  DE           R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSD 460

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|195055773|ref|XP_001994787.1| GH17427 [Drosophila grimshawi]
 gi|193892550|gb|EDV91416.1| GH17427 [Drosophila grimshawi]
          Length = 538

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 81/226 (35%), Positives = 122/226 (53%), Gaps = 16/226 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G +S     +  L+C Y    + +  + PLK+EE  LDP VV  HD +   +I 
Sbjct: 288 YEKVCRGEVSASAAQQRPLRCRYARGQHAYRVLAPLKLEEHSLDPLVVSYHDMLSPQQII 347

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            + +++   ++R  V  +    +     R+SK  +L    +  HP + ++   + D T L
Sbjct: 348 ELRQMAVPHMKRSTVNPLPGRQSKKSAFRVSKNAWLE---YDTHPMMGRMLRDLSDATGL 404

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
            +     Y   LQ+ NYG+GGHY+ H D        P +EG  R+A+ +FYL+DVE GGA
Sbjct: 405 DMT----YCEQLQVANYGVGGHYEPHWDFFVDSQHYPAEEGN-RIATAIFYLSDVEQGGA 459

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T FP LN  V P+ G+ +FWYN H +  +DYR  H+GCPV  G+KW
Sbjct: 460 TAFPFLNFAVRPQLGNILFWYNLHRSLDMDYRTKHAGCPVLKGSKW 505


>gi|195391758|ref|XP_002054527.1| GJ22759 [Drosophila virilis]
 gi|194152613|gb|EDW68047.1| GJ22759 [Drosophila virilis]
          Length = 539

 Score =  137 bits (344), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 81/229 (35%), Positives = 123/229 (53%), Gaps = 18/229 (7%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+  L         L+C   + N       P K+EEL+LDP ++++HD I   + 
Sbjct: 286 LYQQVCREELRPAPAALRELRCRLFAGNGRKSTYAPYKLEELHLDPYIIQVHDVISARDT 345

Query: 62  NRIIELSKGKVERGKVVNYG--DTIYVDTRLSK-VYFLYPEIFGDHPFLYKIQTRIQDMT 118
             +  L++ +++R +V +    + I  + R S+   F Y     DHP + K+   + +++
Sbjct: 346 AELQHLARPELQRSQVYSRTGHEHISANFRTSQGTTFEYT----DHPIMQKMSHHVAEIS 401

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPR--DEGLW-----RLASFMFYLTDVEL 171
               G + R   PLQI NYG+GGHY+ H D+ P   D  L      RLA+ ++YL++VE 
Sbjct: 402 ----GLDMRSAEPLQIANYGIGGHYEPHMDSFPDSYDYSLNMYKTNRLATGIYYLSNVEA 457

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GG T FP L L V PE+GS +FWYN H +   DYR  H+ CPV  G+KW
Sbjct: 458 GGGTAFPFLPLLVTPERGSLLFWYNLHPSGDADYRTKHAACPVLQGSKW 506


>gi|195391760|ref|XP_002054528.1| GJ22757 [Drosophila virilis]
 gi|194152614|gb|EDW68048.1| GJ22757 [Drosophila virilis]
          Length = 534

 Score =  136 bits (343), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 81/226 (35%), Positives = 121/226 (53%), Gaps = 16/226 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G +      +  L+C Y    + +  + PLK+EE  LDP VV  HD +    I 
Sbjct: 284 YEKVCRGEVGASAAQQRPLRCRYTRGEHAYRLLAPLKLEEHSLDPLVVTFHDMLSQHRIA 343

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            + E++   ++R  V  +  G       R+SK  +L    +  HP + ++   + D T L
Sbjct: 344 ELREMAVPHMQRSTVNPLPGGQRRKSAFRVSKNAWL---PYSTHPTMGRMLRDVSDATGL 400

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
            +   E+    LQ+ NYG+GGHY+ H D        P  EG  R+A+ +FYL+DVE GGA
Sbjct: 401 DMTFCEQ----LQVANYGVGGHYEPHWDFFRDSRHYPAAEGN-RIATAIFYLSDVEQGGA 455

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T FP LN  V P+ G+ +FWYN H ++ +D+R  H+GCPV  G+KW
Sbjct: 456 TAFPFLNFAVRPQLGNILFWYNLHRSSDMDFRTKHAGCPVLKGSKW 501


>gi|195109817|ref|XP_001999478.1| GI23043 [Drosophila mojavensis]
 gi|193916072|gb|EDW14939.1| GI23043 [Drosophila mojavensis]
          Length = 491

 Score =  136 bits (343), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 75/222 (33%), Positives = 120/222 (54%), Gaps = 20/222 (9%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G   +P  +  +L+C Y +  + FL++ PLK+E+L +DP V   H+AI+D+E+  IIE
Sbjct: 259 CRGQRQLP--VSDSLRCRYSAEGSPFLRLAPLKLEQLSIDPYVALCHNAIHDNELEYIIE 316

Query: 67  LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
            S+  ++R  +V+ G        +   + L     G       ++ R++DM+   +    
Sbjct: 317 QSRPYLKRA-LVDQGVVHEKRVTMDAAFDLNASTHG-----RTLRQRLEDMSGFDLSN-- 368

Query: 127 RYKGPLQINNYGLGGHYDLHCDA-TPRDEGLW-------RLASFMFYLTDVELGGATIFP 178
              G L + NYG+GGHY +H D     D   +       R+A+ + YL +V++GG T FP
Sbjct: 369 --SGQLAVLNYGIGGHYSMHFDCWFSSDSAAYEAYIRSNRIATILLYLNEVQMGGITSFP 426

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +L L V P KGSA+ W+N +     DYR  H+ CP  LGN+W
Sbjct: 427 ALGLGVQPIKGSALIWHNMNHEIECDYRTLHAACPTLLGNRW 468


>gi|195069801|ref|XP_001997031.1| GH12975 [Drosophila grimshawi]
 gi|193891500|gb|EDV90366.1| GH12975 [Drosophila grimshawi]
          Length = 242

 Score =  136 bits (343), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 72/213 (33%), Positives = 123/213 (57%), Gaps = 17/213 (7%)

Query: 18  KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
           +  L+C     N       P ++EEL+LDP V+++HD I   E   + +L++ +++R  V
Sbjct: 4   QRKLRCRLHRGNGLRSSYQPYRLEELHLDPYVIQVHDIISAEETIVLQQLARPELQRSMV 63

Query: 78  VNYGDTIYVDT--RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQIN 135
            +  ++ ++ T  R+S+  F     + +HP + ++   +++++ L +   E+    LQ+ 
Sbjct: 64  YSLSNSEHISTNFRISQGTFFE---YHEHPIMQRMSQHLENISGLDMRSAEQ----LQVA 116

Query: 136 NYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
           NYG+GGHY+ H D+   +            R+A+ ++YL++VE GG T FP L L V PE
Sbjct: 117 NYGIGGHYEPHMDSFSENHNYGINTYMSTNRVATGIYYLSNVEAGGGTAFPFLPLLVEPE 176

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +GS +FWYN H +  LDYR  H+GCPV +G+KW
Sbjct: 177 RGSLLFWYNLHRSGDLDYRTKHAGCPVLMGSKW 209


>gi|195110923|ref|XP_002000029.1| GI22757 [Drosophila mojavensis]
 gi|193916623|gb|EDW15490.1| GI22757 [Drosophila mojavensis]
          Length = 535

 Score =  136 bits (343), Expect = 5e-30,   Method: Compositional matrix adjust.
 Identities = 86/228 (37%), Positives = 123/228 (53%), Gaps = 17/228 (7%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           IY   C+  L      +  L+C Y S +   L     K+EEL+ DP ++++H+ I   E 
Sbjct: 285 IYQQVCREELMPTAAAQRELRCRYFSGHGRSLNYLAYKLEELHRDPYIIQLHEVIGAHES 344

Query: 62  NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
            ++  L++  ++R +V +   G T           F Y E    HP + K+    Q MT 
Sbjct: 345 VQLQHLARPVLQRSEVYSPTNGSTAATFRTSQGTVFEYDE----HPIIEKLS---QHMT- 396

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPR--DEGLWR-----LASFMFYLTDVELG 172
           L+ G +  +  PLQI NYG+GGHY+ H D+ P   D  L R     +A+ +FYL++VE G
Sbjct: 397 LISGLDMGFAEPLQIANYGIGGHYEPHMDSFPESFDYSLQRFKTNRIATGIFYLSNVEAG 456

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GAT FP L L V PE+GS +FWYN H +   DYR  H+GCPV  G+KW
Sbjct: 457 GATAFPFLPLLVKPEQGSLLFWYNLHRSGDADYRTKHAGCPVLQGSKW 504


>gi|195452744|ref|XP_002073481.1| GK14140 [Drosophila willistoni]
 gi|194169566|gb|EDW84467.1| GK14140 [Drosophila willistoni]
          Length = 454

 Score =  136 bits (343), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 80/227 (35%), Positives = 123/227 (54%), Gaps = 16/227 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++ P  C GN  V  + +  L C Y + +  FL+I P+K+E L L+P +V  HD I  SE
Sbjct: 217 DVLPYCCSGNCEVDREFQ--LFCLYNTKDAYFLRIAPVKMEILSLNPYIVLCHDVILPSE 274

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
              +   S  ++E  + ++    + ++   R SK  +L            ++   I+D++
Sbjct: 275 QEFLKTQSSKRLEGARALDQVKNEVVFNFIRTSKATWLKK---NSDNVTRRLSHWIEDVS 331

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGG 173
           NL     + Y    QI NYG+GG ++ H D   +DE  W     R+A+F+FYL DV  GG
Sbjct: 332 NLDSNIGDLY----QIINYGVGGLFEAHSDTMRKDEDRWKVLYDRIATFIFYLQDVPQGG 387

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           AT+F +LNLTVFP+ G+A+FW+N       D    H+GCPV +G+KW
Sbjct: 388 ATLFNNLNLTVFPKAGAALFWFNLDNAGDTDLFTVHTGCPVIVGSKW 434


>gi|355691582|gb|EHH26767.1| hypothetical protein EGK_16829 [Macaca mulatta]
 gi|355750162|gb|EHH54500.1| hypothetical protein EGM_15360 [Macaca fascicularis]
 gi|384939464|gb|AFI33337.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Macaca
           mulatta]
          Length = 535

 Score =  136 bits (343), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +  DE           R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERHTFKHLGTGNRVATFLNYMSD 460

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|195352182|ref|XP_002042593.1| GM14980 [Drosophila sechellia]
 gi|194124477|gb|EDW46520.1| GM14980 [Drosophila sechellia]
          Length = 520

 Score =  136 bits (343), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 84/225 (37%), Positives = 123/225 (54%), Gaps = 21/225 (9%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           + L C+G        K+NL C ++S  NTFL++ PLK+EE+ LDP +   H+ +YDSEI+
Sbjct: 291 FELGCRGLYRQ----KTNLVCRFKSTANTFLRLAPLKLEEISLDPFIAMYHEVLYDSEIH 346

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            +   S   V  G       T   DT +   ++    +  +     +I  RI DMT    
Sbjct: 347 ELKGQSMNMV-NGYASERNGTEIRDTVVRYDWWSNISLVRE-----RINQRIIDMTEFNF 400

Query: 123 GREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGGAT 175
            ++E+    LQI NYG+G ++  H D       TP    L  RLAS +FY ++V  GGAT
Sbjct: 401 SKDEK----LQIANYGVGTYFQPHFDYSSDGFETPNITTLGDRLASILFYASEVPQGGAT 456

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +FP +N+TVFP+KGS ++W+N H +   D R  HS CPV  G++W
Sbjct: 457 VFPEINVTVFPQKGSMLYWFNLHDDGRPDIRSKHSVCPVINGDRW 501


>gi|348518914|ref|XP_003446976.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Oreochromis
           niloticus]
          Length = 536

 Score =  136 bits (343), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 80/232 (34%), Positives = 128/232 (55%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           E Y   C+G  + + E  +S L C +++   N  L + P+K E+ +  P +V+  D + D
Sbjct: 289 ESYEALCRGEGIQMTEARRSRLFCRYHDGKRNPHLLLKPVKEEDEWDSPHIVRYLDLLSD 348

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI +I EL+K ++ R  V +   G     + R+SK  +L  E   + P + ++  RI+ 
Sbjct: 349 EEIEKIKELAKPRLARATVRDPKTGVLTTANYRVSKSAWLEGE---EDPVIDRVNQRIEA 405

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D + +DE           R+A+F+ Y++D
Sbjct: 406 ITGLTVETAEL----LQVANYGVGGQYEPHFDFSRKDEPDAFKRLGTGNRVATFLNYMSD 461

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP     ++P KG++VFWYN   +   DYR  H+ CPV +G+KW
Sbjct: 462 VEAGGATVFPDFGAAIWPRKGTSVFWYNLFRSGEGDYRTRHAACPVLVGSKW 513


>gi|198417610|ref|XP_002125349.1| PREDICTED: similar to Prolyl 4-hydroxylase subunit alpha-1
           precursor (4-PH alpha-1)
           (Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1) [Ciona intestinalis]
          Length = 527

 Score =  136 bits (343), Expect = 6e-30,   Method: Compositional matrix adjust.
 Identities = 85/233 (36%), Positives = 123/233 (52%), Gaps = 19/233 (8%)

Query: 1   EIYPLACQGNLSVPEDIK-SNLKCFYES-YNNTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           E +   C+G  ++ +  +   L+C+  +   N  L I P+KVEEL   P +V+ HD + D
Sbjct: 271 ETFFKLCRGEQTLTKKKQHKKLRCYLSTNMGNPKLLIRPVKVEELSKSPDIVQFHDVLSD 330

Query: 59  SEINRIIELSKGKVERGKVVNYGDTIYVDT--RLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
           + IN I +L+K ++ R       DT       R++K+ +L  +   D P + KI  RI D
Sbjct: 331 TVINEIKKLAKPQLFRAIHAGSDDTDLQKAPYRITKLAWLLDD---DGPEVAKITERISD 387

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL--------WRLASFMFYLTD 168
           +T L +   E     +Q+ NYG+GG Y  H D    DE           R+A+F+ YL+D
Sbjct: 388 ITGLTLNTSEE----IQVANYGVGGEYPPHFDIPTTDEERDDLKSQDGERIATFLIYLSD 443

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           VE+GG T F +  ++  P KGSAVFWYN   +   D R YH  CPVA GNKW 
Sbjct: 444 VEVGGRTAFVNAGVSAKPIKGSAVFWYNVFPSGEPDLRTYHGACPVAFGNKWA 496


>gi|403255941|ref|XP_003920663.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Saimiri
           boliviensis boliviensis]
 gi|403255945|ref|XP_003920665.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Saimiri
           boliviensis boliviensis]
          Length = 535

 Score =  136 bits (342), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +  DE           R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERDAFKHLGTGNRVATFLNYMSD 460

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|119582752|gb|EAW62348.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_f
           [Homo sapiens]
          Length = 567

 Score =  136 bits (342), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 320 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 379

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 380 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 436

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +  DE           R+A+F+ Y++D
Sbjct: 437 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSD 492

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 493 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 544


>gi|332221660|ref|XP_003259981.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Nomascus
           leucogenys]
          Length = 537

 Score =  136 bits (342), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 290 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 349

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 350 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 406

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +  DE           R+A+F+ Y++D
Sbjct: 407 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSD 462

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 463 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514


>gi|4758868|ref|NP_004190.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
           sapiens]
 gi|217272863|ref|NP_001136071.1| prolyl 4-hydroxylase subunit alpha-2 isoform 1 precursor [Homo
           sapiens]
 gi|20455169|sp|O15460.1|P4HA2_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|2439985|gb|AAB71339.1| prolyl 4-hydroxylase alpha (II) subunit [Homo sapiens]
 gi|18073926|emb|CAC85689.1| Prolyl 4-hydroxylase alpha IIb subunit [Homo sapiens]
 gi|119582746|gb|EAW62342.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_b
           [Homo sapiens]
 gi|119582747|gb|EAW62343.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide II, isoform CRA_b
           [Homo sapiens]
          Length = 535

 Score =  136 bits (342), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +  DE           R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSD 460

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|297675929|ref|XP_002815906.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 3 [Pongo
           abelii]
          Length = 535

 Score =  136 bits (342), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +  DE           R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSD 460

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|332221664|ref|XP_003259983.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 5 [Nomascus
           leucogenys]
          Length = 558

 Score =  136 bits (342), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 311 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 370

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 371 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 427

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +  DE           R+A+F+ Y++D
Sbjct: 428 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSD 483

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 484 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 535


>gi|196011900|ref|XP_002115813.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
 gi|190581589|gb|EDV21665.1| hypothetical protein TRIADDRAFT_59899 [Trichoplax adhaerens]
          Length = 581

 Score =  136 bits (342), Expect = 7e-30,   Method: Compositional matrix adjust.
 Identities = 89/236 (37%), Positives = 125/236 (52%), Gaps = 25/236 (10%)

Query: 3   YPLACQGNLS--VPEDIKSN--LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           Y   C+GN++    +D+K N  L C Y+ Y N  L   PL VE L L P +V  H+ + +
Sbjct: 305 YKELCRGNVNQKTGDDVKLNNQLNC-YQDYRNPRLLFSPLNVEVLSLQPYIVIYHNLLTN 363

Query: 59  SEINRIIELSKGKVERGKVVNYGDTIYVDT---RLSKVYFLYPEIFGDHPFLYKIQTRIQ 115
           SE+  +  L+   ++R  VV   D  Y +    R+SK  +L  E   DHP + +I T I 
Sbjct: 364 SEVVLLKTLASPLLKRAVVVGKPDKEYGEETTYRISKTAWLDKE---DHPAVKRITTLIG 420

Query: 116 DMTNLVIGREERYKGPLQINNYGLGGHYDLHCD--ATPRDEGLW--------RLASFMFY 165
           D    +IG       PLQI NYG+GGHY+ H D   +   E L         R+A+ + Y
Sbjct: 421 D----IIGLTSETAEPLQIANYGIGGHYEPHLDFIESEDKEALSEYTSRIGNRIATVLIY 476

Query: 166 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           L++VE GGAT+FP   + V P +GSA FWYN H N   +    H+ CPV +G+KW 
Sbjct: 477 LSNVEAGGATVFPKAGVRVEPRQGSAAFWYNMHRNGEGNKLSVHAACPVLIGSKWA 532


>gi|47550697|ref|NP_999856.1| prolyl 4-hydroxylase, alpha polypeptide I b precursor [Danio rerio]
 gi|28277826|gb|AAH45890.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide I [Danio rerio]
          Length = 536

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y   C+G  + +    +S L C Y + N N  L + P+K E+ +  PR+V+ H+ I DSE
Sbjct: 291 YERLCRGEGIKLTPRRQSRLFCRYSNNNRNPRLLLAPVKQEDEWDRPRIVRYHEIISDSE 350

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + E++K ++ R  + N   G       R+SK  +L      +H  + +I  RI+D+T
Sbjct: 351 IETVKEMAKPRLRRATISNPITGVLETAPYRISKSAWLSG---YEHSTIERINQRIEDVT 407

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 408 GLEMDTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 463

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+F  +   V+P+KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 464 AGGATVFTDVGAAVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 513


>gi|395736141|ref|XP_003776706.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Pongo abelii]
          Length = 577

 Score =  136 bits (342), Expect = 8e-30,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 330 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 389

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 390 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 446

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +  DE           R+A+F+ Y++D
Sbjct: 447 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRNDERDTFKHLGTGNRVATFLNYMSD 502

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 503 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 554


>gi|442757047|gb|JAA70682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
          Length = 532

 Score =  135 bits (341), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 81/229 (35%), Positives = 123/229 (53%), Gaps = 14/229 (6%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y   C+G       + S L+C Y +    F K+ P+K+EE  L P VV + D + D +
Sbjct: 280 ENYKRLCRGEQLRTPKMDSQLRCRYYTGETGFFKLQPIKLEEFNLKPYVVVLRDLLQDRD 339

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           +N +I  +K ++E+ K +   D     +R S   +L  E   D P   ++   +Q +  L
Sbjct: 340 LNDMIAFAKPRLEQSKTLCAADKDGPPSRTSSNTWLNDE---DAPVAARVNQYLQSLLGL 396

Query: 121 --VIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLW--RLASFMFYLTDVEL 171
             +  R+E  K   Q+ NYG+GGHY  H D      TP     +  R+A+ M Y++DVE 
Sbjct: 397 GTLFSRDEAEK--YQLANYGIGGHYVPHHDYFEEFQTPSKGNRFGNRVATLMIYMSDVEE 454

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GGAT+FPSL + V P+KG AVFW+N  ++   +   +H+GCPV  G+KW
Sbjct: 455 GGATVFPSLGVRVSPKKGDAVFWWNIMSSWEGEMLTWHAGCPVLYGSKW 503


>gi|410900628|ref|XP_003963798.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Takifugu
           rubripes]
          Length = 548

 Score =  135 bits (341), Expect = 9e-30,   Method: Compositional matrix adjust.
 Identities = 81/230 (35%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +S L C +Y++ +N    + P+K ++ +  P +V+  D I D E
Sbjct: 303 YEMLCRGEGIKMTPRRQSRLFCRYYDNNHNPKYVLSPVKQQDEWDRPYIVRYIDIISDKE 362

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   G       R+SK  +L      +HP +  I  RI+D+T
Sbjct: 363 IETVKKLAKPRLRRATISNPITGVLETASYRISKSAWL---TGYEHPVIEIINQRIEDLT 419

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 420 GLEMDTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVA 475

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +   V+P+KG+AVFWYN  AN   DY   H+ CPV +GNKW
Sbjct: 476 AGGATVFPDVGAAVWPQKGTAVFWYNLFANGEGDYSTRHAACPVLVGNKW 525


>gi|2498741|sp|Q60716.1|P4HA2_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-2; Short=4-PH
           alpha-2; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-2; Flags: Precursor
 gi|836900|gb|AAC52198.1| prolyl 4-hydroxylase alpha(II)-subunit [Mus musculus]
 gi|18073923|emb|CAC85691.1| Prolyl 4-hydroxylase alpha IIb subunit [Mus musculus]
 gi|1096888|prf||2112362B Pro 4-hydroxylase:SUBUNIT=alpha:ISOTYPE=II
          Length = 537

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 125/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 290 DVYESLCRGEGVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 349

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 350 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 406

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW-------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +   DE  +       R+A+F+ Y++D
Sbjct: 407 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRSDDEDAFKRLGTGNRVATFLNYMSD 462

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 463 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 514


>gi|291387304|ref|XP_002710243.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 3 [Oryctolagus
           cuniculus]
          Length = 535

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHDGNGAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + +I  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARINRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +  +E           R+A+F+ Y++D
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRNNERDAFKRLGTGNRVATFLNYMSD 460

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|239792190|dbj|BAH72464.1| ACYPI007079 [Acyrthosiphon pisum]
          Length = 249

 Score =  135 bits (341), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 80/227 (35%), Positives = 122/227 (53%), Gaps = 18/227 (7%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
           + C+    +   I S L+C Y + N N  L I PLK EE +  PR++   D +YD+EI  
Sbjct: 1   MLCRNENLMSIQISSQLRCRYTNNNRNPLLLIAPLKEEEAFFSPRIILYRDVLYDNEIEV 60

Query: 64  IIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
           I  +++ +++R  V NY  G+  + D R+SK  +L      +   +  +  R++ MT L 
Sbjct: 61  IKRMAQPRLKRATVQNYKTGELEFADYRISKSAWLKEH---EDVVVANVAKRVEVMTGLT 117

Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDE-------GLW-RLASFMFYLTDVELGG 173
               E     LQ+ NYG+GGHYD H D    +E       G   R+A+ +FY++DV  GG
Sbjct: 118 TETAEE----LQVVNYGVGGHYDPHYDFARTEEINAFKSLGTGNRIATVLFYMSDVAQGG 173

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           AT+FP L + + P KG+A  W+N + +   D R  H+ CPV  G+KW
Sbjct: 174 ATVFPWLGVALQPVKGTAAVWFNLYPSGNGDLRTRHAACPVLQGSKW 220


>gi|198449641|ref|XP_002136935.1| GA26860 [Drosophila pseudoobscura pseudoobscura]
 gi|198130697|gb|EDY67493.1| GA26860 [Drosophila pseudoobscura pseudoobscura]
          Length = 508

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 76/208 (36%), Positives = 118/208 (56%), Gaps = 16/208 (7%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           S L C Y +    FL++ PLK+E L LDP VV  HD + D E++ +  +++  + R    
Sbjct: 291 SRLYCLYNTTATAFLRLAPLKMELLSLDPYVVLYHDVLADREMSLLKSMAQKDLVRASTY 350

Query: 79  NYGDTIYVD--TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINN 136
           +  D  + +   R +K  +L P     H  + ++    +DMTNL + R E +    Q+ N
Sbjct: 351 DVMDKKHSEDPNRTTKARWLDPS----HSLIRRMGILTEDMTNLDLERLEDF----QVLN 402

Query: 137 YGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           YG+GGH D+H D    + P  E   R+A+ +FYL+DV LGGAT+FP L+L+VFP++G+ +
Sbjct: 403 YGIGGHDDIHPDYYEGSNP--ELPDRVATLLFYLSDVPLGGATVFPLLDLSVFPKRGAVL 460

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            WYN         +  HS CPV +G++W
Sbjct: 461 MWYNLDHKGQGIEKTVHSACPVVVGSRW 488


>gi|354474413|ref|XP_003499425.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Cricetulus griseus]
          Length = 535

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 81/226 (35%), Positives = 121/226 (53%), Gaps = 19/226 (8%)

Query: 7   CQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D EI RI
Sbjct: 294 CRGEGVKLTPQRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERI 353

Query: 65  IELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q +T L +
Sbjct: 354 KEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQHITGLTV 410

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELGGA 174
              E     LQ+ NYG+GG Y+ H D +  DE           R+A+F+ Y++DVE GGA
Sbjct: 411 KTAEL----LQVANYGMGGQYEPHFDFSRSDEQDAFKRLGTGNRVATFLNYMSDVEAGGA 466

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 467 TVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|47213360|emb|CAF90979.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 511

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 82/237 (34%), Positives = 129/237 (54%), Gaps = 26/237 (10%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y   C+G  L +    +S L C +Y++  +    IGP+K E+ +  PR+V+ HD + + E
Sbjct: 261 YEQLCRGEGLRMTPQRQSGLFCRYYDNGRHPKYVIGPVKQEDEWDHPRIVRYHDVLSNRE 320

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           + ++ EL++ ++ R  V +   G       R+SK  +L      +HP + +I  RI+D+T
Sbjct: 321 MEKVKELARPRLRRATVHDPRTGQLTTAPYRVSKSAWLGA---FEHPIVDQINQRIEDIT 377

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFY----- 165
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++ Y     
Sbjct: 378 GLDVSTAE----DLQVANYGVGGQYEPHFDFGQKDEPDAFEELGTGNRIATWLLYVSAAV 433

Query: 166 --LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             ++DV+ GGAT+F  +  +V P+KGSAVFWYN   +   DYR  H+ CPV LGNKW
Sbjct: 434 LRMSDVQAGGATVFTDIGASVLPQKGSAVFWYNLRPSGDGDYRTRHAACPVLLGNKW 490


>gi|194904100|ref|XP_001981000.1| GG23922 [Drosophila erecta]
 gi|190652703|gb|EDV49958.1| GG23922 [Drosophila erecta]
          Length = 490

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 116/227 (51%), Gaps = 34/227 (14%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           LA   N +      S L C Y S    F +I PLK+EEL  DP +V  HD IYDSEI+ +
Sbjct: 258 LATVQNCTAVVQKPSRLHCRYNSSTTPFTRIAPLKMEELSSDPYMVVYHDVIYDSEIDLM 317

Query: 65  IELSKGKV------ERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           +  S   +      ++ +V    D+  VD++                    +  R+ DMT
Sbjct: 318 LNASNFSLSLTNSGQKSEVRASKDSYIVDSK-------------------TLNDRVTDMT 358

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGG 173
            L +   +    P  + NYG+GGHY LH D        R++   R+A+ +FYL +V  GG
Sbjct: 359 GLSMEMSD----PFSMINYGIGGHYMLHYDYHEYSNMTREKYGDRIATVLFYLGEVHSGG 414

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           ATIFP +N+TV P+KGSAVFWYN H +  +     HS CPV  G+K+
Sbjct: 415 ATIFPRINITVTPKKGSAVFWYNLHNSGAMHSETLHSACPVISGSKY 461


>gi|194905392|ref|XP_001981188.1| GG11756 [Drosophila erecta]
 gi|190655826|gb|EDV53058.1| GG11756 [Drosophila erecta]
          Length = 509

 Score =  135 bits (340), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 80/206 (38%), Positives = 112/206 (54%), Gaps = 12/206 (5%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           + L C Y +  + FL++ PLK+E L LDP VV  HD + D +I  I  L+KG + R   V
Sbjct: 291 AKLHCLYNTTASHFLRLAPLKMELLSLDPYVVLFHDVVSDQDILSIRNLAKGGLARAVTV 350

Query: 79  NY-GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
              G+      R +K  +L      +   + ++    QDMTN  +    R   P Q+ NY
Sbjct: 351 TQDGNDKEDPARTTKGTWL----VENSKLIQRLSQLSQDMTNFDV----RDADPFQVLNY 402

Query: 138 GLGGHYDLHCDATPRDE-GLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
           G+GG Y  H D     E G +  R+A+ +FYL+DV  GGAT FP L L+VFPEKG+A+ W
Sbjct: 403 GIGGFYGTHFDFLEDTEMGHFSDRIATAVFYLSDVPQGGATTFPDLGLSVFPEKGAALLW 462

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           YN     + D R  HS CP  +G++W
Sbjct: 463 YNLDHKGVGDNRTAHSACPTIVGSRW 488


>gi|327265288|ref|XP_003217440.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Anolis
           carolinensis]
          Length = 554

 Score =  135 bits (339), Expect = 1e-29,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 126/232 (54%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           EIY   C+G  + +    +  L C Y + N N  L I P K E+ +  P +V+ ++ + D
Sbjct: 307 EIYEALCRGEGVKMTPRRQKRLFCRYHNGNQNPHLLIAPFKEEDEWDSPHIVRYYNVLSD 366

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI +I EL+K K+ R  V +   G     + R+SK  +L  E   D   + K+  R++ 
Sbjct: 367 EEIEKIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEE---DDLVVAKVNQRMEH 423

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D + ++E           R+A+F+ Y++D
Sbjct: 424 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRKEEPDAFKRLGTGNRVATFLNYMSD 479

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 480 VEAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 531


>gi|431892682|gb|ELK03115.1| Prolyl 4-hydroxylase subunit alpha-2 [Pteropus alecto]
          Length = 629

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 86/252 (34%), Positives = 128/252 (50%), Gaps = 39/252 (15%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N T  L I P K E+ +  P +V+ +D + D
Sbjct: 294 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 353

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EINRI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 354 EEINRIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 410

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL                  
Sbjct: 411 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDEQD 466

Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
                    R+A+F+ Y++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  
Sbjct: 467 VFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 526

Query: 209 HSGCPVALGNKW 220
           H+ CPV +G KW
Sbjct: 527 HAACPVLVGCKW 538


>gi|432904500|ref|XP_004077362.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oryzias
           latipes]
          Length = 555

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 79/230 (34%), Positives = 130/230 (56%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +S L C +Y++ +N    + P+K ++ +  P +V+  D I ++E
Sbjct: 305 YEMLCRGEGVRMTSRRQSRLFCRYYDNKHNPRFVLAPVKQQDEWDRPYIVRYIDIISEAE 364

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           +++I +L+K ++ R  + N   G       R+SK  +L      + P + KI  RI+D+T
Sbjct: 365 MDKIKQLAKPRLRRATISNPVTGVLETAPYRISKSAWL---TAYEDPVVEKINQRIEDLT 421

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 422 GLEMDTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLFYMSDVS 477

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +  +V P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 478 AGGATVFPDVGASVGPQKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 527


>gi|291190274|ref|NP_001167096.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha 1 polypeptide precursor [Salmo
           salar]
 gi|223648100|gb|ACN10808.1| Prolyl 4-hydroxylase subunit alpha-1 precursor [Salmo salar]
          Length = 545

 Score =  135 bits (339), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 81/230 (35%), Positives = 125/230 (54%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYNNTFLKI-GPLKVEELYLDPRVVKIHDAIYDSE 60
           Y   C+G  + +    +S + C Y   N   L + GP+K E+ +  PR+++ HD + +SE
Sbjct: 300 YEQLCRGEGIKMTPRRQSRMFCRYSDNNRHPLYVLGPVKQEDEWDRPRIIRYHDVLSNSE 359

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I ++ EL+K ++ R  + N   G       R+SK  +L      + P + KI  RI+D+T
Sbjct: 360 IEKVKELAKPRLRRATISNPITGVLETAHYRISKSAWL---TAYEDPVVDKINQRIEDIT 416

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++ Y++DV 
Sbjct: 417 GLNVKTAEE----LQVANYGVGGQYEPHFDFGRKDEPDAFKELGTGNRIATWLIYMSDVP 472

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+F  +   V+P+KGSAVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 473 SGGATVFTDVGAAVWPKKGSAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 522


>gi|20269814|gb|AAM18062.1|AF495540_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE2
           [Drosophila melanogaster]
 gi|19528175|gb|AAL90202.1| AT27756p [Drosophila melanogaster]
          Length = 542

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 84/227 (37%), Positives = 113/227 (49%), Gaps = 15/227 (6%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           + P  C G   VP ++ SNL C Y    + FL++ P+K E L +DP VV +HD I   E 
Sbjct: 288 VLPPCCSGRCQVPRNL-SNLYCVYNHVTSPFLQLAPIKTEILSIDPFVVLLHDMISQKES 346

Query: 62  NRIIELSKGKVERGKVVN---YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
             I   SK  +      +     D   VDT  +     Y   F D     KI  R+ D T
Sbjct: 347 TLIRTSSKEHMLPSATTDPDASDDETQVDTYRTSKSVWYSSDFNDTT--KKITERLGDAT 404

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGG 173
            L +   E Y    Q+ NYGLGG ++ H D    ++  +     R+A+ +FYL +V  GG
Sbjct: 405 GLDMNSTEFY----QVINYGLGGFFETHLDMLLSEKNRFNGTSDRIATTLFYLNEVRQGG 460

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            T FP LNLTVFP+ GSA+FWYN            H+GCPV +G+KW
Sbjct: 461 GTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 507


>gi|78706702|ref|NP_001027154.1| CG18749 [Drosophila melanogaster]
 gi|21429852|gb|AAM50604.1| GH05783p [Drosophila melanogaster]
 gi|23175900|gb|AAN14309.1| CG18749 [Drosophila melanogaster]
 gi|220956638|gb|ACL90862.1| CG18749-PB [synthetic construct]
          Length = 491

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 82/224 (36%), Positives = 118/224 (52%), Gaps = 34/224 (15%)

Query: 8   QGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIEL 67
           Q   +V +     L C Y +    F +I PLK+EEL LDP +V  HD IYD+EI+ ++  
Sbjct: 262 QNCTAVVQKPSKKLHCRYNTSTTPFTRIAPLKMEELGLDPYMVVFHDVIYDTEIDGMLNS 321

Query: 68  SK-GKVE-----RGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
           S  G  E     + +V    D+  VD +                    +  R+ DMT L 
Sbjct: 322 SDFGLSESVSGLKSEVRTSKDSHIVDAK-------------------TLNERVTDMTGLS 362

Query: 122 IGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGGATI 176
           +   +    P  + NYGLGGH+ LH D      T R +   R+A+ +FYL +V+ GGAT+
Sbjct: 363 MEMSD----PFSLINYGLGGHFILHHDFHEYTNTTRLKQGDRIATVLFYLREVDSGGATV 418

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP LN+TV P+KGSAVFWYN H +  ++ +  H+ CPV  G+K+
Sbjct: 419 FPMLNITVMPKKGSAVFWYNLHNSGAVNSKTLHTACPVISGSKY 462


>gi|211938649|gb|ACJ13221.1| FI08532p [Drosophila melanogaster]
          Length = 543

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 84/227 (37%), Positives = 113/227 (49%), Gaps = 15/227 (6%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           + P  C G   VP ++ SNL C Y    + FL++ P+K E L +DP VV +HD I   E 
Sbjct: 289 VLPPCCSGRCQVPRNL-SNLYCVYNHVTSPFLQLAPIKTEILSIDPFVVLLHDMISQKES 347

Query: 62  NRIIELSKGKVERGKVVN---YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
             I   SK  +      +     D   VDT  +     Y   F D     KI  R+ D T
Sbjct: 348 TLIRTSSKEHMLPSATTDPDASDDETQVDTYRTSKSVWYSSDFNDTT--KKITERLGDAT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGG 173
            L +   E Y    Q+ NYGLGG ++ H D    ++  +     R+A+ +FYL +V  GG
Sbjct: 406 GLDMNSTEFY----QVINYGLGGFFETHLDMLLSEKNRFNGTSDRIATTLFYLNEVRQGG 461

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            T FP LNLTVFP+ GSA+FWYN            H+GCPV +G+KW
Sbjct: 462 GTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 508


>gi|24651430|ref|NP_733378.1| prolyl-4-hydroxylase-alpha NE2 [Drosophila melanogaster]
 gi|23172699|gb|AAF57061.2| prolyl-4-hydroxylase-alpha NE2 [Drosophila melanogaster]
          Length = 542

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 84/227 (37%), Positives = 113/227 (49%), Gaps = 15/227 (6%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           + P  C G   VP ++ SNL C Y    + FL++ P+K E L +DP VV +HD I   E 
Sbjct: 288 VLPPCCSGRCQVPRNL-SNLYCVYNHVTSPFLQLAPIKTEILSIDPFVVLLHDMISQKES 346

Query: 62  NRIIELSKGKVERGKVVN---YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
             I   SK  +      +     D   VDT  +     Y   F D     KI  R+ D T
Sbjct: 347 TLIRTSSKEHMLPSATTDPDASDDETQVDTYRTSKSVWYSSDFNDTT--KKITERLGDAT 404

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGG 173
            L +   E Y    Q+ NYGLGG ++ H D    ++  +     R+A+ +FYL +V  GG
Sbjct: 405 GLDMNSTEFY----QVINYGLGGFFETHLDMLLSEKNRFNGTSDRIATTLFYLNEVRQGG 460

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            T FP LNLTVFP+ GSA+FWYN            H+GCPV +G+KW
Sbjct: 461 GTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMGSLHTGCPVIVGSKW 507


>gi|344264847|ref|XP_003404501.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 1
           [Loxodonta africana]
          Length = 536

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 81/232 (34%), Positives = 124/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N T  L I P K E+ +  P +V+ +D + D
Sbjct: 289 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 348

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI +++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 349 EEIERIKQIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVAQVNRRMQH 405

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +   E           R+A+F+ Y++D
Sbjct: 406 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRSHEQDAFKRLGTGNRVATFLNYMSD 461

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 462 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 513


>gi|113682363|ref|NP_001038463.1| prolyl 4-hydroxylase, alpha polypeptide I a precursor [Danio rerio]
          Length = 522

 Score =  134 bits (338), Expect = 2e-29,   Method: Compositional matrix adjust.
 Identities = 85/252 (33%), Positives = 127/252 (50%), Gaps = 41/252 (16%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y   C+G  L +    + +L C Y + N + F  IGP+K E+ +  PR+++ H+ I + E
Sbjct: 255 YEKLCRGEGLKMTPRRQKHLFCRYFNGNRHPFYTIGPVKQEDEWDRPRIIRYHEIITEQE 314

Query: 61  INRIIELSKGKVERGKVVN------------------------YGDTIYVDTRLSKVYFL 96
           I +I ELSK ++ R  + N                         G       R+SK  +L
Sbjct: 315 IEKIKELSKPRLRRATISNPITGVLETAHYRISKRRATVHDPQTGKLTTAQYRVSKSAWL 374

Query: 97  YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL 156
                 +HP + +I  RI+D+T L +   E     LQ+ NYG+GG Y+ H D   +DE  
Sbjct: 375 AAY---EHPVVDRINQRIEDITGLNVKTAEE----LQVANYGVGGQYEPHFDFGRKDEPD 427

Query: 157 W--------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
                    R+A+++FY++DV  GGAT+FP +   V P KG+AVFWYN   +   DY   
Sbjct: 428 AFKELGTGNRIATWLFYMSDVAAGGATVFPEVGAAVKPLKGTAVFWYNLFPSGEGDYSTR 487

Query: 209 HSGCPVALGNKW 220
           H+ CPV +GNKW
Sbjct: 488 HAACPVLVGNKW 499


>gi|198449502|ref|XP_001357605.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
 gi|198130635|gb|EAL26739.2| GA15937 [Drosophila pseudoobscura pseudoobscura]
          Length = 510

 Score =  134 bits (337), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 75/208 (36%), Positives = 111/208 (53%), Gaps = 9/208 (4%)

Query: 15  EDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER 74
           E     L C Y      FL++ PLK+E L   P VV  HD + DSEI  I+E+++ ++ R
Sbjct: 287 EQSPKALHCCYNFTTTPFLRLAPLKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMAR 346

Query: 75  GKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
              V   +     TR +   +L       +    +I  R++DM+ L +   ER    +Q+
Sbjct: 347 TSTVAQPNRTSSPTRTAMGAWLK---RSSNALTRRIARRVRDMSGLQLEGSER----MQV 399

Query: 135 NNYGLGGHYDLHCDATPRDEGLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
            NYG+GGHY  H D   +   +   RLA+ +FYLTDVE GGAT+F      V P +G+A+
Sbjct: 400 INYGIGGHYVPHKDWFTQHPEVMGNRLATVLFYLTDVEQGGATMFNKAEHKVLPRRGTAL 459

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FWYN H +   D+   H+ CP+ +G+KW
Sbjct: 460 FWYNLHTDGEGDWSTTHAACPIIVGSKW 487


>gi|195159144|ref|XP_002020442.1| GL13995 [Drosophila persimilis]
 gi|194117211|gb|EDW39254.1| GL13995 [Drosophila persimilis]
          Length = 535

 Score =  134 bits (336), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 75/208 (36%), Positives = 111/208 (53%), Gaps = 9/208 (4%)

Query: 15  EDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER 74
           E     L C Y      FL++ PLK+E L   P VV  HD + DSEI  I+E+++ ++ R
Sbjct: 312 EQSPKALHCCYNFTTTPFLRLAPLKMELLGEHPYVVVYHDVLSDSEIAEILEMAERRMAR 371

Query: 75  GKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
              V   +     TR +   +L       +    +I  R++DM+ L +   ER    +Q+
Sbjct: 372 TSTVAQPNRTSSPTRTALGAWLK---RSSNALTRRIARRVRDMSGLQLEGSER----MQV 424

Query: 135 NNYGLGGHYDLHCDATPRDEGLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
            NYG+GGHY  H D   +   +   RLA+ +FYLTDVE GGAT+F      V P +G+A+
Sbjct: 425 INYGIGGHYVPHKDWFTQHPEVMGNRLATVLFYLTDVEQGGATMFNKAEHKVLPRRGTAL 484

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FWYN H +   D+   H+ CP+ +G+KW
Sbjct: 485 FWYNLHTDGEGDWSTTHAACPIIVGSKW 512


>gi|195438148|ref|XP_002066999.1| GK24258 [Drosophila willistoni]
 gi|194163084|gb|EDW77985.1| GK24258 [Drosophila willistoni]
          Length = 217

 Score =  134 bits (336), Expect = 3e-29,   Method: Compositional matrix adjust.
 Identities = 82/224 (36%), Positives = 120/224 (53%), Gaps = 16/224 (7%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           L C+G+L  P +   NL C Y S    FL++ P K EE+ LDP ++  H+AIYD+EI+  
Sbjct: 3   LGCRGHLKAPSN--RNLFCSYNSTTTPFLRLAPFKTEEISLDPFILLFHNAIYDNEISYF 60

Query: 65  IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
            ++ +  +      NY  T     R+ +V     E  GD      +  R++D++ L  G 
Sbjct: 61  TKVKRKDMREAHTDNY-TTPNEQYRIMQVKVY--EGIGDK-MDKTLLERVKDISGLSAGN 116

Query: 125 EERYKGPLQINNYGLGGHYDLHCD-----ATPR-DEGLWRLASFMFYLTDVELGGATIFP 178
               K  L   NYGLG ++  H D      +P  +E   RLA+ +FYL+DV  GG TIFP
Sbjct: 117 ----KSELAAGNYGLGSYFPEHSDYRDIKVSPELNETGDRLATILFYLSDVAQGGHTIFP 172

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGK 222
             N+TV P+KGSA+FW+N H +   + +  H  CP+  GN+W K
Sbjct: 173 LANVTVQPKKGSALFWFNLHNDGEPNIKSLHGVCPIIEGNRWSK 216


>gi|195505216|ref|XP_002099408.1| GE23378 [Drosophila yakuba]
 gi|194185509|gb|EDW99120.1| GE23378 [Drosophila yakuba]
          Length = 546

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 82/231 (35%), Positives = 115/231 (49%), Gaps = 22/231 (9%)

Query: 4   PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
           P  C G    P  +K  L C Y      FL++ P+K E L +DP +V +HD +   E   
Sbjct: 289 PPCCSGCCEGPRKLK-RLYCVYNGVTAPFLRLAPIKTEILSIDPFIVLLHDMVSVEEGAL 347

Query: 64  IIELSKGKVERGKVVNYGDTIYVDT--------RLSKVYFLYPEIFGDHPFLYKIQTRIQ 115
           +   SK  +   +     D+             R SK  +L  +    +    K+  R+ 
Sbjct: 348 LRTFSKNMISPSETAELSDSEEKSIFEFEVGSFRTSKSVWLDNDA---NEATLKLTQRLG 404

Query: 116 DMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDV 169
           D T L I   E    P Q+ NYG+GG ++ H D + +DE  +      RLA+ +FYL DV
Sbjct: 405 DATGLDISHSE----PFQVINYGIGGIFESHFDTSLQDENRFLDGYMDRLATTLFYLNDV 460

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             GGAT FP LN+TVFP+ G+A+FWYN     LL  R  H+GCPV +G+KW
Sbjct: 461 PQGGATHFPGLNITVFPKFGTALFWYNLDTKGLLRLRTMHTGCPVIVGSKW 511


>gi|432109537|gb|ELK33711.1| Prolyl 4-hydroxylase subunit alpha-2 [Myotis davidii]
          Length = 555

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 85/252 (33%), Positives = 127/252 (50%), Gaps = 39/252 (15%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N T  L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPKRQKRLFCRYHDGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIQRIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL                  
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDEQD 460

Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
                    R+A+F+ Y++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  
Sbjct: 461 VFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 520

Query: 209 HSGCPVALGNKW 220
           H+ CPV +G KW
Sbjct: 521 HAACPVLVGCKW 532


>gi|395817620|ref|XP_003782263.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 isoform 2 [Otolemur
           garnettii]
          Length = 540

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 123/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           E+Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 293 EVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 352

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 353 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNHRMQH 409

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +   E           R+A+F+ Y++D
Sbjct: 410 ITGLSVKTAEL----LQVANYGVGGQYEPHFDFSRNHERDAFKRLGTGNRVATFLNYMSD 465

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 466 VEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 517


>gi|351706369|gb|EHB09288.1| Prolyl 4-hydroxylase subunit alpha-2 [Heterocephalus glaber]
          Length = 535

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 82/232 (35%), Positives = 124/232 (53%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           E+Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ ++ + D
Sbjct: 288 EVYESLCRGEGVKLTPQRQKRLFCRYHHGNRAPELLIAPFKEEDEWDSPHIVRYYNVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI+RI EL+K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIDRIKELAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQY 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D +   E           R+A+F+ Y++D
Sbjct: 405 ITGLTVQTAEL----LQVANYGMGGQYEPHFDFSRNHERDAFKRLGTGNRVATFLNYMSD 460

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 461 VEAGGATVFPDLGAALWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 512


>gi|348523976|ref|XP_003449499.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Oreochromis
           niloticus]
          Length = 594

 Score =  134 bits (336), Expect = 4e-29,   Method: Compositional matrix adjust.
 Identities = 80/230 (34%), Positives = 127/230 (55%), Gaps = 19/230 (8%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y   C+G  + +    +S L C +Y++  +    IGP+K E+ +  P +V+ H+ + + +
Sbjct: 349 YEQLCRGQGIKLTPRRQSRLFCRYYDNNRHPRYVIGPVKQEDEWDSPHIVRYHNIVSEKD 408

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           + ++ EL+K ++ R  + N   G       R+SK  +L      +HP + KI   I+D+T
Sbjct: 409 MEKVKELAKPRLRRATISNPVTGVLETAHYRISKSAWLGAY---EHPVVDKINQLIEDVT 465

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYGLGG Y+ H D   +DE           R+A+++ Y+TDV+
Sbjct: 466 GLNVKTAE----DLQVANYGLGGQYEPHFDFGRKDEPDAFEELGTGNRIATWLLYMTDVQ 521

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+F  +   V P+KG+AVFWYN + +   DYR  H+ CPV LGNKW
Sbjct: 522 AGGATVFTDIGAAVKPKKGTAVFWYNLYPSGEGDYRTRHAACPVLLGNKW 571


>gi|357605723|gb|EHJ64752.1| prolyl 4-hydroxylase alpha subunit [Danaus plexippus]
          Length = 235

 Score =  133 bits (335), Expect = 5e-29,   Method: Compositional matrix adjust.
 Identities = 71/200 (35%), Positives = 115/200 (57%), Gaps = 15/200 (7%)

Query: 29  NNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYV 86
           N+ FL++ P+++E LY +P ++  +D + D EI+ I  +++ +  R  V +   G+ +  
Sbjct: 4   NHPFLRLAPVRMEYLYRNPDIIVFNDVLSDYEIDYIKRIAQPRFRRATVHDPATGELVPA 63

Query: 87  DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH 146
             R+SK  +L  E   +   + ++  R+ D+T L +   E     LQ+ NYG+GGHYD H
Sbjct: 64  HYRISKSAWLKDE---ESAVVARVSRRVADITGLSMTTAEE----LQVVNYGIGGHYDPH 116

Query: 147 CDATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN 200
            D   ++E  +      R+A+ +FY++DV  GGAT+F  L L+VFP +GSAVFW N H +
Sbjct: 117 FDFARKEENAFEKFNGNRIATVLFYMSDVAQGGATVFTELGLSVFPRRGSAVFWLNLHPS 176

Query: 201 TLLDYRMYHSGCPVALGNKW 220
              D    H+ CPV  G+KW
Sbjct: 177 GEGDLATRHAACPVLRGSKW 196


>gi|198477150|ref|XP_002136737.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
 gi|198145042|gb|EDY71754.1| GA29215 [Drosophila pseudoobscura pseudoobscura]
          Length = 508

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 116/206 (56%), Gaps = 12/206 (5%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           S L C Y +    FL++ PLK+E L LDP VV  HD + D E++ +  +++  + R    
Sbjct: 291 SRLYCLYNTTATAFLRLAPLKMELLSLDPYVVLYHDVLADREMSLLKLMAQRDLVRAVTY 350

Query: 79  NYGDTIYVD--TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINN 136
           N  +  + +   R +K  +L P     H  + ++    +DM+NL + R E +    Q+ N
Sbjct: 351 NATEKKHSEDPNRTTKAGWLDPS----HNLIRRMGILTEDMSNLDLERSEDF----QVLN 402

Query: 137 YGLGGHYDLHCD--ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
           YG+GGHY +H D       E   R+A+ +FYL+DV LGGAT+FP L+L+VFP+KG+ + W
Sbjct: 403 YGIGGHYAVHPDFFEGSNPELPDRVATLLFYLSDVPLGGATVFPLLDLSVFPKKGAVLMW 462

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           YN         +  HS CPV +G++W
Sbjct: 463 YNLDHKGQGMEKTIHSACPVVVGSRW 488


>gi|66770643|gb|AAY54633.1| IP12395p [Drosophila melanogaster]
          Length = 538

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 78/224 (34%), Positives = 115/224 (51%), Gaps = 15/224 (6%)

Query: 4   PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
           P  C G    P  + + L C Y      FL++ P+K E L +DP V+ +HD +   E   
Sbjct: 288 PPCCSGRCEGPRKL-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGAL 346

Query: 64  IIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
           I   SK ++   + VN  +   +   R SK  +   +    +    K+  R+ + T L +
Sbjct: 347 IRSSSKNQILPSETVNAANEFEIAKFRTSKSVWFDSDA---NEATLKLTQRLGEATGLDM 403

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATI 176
              E    P Q+ NYG+GG ++ H D +  DE  +      RLA+ +FYL DV  GGAT 
Sbjct: 404 KHSE----PFQVINYGIGGVFESHFDTSLADEDRFVNGYIDRLATTLFYLNDVPQGGATH 459

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP LN+TVFP+ G+ + WYN H   +L  R  H+GCPV +G+KW
Sbjct: 460 FPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 503


>gi|449673565|ref|XP_002167120.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 571

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 78/228 (34%), Positives = 121/228 (53%), Gaps = 18/228 (7%)

Query: 3   YPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           Y   C+G +  + +  ++ +KC+Y S  +  LK+ P KVE +++DP +  + + I + +I
Sbjct: 329 YEQLCRGEVRPLTKKEQAKMKCWY-SAKDPVLKLKPQKVERVWVDPEIFILRNIISEKQI 387

Query: 62  NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           N I E +   + R  + +   G   + D R+SK  +L    +    FL  ++ R Q  T 
Sbjct: 388 NLIKEAASPMLRRATIQDPITGKLRHADYRISKSAWLSTNKYN---FLQALEARTQATTG 444

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVELG 172
           L +     Y   LQ+ NYGLGGHY+ H D +  +E  +       R+A+ +FYL+DVE G
Sbjct: 445 LDLS----YAEQLQVANYGLGGHYEPHFDHSRENEDRFTDLGMGNRIATVLFYLSDVEAG 500

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GAT+F      VFP KG AVFW+N   N   +    H+ CPV +G KW
Sbjct: 501 GATVFTVGKTAVFPSKGDAVFWFNLKRNGKGNPNTRHAACPVLVGQKW 548


>gi|116008130|ref|NP_001036777.1| CG31524, isoform B [Drosophila melanogaster]
 gi|113194860|gb|ABI31221.1| CG31524, isoform B [Drosophila melanogaster]
          Length = 535

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 78/224 (34%), Positives = 115/224 (51%), Gaps = 15/224 (6%)

Query: 4   PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
           P  C G    P  + + L C Y      FL++ P+K E L +DP V+ +HD +   E   
Sbjct: 285 PPCCSGRCEGPRKL-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGAL 343

Query: 64  IIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
           I   SK ++   + VN  +   +   R SK  +   +    +    K+  R+ + T L +
Sbjct: 344 IRSSSKNQILPSETVNAANEFEIAKFRTSKSVWFDSDA---NEATLKLTQRLGEATGLDM 400

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATI 176
              E    P Q+ NYG+GG ++ H D +  DE  +      RLA+ +FYL DV  GGAT 
Sbjct: 401 KHSE----PFQVINYGIGGVFESHFDTSLADEDRFVNGYIDRLATTLFYLNDVPQGGATH 456

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP LN+TVFP+ G+ + WYN H   +L  R  H+GCPV +G+KW
Sbjct: 457 FPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 500


>gi|116008537|ref|NP_733379.2| CG31524, isoform A [Drosophila melanogaster]
 gi|113194861|gb|AAN14239.2| CG31524, isoform A [Drosophila melanogaster]
          Length = 536

 Score =  133 bits (334), Expect = 6e-29,   Method: Compositional matrix adjust.
 Identities = 78/224 (34%), Positives = 115/224 (51%), Gaps = 15/224 (6%)

Query: 4   PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
           P  C G    P  + + L C Y      FL++ P+K E L +DP V+ +HD +   E   
Sbjct: 286 PPCCSGRCEGPRKL-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGAL 344

Query: 64  IIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
           I   SK ++   + VN  +   +   R SK  +   +    +    K+  R+ + T L +
Sbjct: 345 IRSSSKNQILPSETVNAANEFEIAKFRTSKSVWFDSDA---NEATLKLTQRLGEATGLDM 401

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATI 176
              E    P Q+ NYG+GG ++ H D +  DE  +      RLA+ +FYL DV  GGAT 
Sbjct: 402 KHSE----PFQVINYGIGGVFESHFDTSLADEDRFVNGYIDRLATTLFYLNDVPQGGATH 457

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP LN+TVFP+ G+ + WYN H   +L  R  H+GCPV +G+KW
Sbjct: 458 FPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 501


>gi|196011908|ref|XP_002115817.1| hypothetical protein TRIADDRAFT_30052 [Trichoplax adhaerens]
 gi|190581593|gb|EDV21669.1| hypothetical protein TRIADDRAFT_30052, partial [Trichoplax
           adhaerens]
          Length = 495

 Score =  133 bits (334), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 81/207 (39%), Positives = 118/207 (57%), Gaps = 15/207 (7%)

Query: 21  LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
           LKC+Y S  +  L + P+ VEE+ LDP +V  +D I D +I  I ++S  K    K  N+
Sbjct: 274 LKCYY-SNQSPLLYLAPIPVEEISLDPFIVIYYDIINDHQIETIKKISPSK--SNKSPNH 330

Query: 81  GDTIY-VDTRLSKVYFLYPEIFGDH---PFLYKIQTRIQDMTNLVIGREERYKGPLQINN 136
                 + +  ++V       + +    P + KI    Q++T+L +     Y   LQ+ N
Sbjct: 331 AMLCSGIKSEATQVSIFCCSTWLEDAYDPVVEKISRLTQELTHLDVN----YAEDLQVAN 386

Query: 137 YGLGGHYDLHCDAT---PRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF 193
           YG+GGHY  H D+T   P D  L RLA+ MFYL++VE+GGATIFP L + V P+KGSA+F
Sbjct: 387 YGIGGHYVPHYDSTIIAPEDP-LQRLATMMFYLSNVEIGGATIFPRLGVAVRPQKGSALF 445

Query: 194 WYNAHANTLLDYRMYHSGCPVALGNKW 220
           W N   N L + +  H+ CPV +G+KW
Sbjct: 446 WINLKRNGLTNRQTLHAACPVVIGSKW 472


>gi|261245137|gb|ACX54875.1| FI12021p [Drosophila melanogaster]
          Length = 538

 Score =  133 bits (334), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 78/224 (34%), Positives = 115/224 (51%), Gaps = 15/224 (6%)

Query: 4   PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
           P  C G    P  + + L C Y      FL++ P+K E L +DP V+ +HD +   E   
Sbjct: 288 PPCCSGRCEGPRKL-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGAL 346

Query: 64  IIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
           I   SK ++   + VN  +   +   R SK  +   +    +    K+  R+ + T L +
Sbjct: 347 IRSSSKNQILPSETVNAANEFEIAKFRTSKSVWFDSDA---NEATLKLTQRLGEATGLDM 403

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATI 176
              E    P Q+ NYG+GG ++ H D +  DE  +      RLA+ +FYL DV  GGAT 
Sbjct: 404 KHSE----PFQVINYGIGGVFESHFDTSLADEDRFVNGYIDRLATTLFYLNDVPQGGATH 459

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP LN+TVFP+ G+ + WYN H   +L  R  H+GCPV +G+KW
Sbjct: 460 FPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 503


>gi|66771513|gb|AAY55068.1| IP12095p [Drosophila melanogaster]
          Length = 538

 Score =  133 bits (334), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 78/224 (34%), Positives = 115/224 (51%), Gaps = 15/224 (6%)

Query: 4   PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
           P  C G    P  + + L C Y      FL++ P+K E L +DP V+ +HD +   E   
Sbjct: 288 PPCCSGRCEGPRKL-NRLYCVYNCVTAPFLRLAPIKTEILSVDPFVILLHDMVSHKEGAL 346

Query: 64  IIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
           I   SK ++   + VN  +   +   R SK  +   +    +    K+  R+ + T L +
Sbjct: 347 IRSSSKNQILPSETVNAANEFEIAKFRTSKSVWFDSDA---NEATLKLTQRLGEATGLDM 403

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATI 176
              E    P Q+ NYG+GG ++ H D +  DE  +      RLA+ +FYL DV  GGAT 
Sbjct: 404 KHSE----PFQVINYGIGGVFESHFDTSLADEDRFVNGYIDRLATTLFYLNDVPQGGATH 459

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP LN+TVFP+ G+ + WYN H   +L  R  H+GCPV +G+KW
Sbjct: 460 FPGLNITVFPKFGTVLMWYNLHTEGMLHVRTMHTGCPVIVGSKW 503


>gi|195390805|ref|XP_002054058.1| GJ23004 [Drosophila virilis]
 gi|194152144|gb|EDW67578.1| GJ23004 [Drosophila virilis]
          Length = 446

 Score =  133 bits (334), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 83/216 (38%), Positives = 121/216 (56%), Gaps = 17/216 (7%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C  +L  P    S+L C Y ++   FL+I PLK+EEL +DP VV  H+ IYDSEI     
Sbjct: 222 CAASLQRP----SHLHCRYNNWTTPFLRIAPLKMEELSIDPFVVLYHNVIYDSEIEWF-- 275

Query: 67  LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
           L++       +++YG   +   R  K  F+  E       +  I+ R+ DM+ L +   +
Sbjct: 276 LTQSFDYTPALLDYGG--FSAHRSGKNVFIELE---KGELVKTIEMRVTDMSGLSMEGSD 330

Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRDEGLW--RLASFMFYLTDVELGGATIFPSLNLTV 184
                L + NYG+GGHY  H D+   +E     R+A+ +FYL+DVELGGAT FP LNLT+
Sbjct: 331 ----DLSLINYGIGGHYIPHHDSFSEEENKTEDRIATALFYLSDVELGGATTFPLLNLTI 386

Query: 185 FPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            PEKG+AV W+N   +     +  H+ CPV +G+K+
Sbjct: 387 SPEKGTAVLWHNLKDSGTPHPKTVHAACPVIVGSKY 422


>gi|443709454|gb|ELU04126.1| hypothetical protein CAPTEDRAFT_167710 [Capitella teleta]
          Length = 535

 Score =  133 bits (334), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 81/229 (35%), Positives = 116/229 (50%), Gaps = 17/229 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           + Y   C+G   +P      L C Y  ++  F  I PL+ E +  DP +   H  + D +
Sbjct: 292 QTYEALCRGEDVIPIKDAHKLTCQYRVWHPMF-TINPLREETMNFDPWIAVYHQLMSDKD 350

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I+ I  L+  ++ R  VVN   G+  +   R+SK  +L  E   +HP + KI  R   +T
Sbjct: 351 IDDIKALATPRLARATVVNSVTGELEFAKYRISKSGWLKDE---EHPTVAKISNRCSALT 407

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE----GLWR---LASFMFYLTDVEL 171
           NL +   E     LQI NYG+GGHY+ H D +   E      WR   + + +FYL+DVE 
Sbjct: 408 NLSLSTVEE----LQIANYGIGGHYEPHFDYSRLAEVTSFDHWRGNRILTVIFYLSDVEA 463

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GG T+F +    + PEKG+A  WYN H +   D    H+ CPV  GNKW
Sbjct: 464 GGGTVFMTAGTKLRPEKGAAAVWYNLHPDGTGDDETKHAACPVLTGNKW 512


>gi|442747091|gb|JAA65705.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
          Length = 533

 Score =  133 bits (334), Expect = 7e-29,   Method: Compositional matrix adjust.
 Identities = 80/230 (34%), Positives = 123/230 (53%), Gaps = 15/230 (6%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y   C+G       + S L+C Y +    F K+ P+K+EE  L P VV + D + D +
Sbjct: 280 ENYKRLCRGEQLRTPKMDSQLRCRYYTGETGFFKLQPIKLEEYNLKPYVVVLRDLLQDRD 339

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           +N +I  +K ++E+ K +   D      R S   +L  +   D P   ++   +Q +  L
Sbjct: 340 LNDMIAFAKPRLEQSKTLCAADKDGPPPRTSSNTWLDDD---DAPVAARVNQYLQSLLGL 396

Query: 121 --VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLW--RLASFMFYLTDVE 170
             + G++E  K   Q+ NYG+GGHY  H D       + +   L+  R+A+ M Y++DVE
Sbjct: 397 GTLYGKDEAEK--YQLANYGIGGHYVPHHDYLEESLTSSKKHRLFGDRVATLMIYMSDVE 454

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FPSL + V P KG AVFW+N  ++   D   +H+GCPV  G+KW
Sbjct: 455 EGGATVFPSLGVRVSPRKGDAVFWWNIKSSWEGDVLTWHAGCPVLYGSKW 504


>gi|387016442|gb|AFJ50340.1| Prolyl 4-hydroxylase subunit alpha-2-like [Crotalus adamanteus]
          Length = 533

 Score =  132 bits (333), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 83/231 (35%), Positives = 124/231 (53%), Gaps = 19/231 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           E Y   C+G  + +    +  L C Y + N N  L I P K E+ +  P +V+ ++ + D
Sbjct: 288 ETYEALCRGEGVKLTPRRQKGLFCRYHNGNRNPHLIIAPFKEEDEWDSPHIVRYYEVLSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI +I EL+K K+ R  V +   G     + R+SK  +L  E   D   + ++  R++ 
Sbjct: 348 EEIEKIKELAKPKLARATVRDPKTGVLTVANYRVSKSSWLEEE---DDLVVARVNHRMEQ 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR-------DEGLWRLASFMFYLTDV 169
           +T L     E     LQ+ NYG+GG Y+ H D + R        EG  RLA+F+ Y++DV
Sbjct: 405 ITGLTTKTAEL----LQVANYGMGGQYEPHFDFSRRPFDITLKTEGN-RLATFLNYMSDV 459

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           E GGAT+FP     ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 460 EAGGATVFPDFGAAIWPKKGTAVFWYNLFRSGEGDYRTRHAACPVLVGCKW 510


>gi|125772813|ref|XP_001357665.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
 gi|54637397|gb|EAL26799.1| GA21991 [Drosophila pseudoobscura pseudoobscura]
          Length = 534

 Score =  132 bits (333), Expect = 9e-29,   Method: Compositional matrix adjust.
 Identities = 81/226 (35%), Positives = 122/226 (53%), Gaps = 16/226 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G +      +  L+C Y   ++ +  + PLK+EE  LDP VV  HD +   +I 
Sbjct: 284 YEKVCRGEVGPSPRQERPLRCRYSLGSHPYRHLAPLKLEEHSLDPFVVTYHDMLSPRKIA 343

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            +  ++  ++ R  V  +  G       R+SK  +L    +  HP +  + + + D T L
Sbjct: 344 DLRLMAVPRMHRSTVNPLPGGQNKKSSFRVSKNAWL---AYDSHPTMGGMLSDLSDATGL 400

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
            +   E+    LQ+ NYG+GGHY+ H D        P +EG  R+A+ +FYL+DVE GGA
Sbjct: 401 DMTFCEQ----LQVANYGVGGHYEPHWDFFRDPDHYPAEEGN-RMATAIFYLSDVEQGGA 455

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T FP LN  V P+ G+ +FWYN H +  +DYR  H+GCPV  G+KW
Sbjct: 456 TAFPFLNFAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLKGSKW 501


>gi|301613004|ref|XP_002936004.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
           (Silurana) tropicalis]
          Length = 526

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 78/232 (33%), Positives = 126/232 (54%), Gaps = 19/232 (8%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           E Y   C+G  + +    +  L C +++   +  L + P K E+ +  PR+V+ HD I D
Sbjct: 279 EKYEKLCRGEGVKMTSRRQKRLFCRYFDGKKDPLLILSPTKQEDEWDKPRIVRYHDIISD 338

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI+++ EL+K ++ R  + N   G       R++K  +L      + P + ++  RI+ 
Sbjct: 339 EEISKVKELAKPRLRRATISNPITGVLETAQYRITKSAWLSGY---EDPVVARLNRRIEG 395

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           +T L +   E     LQ+ NYG+GG Y+ H D   + E           R+A+++FY++D
Sbjct: 396 VTGLDMSTAEE----LQVANYGIGGQYEPHFDFLRKYEPDAFKKLGTGNRVATWLFYMSD 451

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE GGAT+FP +   V+P+KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 452 VEAGGATVFPEVGAAVYPKKGTAVFWYNLLESGEGDYSTRHAACPVLVGNKW 503


>gi|395814850|ref|XP_003780953.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Otolemur
           garnettii]
          Length = 544

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 82/222 (36%), Positives = 123/222 (55%), Gaps = 13/222 (5%)

Query: 7   CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   +  +L C YE+ ++ +L + P++ E ++L+P V   HD + DSE  +I 
Sbjct: 305 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPIRKEVIHLEPFVALYHDFVSDSEAQKIR 364

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
           EL++  ++R  V +    + VD R+SK  +L   +    P L  +  RI  +T L +  +
Sbjct: 365 ELAEPWLQRSVVASGEKQLQVDYRISKSAWLKDTV---DPMLVTLDHRIAALTGLDV--Q 419

Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
             Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F 
Sbjct: 420 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 479

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             N +V   K +A+FW+N H N   D    H+GCPV +G+KW
Sbjct: 480 YANFSVPVVKNAALFWWNLHRNGEGDSDTLHAGCPVLVGDKW 521


>gi|195159317|ref|XP_002020528.1| GL14042 [Drosophila persimilis]
 gi|194117297|gb|EDW39340.1| GL14042 [Drosophila persimilis]
          Length = 534

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 81/226 (35%), Positives = 122/226 (53%), Gaps = 16/226 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G +      +  L+C Y   ++ +  + PLK+EE  LDP VV  HD +   +I 
Sbjct: 284 YEKVCRGEVGPSPRQERPLRCRYSLGSHPYRHLAPLKLEEHSLDPFVVTYHDMLSPRKIA 343

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            +  ++  ++ R  V  +  G       R+SK  +L    +  HP +  + + + D T L
Sbjct: 344 DLRLMAVPRMHRSTVNPLPGGQNKKSSFRVSKNAWL---AYDSHPTMGGMLSDLSDATGL 400

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
            +   E+    LQ+ NYG+GGHY+ H D        P +EG  R+A+ +FYL+DVE GGA
Sbjct: 401 DMTFCEQ----LQVANYGVGGHYEPHWDFFRDPDHYPAEEGN-RMATAIFYLSDVEQGGA 455

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T FP LN  V P+ G+ +FWYN H +  +DYR  H+GCPV  G+KW
Sbjct: 456 TAFPFLNFAVKPQLGNVLFWYNVHRSLDVDYRTKHAGCPVLKGSKW 501


>gi|321463241|gb|EFX74258.1| hypothetical protein DAPPUDRAFT_22132 [Daphnia pulex]
          Length = 523

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 80/229 (34%), Positives = 118/229 (51%), Gaps = 19/229 (8%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G   + + + + LKC+Y++ +  +  + P+K+E+   +P +   HD + D EI  I E
Sbjct: 279 CRGERLLNDKLLAELKCWYDTRHQFYFLLMPIKIEQHSFEPAIYTFHDVLSDEEIETIKE 338

Query: 67  LSKGKVERGKV---VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIG 123
           L+K  + R  V   +  G  +  + R SK  +L PE  G HP L ++  RI  +T L   
Sbjct: 339 LAKPLLARSMVQGKLGVGHEVS-NVRTSKTAWL-PE--GLHPLLNRLSRRIGLITGLKTD 394

Query: 124 REERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------------RLASFMFYLTDVEL 171
                   LQ+ NYG+GGHY  H D   +D+  +            R+A+FMFYL DVE 
Sbjct: 395 PIRDEAELLQVANYGIGGHYSPHHDYLMKDKADFEYMHHRELQAGDRIATFMFYLNDVER 454

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GG+T FP   + V P KG A FW+N   +   D    H  CPV LG+KW
Sbjct: 455 GGSTAFPRAGVAVKPVKGGAAFWFNLKRSGKPDPLTLHGACPVLLGHKW 503


>gi|312092237|ref|XP_003147267.1| hypothetical protein LOAG_11701 [Loa loa]
          Length = 553

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 82/230 (35%), Positives = 126/230 (54%), Gaps = 20/230 (8%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           + Y   C+  + V    +S L C+Y+  +  +L++ P+KVE +Y +P  V  HD + D E
Sbjct: 282 DTYQALCRQEMPVNIKAQSRLYCYYK-MDRPYLRLAPIKVEIVYQNPLAVLFHDIMSDEE 340

Query: 61  INRIIE-LSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
            +RIIE L+  K++R  V  V  G+      R+SK  +L      +H  + +I  R+   
Sbjct: 341 -SRIIEMLAVPKLDRATVHNVETGNLETASYRISKSAWLRS---TEHEVVNRINRRLDLA 396

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVE 170
           TNL I   E     LQ+ NYG+GGHY+ H D + RDE  +       R+A+ + Y+T+ E
Sbjct: 397 TNLEIATAEE----LQVQNYGIGGHYEPHLDCS-RDEDAFERTGTGNRIATILIYMTEPE 451

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +GG T+F +L  +V   K +A+FWYN   +  +D R YH+ CPV  G KW
Sbjct: 452 IGGRTVFINLKASVPCTKNAALFWYNLMRSGAVDMRSYHAACPVLTGTKW 501


>gi|403183473|gb|EJY58123.1| AAEL017524-PA, partial [Aedes aegypti]
          Length = 212

 Score =  132 bits (332), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 82/197 (41%), Positives = 113/197 (57%), Gaps = 20/197 (10%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVY 94
           I P K+EE  LDP +V  H+AI D EI +II++SK  ++R  V         + R S+  
Sbjct: 1   IAPFKLEEASLDPLIVIYHNAISDKEIEQIIQVSKPMLKRSMVGESFSKEVSNERTSQNA 60

Query: 95  FLYPEIFGDHPF-LYKIQT-RIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATP- 151
           +L      D+ F L K+ + R +DMT L    + +    LQ+NNYG+GG Y  H D    
Sbjct: 61  WL-----ADYDFELVKVLSLRTEDMTGL----DRKSYESLQVNNYGIGGFYLPHFDWVRT 111

Query: 152 -------RDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLL 203
                  +D GL  R+A+ M+YL+DVE GGAT+FP + + VFP+KGSA+FWYN   +   
Sbjct: 112 NGTEEPYKDMGLGNRIATLMYYLSDVEQGGATVFPQIGVGVFPKKGSAIFWYNLLPDGTG 171

Query: 204 DYRMYHSGCPVALGNKW 220
           D R  H  CPV LG+KW
Sbjct: 172 DERTLHGACPVLLGSKW 188


>gi|24651420|ref|NP_733374.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
 gi|7301952|gb|AAF57058.1| prolyl-4-hydroxylase-alpha NE1 [Drosophila melanogaster]
 gi|363987308|gb|AEW43896.1| FI16820p1 [Drosophila melanogaster]
          Length = 537

 Score =  132 bits (331), Expect = 1e-28,   Method: Compositional matrix adjust.
 Identities = 80/226 (35%), Positives = 124/226 (54%), Gaps = 18/226 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G   V   ++  L+C Y   N+ +  + PLK+EE  LDP V   HD +   +I+
Sbjct: 289 YEKVCRGE--VHPIVRQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDILSPGKIS 346

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           ++ E++  ++ R  V  +  G       R+SK  +L    +  HP +  +   ++D T L
Sbjct: 347 QLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWL---AYESHPTMVGMLRDLKDATGL 403

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
               +  +   LQ+ NYG+GGHY+ H D        P +EG  R+A+ +FYL++VE GGA
Sbjct: 404 ----DTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGN-RIATAIFYLSEVEQGGA 458

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T FP L++ V P+ G+ +FWYN H +   DYR  H+GCPV  G+KW
Sbjct: 459 TAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504


>gi|227553849|gb|ACP40552.1| IP22178p [Drosophila melanogaster]
          Length = 467

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 80/226 (35%), Positives = 124/226 (54%), Gaps = 18/226 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G   V   ++  L+C Y   N+ +  + PLK+EE  LDP V   HD +   +I+
Sbjct: 219 YEKVCRGE--VHPIVRQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDILSPGKIS 276

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           ++ E++  ++ R  V  +  G       R+SK  +L    +  HP +  +   ++D T L
Sbjct: 277 QLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWL---AYESHPTMVGMLRDLKDATGL 333

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
               +  +   LQ+ NYG+GGHY+ H D        P +EG  R+A+ +FYL++VE GGA
Sbjct: 334 ----DTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGN-RIATAIFYLSEVEQGGA 388

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T FP L++ V P+ G+ +FWYN H +   DYR  H+GCPV  G+KW
Sbjct: 389 TAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 434


>gi|291387302|ref|XP_002710242.1| PREDICTED: prolyl 4-hydroxylase, alpha II subunit isoform 1
           precursor (predicted)-like isoform 2 [Oryctolagus
           cuniculus]
 gi|217273039|gb|ACK28132.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Oryctolagus cuniculus]
          Length = 555

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 85/252 (33%), Positives = 126/252 (50%), Gaps = 39/252 (15%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHDGNGAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + +I  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARINRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL                  
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNNERD 460

Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
                    R+A+F+ Y++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  
Sbjct: 461 AFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 520

Query: 209 HSGCPVALGNKW 220
           H+ CPV +G KW
Sbjct: 521 HAACPVLVGCKW 532


>gi|390459659|ref|XP_002806656.2| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-2 [Callithrix jacchus]
          Length = 579

 Score =  132 bits (331), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 84/252 (33%), Positives = 127/252 (50%), Gaps = 39/252 (15%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N  + L I P K E+ +  P +V+ +D + D
Sbjct: 312 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRASQLLIAPFKEEDEWDSPHIVRYYDVMSD 371

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 372 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 428

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL                  
Sbjct: 429 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERD 484

Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
                    R+A+F+ Y++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  
Sbjct: 485 AFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGXGDYRTR 544

Query: 209 HSGCPVALGNKW 220
           H+ CPV +G KW
Sbjct: 545 HAACPVLVGCKW 556


>gi|390178148|ref|XP_001358756.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
 gi|388859341|gb|EAL27899.3| GA13990 [Drosophila pseudoobscura pseudoobscura]
          Length = 498

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 79/223 (35%), Positives = 116/223 (52%), Gaps = 20/223 (8%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C G    P    S L C Y S    F +I PLK+EEL  DP +V  HD +Y+SEI+
Sbjct: 263 YKRGCNGVFRAP----SYLHCRYNSTTTAFARIAPLKMEELSHDPYMVLFHDVVYESEID 318

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            ++  ++ K         G   Y   R SK        + D   +  +  R+ DMT L +
Sbjct: 319 FLLNATQLKASL-----VGQYQYSPVRTSKEQHFVE--YNDTAVVKTLHRRLNDMTGLDM 371

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGATIF 177
              +     L + NYG+GGHYD+H D+    E        R+A+ +FY+ +V+ GGAT F
Sbjct: 372 IESD----ALTLINYGMGGHYDVHYDSHNYSEANRLILGDRIATVLFYVGEVDSGGATTF 427

Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           P +N++V P+KGSAV WYN      ++ +  H+GCPV +G+K+
Sbjct: 428 PYINVSVTPKKGSAVLWYNLDNAGQMNPKAIHAGCPVIVGSKY 470


>gi|332211329|ref|XP_003254773.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Nomascus
           leucogenys]
          Length = 544

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 81/224 (36%), Positives = 124/224 (55%), Gaps = 18/224 (8%)

Query: 5   LACQGNL-SVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
           L CQ  L  +P     +L C YE+ +N +L + P++ E ++L+P +   HD + DSE  +
Sbjct: 308 LGCQPTLYQIP-----SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQK 362

Query: 64  IIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIG 123
           I EL++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T L + 
Sbjct: 363 IRELAEPWLQRSVVASGEKQLQVEYRISKSAWLKDTV---DPMLVTLNHRIAALTGLDV- 418

Query: 124 REERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATI 176
               Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT 
Sbjct: 419 -RPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATA 477

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           F   NL+V   + +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 478 FIYANLSVPVVRNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKW 521


>gi|20269818|gb|AAM18064.1| prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE1
           [Drosophila melanogaster]
          Length = 286

 Score =  131 bits (330), Expect = 2e-28,   Method: Compositional matrix adjust.
 Identities = 80/226 (35%), Positives = 124/226 (54%), Gaps = 18/226 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G   V   ++  L+C Y   N+ +  + PLK+EE  LDP V   HD +   +I+
Sbjct: 38  YEKVCRGE--VHPIVRQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDILSPGKIS 95

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           ++ E++  ++ R  V  +  G       R+SK  +L    +  HP +  +   ++D T L
Sbjct: 96  QLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWL---AYESHPTMVGMLRDLKDATGL 152

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
               +  +   LQ+ NYG+GGHY+ H D        P +EG  R+A+ +FYL++VE GGA
Sbjct: 153 ----DTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGN-RIATAIFYLSEVEQGGA 207

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T FP L++ V P+ G+ +FWYN H +   DYR  H+GCPV  G+KW
Sbjct: 208 TAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 253


>gi|296217074|ref|XP_002754870.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Callithrix
           jacchus]
          Length = 544

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 78/208 (37%), Positives = 117/208 (56%), Gaps = 12/208 (5%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           +L C YE+ +N +L + P++ E L+L+P +   HD + DSE  +I E ++  ++R  V +
Sbjct: 319 SLYCSYETNSNPYLVLQPIQKEILHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVAS 378

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
               + V+ R+SK  +L   +    P L  +  RI  +T L +     Y   LQ+ NYG+
Sbjct: 379 GEKQLQVEYRISKSAWLKDTV---DPMLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 433

Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F   NL+V   K +A+
Sbjct: 434 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVKNAAL 493

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FW+N H +   D    H+GCPV +GNKW
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGNKW 521


>gi|403263105|ref|XP_003923900.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3, partial [Saimiri boliviensis boliviensis]
          Length = 534

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 79/208 (37%), Positives = 117/208 (56%), Gaps = 12/208 (5%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           +L C YE  +N +L + P++ E L+L+P +   HD + DSE  +I EL++  ++R  V +
Sbjct: 309 SLYCSYEINSNPYLLLQPIQKEVLHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVAS 368

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
               + V+ R+SK  +L   +    P L  +  RI  +T L +     Y   LQ+ NYG+
Sbjct: 369 GEKQLQVEYRISKSAWLKDTV---DPMLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 423

Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F   NL+V   K +A+
Sbjct: 424 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVKNAAL 483

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FW+N H +   D    H+GCPV +GNKW
Sbjct: 484 FWWNLHRSGEGDSDTLHAGCPVLVGNKW 511


>gi|195575099|ref|XP_002105517.1| GD17024 [Drosophila simulans]
 gi|194201444|gb|EDX15020.1| GD17024 [Drosophila simulans]
          Length = 537

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 80/226 (35%), Positives = 123/226 (54%), Gaps = 18/226 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G   V    +  L+C Y   N+ +  + PLK+EE  LDP V   HD +   +I+
Sbjct: 289 YEKVCRGE--VHPIARQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDMLSPRKIS 346

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           ++ E++  ++ R  V  +  G       R+SK  +L    +  HP +  +   ++D T L
Sbjct: 347 QLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWL---AYESHPTMVGMLRDLKDATGL 403

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
               +  +   LQ+ NYG+GGHY+ H D        P +EG  R+A+ +FYL++VE GGA
Sbjct: 404 ----DTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGN-RIATAIFYLSEVEQGGA 458

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T FP L++ V P+ G+ +FWYN H +   DYR  H+GCPV  G+KW
Sbjct: 459 TAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504


>gi|167045848|gb|ABZ10515.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Callithrix jacchus]
          Length = 555

 Score =  131 bits (329), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 84/252 (33%), Positives = 127/252 (50%), Gaps = 39/252 (15%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N  + L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRASQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL                  
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERD 460

Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
                    R+A+F+ Y++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  
Sbjct: 461 AFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 520

Query: 209 HSGCPVALGNKW 220
           H+ CPV +G KW
Sbjct: 521 HAACPVLVGCKW 532


>gi|197215651|gb|ACH53042.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Otolemur garnettii]
          Length = 555

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 85/252 (33%), Positives = 126/252 (50%), Gaps = 39/252 (15%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           E+Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 EVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNHRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL                  
Sbjct: 405 ITGLSVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRVATFLNYNHERD 460

Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
                    R+A+F+ Y++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  
Sbjct: 461 AFKRLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 520

Query: 209 HSGCPVALGNKW 220
           H+ CPV +G KW
Sbjct: 521 HAACPVLVGCKW 532


>gi|195341544|ref|XP_002037366.1| GM12151 [Drosophila sechellia]
 gi|194131482|gb|EDW53525.1| GM12151 [Drosophila sechellia]
          Length = 537

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 80/226 (35%), Positives = 123/226 (54%), Gaps = 18/226 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G   V    +  L+C Y   N+ +  + PLK+EE  LDP V   HD +   +I+
Sbjct: 289 YEKVCRGE--VHPIARQELRCRYSRGNHPYRFLAPLKLEEHSLDPYVATFHDMLNPRKIS 346

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           ++ E++  ++ R  V  +  G       R+SK  +L    +  HP +  +   ++D T L
Sbjct: 347 QLREMAVPRMHRSTVNPLPGGQLKKSAFRVSKNAWL---AYESHPTMVGMLRDLKDATGL 403

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
               +  +   LQ+ NYG+GGHY+ H D        P +EG  R+A+ +FYL++VE GGA
Sbjct: 404 ----DTTFCEQLQVANYGVGGHYEPHWDFFRDPNHYPAEEGN-RIATAIFYLSEVEQGGA 458

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T FP L++ V P+ G+ +FWYN H +   DYR  H+GCPV  G+KW
Sbjct: 459 TAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504


>gi|170649696|gb|ACB21278.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Callicebus moloch]
          Length = 555

 Score =  130 bits (328), Expect = 3e-28,   Method: Compositional matrix adjust.
 Identities = 84/252 (33%), Positives = 126/252 (50%), Gaps = 39/252 (15%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL                  
Sbjct: 405 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERD 460

Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
                    R+A+F+ Y++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  
Sbjct: 461 AFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 520

Query: 209 HSGCPVALGNKW 220
           H+ CPV +G KW
Sbjct: 521 HAACPVLVGCKW 532


>gi|281183175|ref|NP_001162504.1| prolyl 4-hydroxylase subunit alpha-2 [Papio anubis]
 gi|159461520|gb|ABW96795.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase, alpha
           polypeptide II, isoform 1 (predicted) [Papio anubis]
          Length = 578

 Score =  130 bits (327), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 84/252 (33%), Positives = 126/252 (50%), Gaps = 39/252 (15%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 311 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 370

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 371 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 427

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL                  
Sbjct: 428 ITGLTVKTAEL----LQVANYGVGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNDERH 483

Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
                    R+A+F+ Y++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  
Sbjct: 484 TFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 543

Query: 209 HSGCPVALGNKW 220
           H+ CPV +G KW
Sbjct: 544 HAACPVLVGCKW 555


>gi|195390825|ref|XP_002054068.1| GJ24233 [Drosophila virilis]
 gi|194152154|gb|EDW67588.1| GJ24233 [Drosophila virilis]
          Length = 533

 Score =  130 bits (327), Expect = 4e-28,   Method: Compositional matrix adjust.
 Identities = 83/215 (38%), Positives = 118/215 (54%), Gaps = 23/215 (10%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           + L C+Y++  + FL++ P K+E L  DP +   HD IY SEI  +I + +  ++R  V 
Sbjct: 295 TRLTCYYKTNPSEFLRLAPFKLELLSKDPYIAVFHDVIYASEIAELIRIGEPMLKRTAVQ 354

Query: 79  NYGDTIYVDTRLSK------VYFLYPEIFG-DHPFLYKIQTRIQDMTNLVI-GREERYKG 130
           N   T  VDT +SK       + L   +   +   +++IQ RI+DMT L+I G  E+   
Sbjct: 355 NI--TQNVDTYISKDRTATGSWILNGNLTKLERNMIWRIQRRIEDMTGLLITGFSEQ--- 409

Query: 131 PLQINNYGLGGHYDLH-----CDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVF 185
            LQ+ NY  GGHY  H     C + P D    R+A+ + YL DV  GGAT+FP L+L V 
Sbjct: 410 DLQLLNYVFGGHYQSHYDFFNCPSFPHD----RIATTLIYLNDVVRGGATVFPKLDLVVQ 465

Query: 186 PEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
           PE+G  + WYN   +T   D R  H GCPV +G K
Sbjct: 466 PERGKVLHWYNMLPDTFDYDRRSLHGGCPVLIGEK 500


>gi|195341556|ref|XP_002037372.1| GM12148 [Drosophila sechellia]
 gi|194131488|gb|EDW53531.1| GM12148 [Drosophila sechellia]
          Length = 542

 Score =  130 bits (326), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 84/227 (37%), Positives = 114/227 (50%), Gaps = 15/227 (6%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           + P  C G  +VP ++ S L C Y    + FL++ P+K E L +DP VV +HD I   E 
Sbjct: 288 VLPPCCSGRCAVPRNLNS-LYCVYNHVTSPFLQLAPIKTEILSVDPFVVLLHDMISQKES 346

Query: 62  NRIIELSKGKVERGKVVN--YGDT-IYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
             I   SK  +      +    DT   VDT  +     Y   F D     KI  R+ D T
Sbjct: 347 TLIRNSSKEHMLPSATTDPDASDTETQVDTYRTSKSVWYSSDFNDTT--KKITERLGDAT 404

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGG 173
            L +   E Y    Q+ NYGLGG ++ H D    ++  +     R+A+ +FYL +V  GG
Sbjct: 405 GLDMNFTEFY----QVINYGLGGFFETHLDMLLSEKNRFNGTRDRIATTLFYLNEVRQGG 460

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            T FP LNLTVFP+ GSA+FWYN            H+GCPV +G+KW
Sbjct: 461 GTYFPRLNLTVFPQPGSALFWYNLDTKGNDHMDSLHTGCPVIVGSKW 507


>gi|443712762|gb|ELU05926.1| hypothetical protein CAPTEDRAFT_153364 [Capitella teleta]
          Length = 491

 Score =  130 bits (326), Expect = 5e-28,   Method: Compositional matrix adjust.
 Identities = 87/232 (37%), Positives = 119/232 (51%), Gaps = 20/232 (8%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELY-LDPRVVKIHDAIYDSEI 61
           Y   C+G++ V E  KS L C Y    +  L   P+  EE++ +DP V   +D I D+E 
Sbjct: 240 YQELCRGDMIVEESKKSLLYCRYAKGRDIPL---PIYKEEVHNVDPHVAIFYDVISDAEA 296

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT--N 119
           + II  +   + RG V N       D R+SKV +L+  +      + K+  RI D+T  N
Sbjct: 297 DHIIRHAFPGMFRGLVGNSTLRQSSDQRISKVGWLFDNV---DTLIKKLSARIGDVTGLN 353

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----------RLASFMFYLTD 168
            V          +Q+ NYG+GG Y+ H D     E L            R+++F+FYL+ 
Sbjct: 354 TVYTPVRSPVEAMQVVNYGIGGQYEPHLDFYEDPEMLKNVNPSLQDTGDRISTFLFYLSR 413

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           V LGGAT+FP LN+ V P K  A FWYNA  N   D R  H+GCPV LG KW
Sbjct: 414 VHLGGATVFPKLNVRVPPVKNGAAFWYNARPNGEHDKRTLHAGCPVVLGEKW 465


>gi|194764881|ref|XP_001964556.1| GF23245 [Drosophila ananassae]
 gi|190614828|gb|EDV30352.1| GF23245 [Drosophila ananassae]
          Length = 460

 Score =  130 bits (326), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 77/223 (34%), Positives = 123/223 (55%), Gaps = 21/223 (9%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           L  + N S    + ++L C Y S  + FL I PLK+EE+  DP +V  HD IY++EIN +
Sbjct: 227 LGAKRNCSAKFRLPNHLHCRYNSSTSPFLHIAPLKMEEISTDPYMVVYHDVIYENEINWL 286

Query: 65  IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP--FLYKIQTRIQDMTNLVI 122
           ++ S          ++  ++  ++++S +       FG +    +  I+ RI+DMT L +
Sbjct: 287 LDNS----------DFRTSLVGESQISTLRTSQDMPFGANSGEVMRNIEKRIKDMTGLSM 336

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGATIF 177
              E +     + NYG+GG Y +H D     E L      R+ + +FYL DVEL G+T+F
Sbjct: 337 DLSEDF----MLINYGIGGTYKMHYDFYVYSEPLRFLRGERIVTVLFYLGDVELSGSTVF 392

Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           P LN+++ P+KGSAV WYN H +  +  +  H  CPV +G+K+
Sbjct: 393 PFLNISITPKKGSAVMWYNLHNSGDVHQKTQHCACPVVVGSKY 435


>gi|195575111|ref|XP_002105523.1| GD16991 [Drosophila simulans]
 gi|194201450|gb|EDX15026.1| GD16991 [Drosophila simulans]
          Length = 542

 Score =  129 bits (325), Expect = 6e-28,   Method: Compositional matrix adjust.
 Identities = 83/227 (36%), Positives = 115/227 (50%), Gaps = 15/227 (6%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           + P  C G  +VP ++ S+L C Y    + FL++ P+K E L +DP V+ +HD I   E 
Sbjct: 288 VLPPCCSGRCAVPRNL-SSLYCVYNHVTSPFLQLAPIKTEILSVDPFVLLLHDMISQKES 346

Query: 62  NRIIELSKGKVERGKVVN--YGDT-IYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
             I   SK  +      +    DT   VDT  +     Y   F D     KI  R+ D T
Sbjct: 347 TLIRNSSKEHMLPSATTDPDSSDTETQVDTYRTSKSVWYSSDFNDTT--KKITERLGDAT 404

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGG 173
            L     E Y    Q+ NYGLGG ++ H D    ++  +     R+A+ +FYL +V  GG
Sbjct: 405 GLDTNFTEFY----QVINYGLGGFFETHLDMLLSEKNRFNGTRDRIATTLFYLNEVRQGG 460

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            T FP +NLTVFP+ GSA+FWYN   N        H+GCPV +G+KW
Sbjct: 461 GTYFPRINLTVFPQPGSALFWYNLDTNGNDHMGSLHTGCPVIVGSKW 507


>gi|297689698|ref|XP_002822285.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pongo abelii]
          Length = 544

 Score =  129 bits (325), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 76/208 (36%), Positives = 118/208 (56%), Gaps = 12/208 (5%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           +L C YE+ +N +L + P++ E ++L+P +   HD + DSE  +I EL++  ++R  V +
Sbjct: 319 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVAS 378

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
               + V+ R+SK  +L   +    P L  +  RI  +T L +     Y   LQ+ NYG+
Sbjct: 379 GEKQLQVEYRISKSAWLKDTV---DPMLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 433

Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F   NL+V   + +A+
Sbjct: 434 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAAL 493

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FW+N H +   D    H+GCPV +G+KW
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGDKW 521


>gi|195159305|ref|XP_002020522.1| GL13469 [Drosophila persimilis]
 gi|194117291|gb|EDW39334.1| GL13469 [Drosophila persimilis]
          Length = 253

 Score =  129 bits (325), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 74/209 (35%), Positives = 113/209 (54%), Gaps = 24/209 (11%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           S L C Y +    FL++ PLK+E L LDP VV  HD + D E++ +  +++  + R    
Sbjct: 32  SRLYCLYNTTATAFLRLAPLKMELLSLDPYVVLYHDVLADREMSLLKLMAQRDLVRAVTY 91

Query: 79  NYGDTIYVD--TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINN 136
           N  +  + +   R +K  +L P     H  + ++    +DM+NL + R E +    Q+ N
Sbjct: 92  NATEKKHSEDPNRTTKAGWLDPS----HNLIRRMGILTEDMSNLDLERSEDF----QVLN 143

Query: 137 YGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYN 196
           YG+GGHY +H D               F L+DV LGGAT+FP L+L+VFP+KG+ + WYN
Sbjct: 144 YGIGGHYAVHPD--------------FFELSDVPLGGATVFPLLDLSVFPKKGAVLMWYN 189

Query: 197 AHANTLLDYRMYHSGCPVALGNKWGKLLL 225
                    +  HS CPV +G++WGK+ L
Sbjct: 190 LDHKGQGMEKTIHSACPVVVGSRWGKINL 218


>gi|350014318|dbj|GAA37183.1| prolyl 4-hydroxylase [Clonorchis sinensis]
          Length = 595

 Score =  129 bits (325), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 84/229 (36%), Positives = 119/229 (51%), Gaps = 17/229 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           EIY   C+G    P      + C Y    + + KIGP+K E LY DPR+V  +D I+ SE
Sbjct: 345 EIYQALCRGEQLFPPPPDDQVYCRY-YIPHPYYKIGPVKEEVLYPDPRIVMWYDVIHPSE 403

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           + RI EL+  ++ R  V N   G       R SK  +L     G     +++  RI  +T
Sbjct: 404 VGRIQELALPRLRRATVKNPVTGKLENAYYRTSKSAWLQD---GLDEVTHRLNQRIHALT 460

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVEL 171
            L +   E     LQ+ NYG+GG+Y  H D    R++  +      R+A+ +FYLTDV+ 
Sbjct: 461 GLAMETAE----DLQVGNYGIGGYYAPHFDFGRKREKDAFEVENGNRIATIIFYLTDVKA 516

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GGAT+F     +V P +G+A FWYN H +   D R  H  CPV +G+KW
Sbjct: 517 GGATVFNRFGASVKPVRGAAGFWYNLHPSGEGDLRTRHVACPVLVGSKW 565


>gi|229368743|gb|ACQ63024.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Dasypus novemcinctus]
          Length = 556

 Score =  129 bits (325), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 84/252 (33%), Positives = 126/252 (50%), Gaps = 39/252 (15%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N T  L I P K E+ +  P +V+ +D + D
Sbjct: 289 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDIMSD 348

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L      D P + ++  R++ 
Sbjct: 349 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEE---NDDPVVAQVNRRMEH 405

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL                  
Sbjct: 406 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYNHEQD 461

Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
                    R+A+F+ Y++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  
Sbjct: 462 VFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 521

Query: 209 HSGCPVALGNKW 220
           H+ CPV +G KW
Sbjct: 522 HAACPVLVGCKW 533


>gi|195064500|ref|XP_001996577.1| GH12091 [Drosophila grimshawi]
 gi|193895397|gb|EDV94263.1| GH12091 [Drosophila grimshawi]
          Length = 521

 Score =  129 bits (325), Expect = 7e-28,   Method: Compositional matrix adjust.
 Identities = 78/224 (34%), Positives = 124/224 (55%), Gaps = 20/224 (8%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           + C+G+   P+  + NL C Y      FL++ PLK+EE+  DP VV  H+ IYDSEI  +
Sbjct: 288 VGCRGHF--PK--RHNLSCRYNFTTTPFLRLAPLKLEEINHDPYVVMYHNVIYDSEIEEM 343

Query: 65  IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
             LS  +++ G +  Y       T+++ +      +  + PFL ++  RI DMT    G 
Sbjct: 344 KRLSP-QMQNGYIHGYKAN---QTKVTDIAARVNWLVENTPFLERMNQRITDMT----GF 395

Query: 125 EERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLW----RLASFMFYLTDVELGGATI 176
           + +    +Q+ N+G+G +++ H D       R E +     RLAS +FY +DV LGGAT+
Sbjct: 396 DLKEFPSVQVANFGIGNNFEAHYDYIFGKRVRKEDVGDLGDRLASIIFYSSDVPLGGATV 455

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP + + V P+KG+++ WYN   +   D R  HS CPV +G++W
Sbjct: 456 FPDIQVAVQPQKGNSLLWYNLFDDGTPDPRSLHSVCPVVVGSRW 499


>gi|55925444|ref|NP_001007286.1| prolyl 4-hydroxylase subunit alpha-2 precursor [Danio rerio]
 gi|49900294|gb|AAH76508.1| Procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide 2 [Danio rerio]
 gi|182891794|gb|AAI65288.1| P4ha2 protein [Danio rerio]
          Length = 514

 Score =  129 bits (324), Expect = 8e-28,   Method: Compositional matrix adjust.
 Identities = 78/224 (34%), Positives = 117/224 (52%), Gaps = 25/224 (11%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYD 58
           E Y   C+G  + +    +S L C Y   N N  L + P+K E+ +  P +V+  +A+ D
Sbjct: 289 EAYEALCRGEGVKMTTKRQSRLFCRYRDGNRNPRLLLKPMKEEDEWDSPHIVRFLEALSD 348

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI +I E++  K+ R  V +   G       R+SK  +L  E   D P + ++  RI+D
Sbjct: 349 EEIQKIKEIATPKLARATVRDPKTGVLTVAHYRVSKSAWLEGE---DDPVIARVNQRIED 405

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
           +T L +   E     LQ+ NYG+GG Y+ H D +               ++DVE GGAT+
Sbjct: 406 ITGLTVDTAEL----LQVANYGVGGQYEPHFDFS--------------RMSDVEAGGATV 447

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP    +V+P KG+AVFWYN   +   DYR  H+ CPV +G+KW
Sbjct: 448 FPDFGASVWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGSKW 491


>gi|402894624|ref|XP_003910453.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3 [Papio anubis]
          Length = 535

 Score =  129 bits (324), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 76/208 (36%), Positives = 117/208 (56%), Gaps = 12/208 (5%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           +L C YE+ +N +L + P++ E ++L+P +   HD + DSE  +I E ++  ++R  V +
Sbjct: 310 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVAS 369

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
               + V+ R+SK  +L   +    P L  +  RI  +T L +     Y   LQ+ NYG+
Sbjct: 370 GEKQLQVEYRISKSAWLKDTV---DPMLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 424

Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F   NL+V   K +A+
Sbjct: 425 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVKNAAL 484

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FW+N H +   D    H+GCPV +G+KW
Sbjct: 485 FWWNLHRSGEGDSDTLHAGCPVLVGDKW 512


>gi|195444366|ref|XP_002069834.1| GK11733 [Drosophila willistoni]
 gi|194165919|gb|EDW80820.1| GK11733 [Drosophila willistoni]
          Length = 517

 Score =  129 bits (324), Expect = 9e-28,   Method: Compositional matrix adjust.
 Identities = 86/222 (38%), Positives = 113/222 (50%), Gaps = 20/222 (9%)

Query: 6   ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
            C+G    P+     L C+YE   + FL+I P KVE L   P V   +D + DSEI  + 
Sbjct: 287 CCRGEYKPPK----GLSCYYEYGADPFLRIAPFKVELLNRSPYVAAYYDVLNDSEIEELK 342

Query: 66  ELSKGKVERGKVVNYG---DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL-- 120
            +S  ++ R  + N+    D   VD   + V+     I      L  I  R  DMT+L  
Sbjct: 343 LMSSPQIRRSLLYNHTLDIDQADVDRTSNSVFMEETGI----TLLETISQRAADMTDLYV 398

Query: 121 -VIGREERYKGPLQINNYGLGGHYDLHCDATPRD-EGLWRLASFMFYLTDVELGGATIFP 178
             I  E+     LQ+ NYGLGG Y  HCD    + E   RLA+ +FYLTDV+ GGAT+FP
Sbjct: 399 TAISSED-----LQVINYGLGGQYTPHCDYFDENAENGDRLATVLFYLTDVQQGGATVFP 453

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            L L+ FP+KGSA+ + N       D    HS CPV  GNKW
Sbjct: 454 FLRLSYFPKKGSALIFRNLDNAMSGDKDSTHSACPVLFGNKW 495


>gi|116496629|gb|AAI26171.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
          Length = 544

 Score =  129 bits (324), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 76/208 (36%), Positives = 119/208 (57%), Gaps = 12/208 (5%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           +L C YE+ +N +L + P++ E ++L+P +   HD + DSE  +I EL++  ++R  V +
Sbjct: 319 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVAS 378

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
               + V+ R+SK  +L   +   +P L  +  RI  +T L +     Y   LQ+ NYG+
Sbjct: 379 GEKQLQVEYRISKSAWLKDTV---NPKLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 433

Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F   NL+V   + +A+
Sbjct: 434 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAAL 493

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FW+N H +   D    H+GCPV +G+KW
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGDKW 521


>gi|195145084|ref|XP_002013526.1| GL24185 [Drosophila persimilis]
 gi|194102469|gb|EDW24512.1| GL24185 [Drosophila persimilis]
          Length = 229

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 75/207 (36%), Positives = 113/207 (54%), Gaps = 16/207 (7%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           S L C Y S    F +I PLK+EEL  DP +V  HD +Y+SEI+ ++  ++ K       
Sbjct: 6   SYLHCRYNSTTTAFARIAPLKMEELSHDPYMVLFHDVVYESEIDFLLNATQLKASL---- 61

Query: 79  NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYG 138
             G   Y   R SK        + D   +  +  R+ DMT L +   +     L + NYG
Sbjct: 62  -VGQYQYSPVRTSKEQHFVE--YNDTAVVKTLHRRLNDMTGLDMIESDT----LTLINYG 114

Query: 139 LGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF 193
           +GGHYD+H D+    E        R+A+ +FY+ +V+ GGAT FP +N++V P+KGSAV 
Sbjct: 115 MGGHYDVHYDSHNYSEANRLILGDRIATVLFYVGEVDSGGATTFPYINVSVTPKKGSAVL 174

Query: 194 WYNAHANTLLDYRMYHSGCPVALGNKW 220
           WYN   +  ++ +  H+GCPV +G+K+
Sbjct: 175 WYNLDNSGQMNPKAIHAGCPVIVGSKY 201


>gi|195505251|ref|XP_002099423.1| GE23370 [Drosophila yakuba]
 gi|194185524|gb|EDW99135.1| GE23370 [Drosophila yakuba]
          Length = 534

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 77/231 (33%), Positives = 118/231 (51%), Gaps = 24/231 (10%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           + L+C G    P +  + L CFY      FL++ PLK E++ LDP VV  H+ +   EI+
Sbjct: 287 FKLSCNG----PHESSTRLHCFYNFTTTPFLRLAPLKTEQIGLDPYVVLYHEVLSAREIS 342

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            +I  +   ++  +V           R +K ++L  E    +    +I  RI DMT   +
Sbjct: 343 MLISKAAQNMKNTRVHRETKPKTNRGRTAKGHWLKKE---SNELTRRITRRIVDMTGFDL 399

Query: 123 GREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEGLW----RLASFMFYLTDV 169
              E +    Q+ NYG+GGHY LH D           PR         R+A+ +FYL+DV
Sbjct: 400 ADSEDF----QVINYGIGGHYFLHMDYFDYASSNYTGPRSRQSKVLGDRIATVLFYLSDV 455

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           E GGAT+F ++  +V+P+ G+A+FWYN   +   D    H+ CPV +G+KW
Sbjct: 456 EQGGATVFGNVGYSVYPQAGTAIFWYNLDTDGNGDPLTRHASCPVIVGSKW 506


>gi|426369750|ref|XP_004051847.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Gorilla
           gorilla gorilla]
          Length = 517

 Score =  129 bits (323), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 76/208 (36%), Positives = 118/208 (56%), Gaps = 12/208 (5%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           +L C YE+ +N +L + P++ E ++L+P +   HD + DSE  +I EL++  ++R  V +
Sbjct: 292 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVAS 351

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
               + V+ R+SK  +L   +    P L  +  RI  +T L +     Y   LQ+ NYG+
Sbjct: 352 GEKQLQVEYRISKSAWLKDTV---DPKLVALNHRIAALTGLDV--RPPYAEYLQVVNYGI 406

Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F   NL+V   + +A+
Sbjct: 407 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAAL 466

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FW+N H +   D    H+GCPV +G+KW
Sbjct: 467 FWWNLHRSGEGDSDTLHAGCPVLVGDKW 494


>gi|195505202|ref|XP_002099402.1| GE23382 [Drosophila yakuba]
 gi|194185503|gb|EDW99114.1| GE23382 [Drosophila yakuba]
          Length = 537

 Score =  128 bits (322), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 79/226 (34%), Positives = 123/226 (54%), Gaps = 18/226 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G   V    +  L+C Y   ++ +  + PLK+EE  LDP V   HD +   +I+
Sbjct: 289 YEKVCRGE--VHPIARQELRCRYSRGSHPYRYLAPLKLEEHSLDPYVATYHDMLSPRKIS 346

Query: 63  RIIELSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           ++ E++  ++ R  V  +  G       R+SK  +L    +  HP +  +   +++ T L
Sbjct: 347 QLREMAVPRMRRSTVNPLPGGQHKKSAFRVSKNAWL---AYESHPTMVGMLRDLKEATGL 403

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
               +  Y   LQ+ NYG+GGHY+ H D        P +EG  R+A+ +FYL++VE GGA
Sbjct: 404 ----DTTYCEQLQVANYGVGGHYEPHWDFFRDPNHYPEEEGN-RIATAIFYLSEVEQGGA 458

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T FP L++ V P+ G+ +FWYN H +   DYR  H+GCPV  G+KW
Sbjct: 459 TAFPFLDIAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504


>gi|33589818|ref|NP_878907.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Homo sapiens]
 gi|114639354|ref|XP_001174896.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan troglodytes]
 gi|397487266|ref|XP_003814725.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Pan paniscus]
 gi|74738714|sp|Q7Z4N8.1|P4HA3_HUMAN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|33188232|gb|AAP97874.1| prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
 gi|36962719|gb|AAQ87603.1| collagen prolyl 4-hydroxylase alpha III subunit [Homo sapiens]
 gi|37182165|gb|AAQ88885.1| GPGA711 [Homo sapiens]
 gi|109658570|gb|AAI17334.1| Prolyl 4-hydroxylase, alpha polypeptide III [Homo sapiens]
 gi|119595341|gb|EAW74935.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III, isoform CRA_b
           [Homo sapiens]
 gi|410219716|gb|JAA07077.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
 gi|410248278|gb|JAA12106.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
 gi|410336087|gb|JAA36990.1| prolyl 4-hydroxylase, alpha polypeptide III [Pan troglodytes]
          Length = 544

 Score =  128 bits (322), Expect = 1e-27,   Method: Compositional matrix adjust.
 Identities = 76/208 (36%), Positives = 118/208 (56%), Gaps = 12/208 (5%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           +L C YE+ +N +L + P++ E ++L+P +   HD + DSE  +I EL++  ++R  V +
Sbjct: 319 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVAS 378

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
               + V+ R+SK  +L   +    P L  +  RI  +T L +     Y   LQ+ NYG+
Sbjct: 379 GEKQLQVEYRISKSAWLKDTV---DPKLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 433

Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F   NL+V   + +A+
Sbjct: 434 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAAL 493

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FW+N H +   D    H+GCPV +G+KW
Sbjct: 494 FWWNLHRSGEGDSDTLHAGCPVLVGDKW 521


>gi|59809017|gb|AAH89446.1| P4HA3 protein [Homo sapiens]
          Length = 528

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/208 (36%), Positives = 118/208 (56%), Gaps = 12/208 (5%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           +L C YE+ +N +L + P++ E ++L+P +   HD + DSE  +I EL++  ++R  V +
Sbjct: 303 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVAS 362

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
               + V+ R+SK  +L   +    P L  +  RI  +T L +     Y   LQ+ NYG+
Sbjct: 363 GEKQLQVEYRISKSAWLKDTV---DPKLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 417

Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F   NL+V   + +A+
Sbjct: 418 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAAL 477

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FW+N H +   D    H+GCPV +G+KW
Sbjct: 478 FWWNLHRSGEGDSDTLHAGCPVLVGDKW 505


>gi|195425415|ref|XP_002061004.1| GK10713 [Drosophila willistoni]
 gi|194157089|gb|EDW71990.1| GK10713 [Drosophila willistoni]
          Length = 502

 Score =  128 bits (322), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 80/224 (35%), Positives = 117/224 (52%), Gaps = 18/224 (8%)

Query: 6   ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
            C+G    P+     L C+Y+S +  FL + P KVE L   P V   HD +YD EI  + 
Sbjct: 250 CCRGEYEHPK----GLSCYYDSKDEPFLFLAPFKVEILNNLPFVAIYHDVLYDREIEELK 305

Query: 66  ELSKGKVERGKVVNYGD--TIYVDTRLSKVYFLYPEIFGDHPFLYKI-QTRIQDMTNLVI 122
            L+   + R  + +Y     + V+ R S   FL      +  +L  I + R+ DMT+L +
Sbjct: 306 RLAVPTITRSTIYDYDKEGNVPVNFRTSNSVFL----LNNASYLVDILRQRVADMTHLNV 361

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGATIF 177
            +       LQ+ NYGLGG+Y  H D   +DE        R+ + + Y+TDV+ GGAT+F
Sbjct: 362 FKNS--SDDLQVMNYGLGGYYRYHFDFFGKDESPNKLLGDRIITVLIYMTDVQQGGATVF 419

Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           P+L +T FP+KGSA+ + N   N   D    H+GCPV  G+KW 
Sbjct: 420 PALRITNFPKKGSALIFRNLDNNISPDPSTLHAGCPVLFGSKWA 463


>gi|443709455|gb|ELU04127.1| hypothetical protein CAPTEDRAFT_149240 [Capitella teleta]
          Length = 532

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 80/229 (34%), Positives = 116/229 (50%), Gaps = 17/229 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           + Y   C+G   VP      L C Y  ++  F  I PL+ E   L+P +   H  + D E
Sbjct: 291 QTYEALCRGEDVVPVKDPHKLTCQYRFWHPMFY-INPLREETASLEPWIAVYHQLMNDHE 349

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I RI E++  ++ R  V N   G   +   R+SK  +L  E   + P + +I  R   +T
Sbjct: 350 IERIKEMATPRLARATVHNSATGQLEHAKYRISKSGWLRDE---EDPLIARISERCSALT 406

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL----WR---LASFMFYLTDVEL 171
           NL +   E     LQ+ NYG+GG Y+ H D + R E      WR   + + ++Y+TDVE 
Sbjct: 407 NLSLTTVEE----LQVVNYGIGGQYEPHFDFSRRSEPTAFEKWRGNRILTVIYYMTDVEA 462

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GGAT+F    + V+PEKGSA  W+N   +   D R  H+ CPV  G+KW
Sbjct: 463 GGATVFLDAGVKVYPEKGSAAVWHNLLPSGEGDMRTRHAACPVLTGSKW 511


>gi|47218149|emb|CAG10069.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 595

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 80/252 (31%), Positives = 125/252 (49%), Gaps = 41/252 (16%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +S L C +Y+S  +    + P+K ++ +  P +V+  D I D E
Sbjct: 328 YEMLCRGEGIRLTPRRQSRLFCRYYDSKRHPRYILSPVKQQDEWDRPYIVRYLDIISDKE 387

Query: 61  INRIIELSKGKVERGKVVN------------------------YGDTIYVDTRLSKVYFL 96
           I  + +L+K ++ R  + N                         G       R+SK  +L
Sbjct: 388 IELVKQLAKPRLRRATISNPITGVLETASYRISKRRATVHDPQTGKLTTAQYRVSKSAWL 447

Query: 97  YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL 156
                 +HP +  I  RI+D+T L +   E     LQ+ NYG+GG Y+ H D   +DE  
Sbjct: 448 ---TGYEHPVIETINQRIEDLTGLEVDTAEE----LQVANYGVGGQYEPHFDFGRKDEPD 500

Query: 157 W--------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
                    R+A+++FY++DV  GGAT+FP +   V+P+KGSAVFWYN   +   DY   
Sbjct: 501 AFKELGTGNRIATWLFYMSDVAAGGATVFPDVGAAVWPQKGSAVFWYNLFTSGEGDYSTR 560

Query: 209 HSGCPVALGNKW 220
           H+ CPV +GNKW
Sbjct: 561 HAACPVLVGNKW 572


>gi|195452730|ref|XP_002073475.1| GK13125 [Drosophila willistoni]
 gi|194169560|gb|EDW84461.1| GK13125 [Drosophila willistoni]
          Length = 539

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 78/231 (33%), Positives = 120/231 (51%), Gaps = 23/231 (9%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+G        +  L+C       + L    L++EEL+ DP VV++H+ +   ++
Sbjct: 287 LYEQVCRGETRPSAKSQRELRC---RLQRSRLSYEVLELEELHQDPFVVQVHNIVSQKDM 343

Query: 62  NRIIELSKGKVERGKVV----NYGDTIYVDTRLSK-VYFLYPEIFGDHPFLYKIQTRIQD 116
           N + ++++  ++R +V     N  +T+    R SK   F Y E    H  +  +   + D
Sbjct: 344 NLLQKIARPNIQRSQVYAQDHNANETVAAAYRTSKGATFEYFE----HRSMELLSRHVAD 399

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDA-------TPRDEGLWRLASFMFYLTDV 169
           ++ L +   E     LQI NYG+GGHY+ H D         P D    R+A+ ++YL++V
Sbjct: 400 LSGLDMNSAEL----LQIANYGIGGHYEPHWDCFPDHHVYLPDDRDGNRIATGIYYLSEV 455

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           E GG T FP L L V PE+GS VFWYN H +   DYR  H+ CPV  G+KW
Sbjct: 456 EAGGGTAFPFLPLLVTPERGSLVFWYNLHRSGDQDYRTKHAACPVLQGSKW 506


>gi|194905410|ref|XP_001981191.1| GG11931 [Drosophila erecta]
 gi|190655829|gb|EDV53061.1| GG11931 [Drosophila erecta]
          Length = 537

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 80/226 (35%), Positives = 121/226 (53%), Gaps = 18/226 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G   V    +  L+C Y   ++ +  + PLK+EE  LDP V   HD +   +I+
Sbjct: 289 YEEVCRGE--VQPIARQELRCRYSRGSHPYRILAPLKLEEHSLDPYVASFHDMLSPRKIS 346

Query: 63  RIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           ++ E++  +++R  V     G       R+SK  +L  E    HP +  +   ++D T L
Sbjct: 347 QLREMAVPRMQRSTVNPRPGGQHKKSAFRVSKNAWLAYEA---HPTMAGMLRDLKDATGL 403

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGA 174
               +  +   LQ+ NYG+GGHY+ H D        P  EG  R+A+ +FYL++VE GGA
Sbjct: 404 ----DTTFCEQLQVANYGVGGHYEPHWDFFRDPSHYPAAEGN-RIATAIFYLSEVEQGGA 458

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T FP L+  V P+ G+ +FWYN H +   DYR  H+GCPV  G+KW
Sbjct: 459 TAFPFLDFAVKPQLGNVLFWYNLHRSLDKDYRTKHAGCPVLKGSKW 504


>gi|195575105|ref|XP_002105520.1| GD21524 [Drosophila simulans]
 gi|194201447|gb|EDX15023.1| GD21524 [Drosophila simulans]
          Length = 448

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/201 (37%), Positives = 107/201 (53%), Gaps = 12/201 (5%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           + L C Y +  + FL++ PLK+E L LDP +V  HD + D +I  I  ++KG++ R   V
Sbjct: 255 TKLYCLYNTTASYFLRLAPLKMELLSLDPYMVLFHDVVSDKDIVSIRNMAKGRLARAVTV 314

Query: 79  NYGDTIYVD-TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
           +       D  R +K  +L      +   + ++    QDMTN  I   +    P Q+ NY
Sbjct: 315 SKDGNYTEDPDRTTKGTWL----VENSKLIQRLSQLTQDMTNFEIHDAD----PFQVLNY 366

Query: 138 GLGGHYDLHCD---ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
           G+GG Y +H D       D    R+A+ +FYL+DV  GGATIFP L L+VFP+KGSA+ W
Sbjct: 367 GIGGFYGIHLDFLGEAELDNFSDRIATAVFYLSDVPQGGATIFPKLGLSVFPKKGSALLW 426

Query: 195 YNAHANTLLDYRMYHSGCPVA 215
           YN       D R  HS CP  
Sbjct: 427 YNLDHKGDGDNRTAHSACPTV 447


>gi|256083648|ref|XP_002578053.1| prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
 gi|360044447|emb|CCD81995.1| putative prolyl 4-hydroxylase alpha subunit 1 [Schistosoma mansoni]
          Length = 584

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 85/229 (37%), Positives = 120/229 (52%), Gaps = 17/229 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E+Y   C+     P     +L C Y +  + F KIGP+K E L  DPR+V  +D I+ SE
Sbjct: 334 ELYESLCRNENPFPTVPSHHLTCRYYT-PHAFFKIGPVKEETLNPDPRIVMWYDLIFPSE 392

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDT--RLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I +I EL+  ++ R  V N    I      R SK  +L P    +     +I  RI+ +T
Sbjct: 393 IEKIKELATPRLRRATVKNPVTGILEIAFYRTSKSAWL-PHSMSE--ITDQISQRIRAVT 449

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVEL 171
            L +   E     LQ+ NYGLGGHY  H D    R++  +      R+A+ +FYL+DV+ 
Sbjct: 450 GLSLETAE----DLQVGNYGLGGHYAPHFDFGRKREKDAFEVKNGNRIATIIFYLSDVQA 505

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GGAT+F  +   V P+KG+A FW+N   N   D R  H+ CPV  G+KW
Sbjct: 506 GGATVFNRIGTRVVPKKGAAGFWFNLLPNGEGDLRTRHAACPVLAGSKW 554


>gi|297301157|ref|XP_001103971.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Macaca
           mulatta]
          Length = 512

 Score =  128 bits (321), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 76/222 (34%), Positives = 116/222 (52%), Gaps = 25/222 (11%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFP 178
            L +   E     LQ+ NYG+GG Y+ H D                 ++DV  GGAT+FP
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFD--------------FARMSDVSAGGATVFP 447

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 448 EVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 489


>gi|313229039|emb|CBY18191.1| unnamed protein product [Oikopleura dioica]
          Length = 522

 Score =  127 bits (320), Expect = 2e-27,   Method: Compositional matrix adjust.
 Identities = 81/227 (35%), Positives = 123/227 (54%), Gaps = 14/227 (6%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCF-YESYNNTFL-KIGPLKVEELYLDPRVVKIHDAIYD 58
           E+  L      ++ ++   +L+CF ++ + + F  ++GP KVEE+   P VV+  D + D
Sbjct: 275 ELCQLGYNNEHTIRDNNDDSLRCFLFKGHEDDFFSQLGPWKVEEIAKQPYVVRFFDILND 334

Query: 59  SEINRIIELSKGKVERGKVVNYG--DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
           +EIN +  L + K+ R  V +      +  D R+SK  +L  E   D   + K   RI  
Sbjct: 335 NEINSLERLGEEKLARATVFDPATHKLVNADYRVSKSAWLKDE---DSDTVEKYNRRISR 391

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASFMFYLTDVELGG 173
           +T L +     Y   LQ++NYG+GG Y+ H D + R+  ++   R+A+++ YLT VE GG
Sbjct: 392 LTGLDL----EYAEQLQMSNYGIGGQYEPHYDYSRREWDIYNNRRIATWLSYLTTVEQGG 447

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            T+F  L L +   KGSAVFWYN   N   D R  H+ CPV  GNKW
Sbjct: 448 GTVFTELGLHIRSIKGSAVFWYNLLPNGSGDERTRHAACPVLRGNKW 494


>gi|195591296|ref|XP_002085378.1| GD14754 [Drosophila simulans]
 gi|194197387|gb|EDX10963.1| GD14754 [Drosophila simulans]
          Length = 508

 Score =  127 bits (320), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 80/220 (36%), Positives = 115/220 (52%), Gaps = 31/220 (14%)

Query: 11  LSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKG 70
           LS       +L C YE   + FL+I PLKVE L L P +V  HD IYDSEI+++  +S  
Sbjct: 279 LSSVSQTSQHLSCHYEQNTSEFLRIAPLKVETLSLKPHIVLYHDVIYDSEISKVKNISLP 338

Query: 71  KVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKG 130
            ++    +   D +  + +L+       +I  DH     +  RI+DMT    G + +   
Sbjct: 339 SLKSP--LRIIDAVDYNLKLA-------QIRDDHQ--SPLSLRIKDMT----GEDVQEDS 383

Query: 131 PLQINNYGLGGHYDLHCDATPRDEGLW----RLASFMFYLTDVELGGATIFPSLNLTVFP 186
             QI+NYG+ G  + H D     +       RL S +F++TDV  GGA  FP+LNLT++P
Sbjct: 384 DFQIDNYGICGFRNFHTDNIEMQDQTAELGDRLTSILFFMTDVVQGGAFAFPNLNLTIWP 443

Query: 187 EKGSAVFWYNAHANTLLDYRM------YHSGCPVALGNKW 220
           +KGSA+ W N      LD+RM       H  CPV +G+KW
Sbjct: 444 QKGSALVWRN------LDHRMQPNKDLLHVSCPVVVGSKW 477


>gi|221512810|ref|NP_649043.3| CG18234 [Drosophila melanogaster]
 gi|66771545|gb|AAY55084.1| IP12246p [Drosophila melanogaster]
 gi|220902636|gb|AAF49255.4| CG18234 [Drosophila melanogaster]
          Length = 515

 Score =  127 bits (319), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 75/206 (36%), Positives = 108/206 (52%), Gaps = 19/206 (9%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
            +L C YE   + FL+I PLKVE L L P +V  HD IYDSEI+++  +S   ++    +
Sbjct: 288 QHLSCHYEKNTSEFLRIAPLKVETLSLKPHIVLYHDVIYDSEISKVKNISLPSLKSPLRI 347

Query: 79  NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYG 138
            Y     +D  L      + +I  DH     +  RI+DMT    G + +     QI+NYG
Sbjct: 348 LYA----IDYNLK-----FAKIREDHQ--SPLSLRIKDMT----GEDVQEDTDFQIDNYG 392

Query: 139 LGGHYDLHCDATPRDEGLW----RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
           + G  + H D     +       RL S MF++ DV  GGA  FP+LNLT++P+KGSA+ W
Sbjct: 393 ICGFRNFHTDNIELQDQTAELGDRLTSIMFFMNDVAQGGALAFPNLNLTIWPQKGSALVW 452

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
            N       +  + H  CPV +G+KW
Sbjct: 453 RNLDHRMQPNQDLLHVSCPVVVGSKW 478


>gi|194213450|ref|XP_001495951.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Equus
           caballus]
          Length = 548

 Score =  127 bits (319), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 82/228 (35%), Positives = 125/228 (54%), Gaps = 13/228 (5%)

Query: 1   EIYPLACQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           + Y   CQ   S P   +  +L C YE+ ++ FL + P++ E ++L+P VV  HD + DS
Sbjct: 303 DTYEGLCQTLGSQPTHYQIPSLYCSYETNSSPFLLLQPVRKEVIHLEPYVVLYHDFVSDS 362

Query: 60  EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           E  +I  L++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T 
Sbjct: 363 EAQKIRGLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPMLVTLDHRIAALTG 419

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELG 172
           L +  +  Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE G
Sbjct: 420 LDV--QPPYAEYLQVVNYGIGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAG 477

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GAT F   N +V   K +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 478 GATAFIYANFSVPVVKNAALFWWNLHRSGEGDSDTLHAGCPVLVGDKW 525


>gi|348555277|ref|XP_003463450.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cavia porcellus]
          Length = 584

 Score =  127 bits (319), Expect = 3e-27,   Method: Compositional matrix adjust.
 Identities = 76/208 (36%), Positives = 117/208 (56%), Gaps = 12/208 (5%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           +L C YE+ ++ +L + P++ E ++L+P V   HD + D E  +I EL++  ++R  V +
Sbjct: 359 SLYCSYETNSSPYLLLQPVRKEVIHLEPYVALYHDFVSDPEAQKIRELAEPWLQRSVVAS 418

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
            G  + V+ R+SK  +L   +    P L  +  RI  +T L +     Y   LQ+ NYG+
Sbjct: 419 GGKQLQVEYRISKSAWLKDTV---DPMLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 473

Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F   N +V   K +A+
Sbjct: 474 GGHYEPHFDHATSPSSPLFRMKSGNRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAAL 533

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FW+N H +   D    H+GCPV +G+KW
Sbjct: 534 FWWNLHRSGEGDGDTLHAGCPVLVGDKW 561


>gi|184185444|gb|ACC68850.1| prolyl 4-hydroxylase, alpha II subunit isoform 1 precursor
           (predicted) [Rhinolophus ferrumequinum]
          Length = 555

 Score =  127 bits (319), Expect = 4e-27,   Method: Compositional matrix adjust.
 Identities = 83/252 (32%), Positives = 126/252 (50%), Gaps = 39/252 (15%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N T  L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRTPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI +I E++K K+ R  V +   G       R+SK  +L      + P + ++  R+Q 
Sbjct: 348 EEIEKIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEET---EDPVVARLNLRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----------------- 157
           +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL                  
Sbjct: 405 ITGLSVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDNGLKTEGNRLATFLNYNDEHD 460

Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
                    R+A+F+ Y++DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  
Sbjct: 461 VFKHLGTGNRVATFLNYMSDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTR 520

Query: 209 HSGCPVALGNKW 220
           H+ CPV +G KW
Sbjct: 521 HAACPVLVGCKW 532


>gi|196011902|ref|XP_002115814.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
 gi|190581590|gb|EDV21666.1| hypothetical protein TRIADDRAFT_30039 [Trichoplax adhaerens]
          Length = 534

 Score =  126 bits (317), Expect = 6e-27,   Method: Compositional matrix adjust.
 Identities = 80/235 (34%), Positives = 121/235 (51%), Gaps = 23/235 (9%)

Query: 1   EIYPLACQGNLSVP----EDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAI 56
           + Y   C+G   V     + + ++L C Y+   +  L   P+ VE + L P ++  H+ +
Sbjct: 285 DFYKKLCRGGPKVKAGDNKMVSNHLTC-YQLRQHARLLFSPINVEVISLQPYILIYHNLL 343

Query: 57  YDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRI 114
            D E+  +  L+   ++R  V N   G   Y   R+SK  +L  +   DHP + +I T I
Sbjct: 344 NDLEVEALKTLAAPMLQRATVHNKDTGKLEYATYRISKSAWLNDD---DHPLVRRISTLI 400

Query: 115 QDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL-----W----RLASFMFY 165
           +D+T L +   E     LQI NYG+GGHY+ H D      G      W    R+A+ + Y
Sbjct: 401 EDVTGLTMESAE----ALQIANYGIGGHYEPHFDHADVRSGTDVFKTWKGGNRIATMLIY 456

Query: 166 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           L+ VELGGAT+F S  + + P +GSA FWYN H N   +    H+ CPV +G+KW
Sbjct: 457 LSSVELGGATVFSSAGVRIEPRQGSAAFWYNLHRNGNGNNLTRHAACPVLIGSKW 511


>gi|340367965|ref|XP_003382523.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Amphimedon
           queenslandica]
          Length = 525

 Score =  126 bits (316), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 74/230 (32%), Positives = 121/230 (52%), Gaps = 18/230 (7%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFY-ESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           +Y   C+    +P  +   L C+Y  +  N  L + P+K E  ++ P++   +D + D E
Sbjct: 280 VYEKLCREPAPIPSHLHKKLICYYFNNKRNPRLILSPIKTEVAFVKPKIYIFYDIVTDRE 339

Query: 61  INRIIELSKGKVERGKVV-NYGDTIYVDTRLSKVYFLYPEIFGDHPFLY--KIQTRIQDM 117
           I R+ EL+  K+ R  V    G+ ++   R+SK  +L      D P  Y  +I  RI+D+
Sbjct: 340 IERLKELANPKLNRATVHGENGELLHATYRISKSGWLSG---SDDPLGYVDRIDQRIEDV 396

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVE 170
           T L +   E+    LQ+ NYG+GG Y+ H D     E  +       R+++ + Y++DVE
Sbjct: 397 TGLTMSTAEQ----LQVVNYGIGGQYEPHYDFARTGEDTFTSLGSGNRISTLLIYMSDVE 452

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GGAT+FP +   + P K +A +W+N   +   DY   H+GCPV +G+KW
Sbjct: 453 KGGATVFPGVGARLVPIKRAAAYWWNLKRSGDGDYSTRHAGCPVLVGSKW 502


>gi|390178051|ref|XP_002137433.2| GA30144 [Drosophila pseudoobscura pseudoobscura]
 gi|388859305|gb|EDY67991.2| GA30144 [Drosophila pseudoobscura pseudoobscura]
          Length = 546

 Score =  126 bits (316), Expect = 7e-27,   Method: Compositional matrix adjust.
 Identities = 71/218 (32%), Positives = 116/218 (53%), Gaps = 20/218 (9%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           ++   CQG   +P  ++S+L+C Y +  + FL++ PL++E L  DP V   H+ +  +E 
Sbjct: 267 VHQRNCQGRSRLP--VQSSLRCHYSAEGSAFLRLAPLRMELLSRDPLVAVYHEVVSAAEQ 324

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
             ++ LS+ +++R +   Y D I      S      P +        ++  R++D+T L 
Sbjct: 325 RHLMLLSESQLQRQRGHQY-DKIRTFASASVAANATPTV-------EQLHRRLEDITGLD 376

Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL------WRLASFMFYLTDVELGGAT 175
           +   E    PL+I NYG+GG Y +H D       +      +RLA+ + YL+DV LGG T
Sbjct: 377 LAESE----PLRILNYGIGGQYYIHVDCEQPQTHVEPYPKEYRLATVLLYLSDVRLGGFT 432

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 213
            FP+L L + P +GSA+ W+NA+     DYR  H+ CP
Sbjct: 433 SFPALGLGIRPNRGSALVWHNANNAGNCDYRALHAACP 470


>gi|194765172|ref|XP_001964701.1| GF23326 [Drosophila ananassae]
 gi|190614973|gb|EDV30497.1| GF23326 [Drosophila ananassae]
          Length = 885

 Score =  125 bits (315), Expect = 9e-27,   Method: Compositional matrix adjust.
 Identities = 77/219 (35%), Positives = 109/219 (49%), Gaps = 22/219 (10%)

Query: 4   PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
           P  C G   +    K +L C Y +  + FL++ P+K E L  DP +   HD +Y  E+ R
Sbjct: 660 PRCCNGRCEIAR--KFSLYCLYNTKTSPFLRLAPIKTELLSKDPYIAIFHDVVYPKELTR 717

Query: 64  IIELSKGKVERGKVVNYGDTIY-VDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
           I    K  +     +NY    Y VD+ R SK  ++  +    +    +I   + D T L 
Sbjct: 718 IRTACKSHLIASTTINYTSNAYSVDSYRTSKSVWIPTD---SNNLTQRITNLVGDATGLE 774

Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN 181
           +   E +    Q+ NYG+GG ++ H D    +            L+DVE GGATIF  LN
Sbjct: 775 MTTSEMF----QVINYGIGGLFEAHMDPVLSNA-----------LSDVEQGGATIFTKLN 819

Query: 182 LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           LTVFP+ GSA+FWYN       D R  H+GCPV +G+KW
Sbjct: 820 LTVFPQSGSALFWYNLDNWGNEDKRTEHAGCPVIVGSKW 858



 Score = 65.1 bits (157), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 46/170 (27%), Positives = 77/170 (45%), Gaps = 13/170 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E++ L C G     + I+ NL CFY++  +  L I P+K E L +DP +   HD I   E
Sbjct: 288 EVFSLCCNGKCQKDKKIQ-NLYCFYDTKTSNALIIAPVKKEILSVDPYIALFHDVISQKE 346

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
              +  +SK  +     ++  +    + R+SK  + Y   + D      +  R+      
Sbjct: 347 QKILQSVSKIHLMASTTIHNNNKAVKNYRISKSVW-YASDYND------VTKRLTTFMEQ 399

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFY 165
             G + +     Q+ NYGLGG +D H D    D+  +     R+A+ +FY
Sbjct: 400 ATGYDMKSSELFQVINYGLGGRFDGHEDYLLTDKTRFNGTSDRIATTLFY 449


>gi|344296798|ref|XP_003420090.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Loxodonta
           africana]
          Length = 544

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 80/222 (36%), Positives = 121/222 (54%), Gaps = 13/222 (5%)

Query: 7   CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   +  +L C YE+ +N +L + P + E ++L+P VV  HD + D E  +I 
Sbjct: 305 CQTLGSQPTHYQIPSLYCSYETNSNPYLLLQPFRKEVIHLEPYVVLYHDFVNDMEAQKIK 364

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
            L++  ++R  V +    + VD R+SK  +L   +    P L  +  RI  +T L +  +
Sbjct: 365 GLAEPWLQRSVVASGEKQLQVDYRISKSAWLKDSV---DPMLVTLDHRIAALTGLDV--Q 419

Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
             Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F 
Sbjct: 420 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSAVEAGGATAFI 479

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             N ++   K +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 480 YANFSMPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 521


>gi|194905294|ref|XP_001981167.1| GG11919 [Drosophila erecta]
 gi|190655805|gb|EDV53037.1| GG11919 [Drosophila erecta]
          Length = 533

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 77/231 (33%), Positives = 118/231 (51%), Gaps = 24/231 (10%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           +  +C G L  P    + L CFY      FL++ PLK E++ L P VV  H+ +   EI+
Sbjct: 286 FKTSCNGLLEKP----TRLHCFYNFTTTPFLRLAPLKTEQIGLKPYVVLYHEVLSAREIS 341

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            ++  +   ++  +V +         R +K Y+L  E    +    +I  RI DMT   +
Sbjct: 342 MLMGKAAQNMKNTRVQSEKAVNTNRERTAKGYWLKKE---SNEMTRRITRRIVDMTGFDL 398

Query: 123 GREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLW---------RLASFMFYLTDV 169
              E +    Q+ NYG+GGHY LH D    A+    G           R+A+ +FYLTDV
Sbjct: 399 ADSEDF----QVINYGIGGHYSLHFDYFGFASSNYTGERSHHSIVLGDRIATVLFYLTDV 454

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           E GGAT+F ++  +V+P+ G+A+FWYN   +   D    H+ CPV +G+KW
Sbjct: 455 EQGGATVFGNVGYSVYPQAGTAIFWYNLDTDGNGDPLTRHASCPVVVGSKW 505


>gi|195575143|ref|XP_002105539.1| GD16913 [Drosophila simulans]
 gi|194201466|gb|EDX15042.1| GD16913 [Drosophila simulans]
          Length = 534

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 77/221 (34%), Positives = 113/221 (51%), Gaps = 21/221 (9%)

Query: 14  PEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVE 73
           P +  + L CFY      FL++ PLK+E++ LDP VV  H+ +   EI+ +I  +   ++
Sbjct: 293 PLESSTRLHCFYNFTTTPFLRLAPLKIEQIGLDPYVVLYHEVLSAREISMLIGKAAQNMK 352

Query: 74  RGKV-VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
             +V    G       R +K ++   E    +     I  RI DMT   +   E +    
Sbjct: 353 NTRVHKEQGVPKKNRGRTAKGFWFKKE---SNELTKGITRRIMDMTGFDLADSEGF---- 405

Query: 133 QINNYGLGGHYDLHCD--------ATPRDEGLW-----RLASFMFYLTDVELGGATIFPS 179
           Q+ NYG+GGHY LH D         T    G       R+A+ +FYLTDVE GGAT+F  
Sbjct: 406 QVINYGIGGHYLLHMDYFDFASSNHTDTRSGYSMDLGDRIATVLFYLTDVEQGGATVFAD 465

Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +  +V+P+ G+A+FWYN   N   D R  H+ CPV +G+KW
Sbjct: 466 VGYSVYPQAGTAIFWYNLDTNGKGDPRTRHAACPVIVGSKW 506


>gi|195159297|ref|XP_002020518.1| GL13472 [Drosophila persimilis]
 gi|194117287|gb|EDW39330.1| GL13472 [Drosophila persimilis]
          Length = 526

 Score =  125 bits (315), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 79/236 (33%), Positives = 113/236 (47%), Gaps = 23/236 (9%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y L C G+  +    + +L+C Y +  + FL + PLK EEL  DP +V  HD IY SE
Sbjct: 253 EAYRLTCSGHSRLTAREQRHLRCGYMTETHPFLLLAPLKAEELSHDPLLVLYHDVIYQSE 312

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I+ I +L+  ++ R  V     +   + R S++ F+      +H  L  I  R+ DMTNL
Sbjct: 313 IDVIRQLTTNRMARAMVTLTNQSTVSNVRTSQITFIAK---TEHEVLQTIDRRVADMTNL 369

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW-------RLASFMFYLTDVE 170
            +     Y    Q  NYG+GGHY  H D    T  D GL        R+A+ +FY   + 
Sbjct: 370 NMD----YAEDHQFANYGIGGHYGQHMDWFTETTFDNGLVSSTEMGNRIATVLFYNISLN 425

Query: 171 ------LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
                 +  A   P L   +  +K +A FW+N HA    D R  H  CP+  G+KW
Sbjct: 426 SSRMWLMSAALTCPYLKQHLRLKKYAAAFWHNLHAAGRGDARTQHGACPIIAGSKW 481


>gi|390176836|ref|XP_003736216.1| GA26872, isoform B [Drosophila pseudoobscura pseudoobscura]
 gi|388858809|gb|EIM52289.1| GA26872, isoform B [Drosophila pseudoobscura pseudoobscura]
          Length = 567

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 78/212 (36%), Positives = 112/212 (52%), Gaps = 16/212 (7%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           + L C Y +    FL++ PL++EEL LDP +V  H+ + D E+ R+  +S   + R +V 
Sbjct: 330 ARLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARVF 389

Query: 79  NYG---DTIYVDTRLSKVYFLYPEIFG-DHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
           + G     I       +V    P++   D   + +IQ R+ D+T LV+    R    +Q 
Sbjct: 390 DSGIRKPKISPARTADEVQIPNPKLVAEDIQLVERIQKRMTDLTGLVLTSMRR----IQF 445

Query: 135 NNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
             YG GG Y  H D       T R  G  R+A+ +FYL DVE GGAT FP+L+L V  E+
Sbjct: 446 LKYGFGGIYVPHHDFFSVHTPTSRLHGD-RIATVIFYLNDVEHGGATAFPNLDLVVPTER 504

Query: 189 GSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
           G+ +FW+N    T  LDYR  H  CPV +G K
Sbjct: 505 GAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 536


>gi|195341588|ref|XP_002037388.1| GM12140 [Drosophila sechellia]
 gi|194131504|gb|EDW53547.1| GM12140 [Drosophila sechellia]
          Length = 534

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 77/222 (34%), Positives = 114/222 (51%), Gaps = 23/222 (10%)

Query: 14  PEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVE 73
           P +  + L CFY      FL++ PLK+E++ LDP VV  H+ +   EI+ +I  +   ++
Sbjct: 293 PLESSTRLHCFYNFTTTPFLRLAPLKIEQIGLDPYVVLYHEVLSAREISMLIGKATQNMK 352

Query: 74  RGKV-VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
             +V    G       R +K ++   E    +     I  RI DMT   +   E +    
Sbjct: 353 NTRVHKEQGVPKKNRGRTAKGFWFKKE---SNELTKGITRRIMDMTGFDLADSEGF---- 405

Query: 133 QINNYGLGGHYDLHCD--------------ATPRDEGLWRLASFMFYLTDVELGGATIFP 178
           Q+ NYG+GGHY LH D              +   D G  R+A+ +FYLTDVE GGAT+F 
Sbjct: 406 QVINYGIGGHYLLHMDYFDFASSNHTDTRSSYSMDLGD-RIATVLFYLTDVEQGGATVFA 464

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +  +V+P+ G+A+FWYN   N   D R  H+ CPV +G+KW
Sbjct: 465 DVGYSVYPQAGTAIFWYNLDTNGKGDPRTKHAACPVIVGSKW 506


>gi|198449518|ref|XP_002136915.1| GA26872, isoform A [Drosophila pseudoobscura pseudoobscura]
 gi|198130643|gb|EDY67473.1| GA26872, isoform A [Drosophila pseudoobscura pseudoobscura]
          Length = 543

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 78/212 (36%), Positives = 112/212 (52%), Gaps = 16/212 (7%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           + L C Y +    FL++ PL++EEL LDP +V  H+ + D E+ R+  +S   + R +V 
Sbjct: 306 ARLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARVF 365

Query: 79  NYG---DTIYVDTRLSKVYFLYPEIFG-DHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
           + G     I       +V    P++   D   + +IQ R+ D+T LV+    R    +Q 
Sbjct: 366 DSGIRKPKISPARTADEVQIPNPKLVAEDIQLVERIQKRMTDLTGLVLTSMRR----IQF 421

Query: 135 NNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
             YG GG Y  H D       T R  G  R+A+ +FYL DVE GGAT FP+L+L V  E+
Sbjct: 422 LKYGFGGIYVPHHDFFSVHTPTSRLHGD-RIATVIFYLNDVEHGGATAFPNLDLVVPTER 480

Query: 189 GSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
           G+ +FW+N    T  LDYR  H  CPV +G K
Sbjct: 481 GAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 512


>gi|195591298|ref|XP_002085379.1| GD14755 [Drosophila simulans]
 gi|194197388|gb|EDX10964.1| GD14755 [Drosophila simulans]
          Length = 515

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 84/226 (37%), Positives = 124/226 (54%), Gaps = 29/226 (12%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           L CQG    P+  KSNL C Y S  N FL++ PLK+EE+  DP +V  H+ I D EI  +
Sbjct: 285 LGCQG--LFPK--KSNLVCRYNSSTNAFLQLAPLKMEEVSRDPYIVLFHEMISDKEIEEM 340

Query: 65  IELSKGKVERGKVVNYGDTIYVDTR--LSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
               KG++     +  G T   D++  +S+VY++  E      F  +I  RI DMT   +
Sbjct: 341 ----KGEITE---MENGWTSLGDSKEIVSRVYWIRKE----SSFSKRINQRISDMTGFKL 389

Query: 123 GREERYKGPLQINNYGLGG----HYDLHCDATPR---DEGLW-RLASFMFYLTDVELGGA 174
              E +   +Q+ N+G+GG    HYD + D       +  L  R+ S +FY  +V  GG 
Sbjct: 390 ---EEFPA-IQLANFGVGGYFKPHYDYYTDRLKEVDVNNTLGDRIGSIIFYAGEVSQGGQ 445

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T+FP L + V P+KG+A+FW+NA  ++  D R  HS CPV +G++W
Sbjct: 446 TVFPDLKVAVEPKKGNALFWFNAFDDSSPDPRTLHSVCPVIVGSRW 491


>gi|195166681|ref|XP_002024163.1| GL22882 [Drosophila persimilis]
 gi|194107518|gb|EDW29561.1| GL22882 [Drosophila persimilis]
          Length = 534

 Score =  125 bits (314), Expect = 1e-26,   Method: Compositional matrix adjust.
 Identities = 79/227 (34%), Positives = 118/227 (51%), Gaps = 23/227 (10%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y + C+G        ++NL C Y      FL++ PLK+EE+  DP +V  H+ + D EI 
Sbjct: 296 YEIGCRGLFPK----RTNLVCRYNFTTTPFLRLAPLKMEEVNHDPYIVMYHEVLSDREIE 351

Query: 63  RIIELSKGKVERGKVVN-YGDTIYVD-TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            +    KG+   G++ N + D    + T++  +   +           ++  RI DMTN 
Sbjct: 352 EM----KGR--SGQMSNGWADQKEANSTKIRDIVCRHTWWREQSAIKERVNRRISDMTNF 405

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGG 173
               +E     LQ+ NYGLG H+  H D       TP    L  RL S +FY +DV  GG
Sbjct: 406 DFPPQE----DLQVANYGLGTHFKPHYDYTSDGYETPDVLTLGDRLGSIIFYASDVPQGG 461

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           AT+FP   +++FP KGS+VFWYN + +  +D R  HS CPV +G++W
Sbjct: 462 ATVFPRSRVSIFPRKGSSVFWYNLYDDGRIDTRSQHSVCPVIVGDRW 508


>gi|195392288|ref|XP_002054791.1| GJ24631 [Drosophila virilis]
 gi|194152877|gb|EDW68311.1| GJ24631 [Drosophila virilis]
          Length = 499

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 75/228 (32%), Positives = 121/228 (53%), Gaps = 32/228 (14%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G+ S+P  + S+L+C Y + +  FL++ PLK+E+L LDP +V  HD +  +E   I++
Sbjct: 267 CRGH-SLPL-VSSSLRCRYNTASAPFLRLAPLKLEQLSLDPYMVLYHDVVQANEREHIMQ 324

Query: 67  LSKGKVER---GKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIG 123
           L+K  + R   G    +     ++   S         + D     +++ R++DM+    G
Sbjct: 325 LAKPHLRRALVGAARAHSQRFAMNAGFS---------YNDSRQGQRLRQRLEDMS----G 371

Query: 124 REERYKGPLQINNYGLGGHYDLHCD-----------ATPRDEGLWRLASFMFYLTDVELG 172
            +    G L + NYG+GG Y +H D           A+ +D    R+A+ + YLTDV+LG
Sbjct: 372 FDLTNSGQLAVLNYGIGGQYYMHYDCWFSQDDAAQVASIKDN---RIATILLYLTDVQLG 428

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           G T FP+L L V P  GSA+ W+N +     D R  H+ CP+ LG +W
Sbjct: 429 GLTSFPALGLAVQPSPGSALIWHNMNNAAECDRRTLHAACPLLLGTRW 476


>gi|194905381|ref|XP_001981186.1| GG11928 [Drosophila erecta]
 gi|190655824|gb|EDV53056.1| GG11928 [Drosophila erecta]
          Length = 543

 Score =  125 bits (313), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 81/228 (35%), Positives = 114/228 (50%), Gaps = 17/228 (7%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           ++P  C     V  ++ + L C Y    + FL++ P+K E L LDP V+ +HD +   E 
Sbjct: 289 LHPPCCSARCEVVRNL-TRLYCVYNRVTSPFLQLAPIKTEILSLDPFVLLLHDMVRQKES 347

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDT----RLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
             I   SK  + + ++ N   +   D     R SK  + Y   F D     KI  R+ D 
Sbjct: 348 TLIRASSKEHLLQSEITNTDASSSEDNVAIFRTSKSVW-YSSDFND--TTKKITERLADA 404

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELG 172
           T L +   E +    Q+ NYGLGG +  H D    D+  +     R+A+ +FYL  V  G
Sbjct: 405 TGLDMHFTEYF----QVINYGLGGFFATHLDMLLSDKTRFNGTSDRIATTVFYLNGVRQG 460

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GAT FP LNLTVFP+ GSA+FWYN            H+GCPV +G+KW
Sbjct: 461 GATHFPLLNLTVFPQPGSALFWYNLDTKGNDQRSTMHTGCPVIVGSKW 508


>gi|66772633|gb|AAY55628.1| IP02961p [Drosophila melanogaster]
          Length = 409

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 79/231 (34%), Positives = 118/231 (51%), Gaps = 24/231 (10%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+G L+     + NL+C       + L   P K+EEL+LDP VV++H  I   + 
Sbjct: 158 MYEQVCRGELAPLPSKQRNLRC---RLRKSRLGYAPFKLEELHLDPLVVQLHQVIGSKDS 214

Query: 62  NRIIELSKGKVERGKVV----NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR-IQD 116
           + + + ++ +++R  V     N G T           F Y           K+ +R + D
Sbjct: 215 DSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSRNAAT-----KLLSRHVGD 269

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EGLW---RLASFMFYLTDV 169
            + L +     Y   LQ+ NYG+GGHY+ H D+ P +    EG     R+A+ ++YL DV
Sbjct: 270 FSGLNMD----YAEDLQVANYGIGGHYEPHWDSFPENHIYQEGDLHGNRMATGIYYLADV 325

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           E GG T FP L L V PE+GS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 326 EAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 376


>gi|195061021|ref|XP_001995909.1| GH14207 [Drosophila grimshawi]
 gi|193891701|gb|EDV90567.1| GH14207 [Drosophila grimshawi]
          Length = 477

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 77/207 (37%), Positives = 112/207 (54%), Gaps = 10/207 (4%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           S L C Y+  ++ FL++ PLK+E L +DP VV  H+AIYDSEI+ +  L + ++ R ++ 
Sbjct: 261 SRLICNYKMDSSPFLRLAPLKMEMLSMDPYVVVFHEAIYDSEIDELRRLCESRLSRTEIA 320

Query: 79  NYGDTIYVDTRLSKVYFLYPEIFGDH-PFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
             G    + +  S V+    ++       L +I+ R+ DM+ L+I    +    +Q   Y
Sbjct: 321 KQGKNKSIRSS-SGVWIFELDLNRQQLELLERIRRRVADMSGLLIDFNSQ---EVQYMEY 376

Query: 138 GLGGHYDLHCD--ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWY 195
             GGHY  H D    P  E   R+A+ +FYL DV  GGATIFP L L V PE+G  + W+
Sbjct: 377 VFGGHYYPHWDFKGIPHLED--RIATVLFYLNDVARGGATIFPDLELLVQPERGKVLHWH 434

Query: 196 NAHANTL-LDYRMYHSGCPVALGNKWG 221
           N    T  L+ R  H  CPV +G K G
Sbjct: 435 NMDLGTYDLEKRSLHGACPVIMGKKEG 461


>gi|4336512|gb|AAD17844.1| prolyl 4-hydroxylase alpha subunit [Drosophila melanogaster]
          Length = 535

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 79/231 (34%), Positives = 119/231 (51%), Gaps = 24/231 (10%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+G L+     + NL+C       + L   P K+EEL+LDP VV++H  I   + 
Sbjct: 284 MYEQVCRGELAPLPSKQRNLRC---RLRKSRLGYAPFKLEELHLDPLVVQLHQVIGSKDS 340

Query: 62  NRIIELSKGKVERGKVV----NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR-IQD 116
           + + + ++ +++R  V     N G T           F Y           K+ +R + D
Sbjct: 341 DSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSRNAAT-----KLLSRHVGD 395

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EGLW---RLASFMFYLTDV 169
            + L +     Y   LQ+ NYG+GGHY+ H D+ P +    EG     R+A+ ++YL+DV
Sbjct: 396 FSGLNMD----YAEDLQVANYGIGGHYEPHWDSFPENHIYQEGDLHGNRMATGIYYLSDV 451

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           E GG T FP L L V PE+GS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 452 EAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502


>gi|386771382|ref|NP_649044.3| CG18233 [Drosophila melanogaster]
 gi|383291998|gb|AAF49254.3| CG18233 [Drosophila melanogaster]
          Length = 515

 Score =  124 bits (312), Expect = 2e-26,   Method: Compositional matrix adjust.
 Identities = 79/213 (37%), Positives = 117/213 (54%), Gaps = 25/213 (11%)

Query: 18  KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
           KSNL C Y S  N FLK+ PLK+EE+  DP +V  H+ I D +I  +    KG++     
Sbjct: 294 KSNLVCRYNSSTNAFLKLAPLKMEEISRDPYIVMFHEVISDKDIEEM----KGEITE--- 346

Query: 78  VNYGDTIYVDTR--LSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQIN 135
           +  G T   D +  +S+VY++  E      F  +I  RI DMT   +   E +   +Q+ 
Sbjct: 347 MENGWTSLGDPKEIVSRVYWIRKE----SSFSKRINQRISDMTGFKL---EEFPA-IQLA 398

Query: 136 NYGLGG----HYDLHCDATPR---DEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
           N+G+GG    HYD + D       +  L  R+ S +FY  +V  GG T+FP L + V P+
Sbjct: 399 NFGVGGYFKPHYDFYTDRLKEVDVNNTLGDRIGSIIFYAGEVSQGGQTVFPDLKVAVEPK 458

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           KG+A+FW+NA  ++  D R  HS CPV +G++W
Sbjct: 459 KGNALFWFNAFDDSTPDPRSLHSVCPVLVGSRW 491


>gi|24651418|ref|NP_524594.2| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
 gi|7301951|gb|AAF57057.1| prolyl-4-hydroxylase-alpha MP [Drosophila melanogaster]
 gi|359807686|gb|AEV66559.1| FI17802p1 [Drosophila melanogaster]
          Length = 535

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 79/231 (34%), Positives = 118/231 (51%), Gaps = 24/231 (10%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+G L+     + NL+C       + L   P K+EEL+LDP VV++H  I   + 
Sbjct: 284 MYEQVCRGELAPLPSKQRNLRC---RLRKSRLGYAPFKLEELHLDPLVVQLHQVIGSKDS 340

Query: 62  NRIIELSKGKVERGKVV----NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR-IQD 116
           + + + ++ +++R  V     N G T           F Y           K+ +R + D
Sbjct: 341 DSLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSRNAAT-----KLLSRHVGD 395

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EGLW---RLASFMFYLTDV 169
            + L +     Y   LQ+ NYG+GGHY+ H D+ P +    EG     R+A+ ++YL DV
Sbjct: 396 FSGLNMD----YAEDLQVANYGIGGHYEPHWDSFPENHIYQEGDLHGNRMATGIYYLADV 451

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           E GG T FP L L V PE+GS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 452 EAGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502


>gi|198449648|ref|XP_001357666.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
 gi|198130700|gb|EAL26801.2| GA21989 [Drosophila pseudoobscura pseudoobscura]
          Length = 536

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 75/229 (32%), Positives = 115/229 (50%), Gaps = 21/229 (9%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+G L+     + +L+C  +          P K+EEL+ DP +V++HD +   E 
Sbjct: 286 MYEQVCRGELTPSPTAQRHLRCRLQRRR---FDYAPFKLEELHADPPIVQVHDMVSQRES 342

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDT--RLSK-VYFLYPEIFGDHPFLYKIQTRIQDMT 118
             +   ++ +++R  V N           R S+   F Y +        Y    R+    
Sbjct: 343 LFLQNAARPRIQRSTVYNQAGAGTTAAAFRTSQGASFNYSQ--------YATTQRLSQHV 394

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPR-----DEGLW--RLASFMFYLTDVEL 171
             + G +  Y   LQI NYG+GGHY+ H D+ P      ++ L+  RLA+ ++YL+DV  
Sbjct: 395 ADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPEHHEYPEDDLYGNRLATAIYYLSDVVA 454

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GG T FP L L V PE+GS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 455 GGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 503


>gi|195113247|ref|XP_002001179.1| GI22114 [Drosophila mojavensis]
 gi|193917773|gb|EDW16640.1| GI22114 [Drosophila mojavensis]
          Length = 487

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 76/213 (35%), Positives = 114/213 (53%), Gaps = 17/213 (7%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           + L C Y++  + FL + P K+E L  DP +V  HD IY+SEI  +  +SK  ++R  VV
Sbjct: 277 TRLVCSYKTKPSKFLYLAPFKMELLSEDPYMVVFHDVIYESEIEHLNRISKPFLQRATVV 336

Query: 79  ---NYGDTIYVDTRLSKVYFLYPEIFG--DHPFLYKIQTRIQDMTNLVIGREERYKGPLQ 133
              N  DT+ +  R +   FLY +     D   + +I  R++DM++L I  +       +
Sbjct: 337 VEDNSEDTL-IKFRTANGAFLYRDKISPKDVQLVERIFQRMRDMSDLQINDD-----AFE 390

Query: 134 INNYGLGGHYDLHCDATPRDEGLW---RLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
              Y  GGHYD+H D     +  +   R A+F+ YL DV  GGAT+FP + + V PE+G 
Sbjct: 391 YLKYDFGGHYDIHADYFNYTDDQFTDDRFATFVIYLNDVARGGATVFPDVEIAVHPERGK 450

Query: 191 AVFWYNAHANTLLDYRM--YHSGCPVALGNKWG 221
            + WYN +  +  DY +  YH  CPV +G K G
Sbjct: 451 VIHWYNMNPKS-FDYELHSYHGACPVLIGQKIG 482


>gi|195159319|ref|XP_002020529.1| GL14044 [Drosophila persimilis]
 gi|194117298|gb|EDW39341.1| GL14044 [Drosophila persimilis]
          Length = 536

 Score =  124 bits (311), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 75/229 (32%), Positives = 115/229 (50%), Gaps = 21/229 (9%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+G L+     + +L+C  +          P K+EEL+ DP +V++HD +   E 
Sbjct: 286 MYEQVCRGELTPSPTAQRHLRCRLQRRR---FDYAPFKLEELHADPPIVQVHDMVSQRES 342

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDT--RLSK-VYFLYPEIFGDHPFLYKIQTRIQDMT 118
             +   ++ +++R  V N           R S+   F Y +        Y    R+    
Sbjct: 343 LFLQNAARPRIQRSTVYNQAGAGTTAAAFRTSQGASFNYSQ--------YATTQRLSQHV 394

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPR-----DEGLW--RLASFMFYLTDVEL 171
             + G +  Y   LQI NYG+GGHY+ H D+ P      ++ L+  RLA+ ++YL+DV  
Sbjct: 395 ADLSGLDMDYAENLQIANYGIGGHYEPHWDSFPEHHEYPEDDLYGNRLATAIYYLSDVVA 454

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GG T FP L L V PE+GS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 455 GGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 503


>gi|195159164|ref|XP_002020452.1| GL13506 [Drosophila persimilis]
 gi|194117221|gb|EDW39264.1| GL13506 [Drosophila persimilis]
          Length = 536

 Score =  124 bits (310), Expect = 4e-26,   Method: Compositional matrix adjust.
 Identities = 78/214 (36%), Positives = 114/214 (53%), Gaps = 22/214 (10%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           S L C Y +    FL++ PL++EEL LDP +V  H+ + D+EI ++  +++  +   K +
Sbjct: 304 SRLHCRYNATTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAEIAKVERVAEPLL---KSI 360

Query: 79  NYGDTIYVDTRLSKVYFLYPEIFGDH-------PFLYKIQTRIQDMTNLVIGREERYKGP 131
             G+    +++ SKV         D        P + +I  RI DMT L+I R +     
Sbjct: 361 GVGEMD--NSKKSKVRTALGAWIPDENMHISGWPVIQRIVRRIHDMTGLIIKRGQ----V 414

Query: 132 LQINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFP 186
           +Q+  YG GGHYD H D    + P  + L  R+A+ +FYL DV+ GG+T+FP L L V  
Sbjct: 415 VQLIKYGYGGHYDTHFDYLNDSLPITQALGDRMATVLFYLNDVKHGGSTVFPVLQLKVPS 474

Query: 187 EKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
           E+G  + WYN H  T  LD R  H  CPV  G K
Sbjct: 475 ERGKVLVWYNMHGETHDLDSRTLHGSCPVIDGAK 508


>gi|195575097|ref|XP_002105516.1| GD17035 [Drosophila simulans]
 gi|194201443|gb|EDX15019.1| GD17035 [Drosophila simulans]
          Length = 535

 Score =  123 bits (309), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 77/230 (33%), Positives = 117/230 (50%), Gaps = 22/230 (9%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+G L+     + +L+C       + L   P K+EEL+LDP VV++H  I  ++ 
Sbjct: 284 MYEQVCRGELAPLSSKQRSLRC---RLRKSRLGYAPFKLEELHLDPLVVQLHQVIGSNDS 340

Query: 62  NRIIELSKGKVERGKVV----NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
             + + ++ +++R  V     N G T           F Y      +     +   + D 
Sbjct: 341 ESLQKTARPRIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSR----NAATKLLSHHVGDF 396

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EGLW---RLASFMFYLTDVE 170
           + L +     Y   LQ+ NYG+GGHY+ H D+ P +    EG     R+A+ ++YL+DVE
Sbjct: 397 SGLNMD----YAEDLQVANYGIGGHYEPHWDSFPENHIYQEGDLHGNRIATGIYYLSDVE 452

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T FP L L V PEKGS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 453 AGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502


>gi|198428011|ref|XP_002120302.1| PREDICTED: similar to prolyl 4-hydroxylase alpha-2 subunit, partial
           [Ciona intestinalis]
          Length = 233

 Score =  123 bits (309), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 83/219 (37%), Positives = 115/219 (52%), Gaps = 21/219 (9%)

Query: 18  KSNLKC---FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER 74
           KSNLK    F+  + N  L I P+K EEL   P VV+ +D + D +   II L+   + R
Sbjct: 3   KSNLKLKCYFHNGWKNPRLLIQPIKSEELCDSPHVVRFYDVLSDRDSEEIIRLAAPLMFR 62

Query: 75  GKVVNYGDTIYVD----TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKG 130
             V   GD   ++     R+ K  +L      + P +    TR+ D+T L +G E     
Sbjct: 63  SGVT--GDDGAINDNPMERVGKNAWL-----DNSPVVNNFMTRVADITGLNVGAEIY--- 112

Query: 131 PLQINNYGLGGHYDLHCDATPRDEGLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
            LQ+ NYG+GGH+D H D T   E +   R+A+F+ Y +DVE GG T F    +   P K
Sbjct: 113 -LQVANYGIGGHFDPHIDETGGYENIMERRIATFLTYFSDVEYGGNTPFVYQEVVAEPIK 171

Query: 189 GSAVFWYNAHANTLLDYRMYHSGCPVALGNKW-GKLLLS 226
           GSA+FWY+   +   D R  H+ CPV LGNKW G L L+
Sbjct: 172 GSAIFWYDVFNDGSADERTEHAACPVVLGNKWAGNLWLT 210


>gi|85857698|gb|ABC86384.1| IP10964p [Drosophila melanogaster]
          Length = 534

 Score =  123 bits (309), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 81/239 (33%), Positives = 119/239 (49%), Gaps = 39/239 (16%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           + L+C G    P +  + L CFY      FL++ PLK E++ LDP VV  H+ +   EI+
Sbjct: 286 FKLSCNG----PLESSTRLHCFYNFTTTPFLRLAPLKTEQIGLDPYVVLYHEVLSAREIS 341

Query: 63  RII-----ELSKGKVERGKVV---NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRI 114
            +I      +   K+ + + V   N G       R +K ++L  E    +    +I  RI
Sbjct: 342 MLIGKAAQNMKNTKIHKERAVPKKNRG-------RTAKGFWLKKE---SNELTKRITRRI 391

Query: 115 QDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLW---------RLAS 161
            DMT   +   E +    Q+ NYG+GGHY LH D    A+                R+A+
Sbjct: 392 MDMTGFDLADSEGF----QVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDLGDRIAT 447

Query: 162 FMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +FYLTDVE GGAT+F  +   V P+ G+A+FWYN   +   D R  H+ CPV +G+KW
Sbjct: 448 VLFYLTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGSKW 506


>gi|221460681|ref|NP_733394.3| CG31013 [Drosophila melanogaster]
 gi|220903261|gb|AAF57073.4| CG31013 [Drosophila melanogaster]
          Length = 534

 Score =  123 bits (309), Expect = 5e-26,   Method: Compositional matrix adjust.
 Identities = 79/236 (33%), Positives = 118/236 (50%), Gaps = 33/236 (13%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           + L+C G    P +  + L CFY      FL++ PLK E++ LDP VV  H+ +   EI+
Sbjct: 286 FKLSCNG----PLESSTRLHCFYNFTTTPFLRLAPLKTEQIGLDPYVVLYHEVLSAREIS 341

Query: 63  RII-----ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
            +I      +   K+ + + V   +      R +K ++L  E    +    +I  RI DM
Sbjct: 342 MLIGKAAQNMKNTKIHKERAVPKKNR----GRTAKGFWLKKE---SNELTKRITRRIMDM 394

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLW---------RLASFMF 164
           T   +   E +    Q+ NYG+GGHY LH D    A+                R+A+ +F
Sbjct: 395 TGFDLADSEGF----QVINYGIGGHYFLHMDYFDFASSNHTDTRSRYSIDLGDRIATVLF 450

Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           YLTDVE GGAT+F  +   V P+ G+A+FWYN   +   D R  H+ CPV +G+KW
Sbjct: 451 YLTDVEQGGATVFGDVGYYVSPQAGTAIFWYNLDTDGNGDPRTRHAACPVIVGSKW 506


>gi|195069738|ref|XP_001997014.1| GH23597 [Drosophila grimshawi]
 gi|193892024|gb|EDV90890.1| GH23597 [Drosophila grimshawi]
          Length = 239

 Score =  123 bits (309), Expect = 6e-26,   Method: Compositional matrix adjust.
 Identities = 76/205 (37%), Positives = 111/205 (54%), Gaps = 10/205 (4%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           S L C Y+  ++ FL++ PLK+E L +DP VV  H+AIYDSEI+ +  L + ++ R ++ 
Sbjct: 21  SRLICNYKMDSSPFLRLAPLKMEMLSMDPYVVVFHEAIYDSEIDELRRLCESRLSRTEIA 80

Query: 79  NYGDTIYVDTRLSKVYFLYPEIFGDH-PFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
             G    + +  S V+    ++       L +I+ R+ DM+ L+I    +    +Q   Y
Sbjct: 81  KQGKNKSIRSS-SGVWIFELDLNRQQLELLERIRRRVADMSGLLIDFNSQ---EVQYMEY 136

Query: 138 GLGGHYDLHCD--ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWY 195
             GGHY  H D    P  E   R+A+ +FYL DV  GGATIFP L L V PE+G  + W+
Sbjct: 137 VFGGHYYPHWDFKGIPHLED--RIATVLFYLNDVARGGATIFPDLELLVQPERGKVLHWH 194

Query: 196 NAHANTL-LDYRMYHSGCPVALGNK 219
           N    T  L+ R  H  CPV +G K
Sbjct: 195 NMDLGTYDLEKRSLHGACPVIMGKK 219


>gi|395521232|ref|XP_003764722.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Sarcophilus
           harrisii]
          Length = 521

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 79/228 (34%), Positives = 121/228 (53%), Gaps = 13/228 (5%)

Query: 1   EIYPLACQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           + Y   CQ   S P   +  +L C YE+  + +L + P++ E L+L+P +V  HD + DS
Sbjct: 276 DTYEGLCQTLGSQPTHYQIPSLYCAYETNGSPYLLLQPVRKEVLHLEPYIVLYHDFVSDS 335

Query: 60  EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           E  +I   +   ++R  V +      V+ R+SK  +L   +    P L  +  RI  +T 
Sbjct: 336 EAQKIRGFAAPWLQRSVVASGEKQQQVEYRISKSAWLKDTV---DPILVSLDRRIAALTG 392

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELG 172
           L +  +  Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE G
Sbjct: 393 LNV--QPPYAEHLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSGNRVATFMIYLSSVEAG 450

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           G+T F   N +V   K +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 451 GSTAFIYANFSVPVVKNAALFWWNLHRSGQGDGDTLHAGCPVLVGDKW 498


>gi|344274276|ref|XP_003408943.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3
           [Loxodonta africana]
          Length = 516

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 74/224 (33%), Positives = 115/224 (51%), Gaps = 25/224 (11%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+V+ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIVRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEVVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKG--PLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
            L +   E  +   P      G G                 R+A+++FY++DV  GGAT+
Sbjct: 406 GLDVSTAEELQKDEPDAFKELGTGN----------------RIATWLFYMSDVSAGGATV 449

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 450 FPDVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 493


>gi|195159160|ref|XP_002020450.1| GL13507 [Drosophila persimilis]
 gi|194117219|gb|EDW39262.1| GL13507 [Drosophila persimilis]
          Length = 543

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 78/212 (36%), Positives = 111/212 (52%), Gaps = 16/212 (7%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           + L C Y +    FL++ PL++EEL LDP +V  H  + D E+ R+  +S   + R +V 
Sbjct: 306 ARLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHSVLSDEEMARLENMSTPLLHRARVF 365

Query: 79  NYG---DTIYVDTRLSKVYFLYPEIFGDHPFLYK-IQTRIQDMTNLVIGREERYKGPLQI 134
           + G     I       +V    P++  +   L + IQ RI D+T L++    R    +Q 
Sbjct: 366 DSGIRKPKISPARTADEVQIPNPKLVAEDIQLVECIQKRITDLTGLMLTSMRR----IQF 421

Query: 135 NNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
             YG GG Y  H D       T R  G  R+A+ +FYL DVE GGAT FP+L+L V  E+
Sbjct: 422 LKYGFGGIYVPHHDFFSVHTPTSRLHGD-RIATVIFYLNDVEHGGATAFPNLDLVVPTER 480

Query: 189 GSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
           G+ +FW+N    T  LDYR  H  CPV +G K
Sbjct: 481 GAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 512


>gi|194905419|ref|XP_001981192.1| GG11932 [Drosophila erecta]
 gi|190655830|gb|EDV53062.1| GG11932 [Drosophila erecta]
          Length = 535

 Score =  123 bits (308), Expect = 7e-26,   Method: Compositional matrix adjust.
 Identities = 77/230 (33%), Positives = 118/230 (51%), Gaps = 22/230 (9%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+G L+     + +L+C       + L   P K+EEL+LDP VV++H  I   + 
Sbjct: 284 MYEQVCRGELAPLPSKQRDLRC---RLWRSRLGYAPFKLEELHLDPPVVQLHQVIGSKDA 340

Query: 62  NRIIELSKGKVERGKV---VNYGDTIYVDTRLSK-VYFLYPEIFGDHPFLYKIQTRIQDM 117
             +   ++ +++R  V      GD+     R S+   F Y      +     +   + D 
Sbjct: 341 ESLQRTARPRIKRSTVYSLAGNGDSTAAAFRTSQGASFNYSR----NAATKLLSHHVGDF 396

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EGLW---RLASFMFYLTDVE 170
           + L +     Y   LQ+ NYG+GGHY+ H D+ P +    EG     R+A+ ++YL+DVE
Sbjct: 397 SGLNM----EYAEDLQVANYGIGGHYEPHWDSFPDNHVYQEGDLHGNRIATAIYYLSDVE 452

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T FP L L V PE+GS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 453 AGGGTAFPFLPLLVTPERGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502


>gi|198466403|ref|XP_002135183.1| GA23911 [Drosophila pseudoobscura pseudoobscura]
 gi|198150584|gb|EDY73810.1| GA23911 [Drosophila pseudoobscura pseudoobscura]
          Length = 534

 Score =  122 bits (307), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 78/227 (34%), Positives = 117/227 (51%), Gaps = 23/227 (10%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y + C+G        ++ L C Y      FL++ PLK+EE+  DP +V  H+ + D EI 
Sbjct: 296 YEIGCRGLFPK----RTKLVCRYNFTTTPFLRLAPLKMEEVNHDPYIVMYHEVLSDREIE 351

Query: 63  RIIELSKGKVERGKVVN-YGDTIYVD-TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            +    KG+   G++ N + D    + T++  +   +           ++  RI DMTN 
Sbjct: 352 EM----KGR--SGQMSNGWADQKEANSTKIRDIVCRHTWWREQSAIKERVNRRISDMTNF 405

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGG 173
               +E     LQ+ NYGLG H+  H D       TP    L  RL S +FY +DV  GG
Sbjct: 406 DFPPQE----DLQVANYGLGTHFKPHYDYTSDGYETPDVLTLGDRLGSIIFYASDVPQGG 461

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           AT+FP   +++FP KGS+VFWYN + +  +D R  HS CPV +G++W
Sbjct: 462 ATVFPRSRVSIFPRKGSSVFWYNLYDDGRIDTRSQHSVCPVIVGDRW 508


>gi|301759032|ref|XP_002915381.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Ailuropoda
           melanoleuca]
          Length = 539

 Score =  122 bits (307), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 13/222 (5%)

Query: 7   CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   +  +L C YE+ ++ +L + P++ E ++L+P VV  HD + D E  +I 
Sbjct: 300 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDGEAQKIR 359

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
            L++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T L +  +
Sbjct: 360 GLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPLLVTLDHRIGALTGLDV--Q 414

Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
             Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F 
Sbjct: 415 PPYAEYLQVVNYGIGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 474

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             N +V   K +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 475 YANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 516


>gi|195494568|ref|XP_002094893.1| GE19959 [Drosophila yakuba]
 gi|194180994|gb|EDW94605.1| GE19959 [Drosophila yakuba]
          Length = 486

 Score =  122 bits (307), Expect = 8e-26,   Method: Compositional matrix adjust.
 Identities = 78/214 (36%), Positives = 117/214 (54%), Gaps = 25/214 (11%)

Query: 18  KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
           K+NL C Y S  N FLK+ PLK+EE+  DP +V  H+ I D EI  +    KG +   + 
Sbjct: 246 KTNLVCRYNSSTNAFLKLAPLKMEEISRDPYIVMFHEVISDKEIEEM----KGDI---RE 298

Query: 78  VNYGDTIYVDTR--LSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQIN 135
           +  G T   D +  +S VY++  E      F  +I  RI DMT   +   E +   +Q+ 
Sbjct: 299 MENGWTGLEDPKEIVSSVYWIREET----SFSKRINQRISDMTGFKL---EEFVA-IQLA 350

Query: 136 NYGLGGHYDLHCDA-TPRDEGLW-------RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
           N+G+GG++  H D  T R  G+        R+AS +FY  +V  GG T+FP L + V P+
Sbjct: 351 NFGVGGYFKPHFDYYTERLRGVDANNTLGDRIASIIFYAGEVSQGGQTVFPDLKVVVEPK 410

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           +G+A+FW+N   ++  D R  HS CPV +G++W 
Sbjct: 411 RGNALFWFNKLDDSSPDPRSLHSVCPVIVGSRWS 444


>gi|81870817|sp|Q6W3F0.1|P4HA3_MOUSE RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|36962749|gb|AAQ87604.1| collagen prolyl 4-hydroxylase alpha III subunit [Mus musculus]
          Length = 542

 Score =  122 bits (307), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 120/222 (54%), Gaps = 13/222 (5%)

Query: 7   CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   +  +L C YE+ ++ +L + P + E ++L P +   HD + D E  +I 
Sbjct: 303 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIR 362

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
           EL++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T L I  +
Sbjct: 363 ELAEPWLQRSVVASGEKQLQVEYRISKSAWLKDTV---DPMLVTLDHRIAALTGLDI--Q 417

Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
             Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F 
Sbjct: 418 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 477

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             N +V   K +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 478 YGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 519


>gi|195505214|ref|XP_002099407.1| GE23379 [Drosophila yakuba]
 gi|194185508|gb|EDW99119.1| GE23379 [Drosophila yakuba]
          Length = 547

 Score =  122 bits (307), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 78/227 (34%), Positives = 109/227 (48%), Gaps = 15/227 (6%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           ++P  C G   V  ++ + L C Y    + FL++ P+K E L +DP V+  HD I   E 
Sbjct: 293 LFPPCCSGRCEVSRNL-TGLYCVYNHVTSPFLQLAPIKTEILSIDPFVLLFHDMISQKES 351

Query: 62  NRIIELSKGKVERGKVVNY---GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
             I   SK  +      +    G   +V T  +     Y     D     +I  R+ D T
Sbjct: 352 TLIRSSSKEHMLPSATTDVDASGSEDHVATFRTSKSVWYSSTSNDTT--KRITERLGDAT 409

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGG 173
            L +   E +    Q+ NYGLGG ++ H D    D   +     RLA+ +FYL +V  GG
Sbjct: 410 GLDMNFTEYF----QVINYGLGGFFETHLDMLLSDRSRFNGTRDRLATTLFYLNEVRQGG 465

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            T FP LNLTVFP+ GSA+FWYN            H+GCPV +G+KW
Sbjct: 466 GTHFPRLNLTVFPQPGSALFWYNLDTRGNDHTSTLHTGCPVIVGSKW 512


>gi|227908832|ref|NP_796135.3| prolyl 4-hydroxylase subunit alpha-3 precursor [Mus musculus]
          Length = 542

 Score =  122 bits (307), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 120/222 (54%), Gaps = 13/222 (5%)

Query: 7   CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   +  +L C YE+ ++ +L + P + E ++L P +   HD + D E  +I 
Sbjct: 303 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIR 362

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
           EL++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T L I  +
Sbjct: 363 ELAEPWLQRSVVASGEKQLQVEYRISKSAWLKDTV---DPMLVTLDHRIAALTGLDI--Q 417

Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
             Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F 
Sbjct: 418 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 477

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             N +V   K +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 478 YGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 519


>gi|194751825|ref|XP_001958224.1| GF23629 [Drosophila ananassae]
 gi|190625506|gb|EDV41030.1| GF23629 [Drosophila ananassae]
          Length = 523

 Score =  122 bits (307), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 76/210 (36%), Positives = 111/210 (52%), Gaps = 17/210 (8%)

Query: 18  KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
           K NL C Y      FL++ PLK+EE+ LDP VV  H+ +Y++EI  + + S G ++ G  
Sbjct: 303 KMNLFCRYNFTTTPFLRLAPLKLEEINLDPYVVMYHEVLYETEIEELKKQS-GHMKNGYA 361

Query: 78  VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
                T+Y       V   +     + P   +I  RI+DMT L     +     LQ+ NY
Sbjct: 362 DQKNGTMY-----RAVVARHSWWSDESPTRERINRRIRDMTGLDFPITD----TLQVANY 412

Query: 138 GLGGHYDLHCD------ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
           G G ++  H D       TP  + L  RL + +FY +DV  GGAT+FP + +++ P KGS
Sbjct: 413 GCGTYFKPHFDYTSDGYETPNADALGDRLGTIIFYASDVLQGGATVFPDIKVSITPRKGS 472

Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +VFWYN + +   D R  HS CPV  G++W
Sbjct: 473 SVFWYNLYDDGRPDIRSRHSVCPVINGDRW 502


>gi|326923465|ref|XP_003207956.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 3
           [Meleagris gallopavo]
          Length = 518

 Score =  122 bits (307), Expect = 9e-26,   Method: Compositional matrix adjust.
 Identities = 75/224 (33%), Positives = 115/224 (51%), Gaps = 25/224 (11%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  L +    +  L C +Y+   N    +GP+K E+ +  PR+V+  D I D E
Sbjct: 291 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 350

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + EL+K ++ R  V +   G       R+SK  +L      + P + +I TRIQD+T
Sbjct: 351 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSGY---ESPVVSRINTRIQDLT 407

Query: 119 NLVIGREERYKG--PLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
            L +   E  +   P      G G                 R+A+++FY++DV  GGAT+
Sbjct: 408 GLDVSTAEELQKDEPDAFKELGTGN----------------RIATWLFYMSDVSAGGATV 451

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP +  +V+P+KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 452 FPEVGASVWPKKGTAVFWYNLFPSGEGDYSTRHAACPVLVGNKW 495


>gi|52139015|gb|AAH82538.1| P4ha3 protein [Mus musculus]
          Length = 404

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 80/228 (35%), Positives = 122/228 (53%), Gaps = 13/228 (5%)

Query: 1   EIYPLACQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           + Y   CQ   S P   +  +L C YE+ ++ +L + P + E ++L P +   HD + D 
Sbjct: 159 DTYEGLCQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDE 218

Query: 60  EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           E  +I EL++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T 
Sbjct: 219 EAQKIRELAEPWLQRSVVASGEKQLQVEYRISKSAWLKDTV---DPMLVTLDHRIAALTG 275

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELG 172
           L I  +  Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE G
Sbjct: 276 LDI--QPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAG 333

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GAT F   N +V   K +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 334 GATAFIYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 381


>gi|395820528|ref|XP_003783616.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Otolemur
           garnettii]
          Length = 516

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 73/224 (32%), Positives = 115/224 (51%), Gaps = 25/224 (11%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKG--PLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
            L +   E  +   P      G G                 R+A+++FY++DV  GGAT+
Sbjct: 406 GLDVSTAEELQKDEPDAFKELGTGN----------------RIATWLFYMSDVSAGGATV 449

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 450 FPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 493


>gi|354504916|ref|XP_003514519.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Cricetulus
           griseus]
          Length = 509

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 81/228 (35%), Positives = 123/228 (53%), Gaps = 13/228 (5%)

Query: 1   EIYPLACQGNLSVPEDIKS-NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           + Y   CQ   S P   ++  L C YE+ ++ +L + P + E ++L P V   HD + D+
Sbjct: 264 DTYEGLCQTLGSQPTHYQNPRLYCSYETNSSPYLLLQPARKEVIHLRPFVALYHDFVSDA 323

Query: 60  EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           E  +I EL++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T 
Sbjct: 324 EAQKIRELAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPMLGTLDHRIAALTG 380

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELG 172
           L I  +  Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE G
Sbjct: 381 LDI--QPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSAVEAG 438

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GAT F   N +V   K +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 439 GATAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 486


>gi|291404186|ref|XP_002718473.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit isoform 3
           [Oryctolagus cuniculus]
          Length = 516

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 73/224 (32%), Positives = 115/224 (51%), Gaps = 25/224 (11%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKG--PLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
            L +   E  +   P      G G                 R+A+++FY++DV  GGAT+
Sbjct: 406 GLDVSTAEELQKDEPDAFKELGTGN----------------RIATWLFYMSDVSAGGATV 449

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 450 FPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 493


>gi|148701598|gb|EDL33545.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_c [Mus
           musculus]
 gi|149052607|gb|EDM04424.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide (predicted),
           isoform CRA_d [Rattus norvegicus]
          Length = 189

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 70/173 (40%), Positives = 100/173 (57%), Gaps = 15/173 (8%)

Query: 56  IYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR 113
           + D EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R
Sbjct: 1   MSDEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRR 57

Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLT 167
           +Q +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++
Sbjct: 58  MQHITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMS 113

Query: 168 DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 114 DVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 166


>gi|217272851|ref|NP_001136068.1| prolyl 4-hydroxylase subunit alpha-1 isoform 3 precursor [Homo
           sapiens]
 gi|114631189|ref|XP_001140871.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 10 [Pan
           troglodytes]
          Length = 516

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 73/224 (32%), Positives = 115/224 (51%), Gaps = 25/224 (11%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKG--PLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
            L +   E  +   P      G G                 R+A+++FY++DV  GGAT+
Sbjct: 406 GLDVSTAEELQKDEPDAFKELGTGN----------------RIATWLFYMSDVSAGGATV 449

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 450 FPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 493


>gi|126327904|ref|XP_001367838.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Monodelphis
           domestica]
          Length = 559

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 79/228 (34%), Positives = 121/228 (53%), Gaps = 13/228 (5%)

Query: 1   EIYPLACQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           + Y   CQ   S P   +  +L C YE+  + +L + P++ E L+L+P +V  HD + DS
Sbjct: 314 DTYEGLCQTLGSQPTHYQIPSLYCAYETNASPYLLLQPVRKEVLHLEPYIVLYHDFVSDS 373

Query: 60  EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           E  +I   +   ++R  V +      V+ R+SK  +L   +    P L  +  RI  +T 
Sbjct: 374 EAQKIRGFAAPWLQRSVVASGEKQQQVEYRISKSAWLKDTV---DPMLVSLDHRIAALTG 430

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELG 172
           L +  +  Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE G
Sbjct: 431 LNV--QPPYAEHLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSGNRVATFMIYLSSVEAG 488

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           G+T F   N +V   K +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 489 GSTAFIYANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 536


>gi|442747045|gb|JAA65682.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
          Length = 538

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 71/234 (30%), Positives = 116/234 (49%), Gaps = 17/234 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           + Y   C+G L     + S L+C Y    + F  + P+K+EE+ L P ++ +HD + D +
Sbjct: 279 QSYKRLCRGELLRSPKMDSQLRCRYYKGQDGFFSLQPIKLEEINLKPYIIVMHDVVQDKD 338

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I  ++  ++ ++ER       + +    R S   +L  +   + P   ++ + ++ +  +
Sbjct: 339 IKDLMAYAEPRLERSTTYTGSEMVPSPVRTSSTAWLNED---EAPIAVRMNSYLRALLGM 395

Query: 121 VIGREERYKGPLQINNYGLGG----HYD-----LHCDATPRDEGLW-----RLASFMFYL 166
                       Q+ NYG GG    H+D     LH   +  D  L      RLA+ M Y+
Sbjct: 396 GTSDTNEEAEAYQLANYGTGGQFLPHHDFLQDSLHSYNSSADYYLQYGTGDRLATLMIYM 455

Query: 167 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           TDVE GGAT+FPSL + + P+KG A FW+N  A+   D    H+GCPV  G+KW
Sbjct: 456 TDVEEGGATVFPSLGIRLTPKKGDAAFWWNLKASGEGDRLTTHAGCPVLYGSKW 509


>gi|198449524|ref|XP_002136918.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
 gi|198130646|gb|EDY67476.1| GA26871 [Drosophila pseudoobscura pseudoobscura]
          Length = 530

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 77/214 (35%), Positives = 114/214 (53%), Gaps = 22/214 (10%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           S L C Y +    FL++ PL++EEL LDP +V  H+ + D+EI ++  +++  +   K +
Sbjct: 298 SRLHCRYNATTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAEIAKVERVAEPLL---KSI 354

Query: 79  NYGDTIYVDTRLSKVYFLYPEIFGDH-------PFLYKIQTRIQDMTNLVIGREERYKGP 131
             G+    +++ SKV         D        P + +I  RI DMT L+I    ++   
Sbjct: 355 GVGEMD--NSKKSKVRTALGAWIPDKNMHISGWPVIQRIVRRIHDMTGLII----KHGQV 408

Query: 132 LQINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFP 186
           +Q+  YG GGHYD H D    + P  + L  R+A+ +FYL DV+ GG+T+FP L L V  
Sbjct: 409 VQLIKYGYGGHYDTHFDYLNDSLPITQALGDRMATVLFYLNDVKHGGSTVFPVLKLKVPS 468

Query: 187 EKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
           E+G  + WYN H  T  LD R  H  CPV  G K
Sbjct: 469 ERGKVLVWYNMHGETHDLDSRTLHGSCPVIDGAK 502


>gi|355709028|gb|AES03457.1| prolyl 4-hydroxylase, alpha polypeptide III [Mustela putorius furo]
          Length = 477

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 75/208 (36%), Positives = 117/208 (56%), Gaps = 12/208 (5%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           +L C YE+ ++ +L + P++ E ++L+P VV  HD + D E  +I  L++  ++R  V +
Sbjct: 253 SLYCSYETNSSPYLLLQPIRKEVIHLEPYVVLYHDFVSDMEAQKIRGLAEPWLQRSVVAS 312

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
               + V+ R+SK  +L   +    P L  +  RI  +T L +  +  Y   LQ+ NYG+
Sbjct: 313 GEKQLPVEYRISKSAWLKDTV---DPLLVNLDHRIGALTGLDV--QPPYAEYLQVVNYGI 367

Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F   N +V   K +A+
Sbjct: 368 GGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAAL 427

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FW+N H +   D    H+GCPV +G+KW
Sbjct: 428 FWWNLHRSGEGDGDTLHAGCPVLVGDKW 455


>gi|321461762|gb|EFX72791.1| hypothetical protein DAPPUDRAFT_308081 [Daphnia pulex]
          Length = 561

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 80/236 (33%), Positives = 119/236 (50%), Gaps = 23/236 (9%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y   C+G      +I++ L+C   +  +  L + P+KVEE  LDP +V +HD I + +
Sbjct: 303 EHYERLCRGEKLRSANIEAGLRCRLVTRGHPALLLQPIKVEEQSLDPMIVVLHDLITERQ 362

Query: 61  INRIIELSKGKV----ERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
              + +L + K+     RG     G  +    R SK  +L      ++  L  I+ R++ 
Sbjct: 363 TEILRQLGEPKLATSLHRG---GEGKFVRSMIRTSKNAWLQEH---ENASLPAIRHRMEL 416

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYD------LHCDATPRDEGLW------RLASFMF 164
            T L+ G E   +   QI NYG+GG Y       +H D  P D+  W      R+A+ M 
Sbjct: 417 ATGLIYGPETASEY-FQIANYGIGGLYKTHTDNVIHPDVRPEDQDPWNLYVGDRIATLMV 475

Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           YL+DVE GGAT+FP   +T +P KGSA FW+N + +   D    H  CPV  G+KW
Sbjct: 476 YLSDVEAGGATVFPRAGVTCWPRKGSAAFWWNLYKSGEPDLTTRHGACPVLHGSKW 531


>gi|195505199|ref|XP_002099401.1| GE23383 [Drosophila yakuba]
 gi|194185502|gb|EDW99113.1| GE23383 [Drosophila yakuba]
          Length = 535

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 77/230 (33%), Positives = 114/230 (49%), Gaps = 22/230 (9%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+G L+     + NL+C       + L   P K+EEL+LDP +V++H  I   + 
Sbjct: 284 MYEQVCRGELAPLPAKQRNLRC---RLRKSRLGYAPFKLEELHLDPLLVQLHQVIGAKDS 340

Query: 62  NRIIELSKGKVERGKVV----NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
             +   ++ +++R  V     N G T           F Y          +     + D 
Sbjct: 341 ESLQRTARPRIKRSTVYSLAGNGGSTAAAFRTSQGASFNYSRSAATKLLSH----HVGDF 396

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EGLW---RLASFMFYLTDVE 170
           + L +     Y   LQ+ NYG+GGHY+ H D+ P +    EG     R+A+ ++YL+DVE
Sbjct: 397 SGLNM----EYAEDLQVANYGIGGHYEPHWDSFPENHVYQEGDLHGNRIATGIYYLSDVE 452

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T FP L L V PEKGS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 453 AGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502


>gi|386766694|ref|NP_651648.5| CG11828 [Drosophila melanogaster]
 gi|383293009|gb|AAF56834.5| CG11828 [Drosophila melanogaster]
          Length = 458

 Score =  122 bits (306), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 78/228 (34%), Positives = 117/228 (51%), Gaps = 26/228 (11%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G   +P   KS L+C Y    + FL++ P+K+E+L ++P V   HDAI  +E   ++ 
Sbjct: 239 CRGKNLLPS--KSYLRCRYLRDGSPFLRMAPVKLEQLNIEPFVGLFHDAISPAEQKDLLH 296

Query: 67  LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
           L+  ++E  K  +      VDT  S           DH  + +I  RI+D+T   +   E
Sbjct: 297 LTDSRLEHRKKDSSSVEAKVDTNAS-----------DH--VRRIHQRIEDITGFDLEESE 343

Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRDEGL------WRLASFMFYLTDVELGGATIFPSL 180
               PL ++NYG+GG   +H D     E +      +R AS MFYL+DV++GG   FP L
Sbjct: 344 ----PLTVSNYGIGGQDFIHLDCEQPKEFIGYYPKEYRSASAMFYLSDVQMGGYASFPDL 399

Query: 181 NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW-GKLLLSG 227
                P +GSA+ W+N   +   D R   + CPV LGN+W  K  +SG
Sbjct: 400 GFGFKPRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQWVAKKWISG 447


>gi|73988166|ref|XP_851718.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Canis lupus
           familiaris]
          Length = 544

 Score =  122 bits (305), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 13/222 (5%)

Query: 7   CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   +  +L C YE+ ++ +L + P++ E ++L+P VV  HD + D E  +I 
Sbjct: 305 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPVRKEVIHLEPYVVLYHDFVNDVEAQKIR 364

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
            L++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T L +  +
Sbjct: 365 GLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPLLVTLDHRIGALTGLDV--Q 419

Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
             Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F 
Sbjct: 420 PPYAEYLQVVNYGIGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 479

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             N +V   K +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 480 YANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 521


>gi|443697961|gb|ELT98195.1| hypothetical protein CAPTEDRAFT_181380 [Capitella teleta]
          Length = 530

 Score =  122 bits (305), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 77/227 (33%), Positives = 111/227 (48%), Gaps = 17/227 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G  +     K  L C Y+ Y+  F  I PLK E L  DP +   HD + DS+  
Sbjct: 289 YEKLCRGEETHKRPFKHRLVCRYQRYHPIFY-ISPLKEEMLNFDPAIYVYHDVLTDSQNA 347

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD--HPFLYKIQTRIQDMTNL 120
            I E+S+ K+ R  V +  D    DT LS           D  HP + ++  +   ++NL
Sbjct: 348 IIKEVSRPKLHRSGVFSKTD---ADTGLSNFRTSQTAWHDDSTHPLIARLSQKASAISNL 404

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-------GLWRLASFMFYLTDVELGG 173
            +   E     LQ+ NYG+GG Y+ H D    +E          R+A+F+ YL+++E GG
Sbjct: 405 TLETVEH----LQVLNYGIGGLYEPHWDFVQGEERNEFSESDRNRVATFICYLSELEAGG 460

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            T++P++   V P K S   WYN   N   DYR YH+ CP+  G KW
Sbjct: 461 YTVYPTVGAAVVPRKNSCALWYNLMRNGTGDYRTYHAACPILYGYKW 507


>gi|443697959|gb|ELT98193.1| hypothetical protein CAPTEDRAFT_162820 [Capitella teleta]
          Length = 347

 Score =  122 bits (305), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 77/227 (33%), Positives = 111/227 (48%), Gaps = 17/227 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G  +     K  L C Y+ Y+  F  I PLK E L  DP +   HD + DS+  
Sbjct: 106 YEKLCRGEETHKRPFKHRLVCRYQRYHPIFY-ISPLKEEMLNFDPAIYVYHDVLTDSQNA 164

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD--HPFLYKIQTRIQDMTNL 120
            I E+S+ K+ R  V +  D    DT LS           D  HP + ++  +   ++NL
Sbjct: 165 IIKEVSRPKLHRSGVFSKTD---ADTGLSNFRTSQTAWHDDSTHPLIARLSQKASAISNL 221

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-------GLWRLASFMFYLTDVELGG 173
            +   E     LQ+ NYG+GG Y+ H D    +E          R+A+F+ YL+++E GG
Sbjct: 222 TLETVEH----LQVLNYGIGGLYEPHWDFVQGEERNEFSESDRNRVATFICYLSELEAGG 277

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            T++P++   V P K S   WYN   N   DYR YH+ CP+  G KW
Sbjct: 278 YTVYPTVGAAVVPRKNSCALWYNLMRNGTGDYRTYHAACPILYGYKW 324


>gi|351696981|gb|EHA99899.1| Prolyl 4-hydroxylase subunit alpha-3 [Heterocephalus glaber]
          Length = 572

 Score =  122 bits (305), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 74/207 (35%), Positives = 115/207 (55%), Gaps = 12/207 (5%)

Query: 21  LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
           L C YE+ ++ +L + P++ E ++L+P V   HD + D E  +I +L++  ++R  V + 
Sbjct: 348 LYCSYETNSSPYLLLQPVRKEVIHLEPYVALYHDFVSDPEAQKIRKLAEPWLQRSVVASG 407

Query: 81  GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG 140
              + V+ R+SK  +L        P L  +  RI  +T L +  +  Y   LQ+ NYG+G
Sbjct: 408 EKQLQVEYRISKSAWLKDTA---DPVLVTLDHRIAALTGLDV--QHPYAEYLQVVNYGIG 462

Query: 141 GHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF 193
           GHY+ H D AT     L+R+      A+FM YL+ VE GGAT F   N +V   K +A+F
Sbjct: 463 GHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANFSVPVVKNAALF 522

Query: 194 WYNAHANTLLDYRMYHSGCPVALGNKW 220
           W+N H +   D    H+GCPV +G+KW
Sbjct: 523 WWNLHRSGEGDGDTLHAGCPVLVGDKW 549


>gi|74216495|dbj|BAE25162.1| unnamed protein product [Mus musculus]
          Length = 187

 Score =  122 bits (305), Expect = 1e-25,   Method: Compositional matrix adjust.
 Identities = 70/171 (40%), Positives = 99/171 (57%), Gaps = 15/171 (8%)

Query: 58  DSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQ 115
           D EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q
Sbjct: 1   DEEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQ 57

Query: 116 DMTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT--PRDEGLW----RLASFMFYLTDV 169
            +T L +   E     LQ+ NYG+GG Y+ H D +  P D GL     RLA+F+ Y++DV
Sbjct: 58  HITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYMSDV 113

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           E GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KW
Sbjct: 114 EAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKW 164


>gi|198466401|ref|XP_002135182.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
 gi|198150583|gb|EDY73809.1| GA23910 [Drosophila pseudoobscura pseudoobscura]
          Length = 530

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 73/223 (32%), Positives = 117/223 (52%), Gaps = 19/223 (8%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y + C+G        ++NL C Y      FL++ PLK+EE+  DP +V  H+ +YD EI 
Sbjct: 296 YEIGCRGLFPK----RTNLVCRYNFTTTPFLRLAPLKMEEVNHDPYIVLYHEVLYDREIE 351

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            + + SK       ++N       + ++ ++   +   +       +I  RI D+T   +
Sbjct: 352 ELKKQSKN------MINGFSEPQQENKIREIIARHAWWWEQTTTRARIYQRITDITGFQL 405

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGL-W----RLASFMFYLTDVELGGATIF 177
             +E     L + NYGLG  +  H D TP +  + W     L + +FY++D++ GGATIF
Sbjct: 406 FVQEE----LNVANYGLGTIFGPHYDYTPENYDIGWFMGGPLGTILFYVSDLQQGGATIF 461

Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           PS+N+TV P KGSA+ W+N + +   D R  HS CPV  G++W
Sbjct: 462 PSINITVSPRKGSALLWFNLYDDGEPDPRTLHSSCPVIEGDRW 504


>gi|268572523|ref|XP_002641343.1| C. briggsae CBR-DPY-18 protein [Caenorhabditis briggsae]
 gi|94442971|emb|CAJ98658.1| prolyl 4-hydroxylase [Caenorhabditis briggsae]
          Length = 559

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 77/229 (33%), Positives = 113/229 (49%), Gaps = 18/229 (7%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+  + V +   S L C+Y+  +  FL   P+KVE    +P  V   D I D E+
Sbjct: 284 MYEALCRNEVPVSQKDISKLYCYYKR-DRPFLIYAPIKVEIKRFNPLAVLFKDVISDEEV 342

Query: 62  NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
             I EL+K K+ R  V +   G  +    R+SK  +L      +H  + ++  RI  MTN
Sbjct: 343 ATIQELAKPKLARATVHDSVTGKLVTATYRISKSAWLKA---WEHEVVERVNKRIDLMTN 399

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVEL 171
           L +   E     LQI NYG+GGHYD H D   ++E           R+A+ +FY++    
Sbjct: 400 LEMETAEE----LQIANYGIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSH 455

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GG T+F  +  TV P K  A+FWYN +     +    H+ CPV +G KW
Sbjct: 456 GGGTVFTEVKSTVLPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKW 504


>gi|410972729|ref|XP_003992809.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Felis catus]
          Length = 533

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 13/222 (5%)

Query: 7   CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   +  +L C YE+ ++ +L + P++ E ++L+P VV  HD + D E  +I 
Sbjct: 294 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPIRKEVIHLEPYVVLYHDFVNDLEAQKIR 353

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
            L++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T L +  +
Sbjct: 354 GLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPLLVTLDHRIGALTGLDV--Q 408

Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
             Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F 
Sbjct: 409 PPYAEYLQVVNYGIGGHYEPHFDHATSPTSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 468

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             N +V   K +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 469 YANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 510


>gi|308497208|ref|XP_003110791.1| CRE-DPY-18 protein [Caenorhabditis remanei]
 gi|308242671|gb|EFO86623.1| CRE-DPY-18 protein [Caenorhabditis remanei]
          Length = 559

 Score =  122 bits (305), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 77/229 (33%), Positives = 113/229 (49%), Gaps = 18/229 (7%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+  + V +   S L C+Y+  +  FL   P+KVE    +P  V   D I D E+
Sbjct: 284 MYEALCRNEVPVSQKDISRLYCYYKR-DRPFLVYAPIKVEIKRFNPLAVLFKDVISDDEV 342

Query: 62  NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
             I EL+K K+ R  V +   G  +    R+SK  +L      +H  + ++  RI+ MTN
Sbjct: 343 ATIQELAKPKLARATVHDSATGKLVTATYRISKSAWLKE---WEHEVVERVNKRIELMTN 399

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVEL 171
           L +   E     LQI NYG+GGHYD H D   ++E           R+A+ +FY++    
Sbjct: 400 LEMETAEE----LQIANYGIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSH 455

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GG T+F  +  TV P K  A+FWYN       +    H+ CPV +G KW
Sbjct: 456 GGGTVFTEVKSTVLPTKNDALFWYNLFKQGDGNPDTRHAACPVLVGIKW 504


>gi|195341542|ref|XP_002037365.1| GM12152 [Drosophila sechellia]
 gi|194131481|gb|EDW53524.1| GM12152 [Drosophila sechellia]
          Length = 535

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 77/230 (33%), Positives = 118/230 (51%), Gaps = 22/230 (9%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+G L+     + +L+C       + L   P K+EEL+LDP VV++H  I  ++ 
Sbjct: 284 MYEQVCRGELAPLPSKQRSLRC---RLRKSRLGYAPFKLEELHLDPLVVQLHQVIGSNDS 340

Query: 62  NRIIELSKGKVERGKVV----NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
             + + ++  ++R  V     N G T           F Y +    +     +   + D 
Sbjct: 341 ESLQKSARPMIKRSTVYSLGGNGGSTAAAFRTSQGASFNYSK----NAATKLLSHHVGDF 396

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EGLW---RLASFMFYLTDVE 170
           ++L +     Y   LQ+ NYG+GGHY+ H D+ P +    EG     R+A+ ++YL+DVE
Sbjct: 397 SDLNMD----YAEDLQVANYGIGGHYEPHWDSFPENHIYQEGDLHGNRIATGIYYLSDVE 452

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            GG T FP L L V PEKGS +FWYN H +   D+R  H+ CPV  G+KW
Sbjct: 453 AGGGTAFPFLPLLVTPEKGSLLFWYNLHPSGDQDFRTKHAACPVLQGSKW 502


>gi|195352176|ref|XP_002042590.1| GM14977 [Drosophila sechellia]
 gi|194124474|gb|EDW46517.1| GM14977 [Drosophila sechellia]
          Length = 485

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 80/225 (35%), Positives = 118/225 (52%), Gaps = 32/225 (14%)

Query: 9   GNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELS 68
           G LSV +    +L C YE   + FL+I PLKVE L L P +V  HD IY+SEI++I  +S
Sbjct: 279 GCLSVWQ-TSQHLSCHYEQNTSEFLRIAPLKVETLSLKPHIVLYHDVIYESEISKIKNIS 337

Query: 69  KGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERY 128
              ++    +   D +  + +L+++         + P    +  RI+DMT    G + + 
Sbjct: 338 LPSLKSP--LRIIDAVDYNLKLAQIR--------EDP-QSPLSLRIKDMT----GEDVKE 382

Query: 129 KGPLQINNYGLGGHYDLHCDATPRDEGLW----RLASFMFYLTDVELGGATIFPSLNLTV 184
               QI+NYG+ G  + H D     +       RL S +F++ DV  GGA  FP+LNLT+
Sbjct: 383 DTDFQIDNYGICGFRNFHTDNIEIQDQTAELGDRLTSILFFMNDVVQGGAFAFPNLNLTI 442

Query: 185 FPEKGSAVFWYNAHANTLLDYRM------YHSGCPVALGNKWGKL 223
           +P KGSA+ W N      LD+RM       H  CPV +G+KW +L
Sbjct: 443 WPHKGSALVWRN------LDHRMQPNKDLLHVSCPVVVGSKWSEL 481


>gi|417402564|gb|JAA48127.1| Putative prolyl 4-hydroxylase alpha subunit [Desmodus rotundus]
          Length = 544

 Score =  121 bits (304), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 78/222 (35%), Positives = 120/222 (54%), Gaps = 13/222 (5%)

Query: 7   CQGNLSVPEDIKS-NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   ++ +L C YE+  + +L + P++ E ++L+P VV  HD + D E  +I 
Sbjct: 305 CQTLGSQPTHYQNPSLHCSYETGASPYLLLQPIRKEVVHLEPYVVLYHDFVNDLEAQKIR 364

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
             ++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T L    +
Sbjct: 365 GFAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPMLVTLDRRIAALTGL--DTQ 419

Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
             Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F 
Sbjct: 420 PPYAEHLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 479

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             N +V   K +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 480 YANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 521


>gi|308451420|ref|XP_003088665.1| CRE-PHY-2 protein [Caenorhabditis remanei]
 gi|308246199|gb|EFO90151.1| CRE-PHY-2 protein [Caenorhabditis remanei]
          Length = 609

 Score =  121 bits (303), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 89/285 (31%), Positives = 124/285 (43%), Gaps = 73/285 (25%)

Query: 1   EIYPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           + Y   C+G +  V +  K+ L+C+ +  +  FLKI P+KVE L  DP  V   + I DS
Sbjct: 295 DAYEALCRGEIPPVEKKWKNKLRCYLKR-DKPFLKIAPIKVEILRFDPLAVLFKNVISDS 353

Query: 60  EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
           EI  I EL+  K++R  V N   G+  +   R+SK  +L  ++   HP + ++  RI+D 
Sbjct: 354 EIKVIKELASPKLKRATVQNSKTGELEHATYRISKSAWLKGDL---HPVIERVNRRIEDF 410

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD----------------------------- 148
           T L  G  E     LQ+ NYGLGGHYD H D                             
Sbjct: 411 TGLYQGTSEE----LQVANYGLGGHYDPHFDFARIANYGLGGHYEPHYDMSLVGYHPIQL 466

Query: 149 -----------ATPRDEGLWRLASFMFY----------------------LTDVELGGAT 175
                        P  +   R+A+ +FY                      ++  E GGAT
Sbjct: 467 TVSLEYFQRGVPEPYGKNGNRIATVLFYKEEKNAFKTLNTGNRIATVLFYMSQPERGGAT 526

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +F  L   VFP K  A+FWYN   +   D R  H+ CPV LG KW
Sbjct: 527 VFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKW 571


>gi|426245942|ref|XP_004016760.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3, partial [Ovis
           aries]
          Length = 514

 Score =  121 bits (303), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 78/222 (35%), Positives = 122/222 (54%), Gaps = 13/222 (5%)

Query: 7   CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   +  +L C YE+ ++ +L + P++ E ++L+P VV  HD + D+E  +I 
Sbjct: 275 CQTLGSQPTHYQIPSLYCSYETSSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDAEAQKIR 334

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
            L++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T L +  +
Sbjct: 335 GLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPVLVTLDHRIAALTGLDV--Q 389

Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
             Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F 
Sbjct: 390 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSGNRVATFMIYLSSVEAGGATAFI 449

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             N +V   K +A+FW+N H +   D    H+ CPV +G+KW
Sbjct: 450 YGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPVLVGDKW 491


>gi|281353153|gb|EFB28737.1| hypothetical protein PANDA_003344 [Ailuropoda melanoleuca]
          Length = 456

 Score =  121 bits (303), Expect = 2e-25,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 122/222 (54%), Gaps = 13/222 (5%)

Query: 7   CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   +  +L C YE+ ++ +L + P++ E ++L+P VV  HD + D E  +I 
Sbjct: 240 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDGEAQKIR 299

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
            L++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T L +  +
Sbjct: 300 GLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPLLVTLDHRIGALTGLDV--Q 354

Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
             Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F 
Sbjct: 355 PPYAEYLQVVNYGIGGHYEPHFDHATVTMGPLYRMKSGNRVATFMIYLSSVEAGGATAFI 414

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             N +V   K +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 415 YANFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 456


>gi|426255748|ref|XP_004021510.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 3 [Ovis
           aries]
          Length = 516

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 73/224 (32%), Positives = 115/224 (51%), Gaps = 25/224 (11%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKG--PLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
            L +   E  +   P      G G                 R+A+++FY++DV  GGAT+
Sbjct: 406 GLDVSTAEELQKDEPDAFKELGTGN----------------RIATWLFYMSDVLAGGATV 449

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 450 FPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 493


>gi|431838427|gb|ELK00359.1| Prolyl 4-hydroxylase subunit alpha-3 [Pteropus alecto]
          Length = 483

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 78/222 (35%), Positives = 120/222 (54%), Gaps = 13/222 (5%)

Query: 7   CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   +  +L C YE+ ++ +L + P++ E ++L+P VV  HD + D E  +I 
Sbjct: 244 CQTLGSQPTHYQIPSLHCSYETNSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDLEAQKIR 303

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
            L++  ++R  V +    + V+ R+SK  +L        P L  +  RI  +T L +  +
Sbjct: 304 GLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTA---DPMLVTLDHRIAALTGLDV--Q 358

Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
             Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F 
Sbjct: 359 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 418

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             N +V   K +A+FW+N H +   D    H+ CPV +G+KW
Sbjct: 419 YANFSVPVVKNAALFWWNLHRSGEGDSDTLHAACPVLVGDKW 460


>gi|195338688|ref|XP_002035956.1| GM16188 [Drosophila sechellia]
 gi|194129836|gb|EDW51879.1| GM16188 [Drosophila sechellia]
          Length = 392

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 75/222 (33%), Positives = 111/222 (50%), Gaps = 20/222 (9%)

Query: 6   ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
            CQG           L C Y S    F++I PLK EE+  DP +   HD IYDSEI ++ 
Sbjct: 165 GCQGKFPP----GPQLVCRYNSTTTPFMRIAPLKEEEISRDPLIWLYHDVIYDSEITQLT 220

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
            L++ ++  G   NY        R+++++ +             +  R+ D++ L +G  
Sbjct: 221 NLTREEMILGTTTNYT----TPDRVNRLFHIKVTNDDGGKLDKTLVNRMADISGLDMGNT 276

Query: 126 ERYKGPLQINNYGLGGHYDLHCD-------ATPRDEGLWRLASFMFYLTDVELGGATIFP 178
                 L   NYGLGG++  H D           +EG  RL +F+FY+TDV +GG TIFP
Sbjct: 277 TT----LARINYGLGGYFQEHSDYMDIKLHPELTEEGD-RLMTFLFYMTDVLVGGGTIFP 331

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
              L + P+KGSA+FWYN H N   +    H+ CP  +G++W
Sbjct: 332 GAQLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGSRW 373


>gi|51490656|emb|CAF31507.1| prolyl 4-hydroxylase 2 precursor [Brugia malayi]
          Length = 551

 Score =  120 bits (302), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 76/229 (33%), Positives = 119/229 (51%), Gaps = 18/229 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           + Y   C+  + +    +S L C+Y+  +  +L++ P KVE ++ +P VV   D + D E
Sbjct: 283 DTYEALCRQEVPINTKAQSRLYCYYK-MDRPYLRLAPFKVEIVHQNPLVVLFRDIVSDEE 341

Query: 61  INRIIE-LSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
           + RIIE L+  K+ R  V N   G+      R S+  +L      +H  + +I  R+   
Sbjct: 342 M-RIIEMLAVPKLARATVHNVVTGNIETAFYRTSQSSWLGS---TEHEVVKRINKRLDLA 397

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVEL 171
           TNL    E      LQ+ NYG+GGHY+ H D + R+          R+A+ + Y+T+ E+
Sbjct: 398 TNL----ETETAEELQVQNYGIGGHYEPHYDCSRRENVFEKTKNGNRIATILIYMTEPEI 453

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GG T+F  L  +V   K +A+FWYN   +  +D R YH+ CPV  G KW
Sbjct: 454 GGGTVFIDLKTSVSCTKNAALFWYNLMRSGAVDMRSYHAACPVLTGTKW 502


>gi|48675383|ref|NP_001001598.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
 gi|75053350|sp|Q75UG4.1|P4HA3_BOVIN RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|47115494|dbj|BAD18888.1| Collagen prolyl 4-hydroxylase alpha III subunit [Bos taurus]
 gi|296479828|tpg|DAA21943.1| TPA: prolyl 4-hydroxylase subunit alpha-3 precursor [Bos taurus]
          Length = 544

 Score =  120 bits (302), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 78/222 (35%), Positives = 121/222 (54%), Gaps = 13/222 (5%)

Query: 7   CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   +  +L C YE+ ++ +L + P++ E ++L+P VV  HD + D+E   I 
Sbjct: 305 CQTLGSQPTHYRIPSLYCSYETSSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDAEAQTIR 364

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
            L++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T L +  +
Sbjct: 365 GLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPVLVTLDHRIAALTGLDV--Q 419

Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
             Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F 
Sbjct: 420 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSGNRVATFMIYLSSVEAGGATAFI 479

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             N +V   K +A+FW+N H +   D    H+ CPV +G+KW
Sbjct: 480 YGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPVLVGDKW 521


>gi|301613006|ref|XP_002936013.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Xenopus
           (Silurana) tropicalis]
          Length = 504

 Score =  120 bits (301), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 73/228 (32%), Positives = 114/228 (50%), Gaps = 41/228 (17%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y   C+G  + +    +  L C +++   +  L + P K E+ +  PR+V+ HD I D E
Sbjct: 285 YEKLCRGEGVKMTSRRQKRLFCRYFDGNKDPLLILSPTKQEDEWDKPRIVRYHDIISDEE 344

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I+++ EL+K ++ R  + N                    I G    L   Q RI     +
Sbjct: 345 ISKVKELAKPRLRRATISN-------------------PITG---VLETAQYRISKRWAI 382

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELG 172
           +          L++ NYG+GG Y+ H D   +DE           R+A+++FY++DVE G
Sbjct: 383 M---------ELEVANYGMGGQYEPHFDFARKDEPDAFKELGTGNRVATWLFYMSDVEAG 433

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GAT+FP +   V+P+KG+AVFWYN   +   DY   H+ CPV +GNKW
Sbjct: 434 GATVFPEVGAAVYPKKGTAVFWYNLFESGEGDYSTRHAACPVLVGNKW 481


>gi|440899661|gb|ELR50930.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Bos grunniens mutus]
          Length = 478

 Score =  120 bits (301), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 78/222 (35%), Positives = 121/222 (54%), Gaps = 13/222 (5%)

Query: 7   CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   +  +L C YE+ ++ +L + P++ E ++L+P VV  HD + D+E   I 
Sbjct: 239 CQTLGSQPTHYRIPSLYCSYETSSSPYLLLQPVRKEVIHLEPYVVLYHDFVSDAEAQTIR 298

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
            L++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T L +  +
Sbjct: 299 GLAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPVLVTLDHRIAALTGLDV--Q 353

Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
             Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F 
Sbjct: 354 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMNSGNRVATFMIYLSSVEAGGATAFI 413

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             N +V   K +A+FW+N H +   D    H+ CPV +G+KW
Sbjct: 414 YGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAACPVLVGDKW 455


>gi|92109908|gb|ABE73278.1| IP10618p [Drosophila melanogaster]
          Length = 501

 Score =  120 bits (301), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 74/221 (33%), Positives = 111/221 (50%), Gaps = 20/221 (9%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           CQG           L C Y S    F++I PLK EE+  DP +   HD IYDSEI ++  
Sbjct: 275 CQGKFPP----GPQLVCRYNSTTTPFMRIAPLKEEEISRDPLIWLYHDVIYDSEIAQLTN 330

Query: 67  LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
           +++ ++  G   NY        R+++++ +             +  R+ D++ L +G   
Sbjct: 331 VTREEMILGTTTNYT----TPDRVNRLFHIKVTDDDGGKLDKTLVNRMADISGLDVGN-- 384

Query: 127 RYKGPLQINNYGLGGHYDLHCDATP-------RDEGLWRLASFMFYLTDVELGGATIFPS 179
                L   NYGLGG++  H D           +EG  RL +F+FY+TDV +GG TIFP 
Sbjct: 385 --TTTLARINYGLGGYFQEHSDYMDIKLYPELTEEGD-RLMTFLFYMTDVPVGGTTIFPG 441

Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             L + P+KGSA+FWYN H N   +    H+ CP  +G++W
Sbjct: 442 AQLAIQPKKGSALFWYNLHNNGDPNLLTRHAVCPTIVGSRW 482


>gi|161076739|ref|NP_001097101.1| CG34345 [Drosophila melanogaster]
 gi|157400090|gb|ABV53635.1| CG34345 [Drosophila melanogaster]
          Length = 504

 Score =  120 bits (301), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 74/221 (33%), Positives = 111/221 (50%), Gaps = 20/221 (9%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           CQG           L C Y S    F++I PLK EE+  DP +   HD IYDSEI ++  
Sbjct: 278 CQGKFPP----GPQLVCRYNSTTTPFMRIAPLKEEEISRDPLIWLYHDVIYDSEIAQLTN 333

Query: 67  LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
           +++ ++  G   NY        R+++++ +             +  R+ D++ L +G   
Sbjct: 334 VTREEMILGTTTNYT----TPDRVNRLFHIKVTDDDGGKLDKTLVNRMADISGLDVGN-- 387

Query: 127 RYKGPLQINNYGLGGHYDLHCDATP-------RDEGLWRLASFMFYLTDVELGGATIFPS 179
                L   NYGLGG++  H D           +EG  RL +F+FY+TDV +GG TIFP 
Sbjct: 388 --TTTLARINYGLGGYFQEHSDYMDIKLYPELTEEGD-RLMTFLFYMTDVPVGGTTIFPG 444

Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             L + P+KGSA+FWYN H N   +    H+ CP  +G++W
Sbjct: 445 AQLAIQPKKGSALFWYNLHNNGDPNLLTRHAVCPTIVGSRW 485


>gi|195159148|ref|XP_002020444.1| GL13996 [Drosophila persimilis]
 gi|194117213|gb|EDW39256.1| GL13996 [Drosophila persimilis]
          Length = 559

 Score =  120 bits (301), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 77/212 (36%), Positives = 111/212 (52%), Gaps = 16/212 (7%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           + L C Y +    FL++ PL++EEL LDP +V  H+ + D E+ R+  +S   + R ++ 
Sbjct: 322 ARLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARIF 381

Query: 79  NYGDT---IYVDTRLSKVYFLYPE-IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
           +       I       +V    P+ + GD   +  IQ RI D+T L++    R    +Q 
Sbjct: 382 DKETKKPKISPVRSADEVGIPNPKLVTGDIQLVECIQKRITDLTGLMLTSMRR----IQF 437

Query: 135 NNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
             YG GG Y  H D       T R  G  R+A+ +FYL DVE GGAT FP+L+L V  E+
Sbjct: 438 LKYGFGGIYVPHHDFFSVHTPTSRLHGD-RIATVIFYLNDVEHGGATAFPNLDLVVPTER 496

Query: 189 GSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
           G+ +FW+N    T  LDYR  H  CPV +G K
Sbjct: 497 GAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 528


>gi|194765182|ref|XP_001964706.1| GF22908 [Drosophila ananassae]
 gi|190614978|gb|EDV30502.1| GF22908 [Drosophila ananassae]
          Length = 509

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 73/224 (32%), Positives = 116/224 (51%), Gaps = 16/224 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   CQG   +PE   + LKC+ +   + F  + PLKVE+++LDP +   H  +   +I+
Sbjct: 263 YSRLCQGK-RLPEKQDNILKCYLDGKRHAFFTLAPLKVEQVHLDPDITVYHGVLSSKQIS 321

Query: 63  RIIELS--KGKVERGKVVNYG-DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
            I   S  K ++  G     G D    D R+S+  +L         +       +  +  
Sbjct: 322 SIFTESNKKERIRSGVAGENGEDRTVKDIRVSQQTWL--------NYSTPTMQYVNRINE 373

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGAT 175
            + G   R    +Q+ NYG+GG Y+ H D      P D    R+++ MFYL++V+ GG T
Sbjct: 374 YICGLTMRGAEEMQVANYGVGGQYEPHPDYFEFDLPPDFDGDRISTSMFYLSNVQQGGYT 433

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           +FP+LN+ + P KGS V W+N H +  +D R +H+GCPV +G+K
Sbjct: 434 VFPNLNVFLPPVKGSMVLWHNLHYSLDVDARTWHAGCPVIVGSK 477


>gi|119595340|gb|EAW74934.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III, isoform CRA_a
           [Homo sapiens]
          Length = 657

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 73/202 (36%), Positives = 113/202 (55%), Gaps = 12/202 (5%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           +L C YE+ +N +L + P++ E ++L+P +   HD + DSE  +I EL++  ++R  V +
Sbjct: 351 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVAS 410

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
               + V+ R+SK  +L   +    P L  +  RI  +T L +     Y   LQ+ NYG+
Sbjct: 411 GEKQLQVEYRISKSAWLKDTV---DPKLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 465

Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F   NL+V   + +A+
Sbjct: 466 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSVPVVRNAAL 525

Query: 193 FWYNAHANTLLDYRMYHSGCPV 214
           FW+N H +   D    H+GCPV
Sbjct: 526 FWWNLHRSGEGDSDTLHAGCPV 547


>gi|195494570|ref|XP_002094894.1| GE19958 [Drosophila yakuba]
 gi|194180995|gb|EDW94606.1| GE19958 [Drosophila yakuba]
          Length = 498

 Score =  120 bits (301), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 75/214 (35%), Positives = 113/214 (52%), Gaps = 33/214 (15%)

Query: 18  KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERG-K 76
           + NL C YE + +  L+I PLKVE L L P +V  HD IYDSEI+++  +S   ++   +
Sbjct: 288 RKNLSCHYEKHTSDLLRIAPLKVETLSLKPHIVLYHDVIYDSEISKVKNISLPSLKSPLR 347

Query: 77  VVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINN 136
           +++  D    + +L+K+            +   +  RI+DMT    G + +     QI+N
Sbjct: 348 ILHAEDH---NLKLAKI---------SEDYHSPLNLRIKDMT----GEDVKEDTDFQIDN 391

Query: 137 YGLGGHYDLHCDATPRDEGLW----RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           YG+ G    H D     +       RL S MF++ DV  GGA +F  LNLT++P+KGSA+
Sbjct: 392 YGICGFRYYHTDNLESQDQTAELGDRLTSIMFFMNDVAQGGAFVFLHLNLTIWPQKGSAL 451

Query: 193 FWYNAHANTLLDYRM------YHSGCPVALGNKW 220
            W N      LD+RM       H+ CPV +G+KW
Sbjct: 452 VWRN------LDHRMQPNEDLLHASCPVIVGSKW 479


>gi|324510827|gb|ADY44523.1| Prolyl 4-hydroxylase subunit alpha-1 [Ascaris suum]
          Length = 551

 Score =  120 bits (300), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 75/230 (32%), Positives = 116/230 (50%), Gaps = 18/230 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           +I+   C+  + V     S L C+Y+  +  +L++ P+KVE + L+P  V  H  + D E
Sbjct: 284 DIFEALCRHEVPVSTKALSRLYCYYK-MDRPYLRLAPIKVEIMRLNPLAVLFHQIMSDEE 342

Query: 61  INRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
            + I  L+  K+ R  V N   G       R+SK  +L P    +H  + +   R+   T
Sbjct: 343 AHIIEMLAIPKLNRATVQNAMTGGLETASYRISKSAWLKPH---EHEVVDRFNKRLDMAT 399

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
           NL +   E     LQI NYG+GGHYD H D   ++E           R+A+ + Y+T+ E
Sbjct: 400 NLEMETAEE----LQIQNYGVGGHYDPHFDCARKEEKNAFKELGTGNRVATILVYMTEPE 455

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +GG T+F  +  +V   K +A+FWYN   +  +D R  H+ CPV  G KW
Sbjct: 456 IGGGTVFTEVKTSVACTKNAALFWYNLLRSGEVDMRSRHAACPVLTGVKW 505


>gi|335294484|ref|XP_003357239.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Sus scrofa]
          Length = 545

 Score =  120 bits (300), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 80/223 (35%), Positives = 123/223 (55%), Gaps = 14/223 (6%)

Query: 7   CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   +  +L C YE+ ++ +L + P++ E ++L+P VV  HD + D+E  +I 
Sbjct: 305 CQTLGSQPTHYQIPSLYCSYETSSSPYLLLQPIRKEVIHLEPYVVLYHDFVTDAEAQKIR 364

Query: 66  ELSKGKVERGKVVNYGD-TIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
            L++  V    +V  G+  + V+ R+SK  +L   +    P L  +  RI  +T L +  
Sbjct: 365 GLAEPWVTAEILVASGEKQLPVEYRISKSAWLKDTV---DPMLVTLDHRIAALTGLDV-- 419

Query: 125 EERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIF 177
           +  Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F
Sbjct: 420 QPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAF 479

Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
              N +V   K +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 480 IYGNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 522


>gi|195577074|ref|XP_002078398.1| GD23422 [Drosophila simulans]
 gi|194190407|gb|EDX03983.1| GD23422 [Drosophila simulans]
          Length = 513

 Score =  120 bits (300), Expect = 5e-25,   Method: Compositional matrix adjust.
 Identities = 74/222 (33%), Positives = 111/222 (50%), Gaps = 20/222 (9%)

Query: 6   ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
            CQG           L C Y S    F++I PLK EE+  DP +   H+ IYDSEI ++ 
Sbjct: 286 GCQGKFPP----GPQLVCRYNSTTTPFMRIAPLKEEEISRDPLIWLYHNVIYDSEIAQLT 341

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
            L++ ++  G   NY        R+ +++ +             +  R+ D++ L +G  
Sbjct: 342 NLTREEMILGTTTNYT----TPDRVDRLFHIKVTDDDGGKLDKTLVNRMADISGLDVGN- 396

Query: 126 ERYKGPLQINNYGLGGHYDLHCDATP-------RDEGLWRLASFMFYLTDVELGGATIFP 178
                 L   NYGLGG++  H D           +EG  RL +F+FY+TD+ +GGATIFP
Sbjct: 397 ---TTTLARINYGLGGYFQEHSDYMDIKLHPELTEEGD-RLMTFLFYMTDIPVGGATIFP 452

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
              L + P+KGSA+FWYN H N   +    H+ CP  +G++W
Sbjct: 453 GAQLAIQPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGSRW 494


>gi|38454288|ref|NP_942070.1| prolyl 4-hydroxylase subunit alpha-3 precursor [Rattus norvegicus]
 gi|81870816|sp|Q6W3E9.1|P4HA3_RAT RecName: Full=Prolyl 4-hydroxylase subunit alpha-3; Short=4-PH
           alpha-3; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-3; Flags: Precursor
 gi|36962768|gb|AAQ87605.1| collagen prolyl 4-hydroxylase alpha III subunit [Rattus norvegicus]
          Length = 544

 Score =  120 bits (300), Expect = 6e-25,   Method: Compositional matrix adjust.
 Identities = 79/222 (35%), Positives = 118/222 (53%), Gaps = 13/222 (5%)

Query: 7   CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   +  +L C YE+ ++ +L + P + E ++L P V   HD + D E  +I 
Sbjct: 305 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPARKEVIHLRPLVALYHDFVSDEEAQKIR 364

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
           EL++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T L I  +
Sbjct: 365 ELAEPWLQRSVVASGEKQLQVEYRISKSAWLKDTV---DPVLVTLDRRIAALTGLDI--Q 419

Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVELGGATIFP 178
             Y   LQ+ NYG+GGHY+ H D AT     L+      R A+ M YL+ VE GGAT F 
Sbjct: 420 PPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYKMKSGNRAATLMIYLSSVEAGGATAFI 479

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             N +V   K +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 480 YGNFSVPVVKNAALFWWNLHRSGEGDDDTLHAGCPVLVGDKW 521


>gi|195505244|ref|XP_002099420.1| GE10895 [Drosophila yakuba]
 gi|194185521|gb|EDW99132.1| GE10895 [Drosophila yakuba]
          Length = 533

 Score =  119 bits (299), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 79/215 (36%), Positives = 111/215 (51%), Gaps = 14/215 (6%)

Query: 15  EDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER 74
           E   S L C Y +    FL++ PL++EEL LDP VV  H+ + D EI ++  +S+  +ER
Sbjct: 288 ESKPSRLHCRYNATTTPFLRLAPLRMEELSLDPYVVLYHNVLSDPEIEKLQLMSEPFLER 347

Query: 75  GKVVNY---GDTIYVDTRLSKVYFLYPEIF-GDHPFLYKIQTRIQDMTNLVIGREERYKG 130
            KV       D I         +  + E    D   L +I  RI D+T    G   R   
Sbjct: 348 AKVFRVEKGSDEIGASRAADGAWLPHQETEPEDLEVLNRIGRRIGDIT----GLSTRSGR 403

Query: 131 PLQINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVF 185
            +Q+  YG GGH+  H D     T   E +  R+A+ +FYL +VE GGAT+FPS+NL V 
Sbjct: 404 QMQLLKYGFGGHFTPHFDYFDSKTLYLEKVGDRIATVLFYLNNVEHGGATVFPSINLAVP 463

Query: 186 PEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
            +KGSA+FW+N    +   D R +H  CP+  G K
Sbjct: 464 TQKGSALFWHNLDGQSYDYDTRTFHGACPLISGTK 498


>gi|339236271|ref|XP_003379690.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
 gi|316977627|gb|EFV60702.1| prolyl 4-hydroxylase subunit alpha-1 [Trichinella spiralis]
          Length = 558

 Score =  119 bits (299), Expect = 7e-25,   Method: Compositional matrix adjust.
 Identities = 80/252 (31%), Positives = 121/252 (48%), Gaps = 41/252 (16%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+    + +  ++ L C+Y+  N  +LK+ P+KVE ++  P++V     I D E
Sbjct: 291 DVYEGLCRSEYPISDKDRAKLYCYYKR-NRPYLKLAPIKVEVMHWKPKIVYFRGVISDEE 349

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDT---RLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
           I  I +L+   ++R  V N  DT  ++T   R+SK  +L      +H  + +I  RI  M
Sbjct: 350 IAVIKQLASPLLKRATVHN-ADTGQLETASYRISKSAWLKD---TEHEVVKRISDRIDMM 405

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR------DEGLW-------------- 157
           T+L +   E     LQI NYG+GGHYD H D + R      +EG                
Sbjct: 406 TDLTMETAEL----LQIANYGIGGHYDPHFDMSTRGESDPYEEGTGNRIATVLFYTNDPY 461

Query: 158 ---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMY 208
                    R+A+ +FY++  E GG T+F S  +TV P K  A FW+N       D    
Sbjct: 462 SFESLNAGNRIATVLFYISQPEAGGGTVFTSHKITVEPSKYDAAFWFNVLQGGEPDMSTR 521

Query: 209 HSGCPVALGNKW 220
           H+ CPV  G KW
Sbjct: 522 HAACPVLAGTKW 533


>gi|241598362|ref|XP_002404733.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
 gi|215500464|gb|EEC09958.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
          Length = 340

 Score =  119 bits (299), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 63/210 (30%), Positives = 109/210 (51%), Gaps = 9/210 (4%)

Query: 17  IKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGK 76
           + S L+C Y    + F  + P+K+EE+ L P V+ +HD + D +I  ++  ++ ++ER  
Sbjct: 1   MDSQLRCRYYKGQDGFFSLQPIKLEEINLKPYVIVMHDVVQDKDIEDLMAFAEPRLERST 60

Query: 77  VVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINN 136
                + +    R S   +L  +   + P   ++ + ++ +  +     +      Q+ N
Sbjct: 61  TYTGNEMMPSPERTSSTAWLNED---EAPIAVRMNSYLRALLGMGTSDTDEEAEAYQLAN 117

Query: 137 YGLGGHY----DLHCDATPRDEGLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
           YG GGH+    D   D+   D  +   RLA+ M Y+TDVE GG T+FP+L + + P+KG 
Sbjct: 118 YGTGGHFLPHHDFLQDSLQADNSVTGDRLATLMIYMTDVEEGGTTVFPNLGIRLTPKKGD 177

Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           A FW+N  A+   +    H+GCPV  G+KW
Sbjct: 178 AAFWWNLKASGDGERLTTHAGCPVLYGSKW 207


>gi|194765180|ref|XP_001964705.1| GF23331 [Drosophila ananassae]
 gi|190614977|gb|EDV30501.1| GF23331 [Drosophila ananassae]
          Length = 535

 Score =  119 bits (299), Expect = 8e-25,   Method: Compositional matrix adjust.
 Identities = 72/227 (31%), Positives = 115/227 (50%), Gaps = 14/227 (6%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+G+L+        L+C    +  + L   P K+EEL  +P V ++H  +    
Sbjct: 283 KMYEQVCRGDLNPSPAKLRELRC---RFRRSRLGYAPFKLEELSHEPLVFQVHQVVSSKS 339

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
              I ++++ K++R  V + G          +        +  +     +   + D+++L
Sbjct: 340 AEFIKKMARPKIKRSTVYSIGGGGGSQAAAFRTSQGASFNYSRNAATKILSRHVGDLSSL 399

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPR----DEGL---WRLASFMFYLTDVELGG 173
            +     +   LQ+ NYG+GGHY+ H D+ P     DEG     R+A+ ++YL+DVE GG
Sbjct: 400 DMN----FAEELQVANYGIGGHYEPHWDSFPENHIYDEGDDRGNRIATGIYYLSDVEAGG 455

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            T FP L L V PEKGS +FWYN H +   DYR  H+ CPV  G+KW
Sbjct: 456 GTAFPFLPLLVTPEKGSLLFWYNLHESGDQDYRTKHAACPVLQGSKW 502


>gi|37496185|emb|CAE47803.1| Prolyl 4-hydroxylase alpha subunit [Sus scrofa]
          Length = 263

 Score =  119 bits (298), Expect = 9e-25,   Method: Compositional matrix adjust.
 Identities = 71/209 (33%), Positives = 115/209 (55%), Gaps = 19/209 (9%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 58  YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 117

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I+ + +L+K ++ R  V +   G       R+SK  +L      ++P + ++  RIQD+T
Sbjct: 118 IDIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRLNMRIQDLT 174

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 175 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 230

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHA 199
            GGAT+FP +  +V+P+KG+AVFWYN  A
Sbjct: 231 AGGATVFPEVGASVWPKKGTAVFWYNLFA 259


>gi|195159321|ref|XP_002020530.1| GL13464 [Drosophila persimilis]
 gi|194117299|gb|EDW39342.1| GL13464 [Drosophila persimilis]
          Length = 533

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 74/223 (33%), Positives = 119/223 (53%), Gaps = 14/223 (6%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   CQG     +   ++L+CF +   + +  + PL+VE+++LDP +   H  +   +I+
Sbjct: 287 YSRLCQGRRLPEKGSGTSLRCFLDGKRHAYFTLAPLQVEQVHLDPDIDVYHGILTLDQID 346

Query: 63  RIIELS-KGKVERGKVVNYGDT-IYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            I E + K ++ R  V   G T   VD R+S+  +L  E     P +  I   +  ++  
Sbjct: 347 SIFEAADKQEMTRSGVAGDGGTRTVVDLRVSQQTWLDYE----SPIMKSIARLVVFISGF 402

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATI 176
            I   E     +Q+ NYG+GG Y+ H D      P D    R+++ MFYL+DVE GG T+
Sbjct: 403 DIAGAE----AMQVANYGVGGQYEPHPDYFEVNLPSDFKGDRISTSMFYLSDVEQGGYTV 458

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           F  LN+ + P KG+ V W+N H +  +D R +H+GCPV +G+K
Sbjct: 459 FTKLNVFLPPIKGALVMWHNLHRSLDVDPRTHHAGCPVIVGSK 501


>gi|281362877|ref|NP_733393.3| CG31016, isoform B [Drosophila melanogaster]
 gi|442621939|ref|NP_001263119.1| CG31016, isoform C [Drosophila melanogaster]
 gi|272477249|gb|AAF57071.5| CG31016, isoform B [Drosophila melanogaster]
 gi|440218076|gb|AGB96498.1| CG31016, isoform C [Drosophila melanogaster]
          Length = 536

 Score =  119 bits (297), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 78/215 (36%), Positives = 109/215 (50%), Gaps = 14/215 (6%)

Query: 15  EDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER 74
           E   S L C Y +    FLK+ P ++EEL LDP V+  H+ + D+EI ++  + K  +ER
Sbjct: 291 ESKPSRLHCRYNTITTPFLKLAPFRMEELSLDPYVIFYHNVLSDAEIEKLKPMGKPFLER 350

Query: 75  GKVVNY---GDTIYVDTRLSKVYFLYPEIFGDH-PFLYKIQTRIQDMTNLVIGREERYKG 130
            KV       D I         +  +  I  D    L +I  RI+DMT    G   R   
Sbjct: 351 AKVFRVEKGSDEIDPSRSADGAWLPHQNIDPDDLEVLNRIGRRIEDMT----GLNTRSGS 406

Query: 131 PLQINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVF 185
            +Q   YG GGH+  H D     T   E +  R+A+ +FYL +V+ GGAT+FP LNL V 
Sbjct: 407 KMQFLKYGFGGHFVPHYDYFNSKTFSLETVGDRIATVLFYLNNVDHGGATVFPKLNLAVP 466

Query: 186 PEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
            +KGSA+FW+N    +   D R +H  CP+  G K
Sbjct: 467 TQKGSALFWHNIDRKSYDYDTRTFHGACPLISGTK 501


>gi|159884097|gb|ABX00727.1| IP12176p [Drosophila melanogaster]
          Length = 538

 Score =  118 bits (296), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 78/215 (36%), Positives = 109/215 (50%), Gaps = 14/215 (6%)

Query: 15  EDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER 74
           E   S L C Y +    FLK+ P ++EEL LDP V+  H+ + D+EI ++  + K  +ER
Sbjct: 293 ESKPSRLHCRYNTITTPFLKLAPFRMEELSLDPYVIFYHNVLSDAEIEKLKPMGKPFLER 352

Query: 75  GKVVNY---GDTIYVDTRLSKVYFLYPEIFGDH-PFLYKIQTRIQDMTNLVIGREERYKG 130
            KV       D I         +  +  I  D    L +I  RI+DMT    G   R   
Sbjct: 353 AKVFRVEKGSDEIDPSRSADGAWLPHQNIDPDDLEVLNRIGRRIEDMT----GLNTRSGS 408

Query: 131 PLQINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVF 185
            +Q   YG GGH+  H D     T   E +  R+A+ +FYL +V+ GGAT+FP LNL V 
Sbjct: 409 KMQFLKYGFGGHFVPHYDYFNSKTFSLETVGDRIATVLFYLNNVDHGGATVFPKLNLAVP 468

Query: 186 PEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
            +KGSA+FW+N    +   D R +H  CP+  G K
Sbjct: 469 TQKGSALFWHNIDRKSYDYDTRTFHGACPLISGTK 503


>gi|443707037|gb|ELU02831.1| hypothetical protein CAPTEDRAFT_181697 [Capitella teleta]
          Length = 538

 Score =  118 bits (296), Expect = 1e-24,   Method: Compositional matrix adjust.
 Identities = 73/208 (35%), Positives = 111/208 (53%), Gaps = 18/208 (8%)

Query: 23  CFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY-- 80
           C Y   +  F+ + P K E ++LDP +   H+ + D E + I  +SK K+ R  V  Y  
Sbjct: 316 CNYVRPHPMFILV-PAKEEVMFLDPFIAIYHNLMTDKEADMIKRISKPKLHRSGVFTYSG 374

Query: 81  GDTIYV-DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
           G+   V D R SK  ++  E   +HP + ++  R   +T+L +   E +    Q+ NYG+
Sbjct: 375 GNQKPVQDYRTSKSAWIEDE---EHPMIRRVSERTSALTDLSLDTVELF----QVVNYGI 427

Query: 140 GGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           GGHY+ H D A P +   +      R+ + +FY+   E GGAT+FP L + ++PEKGS  
Sbjct: 428 GGHYEPHFDFARPNEIATFDPEVGNRIITVIFYVAAPEAGGATVFPDLGVKLWPEKGSCA 487

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            W+N   N   DYR  H+GCP   G+KW
Sbjct: 488 VWWNLMRNGEGDYRTKHAGCPTITGSKW 515


>gi|198449650|ref|XP_001357661.2| GA13747 [Drosophila pseudoobscura pseudoobscura]
 gi|198130701|gb|EAL26795.2| GA13747 [Drosophila pseudoobscura pseudoobscura]
          Length = 533

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 74/223 (33%), Positives = 119/223 (53%), Gaps = 14/223 (6%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   CQG     +   ++L+CF +   + +  + PL+VE+++LDP +   H  +   +I+
Sbjct: 287 YSRLCQGRRLPEKGSGTSLRCFLDGKRHAYFTLAPLQVEQVHLDPDIDVYHGILTLDQID 346

Query: 63  RIIELS-KGKVERGKVVNYGDT-IYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            I E + K ++ R  V   G T   VD R+S+  +L  E     P +  I   +  ++  
Sbjct: 347 SIFEAADKQEMTRSGVAGDGGTRTVVDLRVSQQTWLDYE----SPIMKSIARLVVFISGF 402

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATI 176
            I   E     +Q+ NYG+GG Y+ H D      P D    R+++ MFYL+DVE GG T+
Sbjct: 403 DIAGAE----AMQVANYGVGGQYEPHPDYFEVNLPSDFKGDRISTSMFYLSDVEQGGYTV 458

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           F  LN+ + P KG+ V W+N H +  +D R +H+GCPV +G+K
Sbjct: 459 FTKLNVFLPPIKGALVMWHNLHRSLDVDPRTHHAGCPVIVGSK 501


>gi|195128345|ref|XP_002008624.1| GI13596 [Drosophila mojavensis]
 gi|193920233|gb|EDW19100.1| GI13596 [Drosophila mojavensis]
          Length = 527

 Score =  118 bits (296), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 79/231 (34%), Positives = 121/231 (52%), Gaps = 27/231 (11%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y L C+G    P+  ++NL C Y +    FL++ P K+EE+ LDP +V  H+ I DSE
Sbjct: 287 EPYYLGCRG--GYPK--RTNLHCRYNTTTTPFLRLAPFKMEEVSLDPYIVLYHNVISDSE 342

Query: 61  INRIIELSKG---KVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
           I  I + +      + R  ++N  D   +  R+  V  + P       F  +I  RI D+
Sbjct: 343 IEDIKQHATNFTNGLSRNPLLNVTDKPQIVARMQWVEKMTP-------FTDRINLRITDI 395

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE-------GLW-RLASFMFYLTDV 169
           T   +   +     +QI NYG+GGH+  H D T           G+  R A+ +FY +D+
Sbjct: 396 TGFGVDECK----TVQIANYGIGGHFIPHFDYTTEGRVSINDTFGIGDRTATIVFYASDM 451

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + GGAT+FP++ +TV P+KGSA+ WYN   +   +    HS CPV  G++W
Sbjct: 452 Q-GGATVFPNIQVTVQPQKGSALHWYNLFDDDSPNPLTLHSVCPVISGSRW 501


>gi|195452728|ref|XP_002073474.1| GK14137 [Drosophila willistoni]
 gi|194169559|gb|EDW84460.1| GK14137 [Drosophila willistoni]
          Length = 536

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 72/224 (32%), Positives = 124/224 (55%), Gaps = 16/224 (7%)

Query: 3   YPLACQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           Y   CQG+  +PE     +L C+ ++  +    + PLKVE++++DP +   H  + D++I
Sbjct: 290 YTRLCQGH-RLPEPFTGKSLHCYLDAKRHVSFILAPLKVEQVHVDPDINVYHGVLNDAQI 348

Query: 62  NRIIELS-KGKVERGKVV-NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
            +I++ S + ++ R  V  + G     D R+S+  +L        P +  +   I D++ 
Sbjct: 349 EKILQESDQNEMMRSAVSGDKGSATIADLRVSQQTWLNY----SSPIMRSLSNLISDISG 404

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGAT 175
             +   E+    +Q+ NYG+GG Y+ H D      P++    R+++ MFYL+DVELGG T
Sbjct: 405 FDMAGAEQ----MQVANYGVGGQYEPHPDYFEVNLPQEFKGDRISTSMFYLSDVELGGNT 460

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           +F  LN+ + P KG+ V W+N H +  +D R  H+GCPV +G+K
Sbjct: 461 VFIKLNVFLPPIKGAMVMWHNLHYSLDVDRRTIHAGCPVLIGSK 504


>gi|194871359|ref|XP_001972833.1| GG13662 [Drosophila erecta]
 gi|190654616|gb|EDV51859.1| GG13662 [Drosophila erecta]
          Length = 515

 Score =  118 bits (295), Expect = 2e-24,   Method: Compositional matrix adjust.
 Identities = 80/226 (35%), Positives = 121/226 (53%), Gaps = 29/226 (12%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           + CQG    P+  K+NL C Y    N FLK+ PLK+EE+  DP +V  H+ I D EI  +
Sbjct: 285 IGCQGLF--PK--KTNLVCRYNFSTNAFLKLAPLKMEEISRDPYIVMFHEVISDKEIEEM 340

Query: 65  IELSKGKVERGKVVNYGDTIYVDTR--LSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
               KG++   K +  G T   + +  +S +Y++  E      F  +I  RI DMT   +
Sbjct: 341 ----KGEI---KQMENGWTSLEEPKEIVSHIYWITKE----SSFSKRINDRISDMTGFKV 389

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDA-TPRDEGLW-------RLASFMFYLTDVELGGA 174
              E +   +Q+ N+G+GG++  H D  T R + L        RLAS + Y  +V  GG 
Sbjct: 390 ---EEFPA-IQLANFGVGGYFKPHYDYYTERLKELDANNTLGDRLASIIIYAGEVSQGGQ 445

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T+FP + + V P+KG A+FW+N   ++  D R  HS CPV +G++W
Sbjct: 446 TVFPDIKVAVEPKKGKALFWFNDFDDSSPDPRSLHSVCPVIVGSRW 491


>gi|198449506|ref|XP_002136910.1| GA26925 [Drosophila pseudoobscura pseudoobscura]
 gi|198130637|gb|EDY67468.1| GA26925 [Drosophila pseudoobscura pseudoobscura]
          Length = 543

 Score =  117 bits (294), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 76/212 (35%), Positives = 111/212 (52%), Gaps = 16/212 (7%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           + L C Y +    FL++ PL++EEL LDP +V  H+ + D E+ R+  +S   + R ++ 
Sbjct: 306 ARLHCRYNATTTAFLRLAPLRMEELSLDPYIVLYHNVLSDEEMARLENMSTPLLHRARIF 365

Query: 79  NYGDT---IYVDTRLSKVYFLYPEIFGDHPFLYK-IQTRIQDMTNLVIGREERYKGPLQI 134
           +       I       +V    P++  +   L + IQ RI D+T L++    R    +Q 
Sbjct: 366 DKETKKPKISPVRSADEVGIPNPKLVTEDIQLVECIQKRITDLTGLMLTSMRR----IQF 421

Query: 135 NNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
             YG GG Y  H D       T R  G  R+A+ +FYL DVE GGAT FP+L+L V  E+
Sbjct: 422 LKYGFGGIYVPHHDFFSVHTPTSRLHGD-RIATVIFYLNDVEHGGATAFPNLDLVVPTER 480

Query: 189 GSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
           G+ +FW+N    T  LDYR  H  CPV +G K
Sbjct: 481 GAVLFWHNMDGETYDLDYRTLHGACPVIVGTK 512


>gi|198466397|ref|XP_002135180.1| GA23908 [Drosophila pseudoobscura pseudoobscura]
 gi|198150581|gb|EDY73807.1| GA23908 [Drosophila pseudoobscura pseudoobscura]
          Length = 403

 Score =  117 bits (293), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 74/207 (35%), Positives = 102/207 (49%), Gaps = 28/207 (13%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           N  C YES    FL++ PLKVE L LDP +   HD IY+ EI R++ L+   ++      
Sbjct: 212 NRSCHYESTRTAFLRLAPLKVEMLSLDPYIAIYHDVIYEREIARVMTLALSSLK------ 265

Query: 80  YGDTIYVDTRLS--KVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
            G   Y   R    K   +Y E         ++  R +DMT    G + +     +I N 
Sbjct: 266 -GPGRYSKRREHNIKSVTVYEEENS------QLNQRTRDMT----GEQVKEDKDFRIYNS 314

Query: 138 GLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA 197
           G+GG+   H D   ++E           L +V  GGA  FP L  TV+P KGSA+ W+N 
Sbjct: 315 GIGGYIRYHMDNLAKEEQ---------QLNEVPHGGAISFPQLEFTVWPRKGSALVWHNL 365

Query: 198 HANTLLDYRMYHSGCPVALGNKWGKLL 224
           + N  LDYR+ H  CPV +G+KW K L
Sbjct: 366 NNNLELDYRVAHISCPVIVGSKWSKFL 392


>gi|291224083|ref|XP_002732036.1| PREDICTED: prolyl 4-hydroxylase, alpha I subunit-like [Saccoglossus
           kowalevskii]
          Length = 491

 Score =  117 bits (293), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 78/225 (34%), Positives = 116/225 (51%), Gaps = 17/225 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           + Y   C+G    P D  S +KC Y +  N  L + P K E ++ +PRVV  HD I D E
Sbjct: 257 DAYEALCRGERRKPLD-SSKVKCQYVTNGNYRLLLQPAKQEIMHHNPRVVLYHDVISDEE 315

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIF---GDHPFLYKIQTRIQDM 117
           IN +I+L+K K+ R  VV  G +          Y +    +    D   + K+  RI D+
Sbjct: 316 INEVIKLAKPKLRRSLVVTKGSSPSGTGSSDAEYRVSSGGWLEDWDGTVIAKLTRRISDI 375

Query: 118 TNL--VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGAT 175
           + L  +   E R+   LQI N       D+H   +       R+A++MFY+++V+ GG T
Sbjct: 376 SGLSTLTAPEYRHAEALQIENS------DVHLPGSRN-----RIATWMFYMSEVKAGGYT 424

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +FP ++  V P K +AVFWYN  A+   D    H+GCPV +G+KW
Sbjct: 425 VFPEVDAFVPPVKNAAVFWYNLKASGESDDLTRHAGCPVLIGSKW 469


>gi|449668268|ref|XP_002154169.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 531

 Score =  117 bits (293), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 74/232 (31%), Positives = 121/232 (52%), Gaps = 16/232 (6%)

Query: 3   YPLAC---QGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           Y  AC   Q   ++     +NL CFY++ N   L + PLKV  ++ +P V+  H+ I + 
Sbjct: 296 YARACRRDQRTKTIAVKDVNNLVCFYKN-NKPRLILKPLKVTRMHDNPDVLVFHEMITEE 354

Query: 60  EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
              +I +++  ++   +V++      +    R+SK  F       +     K++  ++D 
Sbjct: 355 VAEKIRDVANPRLRPSEVIDPIIQKHVTASYRVSKNVFFDDAFEEELEISRKLRPLVEDA 414

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRD--EGLWRLASFMFYLTDVEL 171
           T+L     + +   LQ+NNYGLGG Y+ H D     +P D  E   R+A+ + YL+DVE 
Sbjct: 415 TDL----NDDFSEQLQVNNYGLGGQYEFHVDFGDPGSPLDKHEHGNRIATLLIYLSDVER 470

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKL 223
           GG T+F  L L++ P+ G A FW+N + N    Y   H+ CPV  G+KWGK+
Sbjct: 471 GGDTVFTRLGLSLKPKLGDAAFWHNLYKNGSGIYATEHASCPVVSGSKWGKI 522


>gi|198449504|ref|XP_002136909.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
 gi|198130636|gb|EDY67467.1| GA26876 [Drosophila pseudoobscura pseudoobscura]
          Length = 527

 Score =  117 bits (292), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 71/211 (33%), Positives = 112/211 (53%), Gaps = 14/211 (6%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           S L C Y +    FL++ PL++EEL LDP +V  H+ + D+EI  +  +++  ++R  V 
Sbjct: 295 SRLHCRYNTTTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAEIAEVERVTEPLLKRSVVF 354

Query: 79  N-YGDTIYVDTRLSKVYFLYPEIFGD---HPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
           +  G+ +    R + +    P+   D      + +I  RI ++T L+I   +     +Q+
Sbjct: 355 DGKGNKMSTSKRRTALGAWLPDDNMDVSGRAVIQRIFRRIHELTGLIINDRQ----DMQL 410

Query: 135 NNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
             YG GGHYD+H D    +TP  +    R+A+ +FYL D++ GG+T F  L L V  E+G
Sbjct: 411 IKYGYGGHYDIHFDYFNTSTPITKARGDRMATVLFYLNDMKHGGSTAFTDLQLKVPSERG 470

Query: 190 SAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
             +FWYN    T  +D R  H  CPV  G K
Sbjct: 471 KVLFWYNMRGETHDVDSRTLHGACPVINGTK 501


>gi|17552840|ref|NP_499464.1| Protein DPY-18 [Caenorhabditis elegans]
 gi|20455505|sp|Q10576.2|P4HA1_CAEEL RecName: Full=Prolyl 4-hydroxylase subunit alpha-1; Short=4-PH
           alpha-1; AltName:
           Full=Procollagen-proline,2-oxoglutarate-4-dioxygenase
           subunit alpha-1; AltName: Full=Protein dumpy-18; Flags:
           Precursor
 gi|3881011|emb|CAA21045.1| Protein DPY-18 [Caenorhabditis elegans]
 gi|6900013|emb|CAB71298.1| prolyl 4-hydroxylase alpha subunit 1 [Caenorhabditis elegans]
          Length = 559

 Score =  117 bits (292), Expect = 4e-24,   Method: Compositional matrix adjust.
 Identities = 78/229 (34%), Positives = 112/229 (48%), Gaps = 18/229 (7%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+  + V +   S L C+Y+  +  FL   P+KVE    +P  V   D I D E+
Sbjct: 284 MYEALCRNEVPVSQKDISRLYCYYKR-DRPFLVYAPIKVEIKRFNPLAVLFKDVISDDEV 342

Query: 62  NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
             I EL+K K+ R  V +   G  +    R+SK  +L  E  GD   +  +  RI  MTN
Sbjct: 343 AAIQELAKPKLARATVHDSVTGKLVTATYRISKSAWL-KEWEGD--VVETVNKRIGYMTN 399

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVEL 171
           L +   E     LQI NYG+GGHYD H D   ++E           R+A+ +FY++    
Sbjct: 400 LEMETAEE----LQIANYGIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSH 455

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GG T+F     T+ P K  A+FWYN +     +    H+ CPV +G KW
Sbjct: 456 GGGTVFTEAKSTILPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKW 504


>gi|196011912|ref|XP_002115819.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
 gi|190581595|gb|EDV21671.1| hypothetical protein TRIADDRAFT_59908 [Trichoplax adhaerens]
          Length = 300

 Score =  117 bits (292), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 73/225 (32%), Positives = 118/225 (52%), Gaps = 19/225 (8%)

Query: 7   CQGNLSVPEDIKSN-LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           C GN ++P     + LKC+Y  Y ++  +  P  +EE+  DP ++  H+   ++E+  + 
Sbjct: 64  CIGNENLPAKSSGHHLKCYY-FYPSSKTRFMPYAIEEMSRDPLIILYHNLTSNAEMESLK 122

Query: 66  ELSKGKVERGKV--VNYGDTIYVD--TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
            L+  +++   V      D   ++  TR++K+ F+  E   +      I  R+QD+T L 
Sbjct: 123 ALAAKQLQPAGVYHTTSADNRNLEGYTRIAKMAFILDE---ESAVASAITQRLQDVTGLN 179

Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGAT 175
           +   E    PLQ+ NYG+ G Y  H D  P   G        RLA+ + YL+DVE GGAT
Sbjct: 180 MNFSE----PLQVINYGIAGQYTPHYDTFPAKSGDRSHPSHDRLATAILYLSDVERGGAT 235

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +F ++N+ V P KG+ + WYN   +  L     H+GCPV +G+KW
Sbjct: 236 VFTNINVRVLPRKGNVIIWYNYLPDGNLHPGTLHAGCPVLVGSKW 280


>gi|341884171|gb|EGT40106.1| CBN-PHY-2 protein [Caenorhabditis brenneri]
          Length = 607

 Score =  117 bits (292), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 90/292 (30%), Positives = 130/292 (44%), Gaps = 73/292 (25%)

Query: 1   EIYPLACQGNLS-VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           + Y   C+G +  V E  ++ L+C+ +  +  FLKI P+KVE L  DP  V   + I DS
Sbjct: 279 DAYEALCRGEIPPVEEKWRNKLRCYLKR-DKPFLKIAPIKVEILRFDPLAVLFKNVISDS 337

Query: 60  EINRIIELSKGKVERGKVVNY-GDTIYVDTRLSK-------VYFLYPE------------ 99
           EI  I EL+  K+ER  V    G  I VD R++K       ++ + P+            
Sbjct: 338 EIEVIKELASPKLERATVKGPDGTLITVDYRIAKRLVNWNTLHIVSPKGGFPKSKKMKNK 397

Query: 100 --------IFGD-HPFLYKIQTRIQDMTNLVIGREERYKGP--------------LQINN 136
                   + GD  P + ++  RI+D T L     E  +                 +I N
Sbjct: 398 CLVGFSAWLKGDLDPVIDRVNRRIEDFTGLNQATSEELQVANYGLGGHYDPHFDFARIAN 457

Query: 137 YGLGGHYDLHCDATPR---------------------DEGLW-------RLASFMFYLTD 168
           YGLGGHY+ H D + R                     ++  +       R+A+ +FY++ 
Sbjct: 458 YGLGGHYEPHYDMSLRGVPEPYGKNGNRIATVLFYKEEKNAFKTLNTGNRIATVLFYMSQ 517

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            ELGGAT+F  L   VFP K  A+FWYN   +   D R  H+ CPV LG KW
Sbjct: 518 PELGGATVFNHLGTAVFPSKNDALFWYNLRRDGEGDLRTRHAACPVLLGVKW 569


>gi|427783867|gb|JAA57385.1| Putative prolyl 4-hydroxylase subunit alpha-1 [Rhipicephalus
           pulchellus]
          Length = 548

 Score =  117 bits (292), Expect = 5e-24,   Method: Compositional matrix adjust.
 Identities = 79/236 (33%), Positives = 118/236 (50%), Gaps = 26/236 (11%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G       + S L+C Y    N FL++ P+K+EE  L P ++  HD I D +IN
Sbjct: 292 YKRLCRGEQLRTPKMDSQLRCRYYYGRNGFLRLQPVKIEEANLKPYIITFHDIIGDRDIN 351

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            ++  +  ++ R    +YG+     T  S +        GD      + TR+      ++
Sbjct: 352 DLLAYATPRLFRS--THYGEH---GTETSLIRTSSTAWLGDQD--APVATRLNRFVESLL 404

Query: 123 GREERY-KGPL---QINNYGLGGHYDLHCD------ATP--------RDEGLWRLASFMF 164
           G   +Y KG     Q+ NYG+GG Y  H D      A P        R  G  R+A+ MF
Sbjct: 405 GLGSQYLKGEAEYYQLANYGVGGQYIAHHDFLADIYADPNRKLDDFERSAGD-RIATLMF 463

Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           YL+DVE GGAT+FP L + + P+KG+A FW+N +++   +    H GCPV  G+KW
Sbjct: 464 YLSDVEEGGATVFPHLGVRLTPKKGNAAFWWNLNSDGEGEQLTKHGGCPVLYGSKW 519


>gi|326914688|ref|XP_003203656.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Meleagris
           gallopavo]
          Length = 539

 Score =  116 bits (291), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 79/228 (34%), Positives = 115/228 (50%), Gaps = 13/228 (5%)

Query: 1   EIYPLACQG-NLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           + Y   CQG    +  +  S L C YE+  + +L + P K E L L P +V  HD + D+
Sbjct: 294 DAYEELCQGLGAQMAPEQPSQLGCSYETNGSPYLLLQPAKKETLRLQPYIVLYHDFVSDA 353

Query: 60  EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           E   I  L+   ++R  V +      V+ R+SK  +L        P +  ++ R+  +T 
Sbjct: 354 EAETIKGLAGPWLQRSVVASGEKQQKVEYRISKSAWLKDTA---DPVVRALELRMAAITG 410

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELG 172
           L +     Y   LQ+ NYGLGGHY+ H D AT R   L+R+      A+ M YL+ VE G
Sbjct: 411 LDL--RPPYAEYLQVVNYGLGGHYEPHFDHATSRKSPLYRMKSGNRIATVMIYLSAVEAG 468

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           G+T F   N +V   K +A+FW+N   N   D    H+GCPV  G+KW
Sbjct: 469 GSTAFIYANFSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPVLAGDKW 516


>gi|301626782|ref|XP_002942567.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Xenopus
           (Silurana) tropicalis]
          Length = 716

 Score =  116 bits (291), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 112/221 (50%), Gaps = 20/221 (9%)

Query: 1   EIYPLACQGNLSVPEDIKS-NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           ++Y   CQ   S P   +  ++ C Y++ ++ +L + P+K E + L P+VV  HD + D 
Sbjct: 492 DLYEGLCQTLGSQPTSYEDPHMSCMYDTNSHPYLLLQPMKKEIVSLRPQVVLYHDFVSDL 551

Query: 60  EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           E  +I EL+   + R  V +       + R+SK  +L   I   HPF+  + TRI  +T 
Sbjct: 552 EAEKIKELASPWLHRSVVASGEKQAEAEYRISKSAWLKDTI---HPFVQNLDTRISGVTG 608

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPS 179
           L       Y   LQ+ NYG+GGHY+ H D                 L+ V+LGG+T F  
Sbjct: 609 L--NAHPPYAEYLQVVNYGIGGHYEPHFDHAT--------------LSHVDLGGSTAFVF 652

Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            N +    K +AVFW+N H N L D    H+GCPV +G+KW
Sbjct: 653 ANFSSPVVKNAAVFWWNLHRNGLGDEDTLHAGCPVIIGSKW 693


>gi|363729586|ref|XP_417248.3| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Gallus gallus]
          Length = 542

 Score =  116 bits (291), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 79/228 (34%), Positives = 116/228 (50%), Gaps = 13/228 (5%)

Query: 1   EIYPLACQG-NLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           + Y   CQG    +  +  S+L C YE+  + +L + P K E L L P +V  HD + D+
Sbjct: 297 DAYEELCQGLGAQMAPERPSHLGCSYETNGSPYLLLQPAKKETLRLQPYIVLYHDFVSDA 356

Query: 60  EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           E   I  L+   ++R  V +      V+ R+SK  +L        P +  ++ R+  +T 
Sbjct: 357 EAETIKGLAGPWLQRSVVASGEKQQKVEYRISKSAWLKDTA---DPVVQALELRMAAITG 413

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELG 172
           L +     Y   LQ+ NYGLGGHY+ H D AT R   L+R+      A+ M YL+ VE G
Sbjct: 414 LDL--RPPYAEYLQVVNYGLGGHYEPHFDHATSRKSPLYRMKSGNRIATVMIYLSAVEAG 471

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           G+T F   N +V   K +A+FW+N   N   D    H+GCPV  G+KW
Sbjct: 472 GSTAFIYANFSVPVVKNAALFWWNLRRNGDGDGDTLHAGCPVLAGDKW 519


>gi|194751829|ref|XP_001958226.1| GF23628 [Drosophila ananassae]
 gi|190625508|gb|EDV41032.1| GF23628 [Drosophila ananassae]
          Length = 484

 Score =  116 bits (291), Expect = 6e-24,   Method: Compositional matrix adjust.
 Identities = 76/212 (35%), Positives = 118/212 (55%), Gaps = 24/212 (11%)

Query: 18  KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
           ++NL C Y +    FLK+ PLK+EE+ LDP +V  H+ I D EI    E  KG ++    
Sbjct: 265 QNNLVCRYNATTTPFLKLAPLKLEEVSLDPYIVLYHNVISDREI----EEMKGLIDE--- 317

Query: 78  VNYGDTIYVDTR--LSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQIN 135
           ++ G T   ++R  +S++ +L  E      F  ++  RI+D+T   +   +  +G LQI 
Sbjct: 318 MDNGWTDLNESREIVSRLVWLTKE----SRFRKRLNLRIRDITGFNV---DEIRG-LQIA 369

Query: 136 NYGLGG----HYDLHCDATPRDEGLW---RLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
           N+G+GG    HYD   +   R        R+AS +FY+ DV  GG T+FP + + V P+K
Sbjct: 370 NFGVGGQFKPHYDYFTERILRLNNTILGDRIASIIFYVGDVVHGGQTVFPDIQIAVKPQK 429

Query: 189 GSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GS++FW+N   +   D R  HS CPV +G++W
Sbjct: 430 GSSLFWFNTFDDATPDPRSLHSVCPVLIGDRW 461


>gi|195440206|ref|XP_002067933.1| GK11220 [Drosophila willistoni]
 gi|194164018|gb|EDW78919.1| GK11220 [Drosophila willistoni]
          Length = 459

 Score =  116 bits (291), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 75/235 (31%), Positives = 115/235 (48%), Gaps = 33/235 (14%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y L C+G    P      L C Y    + FL++ PLK EE+ LDP +V  HD ++D EI 
Sbjct: 224 YHLGCRGLFLPP----GKLVCRYNFTTSPFLRLAPLKQEEINLDPYIVVYHDVLHDREIA 279

Query: 63  RIIE------LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
           ++ E      +S   +E  K           +++ +V      +     F+  +  RI D
Sbjct: 280 QMKEEMANAHISNAWIEERKANQ--------SQMRQVIGRVSWLTDSSNFMDSVNQRIMD 331

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---------RLASFMFYLT 167
           MT   +   E     LQ+ NYG G ++  H D     EG           RLAS +FY +
Sbjct: 332 MTGFSMKGIE----SLQVCNYGPGCNFKPHYDYMA--EGYEPPNILTLGDRLASVIFYAS 385

Query: 168 DVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGK 222
           +V LGGAT+FP L++ + P+KG+ + WYN + ++  D R  H+ CP  +G++W K
Sbjct: 386 EVHLGGATVFPRLDVAITPKKGAGLVWYNTYDDSTHDQRSQHAVCPTLMGSRWSK 440


>gi|198459366|ref|XP_002138685.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
 gi|198136669|gb|EDY69243.1| GA24919 [Drosophila pseudoobscura pseudoobscura]
          Length = 448

 Score =  116 bits (291), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 80/229 (34%), Positives = 110/229 (48%), Gaps = 18/229 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y   C+G    P+     L C Y+ + +  L++ P KVE L  DP +   HD IYDSE
Sbjct: 210 EHYVRGCRGLFDPPK----GLSCHYDFHTHPVLRLAPFKVEPLSQDPYIAMYHDVIYDSE 265

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPF--LYKIQTRIQDM 117
           I  + + +   +ER KV  Y D    DT R S   F       DH +  + K+  R+  M
Sbjct: 266 IEELKDNAFPDMERSKVYTYSDKDGKDTGRTSMSAFQ-----TDHQYTAVTKVNRRVMHM 320

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD--ATPRDEGLWR---LASFMFYLTDVELG 172
           T   +   +     L + NY     Y  H D       E + R   +A+ +FYL DVE G
Sbjct: 321 TGFEV-LADGSSDELLVLNYATAAQYLTHSDYFGPAYSEYIQRGDRIATVLFYLNDVEQG 379

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           G T+FP L +   P KGSAV +YN +++   D R  H GCPV +G KW 
Sbjct: 380 GKTVFPRLGIFRSPMKGSAVVFYNLNSSLQGDPRTEHGGCPVLVGTKWA 428


>gi|195128343|ref|XP_002008623.1| GI13594 [Drosophila mojavensis]
 gi|193920232|gb|EDW19099.1| GI13594 [Drosophila mojavensis]
          Length = 511

 Score =  116 bits (290), Expect = 8e-24,   Method: Compositional matrix adjust.
 Identities = 76/231 (32%), Positives = 120/231 (51%), Gaps = 27/231 (11%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y L C+G        ++NL C Y +    FL++ P K+EE+ LDP +V  H+ I D E
Sbjct: 275 EPYYLGCRGGYPK----RTNLHCRYNTTTTPFLRLAPFKMEEVSLDPYIVLYHNVISDRE 330

Query: 61  INRIIELSKGKVERGKV---VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
           I  + + +        +   +N  D   +  R+  V  + P       F  +I  RI D+
Sbjct: 331 IEDMKQHATNFANGLSISPDLNVTDKPQIVARMQWVRKMTP-------FTDRINLRITDI 383

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EGLW----RLASFMFYLTDV 169
           T   +   + +K  +QI NYG+GGH+  H D T  D    E ++    R A+ +FY ++V
Sbjct: 384 TGFEV---DEFKA-VQIGNYGIGGHFMPHFDYTTPDRLRIEDIYGLGDRTATIVFYASEV 439

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + GGAT+FP++ +TV P+KGSA+ WYN   +   +    H+ CPV  G++W
Sbjct: 440 Q-GGATVFPNIQVTVQPQKGSALHWYNLFDDDSPNPLSLHTACPVISGSRW 489


>gi|195452736|ref|XP_002073477.1| GK14138 [Drosophila willistoni]
 gi|194169562|gb|EDW84463.1| GK14138 [Drosophila willistoni]
          Length = 518

 Score =  116 bits (290), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 74/227 (32%), Positives = 117/227 (51%), Gaps = 34/227 (14%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           + P  C G   V ++++  L C Y + ++ FL+I P+K+E L L+P +V  HD I   E 
Sbjct: 298 VLPFCCNGKCQVSKELQ--LYCLYNTKDSYFLRIAPVKMEVLSLNPYIVLYHDFILPREQ 355

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDT--------RLSKVYFLYPEIFGDHPFLYKIQTR 113
             +      K +  K ++  +TIY DT        R +K  +           + +I  R
Sbjct: 356 GSL------KAQSIKYLSVAETIYPDTGEWQADSSRTAKAMWFED---SSAEVISRISQR 406

Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGG 173
           I+D+TNL   + E Y    QI NYG+GG Y+ H D    +E           L DV  GG
Sbjct: 407 IEDITNLNPEKGELY----QIINYGIGGLYETHYDYLYENE-----------LQDVPQGG 451

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           AT+  +++L+VFP+ G+A+FWYN +     ++ + H+ CPV +G+KW
Sbjct: 452 ATLLNNISLSVFPKAGAALFWYNLNNAGDTEWNVAHTACPVIVGSKW 498


>gi|444731524|gb|ELW71877.1| Prolyl 4-hydroxylase subunit alpha-3 [Tupaia chinensis]
          Length = 562

 Score =  116 bits (290), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 79/239 (33%), Positives = 122/239 (51%), Gaps = 27/239 (11%)

Query: 7   CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   +  +L C YE+ ++ +L + P++ E ++L+P +   HD + DSE  +I 
Sbjct: 303 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPVRKELIHLEPYIALYHDFVSDSEAQKIR 362

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSK-----------------VYFLYPEIFGDHPFLY 108
            L++  ++R  V +    + V+ R+SK                 VYF         P L 
Sbjct: 363 ALAEPWLQRSVVASGEKQLQVEYRISKRRRLVVSGIASLMPQSVVYFSAWLKDTVDPMLV 422

Query: 109 KIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------AS 161
            +  RI  +T L +  +  Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+
Sbjct: 423 TLDHRIAALTGLDV--QPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVAT 480

Query: 162 FMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FM YL+ VE GGAT F   N +V   K +A+FW+N H +   +    H+GCPV +G+KW
Sbjct: 481 FMIYLSSVEAGGATAFIYANFSVPVVKNAALFWWNLHRSGEGNSDTLHAGCPVLVGDKW 539


>gi|443721482|gb|ELU10773.1| hypothetical protein CAPTEDRAFT_174752 [Capitella teleta]
          Length = 525

 Score =  115 bits (289), Expect = 9e-24,   Method: Compositional matrix adjust.
 Identities = 83/241 (34%), Positives = 116/241 (48%), Gaps = 28/241 (11%)

Query: 1   EIYPLACQG-NLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           + Y   C+G  L +P D+ S  +     Y    L     K E L   P +V  HD + D+
Sbjct: 270 KAYEALCRGEQLKLP-DVDSEQQALKCRYKPGILPFVRYKEEMLNRKPHIVLFHDVMSDA 328

Query: 60  EINRIIELSKGKVERGKVVN----YGDTIYVDTRLSKVYFLYPEIFGDHP--FLYKIQTR 113
           E   +   +  K+ER  V +    +G +     R+S+V +L+     DH    ++++  R
Sbjct: 329 EAKTMKMEAMHKLERAHVADNENKHGHSASA-KRISQVSWLW----DDHANKTIHQLSRR 383

Query: 114 IQDMTNLVIGREE--RYKGPLQINNYGLGGHYDLHCD---------ATP---RDEGLWRL 159
           + D+T L  G         P QI NYG+GG Y+ H D         + P   R  G  RL
Sbjct: 384 VADITGLQTGVVSGLHSAEPFQILNYGIGGQYEPHVDYFAGNHSHSSLPEHVRASGN-RL 442

Query: 160 ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           A+FMFYL DV  GGAT+FP L + + P K  A FWYN   N  +D    H+GCPV LG K
Sbjct: 443 ATFMFYLNDVHAGGATVFPKLKVGIPPTKNGAAFWYNIGLNGDVDPLTEHAGCPVLLGQK 502

Query: 220 W 220
           W
Sbjct: 503 W 503


>gi|198466399|ref|XP_002135181.1| GA23909 [Drosophila pseudoobscura pseudoobscura]
 gi|198150582|gb|EDY73808.1| GA23909 [Drosophila pseudoobscura pseudoobscura]
          Length = 530

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 67/213 (31%), Positives = 113/213 (53%), Gaps = 21/213 (9%)

Query: 18  KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSK---GKVER 74
           ++NL C Y S    FL++ PLK+EE+  DP +V  H  + D E+  + +L++     +  
Sbjct: 303 RTNLVCRYNSTTTPFLRLAPLKMEEVNHDPYIVMYHQVLSDREMEEMKQLARPMTNGMSG 362

Query: 75  GKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
            ++ N  + + +  R++        +    PF  ++  RI DMT   +     +K  LQ+
Sbjct: 363 SEMANLTEPLEIVARVAW-------LIEASPFRERLNLRIGDMTGFDVSD---FKA-LQL 411

Query: 135 NNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
            N+G+G ++  H D  T R   L       R  S +FY ++V  GGATIFP + +TV P+
Sbjct: 412 ANFGVGSYFKAHYDYRTERVNDLGVTELGDRTGSIIFYASEVPQGGATIFPDIQVTVTPQ 471

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           KG+++FW+N   ++  D R  H+ CPV  G++W
Sbjct: 472 KGNSLFWFNTFDDSTPDPRSLHAICPVIAGSRW 504


>gi|195172672|ref|XP_002027120.1| GL20071 [Drosophila persimilis]
 gi|194112933|gb|EDW34976.1| GL20071 [Drosophila persimilis]
          Length = 455

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 79/229 (34%), Positives = 110/229 (48%), Gaps = 18/229 (7%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y   C+G    P+     L C Y+ + +  L++ P KVE L  DP +   HD IYDSE
Sbjct: 217 EHYVRGCRGLFDPPK----GLSCHYDFHTHPVLRLAPFKVEPLSQDPYIAMYHDVIYDSE 272

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDHPF--LYKIQTRIQDM 117
           I  + + +   +ER KV  Y D    +T R S   F       DH +  + K+  R+  M
Sbjct: 273 IEELKDNAFPDMERSKVYTYSDEDSKNTGRTSMSAFQ-----TDHQYKAVTKVNRRVMHM 327

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD--ATPRDEGLWR---LASFMFYLTDVELG 172
           T   +   +     L + NY     Y  H D       E + R   +A+ +FYL DVE G
Sbjct: 328 TGFEV-LADGSSDELLVLNYATAAQYLTHSDYFGPAYSEYIQRGDRIATVLFYLNDVEQG 386

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           G T+FP L +   P KGSAV +YN +++   D R  H GCPV +G KW 
Sbjct: 387 GKTVFPRLGIFRSPMKGSAVVFYNMNSSLQGDPRTEHGGCPVLVGTKWA 435


>gi|426365135|ref|XP_004049642.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Gorilla gorilla
           gorilla]
          Length = 500

 Score =  115 bits (288), Expect = 1e-23,   Method: Compositional matrix adjust.
 Identities = 77/228 (33%), Positives = 115/228 (50%), Gaps = 27/228 (11%)

Query: 8   QGNLSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           Q   S     +  L C Y   N N    + P K E+ +  PR+++ HD I D+EI  + +
Sbjct: 262 QSTASFTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAEIEIVKD 321

Query: 67  LSKGKVERGKVVN--YGDTIYVDTRLSK--VYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
           L+K ++ R  V +   G       R+SK  +  LY         L +  TR+  +  L  
Sbjct: 322 LAKPRLSRATVHDPETGKLTTAQYRVSKRTICLLYIN-------LKRYYTRLGFLFLLY- 373

Query: 123 GREERYKGPL--QINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELG 172
                   P   Q+ NYG+GG Y+ H D   +DE           R+A+++FY++DV  G
Sbjct: 374 ----NTTCPFVPQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVSAG 429

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV +GNKW
Sbjct: 430 GATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPVLVGNKW 477


>gi|198477152|ref|XP_002136738.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
 gi|198145043|gb|EDY71755.1| GA29216 [Drosophila pseudoobscura pseudoobscura]
          Length = 517

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 77/227 (33%), Positives = 115/227 (50%), Gaps = 19/227 (8%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           L C+G    P      L C Y +    FL++ P K E L L P +V  HD I   E   +
Sbjct: 282 LCCRG--GCPYRDMHRLTCSYNTTAAPFLRLAPFKTEILSLSPYMVLYHDVITPLESLTL 339

Query: 65  IELSKGKVERGKVV---NYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
             LSK  ++R  +V   N     ++D+ R S   +L      ++  + +++ R+  MTN 
Sbjct: 340 KNLSKPLMKRRAMVMVNNLKVRPFIDSGRTSNSVWLASH---ENAVMERLERRVGVMTNF 396

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD--ATPRDE----GLWRLASFMFYLTDVELGGA 174
            +   E Y    Q+ NYG+GGHY  H D   TP+      G  R+A+ +FYL+DV  GGA
Sbjct: 397 EMENSEVY----QLINYGIGGHYKPHTDHFETPQAPEHRGGGDRIATVLFYLSDVPQGGA 452

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           T+FP LN++V P +G A+ WYN +     +    H+ CP+  G+KW 
Sbjct: 453 TLFPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWA 499


>gi|195166675|ref|XP_002024160.1| GL22879 [Drosophila persimilis]
 gi|194107515|gb|EDW29558.1| GL22879 [Drosophila persimilis]
          Length = 484

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 72/207 (34%), Positives = 101/207 (48%), Gaps = 28/207 (13%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           N  C YES    F+++ PLKVE L LDP +   HD IY+ EI R++ L+   ++      
Sbjct: 293 NRSCHYESTRTAFVRLAPLKVEMLSLDPYIAIYHDVIYEREIARVMTLALSSLK------ 346

Query: 80  YGDTIYVDTRLS--KVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
            G   Y   R    K   +Y E         ++  R +DMT    G + +     +I N 
Sbjct: 347 -GPGRYSKRREHNIKSVTVYEEENS------QLNQRTRDMT----GEQVKEDKDFRIYNS 395

Query: 138 GLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA 197
           G+GG+   H D   ++E           L +V  GGA  FP L  TV+P KGSA+ W+N 
Sbjct: 396 GIGGYIRYHMDNLAKEEQ---------QLNEVPHGGAISFPQLEFTVWPRKGSALVWHNL 446

Query: 198 HANTLLDYRMYHSGCPVALGNKWGKLL 224
           + N  LDYR+ H  CPV +G+KW K  
Sbjct: 447 NNNLELDYRVAHISCPVIVGSKWSKFF 473


>gi|194905424|ref|XP_001981193.1| GG11755 [Drosophila erecta]
 gi|190655831|gb|EDV53063.1| GG11755 [Drosophila erecta]
          Length = 527

 Score =  115 bits (287), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 72/224 (32%), Positives = 121/224 (54%), Gaps = 16/224 (7%)

Query: 3   YPLACQGNLSVPEDIKSN-LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           Y + CQG   +PE+  ++ LKC+ +   + +  + PL+VE ++LDP +   H  +  ++I
Sbjct: 281 YTMLCQGR-RLPEERSADPLKCYLDGKRHAYFTLAPLQVEPVHLDPDINVYHGMLSANQI 339

Query: 62  NRII-ELSKGKVERGKVV-NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
             I+ E  K ++ R  V  N G++   D R+S+  +L         +   +   +  +  
Sbjct: 340 LSILDEAEKMQMFRSAVSGNGGNSTVKDLRVSQQTWL--------DYKSAVMKSVGRINE 391

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGAT 175
           LV G +      +Q+ NYG+GG Y+ H D      P +    R+++ MFYL+DVE GG T
Sbjct: 392 LVSGFDMAGAEYMQVANYGVGGQYEPHPDYFGVNLPVEFKGDRISTSMFYLSDVEQGGYT 451

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           +FP LN+ + P  G+ V W+N H +  +D R  H+GCPV +G+K
Sbjct: 452 VFPKLNVFLPPVSGALVMWHNLHRSLDVDARTLHAGCPVIVGSK 495


>gi|313242424|emb|CBY34571.1| unnamed protein product [Oikopleura dioica]
          Length = 503

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 75/230 (32%), Positives = 119/230 (51%), Gaps = 15/230 (6%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           E Y   C+    +P +    LKCFY + N+  FL +GP+K EEL+ +P +++ ++ I D 
Sbjct: 245 EYYEKLCRIPNELPREKADTLKCFYWTNNDHPFLVLGPVKAEELWDEPEIIRFYEIITDE 304

Query: 60  EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEI-FGDHPFLYKIQTRIQD 116
           E++ I E ++ K     V +   G  +  D R+S+  +L           L + + RI  
Sbjct: 305 ELDIINEQARPKSNLATVQDPITGKLVNADYRISESAWLPANTDSAQDEKLRQFRKRISI 364

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDV 169
           +T L + R E     +Q +NYG+GG Y+ H D +T  D G +      R+A+++ YL + 
Sbjct: 365 ITGLTMERAE----DIQYSNYGIGGQYEPHYDMSTENDAGKFDEEDGNRIATWLTYLNEP 420

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           + GG T+F    +   P   SAVFWYN   +   DYR  H+ CPV +G K
Sbjct: 421 KHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQK 470


>gi|390176896|ref|XP_002136934.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
 gi|388858831|gb|EDY67492.2| GA26861 [Drosophila pseudoobscura pseudoobscura]
          Length = 513

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 75/223 (33%), Positives = 113/223 (50%), Gaps = 15/223 (6%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           L C+G    P      L C Y +    FL++ P K E L L P +V  HD I   E   +
Sbjct: 282 LCCRG--GCPYRDMHRLTCSYNTTAAPFLRLAPFKTEILSLSPYMVLYHDVITPLESLTL 339

Query: 65  IELSKGKVERGKVVNYGDTI--YVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
             LSK  ++R  +      +   +D+ R S   +L      ++  + +++ R+  MTN  
Sbjct: 340 KNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNSVWLTSH---ENAVMERLERRVGVMTNFE 396

Query: 122 IGREERYKGPLQINNYGLGGHYDLHCD--ATPRDEGLW-RLASFMFYLTDVELGGATIFP 178
           +   E Y    Q+ NYG+GGHY  H D   TP+  G   R+A+ +FYL+DV  GGAT+FP
Sbjct: 397 MENSEVY----QLINYGIGGHYKPHTDHFETPQHRGGGDRIATVLFYLSDVPQGGATLFP 452

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
            LN++V P +G A+ WYN +     +    H+ CP+  G+KW 
Sbjct: 453 RLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIQGSKWA 495


>gi|21358233|ref|NP_651814.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
 gi|20269810|gb|AAM18060.1|AF495538_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]NE3
           [Drosophila melanogaster]
 gi|15291443|gb|AAK92990.1| GH21465p [Drosophila melanogaster]
 gi|23172714|gb|AAN14251.1| prolyl-4-hydroxylase-alpha NE3 [Drosophila melanogaster]
 gi|220945610|gb|ACL85348.1| PH4alphaNE3-PA [synthetic construct]
 gi|220955396|gb|ACL90241.1| PH4alphaNE3-PA [synthetic construct]
          Length = 481

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 72/208 (34%), Positives = 107/208 (51%), Gaps = 19/208 (9%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           S L C Y S  + FL + PLK+EE+ L+P +V  HD + D +I ++I L++  ++     
Sbjct: 277 SKLHCRYNSTTSAFLILAPLKMEEISLEPHIVVYHDILPDKDIQQLITLAEPLLK----- 331

Query: 79  NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYG 138
               T   D   ++    Y    G  P L  +  R++D+T L I    R   P+ I  YG
Sbjct: 332 ---PTEMFDDNKNEARSSYRTPLGG-PLLDSLTQRMRDITGLQI----RQGNPINIIKYG 383

Query: 139 LGG----HYDLHCDATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF 193
            G     +YD         +G   R+A+FMFYL D   GGAT+FP LN+ V  E+G  +F
Sbjct: 384 FGAPYTNYYDFFKKRNSESKGFGDRMATFMFYLNDAPYGGATVFPRLNVKVPAERGKVLF 443

Query: 194 WYNAHANTL-LDYRMYHSGCPVALGNKW 220
           WYN + +T  ++    H+ CPV  G+KW
Sbjct: 444 WYNLNGDTHDMEPTTMHAACPVFHGSKW 471


>gi|195113245|ref|XP_002001178.1| GI22115 [Drosophila mojavensis]
 gi|193917772|gb|EDW16639.1| GI22115 [Drosophila mojavensis]
          Length = 498

 Score =  114 bits (286), Expect = 2e-23,   Method: Compositional matrix adjust.
 Identities = 72/209 (34%), Positives = 110/209 (52%), Gaps = 16/209 (7%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           + L C Y++  + FL + P K+E L  DP +V  HD IYDSEI  +   ++  + R  V 
Sbjct: 263 TRLVCSYKTKPSKFLYLAPFKMELLSEDPYIVVFHDVIYDSEIKHLRNTAEPLLHRSYVK 322

Query: 79  -NYGDTIYVDTRLSKVYFLYPEIFGDHP--FLYKIQTRIQDMTNLVIGREERYKGPLQIN 135
            +  +++    R +K  F++ +         + +++ R+ D+++L I RE      +Q  
Sbjct: 323 KSNNESVVSKVRTAKGAFMHADRLSPESAQVVQRLKQRMGDLSDLNIKREGY--NEMQYL 380

Query: 136 NYGLGGHYDLHCD---ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           NY  G HY LH D    +  D    R+A+F+ YL DV  GG TIFP +   V PEKG  +
Sbjct: 381 NYDFGDHYLLHMDYFNISMND----RIATFLIYLNDVTRGGGTIFPQVKQAVHPEKGKLI 436

Query: 193 FWYNAHANTLLDYRM--YHSGCPVALGNK 219
            WYN ++N  LDY +   H  CPV +G K
Sbjct: 437 LWYNMNSN--LDYELASLHGACPVLIGRK 463


>gi|195341584|ref|XP_002037386.1| GM12898 [Drosophila sechellia]
 gi|194131502|gb|EDW53545.1| GM12898 [Drosophila sechellia]
          Length = 536

 Score =  114 bits (285), Expect = 3e-23,   Method: Compositional matrix adjust.
 Identities = 79/215 (36%), Positives = 110/215 (51%), Gaps = 14/215 (6%)

Query: 15  EDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER 74
           E   S L C Y +    FLK+ P ++EEL LDP VV  H+ + D EI ++  +SK  +ER
Sbjct: 291 ESKPSRLHCRYNTTTTPFLKLAPFRMEELSLDPYVVLYHNVLSDPEIEKLKPMSKPFLER 350

Query: 75  GKV--VNYGDTIYVDTRLSKVYFLYPEIF--GDHPFLYKIQTRIQDMTNLVIGREERYKG 130
            KV  V  G      +R +   +L  +     D   L +I  RI+D+T    G   R   
Sbjct: 351 AKVFRVEKGSDEIAPSRSADGAWLPHQDTDPDDLEVLRRIGRRIKDLT----GLNTRSGS 406

Query: 131 PLQINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVF 185
            +Q   YG GGH+  H D     T   E +  R+A+ +FYL +V+ GGAT FP LNL V 
Sbjct: 407 QMQFLKYGFGGHFVPHYDYFNSKTSYLERVGDRIATVLFYLNNVDHGGATAFPKLNLVVP 466

Query: 186 PEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
            +KGSA+FW+N    +   D   +H  CP+  G K
Sbjct: 467 TQKGSALFWHNLDRKSYDYDTCTFHGACPLISGTK 501


>gi|195390831|ref|XP_002054071.1| GJ22995 [Drosophila virilis]
 gi|194152157|gb|EDW67591.1| GJ22995 [Drosophila virilis]
          Length = 485

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 75/221 (33%), Positives = 105/221 (47%), Gaps = 25/221 (11%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G        ++NL C Y+   + FL++ PLK+E L + P +V  HD +   EI 
Sbjct: 264 YARGCRGQFVQ----QTNLICKYKFRPSPFLRLAPLKMEVLVVKPFIVAFHDVLSPHEIG 319

Query: 63  RIIELSKGKVERGKVVNYGDTIYVD---TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
            + +L+   ++R  V +    ++     TR SK  +L       +    +I  RI DMT 
Sbjct: 320 ELQQLAMPLLKRTTVYDSNAGLHGSVKGTRTSKGIWLSR---SHNNLTKRIGRRISDMT- 375

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPS 179
              G        LQ+ NYGL GHY LH D     E           L+DVE GG T+FP 
Sbjct: 376 ---GFHLEGSTSLQVMNYGLSGHYALHTDYFNTAE-----------LSDVEQGGDTVFPR 421

Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +     PE+G A+ WYN H N   D R  H  CPV +G+KW
Sbjct: 422 IEQAFKPERGKALLWYNLHRNGTGDKRTEHGACPVLVGSKW 462


>gi|195159146|ref|XP_002020443.1| GL13510 [Drosophila persimilis]
 gi|194117212|gb|EDW39255.1| GL13510 [Drosophila persimilis]
          Length = 527

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 70/211 (33%), Positives = 111/211 (52%), Gaps = 14/211 (6%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           S L C Y +    FL++ PL++EEL LDP +V  H+ + D+EI  +  +++  ++R  V 
Sbjct: 295 SRLHCRYNTTTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAEIAEVERVTEPLLKRSVVF 354

Query: 79  N-YGDTIYVDTRLSKVYFLYPEIFGD---HPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
           +   + +    + + +    P+   D      + +I  RI ++T L+I   +     +Q+
Sbjct: 355 DGKENKMSTSKKRTALGAWLPDDNMDVSGRAVIQRIFRRIHELTGLIINDRQ----DMQL 410

Query: 135 NNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
             YG GGHYD+H D    +TP  +    R+A+ +FYL D++ GG+T F  L L V  E+G
Sbjct: 411 IKYGYGGHYDIHFDYFNTSTPITKARGDRMATVLFYLNDMKHGGSTAFTDLQLKVPSERG 470

Query: 190 SAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
             +FWYN    T  LD R  H  CPV  G K
Sbjct: 471 KVLFWYNMRGETHDLDSRTLHGACPVINGTK 501


>gi|195166677|ref|XP_002024161.1| GL22880 [Drosophila persimilis]
 gi|194107516|gb|EDW29559.1| GL22880 [Drosophila persimilis]
          Length = 507

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 66/213 (30%), Positives = 112/213 (52%), Gaps = 21/213 (9%)

Query: 18  KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSK---GKVER 74
           ++NL C Y S    FL++ PLK+EE+  DP +V  H  + D E+  + +L++     +  
Sbjct: 246 RTNLVCRYNSTTTPFLRLAPLKMEEVNHDPYIVMYHQVLSDREMEEMKQLARPMTNGMSG 305

Query: 75  GKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
            ++ N  + + +  R++        +    PF  ++  RI DMT   +     +K  LQ+
Sbjct: 306 SEMANLTEPLEIVARVAW-------LIEASPFRERLNLRIGDMTGFDVSD---FKA-LQL 354

Query: 135 NNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
            N+G+G ++  H D  T R   L       R  S +FY ++V  GG TIFP + +TV P+
Sbjct: 355 ANFGVGSYFKAHYDYRTERVNDLGVTELGDRTGSIIFYASEVPQGGTTIFPDIQVTVTPQ 414

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           KG+++FW+N   ++  D R  H+ CPV  G++W
Sbjct: 415 KGNSLFWFNTFDDSTPDPRSLHAICPVIAGSRW 447


>gi|194765144|ref|XP_001964687.1| GF22917 [Drosophila ananassae]
 gi|190614959|gb|EDV30483.1| GF22917 [Drosophila ananassae]
          Length = 529

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 77/213 (36%), Positives = 111/213 (52%), Gaps = 16/213 (7%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           + L C Y S    FLKI PLK+EE+ LDP +V  HD + D +I+ ++ LS+ K+E  +VV
Sbjct: 291 TRLHCRYNSTTTPFLKIAPLKMEEISLDPYIVVYHDVLPDGDISEVLRLSETKLEPAQVV 350

Query: 79  NYGDT---IYVDTRLSKVYFLYPEIFGDHPF--LY-KIQTRIQDMTNLVIGREERYKGPL 132
           +   T   +   T L      Y E+    P   LY +++  ++D+T LVI   + +    
Sbjct: 351 STPRTSNNVKFRTALGSWLPDYEEVVKGPPKGPLYGRLRNILRDVTGLVIWDYQFF---- 406

Query: 133 QINNYGLGGHYDLHCD---ATPRDEGLW--RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
           Q+  Y  G HY  H D    + +   L   R+A+ +FYL D   GGAT+FP LN+ V  E
Sbjct: 407 QVLKYQFGAHYAQHHDYFNMSLKSTVLQGDRIATVLFYLNDAPHGGATVFPMLNVKVPAE 466

Query: 188 KGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
           KG  +FWYN    T   D +  H  CP+  G K
Sbjct: 467 KGKILFWYNLKGETHDFDEKTLHGACPIFHGTK 499


>gi|195379218|ref|XP_002048377.1| GJ13934 [Drosophila virilis]
 gi|194155535|gb|EDW70719.1| GJ13934 [Drosophila virilis]
          Length = 469

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 66/201 (32%), Positives = 100/201 (49%), Gaps = 24/201 (11%)

Query: 21  LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
           L C Y   ++ +L++ PLK+E L L P +   HD ++DSEI  +  ++     R    N 
Sbjct: 275 LTCRYVQQHSAYLRLAPLKMEILSLQPLIQLYHDVLHDSEIEAVKNVTN---HRAMAENL 331

Query: 81  GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG 140
             T+ + T              D P    +  RI DM+ L + +       L + N+GLG
Sbjct: 332 ASTVKLIT------------LRDAPHTQNMHRRITDMSGLDMAQN----NTLHLLNFGLG 375

Query: 141 GHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN 200
           G+               R+A+ +FY +DV+LGGATIFP L L V P++GSA+ WYN +A 
Sbjct: 376 GYLGKQLKLQGN-----RIATVIFYASDVQLGGATIFPRLQLVVKPKRGSALLWYNLNAA 430

Query: 201 TLLDYRMYHSGCPVALGNKWG 221
              D    H+ CPV +G++W 
Sbjct: 431 GKPDPLTRHAVCPVVVGSRWA 451


>gi|198449508|ref|XP_002136911.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
 gi|198130638|gb|EDY67469.1| GA26875 [Drosophila pseudoobscura pseudoobscura]
          Length = 516

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 70/211 (33%), Positives = 111/211 (52%), Gaps = 14/211 (6%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           S L C Y +    FL++ PL++EEL LDP +V  H+ + D+EI  +  +++  ++R  V 
Sbjct: 284 SRLHCRYNTTTTPFLRLAPLRMEELSLDPYIVVYHNVLCDAEIAEVERVTEPLLKRSVVF 343

Query: 79  N-YGDTIYVDTRLSKVYFLYPEIFGD---HPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
           +   + +    + + +    P+   D      + +I  RI ++T L+I      +  +Q+
Sbjct: 344 DGKENKMSTSKKRTALGAWLPDDNMDVSGRAVIQRIFRRIHELTGLIIND----RQDMQL 399

Query: 135 NNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
             YG GGHYD+H D    ++P  +    R+A+ +FYL DV+ GG+T F  L L V  E+G
Sbjct: 400 IKYGYGGHYDIHFDYFNTSSPITKARGDRMATVLFYLNDVKHGGSTAFTDLQLKVPSERG 459

Query: 190 SAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
             +FWYN    T  LD R  H  CPV  G K
Sbjct: 460 KVLFWYNMRGETHDLDSRTLHGACPVIDGTK 490


>gi|194760358|ref|XP_001962408.1| GF14452 [Drosophila ananassae]
 gi|190616105|gb|EDV31629.1| GF14452 [Drosophila ananassae]
          Length = 498

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 77/221 (34%), Positives = 121/221 (54%), Gaps = 23/221 (10%)

Query: 18  KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE-LSKGKVERGK 76
           +S L C Y +    F++I PLK EE+  DP +   HD ++DSE+  + + L++ ++ +G 
Sbjct: 278 QSRLVCRYNTTTTPFMRIAPLKEEEISKDPLIWLYHDVLFDSEMALLTKNLTREEMIQGY 337

Query: 77  VVNYGDTIYVDTRLSKVYFLYP-EIF-GDHPFLYK-IQTRIQDMTNLVIGREERYKGPLQ 133
             N        T   K Y ++  +++ GD   L + +  R+ D++ L +G        L 
Sbjct: 338 TNN-------QTTPDKGYRIFQVKVYEGDGGKLDRTLVNRMTDISGLDVGNHTY----LA 386

Query: 134 INNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
             NYGLG H+  H D     E         RL +F+FY +DVE+GGATIFP+ N+++ P+
Sbjct: 387 RANYGLGTHFQEHSDYVDLRENPDLGSEGDRLFTFLFYASDVEMGGATIFPAANISIKPK 446

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW--GKLLLS 226
           KGSA+FWYN H +   +    H+ CP+ LGN+W   K +LS
Sbjct: 447 KGSALFWYNLHNDWEPNPLSRHAVCPMVLGNRWILNKSMLS 487


>gi|328713119|ref|XP_003244997.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Acyrthosiphon
           pisum]
          Length = 487

 Score =  114 bits (284), Expect = 4e-23,   Method: Compositional matrix adjust.
 Identities = 79/207 (38%), Positives = 104/207 (50%), Gaps = 12/207 (5%)

Query: 22  KCFYESYNNTFLKI-GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
           KC Y++ NN F +I  P K E++  +P +   HD +YD EI +I  +S   +   KV   
Sbjct: 268 KCRYQT-NNLFYRILMPFKEEDINSEPFIKIYHDVLYDDEILKIKTMSLANMSDAKVKTS 326

Query: 81  GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG 140
            D+I  +   S   +   E+     F   + TRI+  T       ERY    QI NYGLG
Sbjct: 327 NDSILRERSRSGQVYRMNEVDAIEYFD-ALNTRIESFTGFSTKTAERY----QIVNYGLG 381

Query: 141 GHYDLHCDA----TPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYN 196
           GHY  H D     T   E   RL + +FYLTDV+  G T FP LN+    EKGSA+ W N
Sbjct: 382 GHYFPHFDTFKKGTENMEFGNRLVTVLFYLTDVQNDGYTSFPMLNIIAPAEKGSALVWNN 441

Query: 197 AH-ANTLLDYRMYHSGCPVALGNKWGK 222
            H ++  L Y   H  CP+  GNKW K
Sbjct: 442 LHMSDGQLCYESLHGACPLLKGNKWSK 468


>gi|156370129|ref|XP_001628324.1| predicted protein [Nematostella vectensis]
 gi|156215298|gb|EDO36261.1| predicted protein [Nematostella vectensis]
          Length = 541

 Score =  113 bits (283), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 75/248 (30%), Positives = 125/248 (50%), Gaps = 36/248 (14%)

Query: 3   YPLACQGNL-SVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           Y   C+G + S+ +  +  + C ++  ++    + P ++E +++ P V+   + I DSEI
Sbjct: 266 YEKLCRGEVRSLTKWEQGQMSC-WQIRDDPLTVLKPGRIERVFVKPEVLIFRNFITDSEI 324

Query: 62  NRIIELSKGKVERGKVVN--YGDTIYVDTRLSK--VYFLYPEIFGDHPF----------- 106
            RI EL+  +++R  V +   G+ I+ + R+SK      +P + G   F           
Sbjct: 325 KRIKELATPRLKRATVKDPVTGELIFANYRISKRRATIQHP-VTGKLEFANYRISKSGWL 383

Query: 107 -------LYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-- 157
                  + +I  R+Q  + L +   E     LQ+ NYG+GGHY+ H D     E  +  
Sbjct: 384 RDEEDELVKRISYRVQAYSGLNMTTSE----DLQVVNYGIGGHYEPHYDFARDGEDKFTS 439

Query: 158 -----RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 212
                R+A+F+ YL+DVE GG T+F  +  TV+P+KG A FWYN   +   D    H+ C
Sbjct: 440 LGTGNRIATFLSYLSDVEAGGGTVFTRVGATVWPQKGDAAFWYNLKRSGDGDSSTRHAAC 499

Query: 213 PVALGNKW 220
           PV +G+KW
Sbjct: 500 PVLVGSKW 507


>gi|432891690|ref|XP_004075614.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oryzias
           latipes]
          Length = 517

 Score =  113 bits (283), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 79/228 (34%), Positives = 116/228 (50%), Gaps = 13/228 (5%)

Query: 1   EIYPLACQGNLSVPEDIKS-NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           + Y   C+   S P   ++  L C Y + NN  L + P+K E L L P VV  H+ I D 
Sbjct: 272 DTYERLCRTQGSQPIHFENPRLYCDYFTNNNPALLLLPVKREVLSLQPYVVIYHNFITDR 331

Query: 60  EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           E   I   ++  + R  V +  +   V+ R+SK  +L      +   + K+  RI  +T 
Sbjct: 332 EAEEIKGFAQPALRRSVVASGENQATVEYRISKSAWLKG---SESCIVGKLDQRISMLTG 388

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVELG 172
           L +     Y   LQ+ NYG+GGHY+ H D AT     ++      R+A+FM YL+ VE G
Sbjct: 389 LNV--RPPYAEYLQVVNYGIGGHYEPHFDHATSPSSPVFKLKTGNRVATFMIYLSSVEAG 446

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           G+T F   N +V   K +A+FW+N H N   D    H+GCPV +G+KW
Sbjct: 447 GSTAFIYANFSVPVLKKAAIFWWNLHRNGRGDAETLHAGCPVLIGDKW 494


>gi|195159150|ref|XP_002020445.1| GL13509 [Drosophila persimilis]
 gi|194117214|gb|EDW39257.1| GL13509 [Drosophila persimilis]
          Length = 554

 Score =  113 bits (283), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 69/211 (32%), Positives = 111/211 (52%), Gaps = 14/211 (6%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           S L C Y +    FL++ PL++EEL LDP +V  H+ + D+EI  +  +++  ++R  V 
Sbjct: 322 SRLHCRYNTTTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAEIAEVERVTEPLLKRSVVF 381

Query: 79  N-YGDTIYVDTRLSKVYFLYPEIFGD---HPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
           +   + +    + + +    P+   D      + +I  RI ++T L++      +  +Q+
Sbjct: 382 DGKENKMSTSKKRTALGAWLPDDNMDVSGRAVIQRILRRIHELTGLIMND----RQDMQL 437

Query: 135 NNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
             YG GGHYD+H D    ++P  +    R+A+ +FYL DV+ GG+T F  L L V  E+G
Sbjct: 438 IKYGYGGHYDIHFDYFNTSSPITKARGDRMATVLFYLNDVKHGGSTAFTDLQLKVPSERG 497

Query: 190 SAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
             +FWYN    T  LD R  H  CPV  G K
Sbjct: 498 KVLFWYNMRGETHDLDSRTLHGACPVIDGTK 528


>gi|194765184|ref|XP_001964707.1| GF22906 [Drosophila ananassae]
 gi|190614979|gb|EDV30503.1| GF22906 [Drosophila ananassae]
          Length = 708

 Score =  113 bits (283), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 73/223 (32%), Positives = 113/223 (50%), Gaps = 14/223 (6%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   CQG     E     L C+ +   N +  + PL+VE ++LDP +   H  +   +IN
Sbjct: 462 YTRLCQGKKLPEESTGRPLSCYLDGRTNPYFVLAPLQVEPVHLDPDINVYHRMLSQQQIN 521

Query: 63  RIIELS-KGKVERGKVV-NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            I E + K  + R  V  N G +   D R+S+  +L        P +  I   IQ ++  
Sbjct: 522 SIFEEADKLTMYRSAVAGNAGKSTVADLRVSQQTWLN----YTSPIMKSISRIIQFVSGF 577

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATI 176
            I   E     +Q+ NYG+GG Y+ H D      P+     R+++ MFYL++VE GG T+
Sbjct: 578 DIAGAEF----MQVANYGVGGQYEPHPDYFEFNLPQQFQGDRISTSMFYLSNVEQGGYTV 633

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           F  LN+ + P +G+ V W+N H +  +D R  H+GCPV +G+K
Sbjct: 634 FTKLNVFLPPIQGAMVMWHNLHRSLDVDARTLHAGCPVLVGSK 676


>gi|241598357|ref|XP_002404731.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215500462|gb|EEC09956.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 218

 Score =  113 bits (283), Expect = 5e-23,   Method: Compositional matrix adjust.
 Identities = 63/218 (28%), Positives = 103/218 (47%), Gaps = 43/218 (19%)

Query: 17  IKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGK 76
           + S L+C Y    + FL +  +K+EE+ L P ++ +HD + D +I +++E ++ ++ER  
Sbjct: 1   MDSQLRCRYYKGQDGFLALQQIKLEEMNLKPYIIVMHDVVQDKDIEKLMEFAEPRLERST 60

Query: 77  VVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINN 136
             N  + +    R S   +L  +                             + P+ + N
Sbjct: 61  TYNGSEVMPTPQRTSSTAWLNED-----------------------------EAPIALAN 91

Query: 137 YGLGGHYDLHCDATPRDEGLW--------------RLASFMFYLTDVELGGATIFPSLNL 182
           YG GGH+  H D        +              R+A+ M Y+TDVE GGAT+FPSL +
Sbjct: 92  YGTGGHFLPHHDFFQDSLNAYNSSADYYLQHGRGDRIATLMIYMTDVEAGGATVFPSLGI 151

Query: 183 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            + P+KG A FW+N  A+   +    H+GCPV  G+KW
Sbjct: 152 RLTPKKGDAAFWWNLKASGEGERLTMHAGCPVLYGSKW 189


>gi|195159311|ref|XP_002020525.1| GL13465 [Drosophila persimilis]
 gi|194117294|gb|EDW39337.1| GL13465 [Drosophila persimilis]
          Length = 578

 Score =  113 bits (282), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 76/225 (33%), Positives = 114/225 (50%), Gaps = 17/225 (7%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           L C+G    P      L C Y +    FL++ P K E L L P +V  HD I   E   +
Sbjct: 345 LCCRG--GCPYRDMHRLTCSYNTTAAPFLRLAPFKTELLSLAPYMVLYHDVITPLESLTL 402

Query: 65  IELSKGKVERGKVVNYGDTI--YVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
             LSK  ++R  +      +   +D+ R S   +L      ++  + +++ R+  MTN  
Sbjct: 403 KNLSKPHMKRRAMTFNKQKLRPLIDSGRTSNSVWLTSH---ENAVMERLERRVGVMTNFE 459

Query: 122 IGREERYKGPLQINNYGLGGHYDLHCD--ATPRDE---GLWRLASFMFYLTDVELGGATI 176
           +   E Y    Q+ NYG+GGHY  H D   TP+ E   G  R+A+ +FYL+DV  GGAT+
Sbjct: 460 MENSEVY----QLINYGIGGHYKPHTDHFETPQLEHRGGGDRIATVLFYLSDVPQGGATL 515

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           FP LN++V P +G A+ WYN +     +    H+ CP+  G+KW 
Sbjct: 516 FPRLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWA 560


>gi|313229343|emb|CBY23930.1| unnamed protein product [Oikopleura dioica]
          Length = 542

 Score =  113 bits (282), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 74/230 (32%), Positives = 119/230 (51%), Gaps = 15/230 (6%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           E Y   C+    +P +    LKCFY + N+  FL +GP+K EEL+ +P +++ ++ I D 
Sbjct: 284 EYYEKLCRIPNELPREKADTLKCFYWTNNDHPFLVLGPVKAEELWDEPEIIRFYEIITDE 343

Query: 60  EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEI-FGDHPFLYKIQTRIQD 116
           E++ I + ++ K     V +   G  +  D R+S+  +L           L + + RI  
Sbjct: 344 ELDIINKQARPKSNLATVQDPITGKLVNADYRISESAWLPANTDSAQDEKLRQFRKRISI 403

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDV 169
           +T L + R E     +Q +NYG+GG Y+ H D +T  D G +      R+A+++ YL + 
Sbjct: 404 ITGLTMERAE----DIQYSNYGIGGQYEPHYDMSTENDAGKFDEEDGNRIATWLTYLNEP 459

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           + GG T+F    +   P   SAVFWYN   +   DYR  H+ CPV +G K
Sbjct: 460 KHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQK 509


>gi|241044303|ref|XP_002407179.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215492129|gb|EEC01770.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 456

 Score =  113 bits (282), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 72/218 (33%), Positives = 110/218 (50%), Gaps = 16/218 (7%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G        + +L C Y+   + + KIGP+KVE++  +P V++ +D ++  EI     
Sbjct: 225 CRGEKIRNASEEKDLFCLYD-VPHPYFKIGPVKVEQMNKNPYVLQFYDVLWPQEIKAFRR 283

Query: 67  LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
           +   ++ER  V +         R+S+V ++ P+       L ++  R+  +T L   R  
Sbjct: 284 MGDPQLERATVRDTARNTVSHARVSQVAWISPD---SDVLLDRVNARVAMLTGLS-HRLR 339

Query: 127 RYKGPLQINNYGLGGHYDLHCDATPR-DE----GLWRLASFMFYLTDVELGGATIFPSLN 181
           +Y      N+YG GGHY+ H D     DE    G  R+A+FMFYL+DV LGG+T+FP   
Sbjct: 340 KY------NSYGPGGHYEPHHDYLEELDEVDKLGGDRIATFMFYLSDVNLGGSTVFPYAK 393

Query: 182 LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
             V P+ GSA FWYN   +   D    H  C V  G K
Sbjct: 394 AGVMPKMGSAAFWYNMREDGSYDRATLHGACSVLHGTK 431


>gi|313241587|emb|CBY33829.1| unnamed protein product [Oikopleura dioica]
          Length = 541

 Score =  113 bits (282), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 74/230 (32%), Positives = 119/230 (51%), Gaps = 15/230 (6%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           E Y   C+    +P +    LKCFY + N+  FL +GP+K EEL+ +P +++ ++ I D 
Sbjct: 283 EYYEKLCRIPNELPREKADTLKCFYWTNNDHPFLVLGPVKAEELWDEPEIIRFYEIITDE 342

Query: 60  EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEI-FGDHPFLYKIQTRIQD 116
           E++ I + ++ K     V +   G  +  D R+S+  +L           L + + RI  
Sbjct: 343 ELDIINKQARPKSNLATVQDPITGKLVNADYRISESAWLPANTDSAQDEKLRQFRKRISI 402

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDV 169
           +T L + R E     +Q +NYG+GG Y+ H D +T  D G +      R+A+++ YL + 
Sbjct: 403 ITGLTMERAE----DIQYSNYGIGGQYEPHYDMSTENDAGKFDEEDGNRIATWLTYLNEP 458

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           + GG T+F    +   P   SAVFWYN   +   DYR  H+ CPV +G K
Sbjct: 459 KHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQK 508


>gi|313213106|emb|CBY36968.1| unnamed protein product [Oikopleura dioica]
          Length = 541

 Score =  113 bits (282), Expect = 7e-23,   Method: Compositional matrix adjust.
 Identities = 74/230 (32%), Positives = 119/230 (51%), Gaps = 15/230 (6%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           E Y   C+    +P +    LKCFY + N+  FL +GP+K EEL+ +P +++ ++ I D 
Sbjct: 283 EYYEKLCRIPNELPREKADTLKCFYWTNNDHPFLVLGPVKAEELWDEPEIIRFYEIITDE 342

Query: 60  EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEI-FGDHPFLYKIQTRIQD 116
           E++ I + ++ K     V +   G  +  D R+S+  +L           L + + RI  
Sbjct: 343 ELDIINKQARPKSNLATVQDPITGKLVNADYRISESAWLPANTDSAQDEKLRQFRKRISI 402

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDV 169
           +T L + R E     +Q +NYG+GG Y+ H D +T  D G +      R+A+++ YL + 
Sbjct: 403 ITGLTMERAE----DIQYSNYGIGGQYEPHYDMSTENDAGKFDEEDGNRIATWLTYLNEP 458

Query: 170 ELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           + GG T+F    +   P   SAVFWYN   +   DYR  H+ CPV +G K
Sbjct: 459 KHGGDTVFLGPGIKAEPIHKSAVFWYNLLRDGSCDYRTRHAACPVLIGQK 508


>gi|156370133|ref|XP_001628326.1| predicted protein [Nematostella vectensis]
 gi|156215300|gb|EDO36263.1| predicted protein [Nematostella vectensis]
          Length = 526

 Score =  112 bits (281), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 70/198 (35%), Positives = 113/198 (57%), Gaps = 17/198 (8%)

Query: 33  LKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRL 90
           LK+ P+ +E + ++P++   H+ + + EI +++EL++ ++ R +V N   G+   VD R+
Sbjct: 312 LKLKPVAMEIVSVNPQITLFHNVLSEMEIEQMLELARPRLRRARVNNLETGEIEDVDYRI 371

Query: 91  SKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT 150
           S++ +L      D   + +I  R+  +T L     E     LQ+NNYG+GGHY+ H D +
Sbjct: 372 SQIAWLSD---SDGDIVRRINRRVGFITGLNTNTGE----CLQVNNYGVGGHYEPHFDHS 424

Query: 151 PRDEGL--------WRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 202
              E           R+A+FMFYL++VE GG+T+F    +   P KG AVFWYN   +  
Sbjct: 425 LDMENSPIASLGQGNRIATFMFYLSEVEAGGSTVFIKTGVKTNPFKGGAVFWYNLKKSGE 484

Query: 203 LDYRMYHSGCPVALGNKW 220
            D+   H+GCPV +GNKW
Sbjct: 485 GDWDSLHAGCPVLIGNKW 502


>gi|221126103|ref|XP_002165259.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Hydra
           magnipapillata]
          Length = 533

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 76/225 (33%), Positives = 117/225 (52%), Gaps = 19/225 (8%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           CQG   + +   + L C Y +    ++ + PLK+E L+ DP +   ++ I D E   II+
Sbjct: 294 CQGREKMAQKDINRLFCKYVAPKAHYI-LKPLKMEVLHHDPYIELYYELITDDEAKHIIK 352

Query: 67  LSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
            +K  + R  V +   GD IY D R+SK  ++  ++        KI  R+ D+T L +  
Sbjct: 353 FAKPLLRRAFVHDMVTGDLIYADYRVSKNTWIAEDM---DVIAAKIIRRVGDVTGLNM-- 407

Query: 125 EERYKGPLQINNYGLGGHYDLHCDAT----PRDEGLW---RLASFMFYLTDVELGGATIF 177
             RY   LQ+ NYG+ G Y+ H D +    P+    W   R+A+ + YL+DV+ GG T+F
Sbjct: 408 --RYAEHLQVANYGIAGQYEPHFDHSTGTRPKHFDRWGGNRIATMLLYLSDVDWGGRTVF 465

Query: 178 PSL--NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +    +   P KG+ VFWYN   N   + +  H+GCPV LG KW
Sbjct: 466 TNTAPGVGTDPIKGAGVFWYNLLRNGKSNPKTQHAGCPVVLGQKW 510


>gi|194905305|ref|XP_001981170.1| GG11767 [Drosophila erecta]
 gi|190655808|gb|EDV53040.1| GG11767 [Drosophila erecta]
          Length = 536

 Score =  112 bits (280), Expect = 1e-22,   Method: Compositional matrix adjust.
 Identities = 78/211 (36%), Positives = 113/211 (53%), Gaps = 14/211 (6%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV- 77
           S L C Y +    FL++ PL++EEL LDP VV  H+ + D EI ++  +S+  +ER KV 
Sbjct: 295 SRLHCRYNTTTRPFLRLVPLRMEELSLDPYVVLYHNVLSDPEIEKLKLMSEPFLERAKVY 354

Query: 78  -VNYGDTIYVDTRLSKVYFLY-PEIF-GDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
            V  G      +R +   +L  PE    D   L +I  RI D+T L      +    +Q+
Sbjct: 355 RVEKGSDEVAPSRSADGAWLPDPETEPEDLETLNRIGRRIGDITGLSTCSGSQ----MQL 410

Query: 135 NNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
             YG GGH+  H D     T   E +  R+A+ +FYL +V+ GGAT FP++NL V  +KG
Sbjct: 411 LKYGFGGHFVPHYDYFDSKTSYLEAVGDRIATVLFYLNNVDHGGATAFPNINLAVPTQKG 470

Query: 190 SAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
           SA+FW+N    +   D R +H  CP+  G K
Sbjct: 471 SALFWHNLDGKSYDYDTRTFHGACPLISGTK 501


>gi|442751927|gb|JAA68123.1| Putative prolyl 4-hydroxylase alpha subunit [Ixodes ricinus]
          Length = 522

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 67/232 (28%), Positives = 113/232 (48%), Gaps = 17/232 (7%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G +     + S L+C Y    + F  + P+K+EE+ L P ++ + D + + +I 
Sbjct: 265 YKRLCRGEVLRTPKMDSKLRCRYYKGQDGFFTLRPIKLEEINLKPYIIVMRDVVQERDIE 324

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            ++  ++ +++R              R S   +L+ +   + P   ++   ++ +  L  
Sbjct: 325 DLMAFAEPRLQRSTTYTGDGNAPSTRRTSSNAWLWDD---EAPIANRMNWYLRALVGLGT 381

Query: 123 GREERYKGPLQINNYGLGG----HYD-----LHCDATPRDEGLW-----RLASFMFYLTD 168
              +      Q+ NYG GG    H+D     LH   +  D  L      RLA+ M Y+TD
Sbjct: 382 SGSDYEAEAYQLANYGSGGYFLPHHDYLQDTLHAHNSTADYYLQNKEGDRLATLMIYMTD 441

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           VE+GGAT+FP L + + P+KG A FW+N  A+   D    H+GCPV  G+KW
Sbjct: 442 VEVGGATVFPRLGVRLVPKKGDAAFWWNLKASGEGDTLTMHAGCPVLYGSKW 493


>gi|194751823|ref|XP_001958223.1| GF23631 [Drosophila ananassae]
 gi|190625505|gb|EDV41029.1| GF23631 [Drosophila ananassae]
          Length = 502

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 75/227 (33%), Positives = 106/227 (46%), Gaps = 17/227 (7%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           L C+G    P+     L C Y    + FLK+ PLK+E L + P +V  HD +Y+ E   +
Sbjct: 263 LGCRGKW--PKKPSPTLTCRYVRETHDFLKLAPLKMEFLNMQPLIVLYHDVLYEGEFKSM 320

Query: 65  IELSKGKVERGKVVNYGD--TIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            +++      G    Y D        R  +V  +         F  +I  RI DMT    
Sbjct: 321 RDIAIFNATMGDGWTYVDFDKKGKPKRQDRVVKMITFQGTTAEFTLRINRRIADMT---- 376

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPR---------DEGLWRLASFMFYLTDVELGG 173
           G E      L + NYGLGGH+  H D             D G  R+A+ + Y +DV LGG
Sbjct: 377 GLEMNENMALHLTNYGLGGHFGKHVDYVELAKRPPNFFGDLGGDRIATALLYASDVPLGG 436

Query: 174 ATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            T+F  L L++ P+KGSA+ W+N +     D    HS CPV LG++W
Sbjct: 437 TTVFTKLKLSIEPKKGSALIWFNLNNAGDPDPMSEHSACPVVLGSRW 483


>gi|195156517|ref|XP_002019146.1| GL25581 [Drosophila persimilis]
 gi|194115299|gb|EDW37342.1| GL25581 [Drosophila persimilis]
          Length = 206

 Score =  112 bits (279), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 73/210 (34%), Positives = 101/210 (48%), Gaps = 33/210 (15%)

Query: 21  LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI-IELSKGKVERGKVVN 79
           L C Y      FL++ PLK EE+  DP +   HD +YDSE  ++ + L++ ++ +G   N
Sbjct: 2   LVCRYNHTTTPFLRLAPLKEEEVSRDPLIWLYHDVLYDSEFEQLTVNLTRAEMVQGYTDN 61

Query: 80  YGDTIYVDTRLSKVYFLYPEIF--GDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
           Y       T   K    Y  IF          +  R+ D++ L+ G   +    L   NY
Sbjct: 62  Y-------TTTEKERIFYVNIFEGSGEKLDRDLVNRMADISGLLTGEHTQ----LGTVNY 110

Query: 138 GLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           GLG H+  H D     A P              +TDV LGGATIFP +NLT+ P+KGSA+
Sbjct: 111 GLGSHFPEHGDYSDIKANP--------------MTDVPLGGATIFPKINLTIQPKKGSAL 156

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKWGK 222
           FWYN H +        H+ CP   GN+W K
Sbjct: 157 FWYNIHNDWEPHVLTRHAVCPTIEGNRWSK 186


>gi|194871364|ref|XP_001972834.1| GG13661 [Drosophila erecta]
 gi|190654617|gb|EDV51860.1| GG13661 [Drosophila erecta]
          Length = 506

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 72/209 (34%), Positives = 109/209 (52%), Gaps = 22/209 (10%)

Query: 18  KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER-GK 76
           + +  C YE   + FL+I PLKVE L + P +V  HD IYDSEI+++  +S   +    +
Sbjct: 288 RQHQSCHYEKNTSDFLRIAPLKVETLSVKPHIVLYHDVIYDSEISKVKNISLPSLRSPSR 347

Query: 77  VVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINN 136
           ++   D    + +L+K+         + P    +  RI+DMT    G +      LQI N
Sbjct: 348 ILRAEDH---NLKLAKIR--------EDP-RSPLSLRIKDMT----GEDVEEDTDLQIEN 391

Query: 137 YGLGGHYDLHCDATPRDEGLW----RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           YG+ G    H D     +       RL S +F++ DV LGGA +F + NLT+FP+KGSA+
Sbjct: 392 YGICGFRFYHNDNLESQDQTAKLGDRLTSILFFMNDVALGGAFVFLNANLTIFPQKGSAL 451

Query: 193 FWYNA-HANTLLDYRMYHSGCPVALGNKW 220
            W N  H+    +  + H  CPV +G+KW
Sbjct: 452 VWRNLDHSLQPKEDLLQHLSCPVIVGSKW 480


>gi|195379216|ref|XP_002048376.1| GJ13933 [Drosophila virilis]
 gi|194155534|gb|EDW70718.1| GJ13933 [Drosophila virilis]
          Length = 521

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 72/224 (32%), Positives = 110/224 (49%), Gaps = 20/224 (8%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           L C+G    P+    +L C Y S    FL++ PLK+EE+  DP +V  H+ + DSEI  +
Sbjct: 287 LGCRGLFPKPK----SLSCRYNSTTTPFLRLAPLKLEEISHDPYIVMYHNVLSDSEIEEM 342

Query: 65  IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
            +LS          N  +       +++  +L        PFL +I  RI DMT    G 
Sbjct: 343 KQLSVLMENGLSATNKPNNTEPLDIVARAGWLVEAT----PFLERINRRITDMT----GF 394

Query: 125 EERYKGPLQINNYGLGGHYDLHCD--------ATPRDEGLWRLASFMFYLTDVELGGATI 176
           +      + + NYG+G ++  H D             E   R+A+ +FY +DV  GGAT 
Sbjct: 395 DVLDMWAVLLANYGIGNYFKPHYDYMYGGRVSGEAVAELGERIATLIFYASDVAQGGATN 454

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP + + V P+KG+++FWYN   +   D R  HS CP  +G++W
Sbjct: 455 FPDIQVAVQPQKGNSLFWYNMFDDGTPDPRSLHSVCPTIVGSRW 498


>gi|195113263|ref|XP_002001187.1| GI10646 [Drosophila mojavensis]
 gi|193917781|gb|EDW16648.1| GI10646 [Drosophila mojavensis]
          Length = 471

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 78/226 (34%), Positives = 113/226 (50%), Gaps = 41/226 (18%)

Query: 1   EIYPLAC-QGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           E+  LA  +G+ S      ++L C Y  +   FL+I PLK+EEL LDP +V  H AIY+S
Sbjct: 257 EMISLAIIKGHCSASFQRPTHLHCRYNYWMTPFLRIAPLKLEELSLDPLIVLYHKAIYNS 316

Query: 60  EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           EI  +++  +  +  GK  N   TI+                           R+ DM+ 
Sbjct: 317 EIETLLKRQEFNLISGKD-NMDRTIH--------------------------ERVADMSG 349

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDA-----TPRDEGLWRLASFMFYLTDVELGGA 174
           L + R E     L + N    GH+ L  DA      P+D    R+A+ +FYL DVEL GA
Sbjct: 350 LNLDRSE----VLSVINNDNNGHFQLQEDAPETTERPQD----RIATVLFYLEDVELVGA 401

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           TIFP LNLT+ PEKG+A+ W+N  +      +  ++ CPV   +K+
Sbjct: 402 TIFPRLNLTIKPEKGTALLWHNLESCGSSHPKALYAACPVISSSKY 447


>gi|410975458|ref|XP_003994148.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Felis catus]
          Length = 567

 Score =  111 bits (278), Expect = 2e-22,   Method: Compositional matrix adjust.
 Identities = 78/258 (30%), Positives = 123/258 (47%), Gaps = 42/258 (16%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLY-------------------PE 99
           I  + +L+K ++ R  V +   G       R+SK    +                   PE
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSLVSWGKVQRALLIRSMQVCCERGPE 408

Query: 100 IFGDHPFLYKIQTRIQDMTNLV---------IGREERYKGPLQINNYGLGGHYDLHCDAT 150
              D   +   +  + +++ L          IG  E   G   + NYG+GG Y+ H D  
Sbjct: 409 AAWDGGSM-SAEECLAELSLLAGECSAALVPIGVCESRLGK-GVANYGVGGQYEPHFDFA 466

Query: 151 PRDEGLW--------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 202
            +DE           R+A+++FY++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+  
Sbjct: 467 RKDEPDAFKELGTGNRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGE 526

Query: 203 LDYRMYHSGCPVALGNKW 220
            DY   H+ CPV +GNKW
Sbjct: 527 GDYSTRHAACPVLVGNKW 544


>gi|195503448|ref|XP_002098656.1| GE23815 [Drosophila yakuba]
 gi|194184757|gb|EDW98368.1| GE23815 [Drosophila yakuba]
          Length = 472

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 73/219 (33%), Positives = 108/219 (49%), Gaps = 33/219 (15%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G    P   KS L+C Y    + FL+   LK+E+L ++P V   HDAI  +E   ++ 
Sbjct: 249 CRGKNLPPS--KSFLRCRYFREGSPFLRWAALKLEQLNIEPFVGLFHDAISPAEQEDLLR 306

Query: 67  LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
           L++ ++E  K  +Y     VDT  S           DH  + +I  RI+D+T   +   E
Sbjct: 307 LTETRLEHRKKDSYSVEANVDTNGS-----------DH--VRRIHQRIEDITGFDLEDSE 353

Query: 127 RYKGPLQINNYGLGGHYDLHCDA-TPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVF 185
               PL ++NYG+GG   +H D   P+             L+DV++GG   FP L     
Sbjct: 354 ----PLTVSNYGIGGQESIHLDCEQPK-------------LSDVQMGGYASFPDLGFGFK 396

Query: 186 PEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 224
           P +GSA+ W+N  +    D R   + CPV LGN+WGK L
Sbjct: 397 PSRGSALVWHNTDSAGNCDTRSLQATCPVLLGNQWGKWL 435


>gi|195341061|ref|XP_002037130.1| GM12749 [Drosophila sechellia]
 gi|194131246|gb|EDW53289.1| GM12749 [Drosophila sechellia]
          Length = 467

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 69/216 (31%), Positives = 108/216 (50%), Gaps = 33/216 (15%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G   +P   KS+L+C Y    + FL++ P+K+E+L  +P V  +HDAI  +E   ++ 
Sbjct: 255 CRGKNLLPN--KSSLRCRYFRGGSPFLRLAPVKLEQLNFEPFVGLVHDAISQAEQEDLLH 312

Query: 67  LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
           L+  ++E  +  +      VDT  S           DH  + +I  RI+D+T   +   E
Sbjct: 313 LTDSRLEHTRKESSSVEAKVDTNAS-----------DH--VRRIHQRIEDITGFDMEESE 359

Query: 127 RYKGPLQINNYGLGGHYDLHCDA-TPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVF 185
               PL ++NYG+GG   +H D   P+             L+DV++GG   FP L     
Sbjct: 360 ----PLIVSNYGIGGQELIHLDCEQPK-------------LSDVQMGGYASFPDLGFGFK 402

Query: 186 PEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           P +GSA+ W+N   +   D R   + CPV LGN+WG
Sbjct: 403 PRRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQWG 438


>gi|195128347|ref|XP_002008625.1| GI13597 [Drosophila mojavensis]
 gi|193920234|gb|EDW19101.1| GI13597 [Drosophila mojavensis]
          Length = 457

 Score =  111 bits (277), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 73/216 (33%), Positives = 102/216 (47%), Gaps = 39/216 (18%)

Query: 6   ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           AC+G    PE    +L C Y   N+ +LK+ P+K+E+L L+P V   HD +YDSEI  I 
Sbjct: 262 ACRGLW--PERKTDHLSCRYVYENSAYLKLAPMKLEQLSLEPVVQLYHDVLYDSEIKAIK 319

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
            +S  + +  +V                                I  R+ DMT    G  
Sbjct: 320 NMSVPEAKAKRVE-----------------------------LNINQRVADMTG--YGMM 348

Query: 126 ERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVF 185
           E  K  L + N+ LG   D        D    R+A+ +FY  DV +GGATIFP L L V 
Sbjct: 349 EHNK--LHVLNFALGQGADTKSCKARAD----RIATIVFYANDVAIGGATIFPKLRLLVQ 402

Query: 186 PEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           P +G+A+ WYN +A+   D    H+ CPV LG++W 
Sbjct: 403 PRRGTALLWYNLNADGAADPLAKHAVCPVVLGSRWA 438


>gi|194905313|ref|XP_001981171.1| GG11766 [Drosophila erecta]
 gi|190655809|gb|EDV53041.1| GG11766 [Drosophila erecta]
          Length = 496

 Score =  110 bits (276), Expect = 3e-22,   Method: Compositional matrix adjust.
 Identities = 69/213 (32%), Positives = 110/213 (51%), Gaps = 23/213 (10%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           S L C Y S  + FL + PLK+E++ L+P +V  HD + + +I+++I L++ ++      
Sbjct: 271 SKLHCRYNSTTSPFLILAPLKMEQISLEPYIVVYHDILPEGDIHQLIALAEPRLR----- 325

Query: 79  NYGDTIYVDTRLSKVYFLYPEIFGD-----HPFLYKIQTRIQDMTNLVIGREERYKGPLQ 133
                 + + +   V+  +   F D      P L ++  R++D+T L I +  R    + 
Sbjct: 326 --ATLAFTEDKSDSVFGAFLP-FKDMNSSGEPVLDRLTQRMRDITGLQIHQRNR----IN 378

Query: 134 INNYGLGGHY----DLHCDATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
           I  YG G HY    D   +     EG   R+A+ MFYL D   GGAT+FP +N+ V  E+
Sbjct: 379 IIKYGFGAHYAARHDFFNETNSETEGYGDRMATVMFYLNDAPNGGATVFPRINVKVPAER 438

Query: 189 GSAVFWYNAHANTL-LDYRMYHSGCPVALGNKW 220
           G  +FWYN    T  +D +  H+ CPV  G+KW
Sbjct: 439 GKVLFWYNLDGETHDVDPKTVHAACPVFHGSKW 471


>gi|198449520|ref|XP_002136916.1| GA26928 [Drosophila pseudoobscura pseudoobscura]
 gi|198130644|gb|EDY67474.1| GA26928 [Drosophila pseudoobscura pseudoobscura]
          Length = 532

 Score =  110 bits (276), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 72/213 (33%), Positives = 106/213 (49%), Gaps = 20/213 (9%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER-GKV 77
           S L C Y +    FL++ PL++EEL LDP +V  H+ + D+EI  +  + +  ++R G+ 
Sbjct: 295 SRLHCRYNATTTPFLRLAPLRMEELSLDPYIVVYHNVLSDAEIAEVERVIEPLLQRIGRY 354

Query: 78  VNYGDTIYVDTRLSKVYFLYPEI-----FGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
               +++    R  +  F  P I         P + ++   I+DMT L +         L
Sbjct: 355 DETPNSMSPSKR--RTGFTGPHIDDYMHVSGAPVIERVHRHIRDMTGLFMNEH------L 406

Query: 133 QINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
            +  YGLGGH D H D    + P    +  R+A+ +FYL DV+ GG+T F  L L V  E
Sbjct: 407 MMVKYGLGGHCDQHYDFLNASYPSTHAMGDRMATVLFYLNDVKHGGSTAFTDLQLKVPSE 466

Query: 188 KGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
           +G  +FWYN    T  LD R  H  CPV  G K
Sbjct: 467 RGKVLFWYNMRGETHNLDRRTVHGSCPVIDGTK 499


>gi|195471732|ref|XP_002088156.1| GE14021 [Drosophila yakuba]
 gi|194174257|gb|EDW87868.1| GE14021 [Drosophila yakuba]
          Length = 265

 Score =  110 bits (275), Expect = 4e-22,   Method: Compositional matrix adjust.
 Identities = 70/216 (32%), Positives = 109/216 (50%), Gaps = 15/216 (6%)

Query: 6   ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
            C+GN          L C Y S    F++I PLK EE+  +P +   HD IYDSEI ++ 
Sbjct: 45  GCRGNFPP----HPQLVCRYNSTTTPFMRIAPLKEEEISKEPLIWLYHDVIYDSEIAQLT 100

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
            L++ ++  G   NY        R+++++ +             +  R+ D++ L +G  
Sbjct: 101 NLTREEMILGTTNNY----TTPDRVNRLFHVKVTNDDGGQLDRTLVNRMADISGLDMGNT 156

Query: 126 ERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFM-FYLTDVELGGATIFPSLNLTV 184
                 L   NYGLGG++  H D    D  L   +S +   ++DV +GGATIFP+  L +
Sbjct: 157 TS----LARINYGLGGYFQEHSDYV--DIKLHPASSLLPTSISDVPVGGATIFPAAKLAI 210

Query: 185 FPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            P+KGSA+FWYN H N   +    H+ CP  +G++W
Sbjct: 211 QPKKGSALFWYNLHNNGDPNPLTRHAVCPTIVGSRW 246


>gi|20177113|gb|AAM12259.1| RE23792p [Drosophila melanogaster]
 gi|220948174|gb|ACL86630.1| PH4alphaSG2-PB [synthetic construct]
 gi|220960438|gb|ACL92755.1| PH4alphaSG2-PB [synthetic construct]
          Length = 301

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 71/223 (31%), Positives = 115/223 (51%), Gaps = 14/223 (6%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   CQG     E     L+C+ +   + +  + PL+VE ++LDP +   H  +   +I 
Sbjct: 55  YTRLCQGRRLPEERSGDPLRCYLDGKRHAYFTLAPLQVEPVHLDPDINVYHGMLSSKQIL 114

Query: 63  RIIELS-KGKVERGKVVNYGDTIYV-DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            I E + K ++ R  V   G    V D R+S+  +L  +     P +  +   IQ ++  
Sbjct: 115 SIFEEADKEEMVRSAVAGSGGEGTVRDLRVSQQTWLDYK----SPVMNSVGRIIQFVSGF 170

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATI 176
            +   E     +Q+ NYG+GG Y+ H D      P++    R+++ MFYL+DVE GG T+
Sbjct: 171 DMAGAEH----MQVANYGVGGQYEPHPDYFEVNLPKNFEGDRISTSMFYLSDVEQGGYTV 226

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           F  LN+ + P KG+ V W+N H +  +D R  H+GCPV +G+K
Sbjct: 227 FTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSK 269


>gi|198471971|ref|XP_002133305.1| GA28042 [Drosophila pseudoobscura pseudoobscura]
 gi|198139547|gb|EDY70707.1| GA28042 [Drosophila pseudoobscura pseudoobscura]
          Length = 203

 Score =  110 bits (274), Expect = 6e-22,   Method: Compositional matrix adjust.
 Identities = 72/208 (34%), Positives = 100/208 (48%), Gaps = 33/208 (15%)

Query: 21  LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI-IELSKGKVERGKVVN 79
           L C Y      FL++ PLK EE+  DP +   HD +YDSE  ++ + L++ ++ +G   N
Sbjct: 2   LVCRYNHTTTPFLRLAPLKEEEVSRDPLIWLYHDVLYDSEFEQLTVNLTRAEMVQGYTDN 61

Query: 80  YGDTIYVDTRLSKVYFLYPEIF--GDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
           Y       T   K    Y  IF          +  R+ D++ L+ G   +    L   NY
Sbjct: 62  Y-------TTTEKERIFYVNIFEGSGEKLDRDLVNRMADISGLLTGEHTQ----LGTVNY 110

Query: 138 GLGGHYDLHCD-----ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           GLG H+  H D     A P              +TDV LGGATIFP +NLT+ P+KGSA+
Sbjct: 111 GLGSHFPEHGDYSDIKANP--------------MTDVPLGGATIFPKINLTIQPKKGSAL 156

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FWYN H +        H+ CP   GN+W
Sbjct: 157 FWYNIHNDWEPHVLTRHAVCPTIEGNRW 184


>gi|326436053|gb|EGD81623.1| p4ha2 protein [Salpingoeca sp. ATCC 50818]
          Length = 548

 Score =  110 bits (274), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 74/204 (36%), Positives = 103/204 (50%), Gaps = 12/204 (5%)

Query: 21  LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELS-KGKVERGKVVN 79
           L C  + YN   L + P+KVE L+   + +++       E  R ++ + K ++ER     
Sbjct: 311 LTCELKHYNQPHLFLKPIKVEHLHEGRQRLQVFRQFASPEECRHLQHAGKRRLERAVAWT 370

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYK-IQTRIQDMTNLVIGREERYKGPLQINNYG 138
            G    V+ R+S   +L P    DH  + K I  RI+D T + I     Y   LQI+NYG
Sbjct: 371 DGRFQPVEFRISTAAWLQP----DHDAIVKRIHGRIEDATQVDI----EYAEALQISNYG 422

Query: 139 LGGHYDLHCDATPR--DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYN 196
           +GG Y+ H D + R  +    RLA+FM YL  V+ GG T FP L   V P  G AVFWYN
Sbjct: 423 MGGFYEPHFDHSSRGTNPDGERLATFMIYLNPVKQGGFTAFPRLGAAVQPGYGDAVFWYN 482

Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
              + + D    H  CPV  G+KW
Sbjct: 483 LQPSGVGDPLTLHGACPVLRGSKW 506


>gi|328707957|ref|XP_001947811.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Acyrthosiphon
           pisum]
          Length = 507

 Score =  109 bits (273), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 74/205 (36%), Positives = 105/205 (51%), Gaps = 14/205 (6%)

Query: 22  KCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIEL-SKGKVERGKVVNY 80
           KC Y++ N+ +  I P K E++  +P +   HD IYD EI  I ++ SK   +     N 
Sbjct: 285 KCRYQTKNSPYRMIMPFKEEDISSNPNIKLYHDIIYDEEIKTITDMASKDLSDAAYYFNG 344

Query: 81  GDTIYVDTRLSKVYFLYPEIFGDHPFLY-KIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
             T+  D RL ++ +        +P L+ K+  RI+ +T       E Y    Q  NYGL
Sbjct: 345 KITLLDDQRLGQLKWFSENA---NPILFGKLNDRIECITEYTTKTAEGY----QTINYGL 397

Query: 140 GGHYDLHCDA---TPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYN 196
           GGH+ +H DA    P+  G  RL + +FY+TDV   G T+FP+LN      KGSA+ W N
Sbjct: 398 GGHFSVHMDAFTDGPKLNGN-RLVTILFYMTDVPDDGYTVFPNLNYVAHCRKGSALVWLN 456

Query: 197 AHANT-LLDYRMYHSGCPVALGNKW 220
              N   +    +H GCPV  GNKW
Sbjct: 457 LRLNNGSVHSGTFHGGCPVIKGNKW 481


>gi|195452738|ref|XP_002073478.1| GK14139 [Drosophila willistoni]
 gi|194169563|gb|EDW84464.1| GK14139 [Drosophila willistoni]
          Length = 215

 Score =  109 bits (273), Expect = 7e-22,   Method: Compositional matrix adjust.
 Identities = 76/218 (34%), Positives = 115/218 (52%), Gaps = 30/218 (13%)

Query: 9   GNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELS 68
           G   V +++K  L C Y + ++ FL+I P+K+E L LDP +V  HD I  SE      L 
Sbjct: 2   GKCQVSKELK--LYCLYNTKDSYFLRIAPVKMEVLSLDPYIVLYHDFILSSEQEF---LK 56

Query: 69  KGKVERGKVVNYGD----TIYVD-TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIG 123
              +ER  V    D      Y D +R +K  + Y         + +I  RI+++TNL   
Sbjct: 57  AESIERLSVAETVDPDTGKWYADASRTAKAMWFYDT---SSVVIRRINQRIEEITNL--- 110

Query: 124 REERYKGPL-QINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL 182
             +  KG L QI +YG+GG +  H D    +E           L DV  GGAT+F +++L
Sbjct: 111 --DPEKGDLYQIISYGIGGLFQTHYDYLHENE-----------LQDVPQGGATLFNNISL 157

Query: 183 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +VFP+ G+A+FWYN +     ++ + H+GCPV +G+KW
Sbjct: 158 SVFPKAGAALFWYNLNNAGDTEWNVAHTGCPVIVGSKW 195


>gi|344252711|gb|EGW08815.1| Prolyl 4-hydroxylase subunit alpha-2 [Cricetulus griseus]
          Length = 584

 Score =  109 bits (273), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 78/239 (32%), Positives = 113/239 (47%), Gaps = 29/239 (12%)

Query: 7   CQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D EI RI
Sbjct: 294 CRGEGVKLTPQRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSDEEIERI 353

Query: 65  IELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q +T L +
Sbjct: 354 KEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQHITGLTV 410

Query: 123 GREERYKGPLQ--INNYGLGGHY-------DLHCDATP----------RDEGLWRLASFM 163
              E  +   Q      G G          DL   + P          R   L+ L S M
Sbjct: 411 KTAELLQSDEQDAFKRLGTGNRVATFLNYGDLRTLSCPQGFVALLSLGRGAKLFALCSQM 470

Query: 164 FYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGK 222
              +DVE GGAT+FP L   ++P+KG+AVFWYN   +   DYR  H+ CPV +G KWGK
Sbjct: 471 ---SDVEAGGATVFPDLGAAIWPKKGTAVFWYNLLRSGEGDYRTRHAACPVLVGCKWGK 526


>gi|195145080|ref|XP_002013524.1| GL24183 [Drosophila persimilis]
 gi|194102467|gb|EDW24510.1| GL24183 [Drosophila persimilis]
          Length = 296

 Score =  109 bits (273), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 73/212 (34%), Positives = 102/212 (48%), Gaps = 20/212 (9%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           NL C Y    + F ++ PLK+E    DP VV  HD +YD+E+  +I+ ++ ++ R  V  
Sbjct: 55  NLHCRYHKKGSAFSRLAPLKLEIFSHDPYVVIYHDVLYDAEMQGLIDSTRRRMSRSMVQY 114

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
               I +  + +     + E   D   L +I  R++DMT   + R E     L I  Y  
Sbjct: 115 EIRQIEISEQRTSKEAPFTEK-NDPQLLKRIYDRLKDMTGCDMLRSEH----LSILLYDQ 169

Query: 140 GGHYDLHCDATPRDEGLW------------RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
           GGH+D H D     +  W            R AS +FYL DVE GG T+FP L L + P 
Sbjct: 170 GGHHDPHVDY---HDLYWHPQEYEYHPFGDRQASVVFYLNDVEDGGETVFPKLQLVIPPT 226

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           KGSA+ W+N       D R  H+ CPV  G K
Sbjct: 227 KGSALMWHNLRPWGEGDPRTQHASCPVLSGYK 258


>gi|195574593|ref|XP_002105269.1| GD21390 [Drosophila simulans]
 gi|194201196|gb|EDX14772.1| GD21390 [Drosophila simulans]
          Length = 478

 Score =  109 bits (273), Expect = 8e-22,   Method: Compositional matrix adjust.
 Identities = 69/218 (31%), Positives = 106/218 (48%), Gaps = 31/218 (14%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G   +P   KS L+C Y    + FL++ P+K+E+L  +P V   HDAI  +E   ++ 
Sbjct: 255 CRGKNLLPS--KSYLRCRYLRDGSPFLRLAPVKLEQLNFEPFVGLFHDAISPAEQEDLLH 312

Query: 67  LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
           L+  ++E  +  +      VDT  S           DH  + ++  RI+D+T   +   E
Sbjct: 313 LTDSRLEHTRKESSSVEAKVDTNAS-----------DH--VRRMHQRIEDITGFEMEESE 359

Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFP 186
               PL + NYG+GG   +H D    +            L+DV++GG   FP L     P
Sbjct: 360 ----PLTVFNYGIGGQELIHLDCEQPE------------LSDVQMGGYASFPDLGFGFKP 403

Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 224
            +GSA+ W+N   +   D R   + CPV LGN+WGK L
Sbjct: 404 RRGSALVWHNTDNSGNCDTRSLQATCPVLLGNQWGKSL 441


>gi|194906709|ref|XP_001981416.1| GG11627 [Drosophila erecta]
 gi|190656054|gb|EDV53286.1| GG11627 [Drosophila erecta]
          Length = 462

 Score =  109 bits (272), Expect = 9e-22,   Method: Compositional matrix adjust.
 Identities = 71/219 (32%), Positives = 108/219 (49%), Gaps = 33/219 (15%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G    P   KS+L+C Y    + FL++  LK+E+L ++P V   HDAI  +E   ++ 
Sbjct: 239 CRGKNLPPS--KSSLRCRYFREGSPFLRLAALKLEQLNIEPFVGLFHDAILQAEQEDLLR 296

Query: 67  LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
           L++ ++E  K+ +      VDT  S           DH  + +I  RI+D+T   +   E
Sbjct: 297 LTESRLEHKKIESSRVEAKVDTNAS-----------DH--VRRIHQRIEDITGFDLEGSE 343

Query: 127 RYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVF 185
               PL ++N+G+GG   +H D   P+             L DV++GG   FP L     
Sbjct: 344 ----PLTVSNHGIGGQEAIHLDCGQPK-------------LNDVQMGGYASFPDLGFGFK 386

Query: 186 PEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 224
           P +GSA+ W+N       D R   + CPV LGN+WGK L
Sbjct: 387 PVRGSALVWHNTDNCGNCDIRGLQATCPVLLGNQWGKWL 425


>gi|607947|gb|AAA62207.1| prolyl 4-hydroxylase alpha subunit [Caenorhabditis elegans]
          Length = 558

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 74/210 (35%), Positives = 104/210 (49%), Gaps = 18/210 (8%)

Query: 21  LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN- 79
           L C+Y +   +FL   P+KVE    +P  V   D I D E+  I EL+K K+ R  V + 
Sbjct: 302 LYCYYLA-GPSFLVYAPIKVEIKRFNPLAVLFKDVISDDEVAAIQELAKPKLARATVHDS 360

Query: 80  -YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYG 138
             G  +    R+SK  +L  E  GD   +  +  RI  MTNL +   E     LQI NYG
Sbjct: 361 VTGKLVTATYRISKSAWL-KEWEGD--VVETVNKRIGYMTNLEMETAEE----LQIANYG 413

Query: 139 LGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
           +GGHYD H D   ++E           R+A+ +FY++    GG T+F     T+ P K  
Sbjct: 414 IGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQPSHGGGTVFTEAKSTILPTKND 473

Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           A+FWYN +     +    H+ CPV +G KW
Sbjct: 474 ALFWYNLYKQGDGNPDTRHAACPVLVGIKW 503


>gi|328718391|ref|XP_003246474.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 2
           [Acyrthosiphon pisum]
          Length = 514

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 77/208 (37%), Positives = 109/208 (52%), Gaps = 16/208 (7%)

Query: 22  KCFYESYNNTFLKI-GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
           KC Y++ NN F +I  P K E++  +P +   HD +YD EI +I  L+  K++  KV + 
Sbjct: 294 KCRYQT-NNLFYRILMPFKEEDINSEPLIKIYHDVLYDDEILKIKTLALEKMKDAKVKSV 352

Query: 81  GDTIYV---DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
               Y+    TR  +VY+++        +   + TRI+  T       ERY    QI NY
Sbjct: 353 DGKNYLLEEKTRSGQVYWIFE--VDAVEYFDALNTRIESFTGFSTKTAERY----QIVNY 406

Query: 138 GLGGHYDLHCDATPR-DEGLW---RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF 193
           GLGGHY  H D+  +  E +    RL + +FYLTDV+  G T FP LN+    EKG+A+ 
Sbjct: 407 GLGGHYIPHHDSFAKGAENVKFGNRLVTVLFYLTDVQNDGYTSFPMLNIIAPAEKGAALV 466

Query: 194 WYNAH-ANTLLDYRMYHSGCPVALGNKW 220
           W N H +N    Y   H  CP+  GNKW
Sbjct: 467 WNNLHMSNGQKFYETLHGSCPLLKGNKW 494


>gi|21358309|ref|NP_651801.1| prolyl-4-hydroxylase-alpha SG2 [Drosophila melanogaster]
 gi|20269808|gb|AAM18059.1|AF495537_1 prolyl 4-hydroxylase alpha-related protein PH4[alpha]SG2
           [Drosophila melanogaster]
 gi|10726875|gb|AAG22175.1| prolyl-4-hydroxylase-alpha SG2 [Drosophila melanogaster]
          Length = 527

 Score =  109 bits (272), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 71/223 (31%), Positives = 115/223 (51%), Gaps = 14/223 (6%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   CQG     E     L+C+ +   + +  + PL+VE ++LDP +   H  +   +I 
Sbjct: 281 YTRLCQGRRLPEERSGDPLRCYLDGKRHAYFTLAPLQVEPVHLDPDINVYHGMLSSKQIL 340

Query: 63  RIIELS-KGKVERGKVVNYGDTIYV-DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            I E + K ++ R  V   G    V D R+S+  +L  +     P +  +   IQ ++  
Sbjct: 341 SIFEEADKEEMVRSAVAGSGGEGTVRDLRVSQQTWLDYK----SPVMNSVGRIIQFVSGF 396

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATI 176
            +   E     +Q+ NYG+GG Y+ H D      P++    R+++ MFYL+DVE GG T+
Sbjct: 397 DMAGAEH----MQVANYGVGGQYEPHPDYFEVNLPKNFEGDRISTSMFYLSDVEQGGYTV 452

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           F  LN+ + P KG+ V W+N H +  +D R  H+GCPV +G+K
Sbjct: 453 FTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSK 495


>gi|321458081|gb|EFX69155.1| hypothetical protein DAPPUDRAFT_228756 [Daphnia pulex]
          Length = 570

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 76/226 (33%), Positives = 106/226 (46%), Gaps = 22/226 (9%)

Query: 12  SVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGK 71
           S P  +K  LKC   S+ + +  + PLK+EE  L P +   HD + D+E    I  S   
Sbjct: 312 SRPTGLKGRLKCRQISHTHPYFILRPLKLEEHSLVPYIAVFHDFMSDAETE--IFKSLAM 369

Query: 72  VERGKVVNYGDT------IYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
            ER +   +G        +  D R SK  ++     G H  + +I  RI D   L     
Sbjct: 370 AERLERSAHGSKRPGQGGVTSDKRTSKQSWVED---GSHHVVDQISKRISDSVGLNSQPS 426

Query: 126 ERYKGPLQINNYGLGGHYDLHCD--------ATPRDEGLWR---LASFMFYLTDVELGGA 174
                  Q+ NYG+GG Y  H D          P +  L+R   + +FM YL DVE GGA
Sbjct: 427 NVGSEHYQVANYGIGGRYTPHTDHGVLSKSMGGPSEFDLFRGDRILTFMTYLDDVEAGGA 486

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T+F    + V P+KG AVFW+N  +++  D    H GCPV  G+KW
Sbjct: 487 TVFTHAGVVVRPKKGMAVFWWNLKSDSNGDTLTRHGGCPVLHGSKW 532


>gi|410910256|ref|XP_003968606.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Takifugu
           rubripes]
          Length = 540

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 76/229 (33%), Positives = 116/229 (50%), Gaps = 15/229 (6%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTF--LKIGPLKVEELYLDPRVVKIHDAIYD 58
           + Y   CQ   S P   + N + F +++ N    L + P++ E L L P VV  HD I D
Sbjct: 295 DTYERLCQTRGSQPVHFE-NPQLFCDNFANGHPGLLLRPVRREVLSLRPYVVLYHDFISD 353

Query: 59  SEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           SE   I + ++  + R  V         + R+SK  +L       H  + ++  +I  +T
Sbjct: 354 SESEEIKQHAQLGLRRSVVATGDKQATAEYRISKSAWLKGSA---HSTVSRLDQKISMLT 410

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVEL 171
            L +  +  +   LQ+ NYG+GGHY+ H D AT     ++      R+A+FM YL+ VE 
Sbjct: 411 GLNV--QHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKLKTGNRVATFMIYLSSVEA 468

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GG+T F   N +V   K +A+FW+N H N   D    H+GCPV +G+KW
Sbjct: 469 GGSTAFIYANFSVPVMKNAAIFWWNLHRNGEGDADTLHAGCPVLIGDKW 517


>gi|25012370|gb|AAN71294.1| RE09701p [Drosophila melanogaster]
          Length = 301

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 71/223 (31%), Positives = 115/223 (51%), Gaps = 14/223 (6%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   CQG     E     L+C+ +   + +  + PL+VE ++LDP +   H  +   +I 
Sbjct: 55  YTRLCQGRRLPEERSGDPLRCYLDGKRHAYFTLAPLQVELVHLDPDINVYHGMLSSKQIL 114

Query: 63  RIIELS-KGKVERGKVVNYGDTIYV-DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            I E + K ++ R  V   G    V D R+S+  +L  +     P +  +   IQ ++  
Sbjct: 115 SIFEEADKEEMVRSAVAGSGGEGTVRDLRVSQQTWLDYK----SPVMNSVGRIIQFVSGF 170

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATI 176
            +   E     +Q+ NYG+GG Y+ H D      P++    R+++ MFYL+DVE GG T+
Sbjct: 171 DMAGAEH----MQVANYGVGGQYEPHPDYFEVNLPKNFEGDRISTSMFYLSDVEQGGYTV 226

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           F  LN+ + P KG+ V W+N H +  +D R  H+GCPV +G+K
Sbjct: 227 FTKLNVFLPPVKGALVMWHNLHRSLHVDARTLHAGCPVIVGSK 269


>gi|67084101|gb|AAY66985.1| truncated prolyl 4-hydroxylase alpha subunit [Ixodes scapularis]
          Length = 452

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 71/234 (30%), Positives = 115/234 (49%), Gaps = 21/234 (8%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G       + S L+C Y    + F  + P+K+E++ L P ++ + D + + +I 
Sbjct: 195 YRRLCRGEALRTPQMDSKLRCRYYKGQDGFFTLHPIKLEKINLKPYIIVMRDVVQERDIE 254

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTR-LSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
            ++  ++ +++R      GD     TR  S   +L+ +   + P   ++   ++ +  L 
Sbjct: 255 NLMAFAEPRLQRSTTYT-GDGNAPSTRQTSSNAWLWDD---EAPIANRMNWYLRALVGLG 310

Query: 122 IGREERYKGPLQINNYGLGG----HYD-----LHCDATPRD------EGLWRLASFMFYL 166
               E      Q+ NYG GG    HYD     LH   +  D      EG  RLA+ M Y+
Sbjct: 311 TSGSEYEAEAYQLANYGSGGYFLPHYDYLQDTLHAHNSTADYYLQNNEGD-RLATLMIYM 369

Query: 167 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           TDV+ GGAT+FP L + + P+KG A FW+N  A+   D    H+GCPV  G+KW
Sbjct: 370 TDVKEGGATVFPRLGVRLVPKKGDAAFWWNLKASGEGDTLTMHAGCPVLYGSKW 423


>gi|328718393|ref|XP_001945742.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like isoform 1
           [Acyrthosiphon pisum]
          Length = 511

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 77/208 (37%), Positives = 109/208 (52%), Gaps = 16/208 (7%)

Query: 22  KCFYESYNNTFLKI-GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
           KC Y++ NN F +I  P K E++  +P +   HD +YD EI +I  L+  K++  KV + 
Sbjct: 291 KCRYQT-NNLFYRILMPFKEEDINSEPLIKIYHDVLYDDEILKIKTLALEKMKDAKVKSV 349

Query: 81  GDTIYV---DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
               Y+    TR  +VY+++        +   + TRI+  T       ERY    QI NY
Sbjct: 350 DGKNYLLEEKTRSGQVYWIFE--VDAVEYFDALNTRIESFTGFSTKTAERY----QIVNY 403

Query: 138 GLGGHYDLHCDATPR-DEGLW---RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF 193
           GLGGHY  H D+  +  E +    RL + +FYLTDV+  G T FP LN+    EKG+A+ 
Sbjct: 404 GLGGHYIPHHDSFAKGAENVKFGNRLVTVLFYLTDVQNDGYTSFPMLNIIAPAEKGAALV 463

Query: 194 WYNAH-ANTLLDYRMYHSGCPVALGNKW 220
           W N H +N    Y   H  CP+  GNKW
Sbjct: 464 WNNLHMSNGQKFYETLHGSCPLLKGNKW 491


>gi|198452400|ref|XP_002137470.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
 gi|198131917|gb|EDY68028.1| GA26529 [Drosophila pseudoobscura pseudoobscura]
          Length = 348

 Score =  108 bits (271), Expect = 1e-21,   Method: Compositional matrix adjust.
 Identities = 75/222 (33%), Positives = 106/222 (47%), Gaps = 18/222 (8%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G         S L C Y +  + F ++ PLK+E    DP VV  HD +YD+E+  +I+
Sbjct: 111 CRGAFPTKSHHHS-LHCRYHNKGSAFSRLAPLKLEIFSHDPYVVIYHDVLYDAEMQGLID 169

Query: 67  LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
            ++ ++ R  V      I +  + +     + E   D   L +I  R++DMT   + R E
Sbjct: 170 STRRRMSRSMVQYEIRQIEISEQRTSKEAPFTEK-NDPQLLKRIYDRLKDMTGCDMLRSE 228

Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRDEGLW---------RLASFMFYLTDVELGGATIF 177
                L I  Y  GGH+D H D     +  W         R AS +FYL DVE GG T+F
Sbjct: 229 H----LSILLYDQGGHHDPHVDY---HDLYWEYEYHPFGDRQASVVFYLNDVEDGGETVF 281

Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           P L L + P KGSA+ W+N       D R  H+ CPV  G K
Sbjct: 282 PKLQLVIPPTKGSALMWHNLRPWGEGDPRTQHASCPVLSGYK 323


>gi|348505573|ref|XP_003440335.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Oreochromis
           niloticus]
          Length = 517

 Score =  108 bits (269), Expect = 2e-21,   Method: Compositional matrix adjust.
 Identities = 73/208 (35%), Positives = 105/208 (50%), Gaps = 12/208 (5%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
            L C Y + NN  L + P + E + L P VV  HD + D+E   I  L+   + R  V  
Sbjct: 292 QLFCDYFTNNNPALMLMPARRELVSLQPYVVLYHDFVTDTEAEDIKSLAHPGLRRSVVAA 351

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
                  D R+SK  +L          + K+  RI  +T L +  +  Y   LQ+ NYG+
Sbjct: 352 GEKQATADYRISKSAWLKGSA---QSIVGKLDQRISLLTGLNV--KHPYGEYLQVVNYGI 406

Query: 140 GGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           GGHY+ H D AT     ++      R+A+FM YL+ VE GG+T F   N +V   + +A+
Sbjct: 407 GGHYEPHFDHATSPSSPVFKLKTGNRVATFMIYLSPVEAGGSTAFIYANFSVPVVEKAAI 466

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FW+N H N   D    H+GCPV +G+KW
Sbjct: 467 FWWNLHRNGEGDDDTLHAGCPVLIGDKW 494


>gi|449485593|ref|XP_004175686.1| PREDICTED: LOW QUALITY PROTEIN: prolyl 4-hydroxylase subunit
           alpha-3 [Taeniopygia guttata]
          Length = 567

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 72/208 (34%), Positives = 106/208 (50%), Gaps = 12/208 (5%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           +L C YE+ N+ FL + P K E +++ P V   HD I D+E   I  L+   ++R  V +
Sbjct: 342 HLSCSYETNNSPFLLLQPAKKEMVWIQPHVALYHDFITDAEAETIKGLAGPWLQRSVVAS 401

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
                  +  +SK  +L   +    P ++ +  RI  +T L +     Y   LQ+ NYGL
Sbjct: 402 GEKQQKAEYWISKSTWLKDTV---DPVVHALDQRIIAVTGLDLWPP--YAEYLQVVNYGL 456

Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
           GGHY+ H D AT     L+R+      A+ M YL+ VE GG+T     N +V   K +A+
Sbjct: 457 GGHYEPHFDHATSTKSPLYRMKSGNRNATVMIYLSAVEAGGSTALIYTNFSVPVVKNAAL 516

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FW+N   N   D    H+GCPV  G+KW
Sbjct: 517 FWWNLRRNGNGDGDTLHAGCPVLAGDKW 544


>gi|195575095|ref|XP_002105515.1| GD21523 [Drosophila simulans]
 gi|194201442|gb|EDX15018.1| GD21523 [Drosophila simulans]
          Length = 527

 Score =  107 bits (268), Expect = 3e-21,   Method: Compositional matrix adjust.
 Identities = 71/223 (31%), Positives = 114/223 (51%), Gaps = 14/223 (6%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   CQG     E     L C+ +   + +  + PL+VE ++LDP +   H  +   +I 
Sbjct: 281 YTRLCQGRRLPEERSGDPLSCYLDGKRHAYFTLAPLQVEPVHLDPDINVYHGMLSSKQIL 340

Query: 63  RIIELS-KGKVERGKVVNYGDTIYV-DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            I E + K ++ R  V   G    V D R+S+  +L  +     P +  +   IQ ++  
Sbjct: 341 SIFEEADKEEMVRSAVAGDGGKRTVRDLRVSQQTWLDYK----SPVMNSVSRIIQFVSGF 396

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATI 176
            +   E     +Q+ NYG+GG Y+ H D      P++    R+++ MFYL+DVE GG T+
Sbjct: 397 DMAGAEY----MQVANYGVGGQYEPHPDYFEVNLPKNFEGDRISTSMFYLSDVEQGGYTV 452

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           F  LN+ + P KG+ V W+N H +  +D R  H+GCPV +G+K
Sbjct: 453 FTKLNVFLPPVKGALVMWHNLHRSLDVDARTLHAGCPVIVGSK 495


>gi|195441323|ref|XP_002068462.1| GK20483 [Drosophila willistoni]
 gi|194164547|gb|EDW79448.1| GK20483 [Drosophila willistoni]
          Length = 550

 Score =  107 bits (267), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 73/218 (33%), Positives = 112/218 (51%), Gaps = 28/218 (12%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           +NL C Y    + FL++ P+K+EE+ LDP +V+ HD + D+EI  +    K +  +G ++
Sbjct: 320 TNLVCRYNFTTSPFLQLAPMKLEEISLDPYIVQYHDVLSDNEIEDL----KREGIKGTMI 375

Query: 79  NYGDTIYVD--------TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKG 130
           N   ++           T +++V  + P +      + +I  RI DMT   I   +    
Sbjct: 376 NGWTSLKSSNATENESRTIVARVAIMSPSL----EIVQRINRRIIDMTGFNIEESK---- 427

Query: 131 PLQINNYGLGG----HYDLHCDATPRDEGLW----RLASFMFYLTDVELGGATIFPSLNL 182
            +Q+  + +GG    HYD   D     + L     R+AS +FY  DV  GGAT FP   L
Sbjct: 428 TIQLAAFSVGGFFMPHYDYLYDRLLDTDVLKKLGDRVASVIFYAGDVTEGGATNFPRNQL 487

Query: 183 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            V P+KGSA+FWYN   +   D R  HS CPV +G++W
Sbjct: 488 VVQPKKGSALFWYNKFDDGSPDPRSLHSICPVVVGSRW 525


>gi|26352077|dbj|BAC39675.1| unnamed protein product [Mus musculus]
          Length = 383

 Score =  107 bits (267), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 72/221 (32%), Positives = 111/221 (50%), Gaps = 20/221 (9%)

Query: 1   EIYPLACQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           + Y   CQ   S P   +  +L C YE+ ++ +L + P + E ++L P +   HD + D 
Sbjct: 159 DTYEGLCQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDE 218

Query: 60  EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           E  +I EL++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T 
Sbjct: 219 EAQKIRELAEPWLQRSVVASGEKQLQVEYRISKSAWLKDTV---DPMLVTLDHRIAALTG 275

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPS 179
           L I  +  Y   LQ+ NYG+GGHY+ H D                 L+ VE GGAT F  
Sbjct: 276 LDI--QPPYAEYLQVVNYGIGGHYEPHFDHAT--------------LSSVEAGGATAFIY 319

Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            N +V   K +A+FW+N H +   D    H+GCPV +G+KW
Sbjct: 320 GNFSVPVVKNAALFWWNLHRSGEGDGDTLHAGCPVLVGDKW 360


>gi|195505197|ref|XP_002099400.1| GE10884 [Drosophila yakuba]
 gi|194185501|gb|EDW99112.1| GE10884 [Drosophila yakuba]
          Length = 527

 Score =  107 bits (267), Expect = 4e-21,   Method: Compositional matrix adjust.
 Identities = 70/223 (31%), Positives = 110/223 (49%), Gaps = 14/223 (6%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   CQG     E     LKC+ +   + +  + PL+VE ++LDP +   H  +    I 
Sbjct: 281 YTRLCQGRRLPEERSGDPLKCYLDGKRHAYFILAPLQVEPVHLDPDINVYHGMLSSKHIQ 340

Query: 63  RIIELS-KGKVERGKVVNYGDTIYV-DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            I E + K ++ R  V   G    V D R+S+  +L         +   +   +  +   
Sbjct: 341 SIFEEADKKEMVRSAVAGDGGARTVKDLRVSQQTWL--------DYKSPVMKSVGRIIEF 392

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLWRLASFMFYLTDVELGGATI 176
           V G +      +Q+ NYG+GG Y+ H D      P +    R+++ MFYL+DVE GG T+
Sbjct: 393 VSGFDMAGAEFMQVANYGVGGQYEPHPDYFEVNLPEEFIGDRISTSMFYLSDVEQGGYTV 452

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           F  LN+ + P KG+ V W+N H +  +D R  H+GCPV +G+K
Sbjct: 453 FTKLNVFLPPVKGALVMWHNLHRSLDVDARTLHAGCPVIVGSK 495


>gi|24666354|ref|NP_730347.1| CG32199 [Drosophila melanogaster]
 gi|23093193|gb|AAF49251.3| CG32199 [Drosophila melanogaster]
          Length = 509

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 73/225 (32%), Positives = 106/225 (47%), Gaps = 17/225 (7%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN--RI 64
           C+G    P+     L C Y    + FLK+ PLK+E L + P ++  HD +Y++E    R 
Sbjct: 272 CRGEW--PKKSSPELICRYNRDTSAFLKLAPLKLEFLSVQPMILLYHDVLYENEFKSMRD 329

Query: 65  IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
           I +  G +  G      D      +  +V  +        PF   I  R+ DM+    G 
Sbjct: 330 IAMYNGSMIDGWTYVDFDKKGNPKQQDRVVKMIAFQGTTAPFTLSINRRMADMS----GL 385

Query: 125 EERYKGPLQINNYGLGGHYDLHCDATP---------RDEGLWRLASFMFYLTDVELGGAT 175
           E R    L + NYGLGGH+  H D             D G  R+A+ + Y +D+ LGG T
Sbjct: 386 EMRDNMVLYLTNYGLGGHFGKHVDYVELAKRPPDFFADFGGDRIATALIYASDIPLGGTT 445

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +F  L + V P+KGSA+ W+N +     D    HS CPV LG++W
Sbjct: 446 VFTKLKIAVQPKKGSALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 490


>gi|195352184|ref|XP_002042594.1| GM14981 [Drosophila sechellia]
 gi|194124478|gb|EDW46521.1| GM14981 [Drosophila sechellia]
          Length = 539

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 75/235 (31%), Positives = 110/235 (46%), Gaps = 37/235 (15%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G  S        L C Y    + FLK+ PLK+E L + P ++  HD +Y++E   + +
Sbjct: 302 CRGEWSRKSS--PELICRYNRDTSAFLKLAPLKLEFLSVQPMILLYHDVLYENEFKSMRD 359

Query: 67  LSKGKVERGKVVNYGDTI-----YVD-------TRLSKVYFLYPEIFGDHPFLYKIQTRI 114
           L+           Y D++     YVD        +  +V  +        PF   I  R+
Sbjct: 360 LAM----------YNDSMIDGWTYVDFDKKGNPKQQDRVVKIISFQGTTAPFTLSINRRL 409

Query: 115 QDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATP---------RDEGLWRLASFMFY 165
            DM+    G E R    L + NYGLGGH+  H D             D G  R+A+ +FY
Sbjct: 410 ADMS----GLEMRENMVLYLTNYGLGGHFGKHVDYVELAKRPPDFFADFGGDRIATALFY 465

Query: 166 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +DV LGG T+F  L + V P+KG+A+ W+N +     D    HS CPV LG++W
Sbjct: 466 ASDVPLGGTTVFTKLKIAVKPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 520


>gi|195575139|ref|XP_002105537.1| GD21537 [Drosophila simulans]
 gi|194201464|gb|EDX15040.1| GD21537 [Drosophila simulans]
          Length = 536

 Score =  107 bits (266), Expect = 5e-21,   Method: Compositional matrix adjust.
 Identities = 77/213 (36%), Positives = 107/213 (50%), Gaps = 10/213 (4%)

Query: 15  EDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVER 74
           E   S L C Y +    FL++ P ++EEL LDP VV  H+ + D EI ++  +S+  +ER
Sbjct: 291 ESKPSRLHCRYNTTTTPFLRLAPFRMEELSLDPYVVFYHNVLSDPEIEKLKPMSEPFLER 350

Query: 75  GKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
            KV  V  G      TR +   +L P    D P   ++  RI      + G   R    +
Sbjct: 351 AKVFRVEKGSDEIAPTRSADGAWL-PHQDTD-PDDLEVLRRIGRRIRDITGLNTRSGSQM 408

Query: 133 QINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
           Q   YG GGH+  H D     T   E +  R+A+ +FYL +V+ GGAT FP LNL V  +
Sbjct: 409 QFLKYGFGGHFVPHYDYFNSKTSYLERVGDRMATVLFYLNNVDHGGATAFPKLNLVVPTQ 468

Query: 188 KGSAVFWYNAHANTL-LDYRMYHSGCPVALGNK 219
           KGSA+FW+N    +   D R  H  CP+  G K
Sbjct: 469 KGSALFWHNLDRKSYDYDTRTSHGACPLISGTK 501


>gi|328718395|ref|XP_003246475.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Acyrthosiphon
           pisum]
          Length = 518

 Score =  106 bits (265), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 75/208 (36%), Positives = 107/208 (51%), Gaps = 16/208 (7%)

Query: 22  KCFYESYNNTFLKI-GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
           KC Y++ NN F +I  P K E++  +P +   HD +YD EI +I  L+   ++   V + 
Sbjct: 294 KCRYQT-NNLFYRILMPFKEEDINSEPLIKIYHDVLYDDEILKIKTLALENMKDATVKSV 352

Query: 81  ---GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
              GD++   TR  +VY++         +L  + TRI+  T       E+Y    QI NY
Sbjct: 353 DGKGDSLIEKTRSGQVYWISK--VDAVEYLDALDTRIESFTGFSTKTAEQY----QIVNY 406

Query: 138 GLGGHYDLHCDATPRDEGLW----RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF 193
           GLGGHY  H D+  +         RL + +FYLTDV+  G T FP LN+    EKG+A+ 
Sbjct: 407 GLGGHYLPHHDSFAKAINCLQFGNRLVTVLFYLTDVQNDGYTSFPLLNIIAPAEKGAALV 466

Query: 194 WYNAH-ANTLLDYRMYHSGCPVALGNKW 220
           W N H +N    Y   H  CP+  GNKW
Sbjct: 467 WNNLHMSNGQKFYESLHGSCPLLKGNKW 494


>gi|195591304|ref|XP_002085382.1| GD14758 [Drosophila simulans]
 gi|194197391|gb|EDX10967.1| GD14758 [Drosophila simulans]
          Length = 509

 Score =  106 bits (265), Expect = 6e-21,   Method: Compositional matrix adjust.
 Identities = 74/235 (31%), Positives = 109/235 (46%), Gaps = 37/235 (15%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G    P      L C Y    + FLK+ PLK+E L + P ++  HD +Y++E   + +
Sbjct: 272 CRGEW--PRKSSPELICRYNRDTSAFLKLAPLKLEFLSVQPMILLYHDVLYENEFKSMRD 329

Query: 67  LSKGKVERGKVVNYGDTI-----YVD-------TRLSKVYFLYPEIFGDHPFLYKIQTRI 114
                     +  Y D++     YVD        +  +V  +        PF   I  R+
Sbjct: 330 ----------IAMYNDSMIDGWTYVDFDKKGNPKQQDRVVKIISFQGTTAPFTLSINRRL 379

Query: 115 QDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATP---------RDEGLWRLASFMFY 165
            DM+    G E R    L + NYGLGGH+  H D             D G  R+A+ +FY
Sbjct: 380 ADMS----GLEMRENMVLYLTNYGLGGHFGKHVDYVELAKRPPDFFADFGGDRIATAVFY 435

Query: 166 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +DV LGG T+F  L + V P+KG+A+ W+N +     D    HS CPV LG++W
Sbjct: 436 ASDVPLGGTTVFTKLKIAVQPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 490


>gi|198429625|ref|XP_002128613.1| PREDICTED: similar to procollagen-proline, 2-oxoglutarate
           4-dioxygenase (proline 4-hydroxylase), alpha 1
           polypeptide [Ciona intestinalis]
          Length = 195

 Score =  106 bits (264), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 66/174 (37%), Positives = 94/174 (54%), Gaps = 16/174 (9%)

Query: 56  IYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR 113
           + D E+  I  L+K ++ R  V N   G   +   R+SK  +L  E   DHP + ++  R
Sbjct: 1   MSDKEMAMIKSLAKPRLRRATVQNPVTGVLEFAHYRVSKSAWLKDE---DHPVIKRVCQR 57

Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR-------DEGLWRLASFMFYL 166
           I D+T L +   E     LQI NYG+GG Y+ H D + +       DE   R+A+F+ Y+
Sbjct: 58  ISDVTGLSMETAEE----LQIANYGVGGQYEPHFDYSRKSDFGKFDDEVGNRIATFLTYM 113

Query: 167 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           ++VE GG+T+F    + V P KGSAVFWYN   +   D R  H+ CPV  G KW
Sbjct: 114 SNVEQGGSTVFLHPGIAVRPIKGSAVFWYNLLPSGAGDERTRHAACPVLTGVKW 167


>gi|195159299|ref|XP_002020519.1| GL13471 [Drosophila persimilis]
 gi|194117288|gb|EDW39331.1| GL13471 [Drosophila persimilis]
          Length = 238

 Score =  106 bits (264), Expect = 8e-21,   Method: Compositional matrix adjust.
 Identities = 68/228 (29%), Positives = 113/228 (49%), Gaps = 19/228 (8%)

Query: 6   ACQGNLSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
            C G    P +   +L CFY +   + FL +  ++ E L  DP +   +D +  S++  +
Sbjct: 19  CCNGLCKGPRN--RHLHCFYLTKRGSPFLLLARVRTEILSDDPFIALYYDVLTHSDMVSL 76

Query: 65  IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR----IQDMTNL 120
              S+  +     + Y +     +     +F++ E     P + +   R    + D+T L
Sbjct: 77  RNTSEPLLHPATTIQYFNAPQELSNSRTAHFVWLE-----PTITEATRRADRVLWDVTGL 131

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASFMFYLTDVELGGATIF 177
            +   E +    Q+NNYG+GG +  H D    +       R+A+ +FYL+DV  GGAT+F
Sbjct: 132 NLSNSEMF----QVNNYGIGGSFMRHSDLLHSERNYLVRERIATAIFYLSDVPQGGATLF 187

Query: 178 PSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLLL 225
             LN+TVFP+ G+ +FWYN   +   D R  H+GCPV +G+KW +  L
Sbjct: 188 TELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKWSRFSL 235


>gi|241999340|ref|XP_002434313.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215496072|gb|EEC05713.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 267

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 77/227 (33%), Positives = 106/227 (46%), Gaps = 31/227 (13%)

Query: 18  KSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGK 76
           +S L C   +   + FL + P K+E L  DPR+V   D +   E      +S+ K+ R K
Sbjct: 28  QSKLLCKISTIGGHPFLVLQPFKIEVLSEDPRIVVFPDFLNPRECEIFRSISQEKLSRAK 87

Query: 77  VVNYG--DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
           V   G  +  +   R +KV ++  ++   HP L K+  RI   T L +   E Y    Q+
Sbjct: 88  VYLGGPPEGGFSLRRTNKVAWMSDDL---HPLLGKVSRRIALATGLTLTSAEMY----QV 140

Query: 135 NNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELGGATIFPSLNLTVFP 186
            NYGLGGHY  H D     E           RLA+ + YL DV  GGAT F ++ L V P
Sbjct: 141 ANYGLGGHYIPHPDYAGFGEAQGDIYKSSGNRLATMLIYLADVAGGGATAFINMRLAVKP 200

Query: 187 EKGSAVFWYNA-------------HANTLLDYRMYHSGCPVALGNKW 220
             G+A+FWYN              +     D R +H GCPV  G+KW
Sbjct: 201 TLGTALFWYNLKPYDGPIVNESFWNQRRFGDPRTFHMGCPVLTGSKW 247


>gi|20177086|gb|AAM12247.1| AT28279p [Drosophila melanogaster]
          Length = 509

 Score =  105 bits (263), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 72/225 (32%), Positives = 106/225 (47%), Gaps = 17/225 (7%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN--RI 64
           C+G    P+     L C Y    + FLK+ PLK+E L + P ++  HD +Y++E    R 
Sbjct: 272 CRGEW--PKKSSPELICRYNRDTSAFLKLAPLKLEFLSVQPMILLYHDVLYENEFKSMRD 329

Query: 65  IELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGR 124
           I +  G +  G      D      +  +V  +        PF   I  R+ DM+    G 
Sbjct: 330 IAMYNGSMIDGWTYVDFDKKGNPKQQDRVVKMIAFQGTTAPFTLSINRRMADMS----GL 385

Query: 125 EERYKGPLQINNYGLGGHYDLHCDATP---------RDEGLWRLASFMFYLTDVELGGAT 175
           E R    L + NYGLGGH+  H D             D G  R+A+ + Y +D+ LGG T
Sbjct: 386 EMRDNMVLYLTNYGLGGHFGKHVDYVELAKRPPDFFADFGGDRIATALIYASDIPLGGTT 445

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +F  L + V P+KG+A+ W+N +     D    HS CPV LG++W
Sbjct: 446 VFTKLKIAVQPKKGNALIWFNLNHAGEPDPLTEHSVCPVVLGSRW 490


>gi|390176894|ref|XP_002136933.2| GA26862 [Drosophila pseudoobscura pseudoobscura]
 gi|388858830|gb|EDY67491.2| GA26862 [Drosophila pseudoobscura pseudoobscura]
          Length = 520

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 67/222 (30%), Positives = 112/222 (50%), Gaps = 19/222 (8%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           C G    P +   +L CFY +   + FL +  ++ E L  DP +   +D +  S++  + 
Sbjct: 282 CNGLCKGPRN--RHLHCFYLTKRGSPFLLLARVRTEILSDDPFIALYYDVLTHSDMVSLR 339

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR----IQDMTNLV 121
             S+  +     + Y +     +     +F++ E     P + +   R    + D+T L 
Sbjct: 340 NTSEPLLHPATTIQYLNAPQELSNSRTAHFVWLE-----PTITEATRRADRVLWDVTGLN 394

Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASFMFYLTDVELGGATIFP 178
           +   E++    Q+NNYG+GG +  H D    +       R+A+ +FYL+DV  GGAT+F 
Sbjct: 395 LSNSEKF----QVNNYGIGGSFMRHSDPLHSERNYLVRERIATAIFYLSDVPQGGATLFT 450

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            LN+TVFP+ G+ +FWYN   +   D R  H+GCPV +G+KW
Sbjct: 451 ELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKW 492


>gi|405967005|gb|EKC32220.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
          Length = 303

 Score =  105 bits (262), Expect = 1e-20,   Method: Compositional matrix adjust.
 Identities = 69/225 (30%), Positives = 106/225 (47%), Gaps = 29/225 (12%)

Query: 17  IKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGK 76
           ++S L+C+      T + I   K E +   PR+   HD I + +I ++ +    K+   +
Sbjct: 52  VESKLRCYLR---KTAIPIYMAKEEVVNYTPRISLFHDVISNDDIRQLKKAGTKKLTHSR 108

Query: 77  VVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKG--PLQI 134
               G       R+S+  ++Y +         ++  RI ++ NL      +     P Q+
Sbjct: 109 T---GGGYVTRLRVSQTGWVYDQAIPQ--VSRRLARRIANIVNLDTTFRSKASPVEPWQV 163

Query: 135 NNYGLGGHYDLHCDATPRDEGLW-------------------RLASFMFYLTDVELGGAT 175
            +Y  GG+Y  H D    DE LW                   R+A++MFYL+DVE GGAT
Sbjct: 164 LSYTTGGYYGEHIDPDIGDEFLWNMTEAVQGPRALWRKHTGQRIATWMFYLSDVEAGGAT 223

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +FP L   V   KG+A FWYN   +  +D R  H+GCPV LG+KW
Sbjct: 224 VFPKLEARVPVVKGAAAFWYNLTPSGKIDRRTQHAGCPVILGSKW 268


>gi|442762205|gb|JAA73261.1| Putative prolyl 4-hydroxylase alpha subunit, partial [Ixodes
           ricinus]
          Length = 482

 Score =  105 bits (262), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 68/236 (28%), Positives = 112/236 (47%), Gaps = 25/236 (10%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G L     + S L+C Y   +     + P+K+EE+ L P +V +HD + D +I 
Sbjct: 225 YKRLCRGELLRTPKMDSKLRCRYYKGHGGSFTLHPIKLEEVNLKPYIVVMHDVVQDRDIE 284

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM----T 118
            +   ++ +++     +         R S   ++  +   + P   K+   ++ +    T
Sbjct: 285 DLRAFAEPRLQTSLTYDVPGVESPAVRTSSNAWMDEK---NAPVATKLNKFLRSLLGMGT 341

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLH---------CDATPRDEGLW-----RLASFMF 164
           +   G  E+Y    Q+ NYG GGH+  H          D  P +         R+A+ M 
Sbjct: 342 SYSDGEAEKY----QLANYGTGGHFLTHPDYLGDLFENDTDPSEFEFHKKVGDRVATLMI 397

Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           Y++DVE GGAT+FP L + + P+KG A FW+N  AN   +    H+GCPV  G+KW
Sbjct: 398 YMSDVEEGGATVFPYLGVRLTPQKGDAAFWWNLKANGEGEVLTTHAGCPVLYGSKW 453


>gi|195452772|ref|XP_002073493.1| GK14149 [Drosophila willistoni]
 gi|194169578|gb|EDW84479.1| GK14149 [Drosophila willistoni]
          Length = 496

 Score =  105 bits (261), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 68/213 (31%), Positives = 103/213 (48%), Gaps = 14/213 (6%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           + L C Y +    FL++ P ++EEL LDP +V  ++ + D EI ++  L+   +++   +
Sbjct: 264 TKLHCRYNTTTTPFLRLAPFRMEELSLDPYIVAYYNVLSDQEITQLDRLTATLLKKTFAI 323

Query: 79  NYGDTIYVDTRLSKVYFL----YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
              D    + R +   +      P    +   + +I   + D+T L   + + +    Q 
Sbjct: 324 GPDDDYDDNARTADGAWFPNNETPRTEENIQLIERIINLVSDLTGLQGDKADSF----QA 379

Query: 135 NNYGLGGHYDLHCD--ATPRDEGLW---RLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
             YG GGHY  H D      D+  +   RLA+  FYL  V+ GGAT+FPSLNL V  EKG
Sbjct: 380 VRYGFGGHYTPHFDYLNMSIDQTAFYGDRLATVFFYLNTVKHGGATVFPSLNLKVPAEKG 439

Query: 190 SAVFWYNAHANTL-LDYRMYHSGCPVALGNKWG 221
             +FWYN    +   D    H GCPV  G K G
Sbjct: 440 KVLFWYNLDGESFDFDENTEHGGCPVVDGIKLG 472


>gi|292621357|ref|XP_691737.4| PREDICTED: prolyl 4-hydroxylase subunit alpha-3 [Danio rerio]
          Length = 538

 Score =  104 bits (260), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 73/226 (32%), Positives = 114/226 (50%), Gaps = 13/226 (5%)

Query: 3   YPLACQGNLSVPEDIKS-NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           Y   CQ   S P+  ++ +L C Y +  +  L + P++ E + L P VV  H  +  +E 
Sbjct: 295 YEQLCQTKGSQPKHFENPSLFCDYFTNGSPALFLQPIRREIISLQPYVVLFHGFVTQAEA 354

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
             I + +   + R  V +  +    + R+SK  +L       H  + K+  RI  +T L 
Sbjct: 355 KNIRKYAMPGLRRSVVASGMNQATAEYRISKSAWLKE---SAHEVVGKLDQRITLVTGLN 411

Query: 122 IGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGA 174
           +  +  Y   LQ+ NYG+GGHY+ H D AT     L+RL      A+ M YL+ V+ GG+
Sbjct: 412 V--QPPYAEYLQVVNYGIGGHYEPHFDHATSDSSPLYRLKTGNRVATIMIYLSPVQAGGS 469

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T F   N +V   + +A+FW+N H N   +    H+GCPV +GNKW
Sbjct: 470 TAFIYANFSVPVVQNAALFWWNLHKNGQGNVDTLHAGCPVIVGNKW 515


>gi|47227817|emb|CAG08980.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 285

 Score =  104 bits (260), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 73/229 (31%), Positives = 113/229 (49%), Gaps = 15/229 (6%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTF--LKIGPLKVEELYLDPRVVKIHDAIYD 58
           + Y   C+   S P   + N + F +++ N    L + P + E L L P VV  HD I D
Sbjct: 40  DTYERLCRTRGSQPTHFE-NPQLFCDNFANGHPGLLLRPARRETLSLQPYVVLYHDFISD 98

Query: 59  SEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           +E   I   ++  + R  V      +  + R+SK  +L          + ++  RI  +T
Sbjct: 99  TEAEEIKHHAQLGLRRSVVATRDKQVTAEYRISKSAWLKGSA---QSAVSRLDQRISMLT 155

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMFYLTDVEL 171
            L +  +  +   LQ+ NYG+GGHY+ H D AT     ++      R+A+ M YL+ VE 
Sbjct: 156 GLNV--QHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKLKTGNRVATVMIYLSSVEA 213

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GG+T F   N +V   K +A+FW+N H N   D    H+GCPV +G+KW
Sbjct: 214 GGSTAFIYANFSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPVLIGDKW 262


>gi|195391756|ref|XP_002054526.1| GJ24503 [Drosophila virilis]
 gi|194152612|gb|EDW68046.1| GJ24503 [Drosophila virilis]
          Length = 519

 Score =  104 bits (259), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 72/229 (31%), Positives = 112/229 (48%), Gaps = 21/229 (9%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           Y   CQG  LS P+   S L C+ +   +   ++ PLKVE++ L+P +   +D I D +I
Sbjct: 283 YTQLCQGKRLSEPKPNGSALNCYLDFTRHARFRLAPLKVEQVRLNPDIHIYYDLINDDQI 342

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
           + I E+           +   +I  D R+S+  +L         +   I   + ++   +
Sbjct: 343 DDIYEVVDQF--DSFRSSVSSSIVTDWRVSQQVWL--------NYSSPILRSVSNLVGAI 392

Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-----RLASFMFYLTDVELGGATI 176
            G +      +Q+ NYG+GG Y  H D   +    +     R+A+ MFYL+DV  GG T+
Sbjct: 393 SGFDMENAEQMQVANYGIGGQYAPHTDYLSKIPDSYIPRGNRIATNMFYLSDVLNGGYTV 452

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV-----ALGNKW 220
           FP LN+ + P KG+ V WYN H +   D R  H+GCPV      +GN W
Sbjct: 453 FPKLNVFLKPVKGAMVSWYNLHRSLNKDSRTLHAGCPVIEGVKRIGNIW 501


>gi|195110921|ref|XP_002000028.1| GI24861 [Drosophila mojavensis]
 gi|193916622|gb|EDW15489.1| GI24861 [Drosophila mojavensis]
          Length = 508

 Score =  104 bits (259), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 71/219 (32%), Positives = 119/219 (54%), Gaps = 20/219 (9%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           CQG   +PE    +L C+ +   +   ++ PLKVE+ +L+P +   +D + D +I  +++
Sbjct: 280 CQGK-RLPE--PGSLSCYLDFERHPRFRLSPLKVEQAHLNPDIHIYYDVLTDPQIESVLD 336

Query: 67  L-SKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
           L S+ +  R KV+  GD +  +TR+S+  +L         +   I   + ++   + G +
Sbjct: 337 LASQLESFRSKVL--GDVV-TETRVSQQVWLN--------YTSPIMRTVGNLLGAISGLD 385

Query: 126 ERYKGPLQINNYGLGGHYDLHCD--ATPRDEGLWR---LASFMFYLTDVELGGATIFPSL 180
                 +Q+ NYG+GG Y  H D  +  R++ + R   + + MFYL+DV  GG T+FP L
Sbjct: 386 MTNVEEMQVANYGIGGQYFPHFDYISELREDYIERGNRITTNMFYLSDVLQGGYTVFPFL 445

Query: 181 NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           N+ + P KGS V W N H +   D R+ H+GCPV  G+K
Sbjct: 446 NVFLRPVKGSLVIWPNVHRSLAPDSRVLHAGCPVLEGSK 484


>gi|443705944|gb|ELU02240.1| hypothetical protein CAPTEDRAFT_227850 [Capitella teleta]
          Length = 475

 Score =  103 bits (258), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 65/219 (29%), Positives = 110/219 (50%), Gaps = 19/219 (8%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           +L C  +   N++   G  K E L+ +P +   HD I DSEI R+ ++++ + +   V++
Sbjct: 160 DLFCLNKQMRNSY---GLWKTELLHANPEIYLFHDFISDSEIQRLKDMAEPQFQSSAVLD 216

Query: 80  Y--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGP--LQIN 135
              G++ +  +RLS   F    +   +  +  +  R+  +T L     + +     LQ+ 
Sbjct: 217 DTGGESFFDVSRLSSTAF----VNDSNDLVASLNRRVSKLTGLQTEVLDSFSESESLQVL 272

Query: 136 NYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
            YG GG Y  H D    +  L         R+A+F+ YL     GGAT+FP L +++  +
Sbjct: 273 RYGPGGLYTPHYDTLGSEADLPPYIQHTGDRIATFILYLDIATAGGATVFPLLPMSIPIQ 332

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLLLS 226
           KG+A FW+N H +  LD R  H+ CPV  G KW  +++S
Sbjct: 333 KGAAAFWFNLHPDGSLDRRTLHAACPVIRGTKWECVIVS 371


>gi|195159309|ref|XP_002020524.1| GL13466 [Drosophila persimilis]
 gi|194117293|gb|EDW39336.1| GL13466 [Drosophila persimilis]
          Length = 643

 Score =  103 bits (258), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 67/222 (30%), Positives = 111/222 (50%), Gaps = 19/222 (8%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           C G    P +   +L CFY +   + FL +  ++ E L  DP +   +D +  S++  + 
Sbjct: 405 CNGLCKGPRN--RHLHCFYLTKRGSPFLLLARVRTEILSDDPFIALYYDVLTHSDMVSLR 462

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR----IQDMTNLV 121
             S+  +     + Y +     +     +F++ E     P + +   R    + D+T L 
Sbjct: 463 NTSEPLLHPATTIQYFNAPQELSNSRTAHFVWLE-----PTITEATRRADRVLWDVTGLN 517

Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASFMFYLTDVELGGATIFP 178
           +   E +    Q+NNYG+GG +  H D    +       R+A+ +FYL+DV  GGAT+F 
Sbjct: 518 LSNSEMF----QVNNYGIGGSFMRHSDLLHSERNYLVRERIATAIFYLSDVPHGGATLFT 573

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            LN+TVFP+ G+ +FWYN   +   D R  H+GCPV +G+KW
Sbjct: 574 ELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIVGSKW 615


>gi|312385117|gb|EFR29691.1| hypothetical protein AND_01144 [Anopheles darlingi]
          Length = 295

 Score =  103 bits (257), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 71/225 (31%), Positives = 112/225 (49%), Gaps = 21/225 (9%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G    P  + S L+C+Y++ N+  + IGP KVE L  +P V   +D I+DSEI R+ E
Sbjct: 45  CKGTYQRPVGLTSWLRCWYDARNDHSV-IGPRKVEMLNYEPFVALFYDVIHDSEITRLQE 103

Query: 67  LSKGKVE-RGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
           L  G ++  G   +    +Y +    + Y L      D P + ++  R + M+ L     
Sbjct: 104 LGDGVIKVSGATTDGWLPVYYENH--QTYTLQNR---DDPVVKRLSQRTERMSGLSCDTA 158

Query: 126 ERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDV---ELGGAT 175
           E     L++    +G +     D   +            RLA+ +F+++DV   E GG  
Sbjct: 159 ED----LKVIYNEVGAYKSFIVDGKKKSSVAQQFAFAGKRLATVLFFMSDVDGAEGGGRI 214

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            FP L L+V P+KG+A+FWYN H +   D RM +S CP+   N+W
Sbjct: 215 AFPYLGLSVLPQKGAALFWYNLHDSGRPDERMTYSICPLLADNRW 259


>gi|313243209|emb|CBY39868.1| unnamed protein product [Oikopleura dioica]
          Length = 430

 Score =  103 bits (257), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 68/216 (31%), Positives = 108/216 (50%), Gaps = 17/216 (7%)

Query: 18  KSNLKCFYESYNN--TFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERG 75
           KSNLKCFY +  +  + L+  P+K EEL+ DP VV+ ++ I D E   I  L+   + R 
Sbjct: 191 KSNLKCFYWTGPSPVSPLQWAPVKTEELHDDPLVVQFYEVISDEEERAIQFLAGEHLNRA 250

Query: 76  KVVN--YGDTIYVDTRLSKVYFLYP-EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
            + +   G  +  D R+ K  +L   + F  +  + K   ++  +T L     E     +
Sbjct: 251 TIQDPATGKLVNADYRIQKTAWLTEFDKFDVNGTIAKYNAKLTKITGLDADHAEL----V 306

Query: 133 QINNYGLGGHYDLHCD--ATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTV 184
           Q+ NYG+ G Y+ H D  + P  E  W      R+A+++ Y+++  +GG T+F    +  
Sbjct: 307 QVGNYGVAGQYEPHWDHQSYPGAENRWDPIEGSRIATWLAYMSEPNMGGGTVFIQAGIQA 366

Query: 185 FPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            P + SAVFWYN   +   D    H+ CPV  G KW
Sbjct: 367 RPIRNSAVFWYNLLPSGESDDNTQHAACPVLSGTKW 402


>gi|195505241|ref|XP_002099419.1| GE10893 [Drosophila yakuba]
 gi|194185520|gb|EDW99131.1| GE10893 [Drosophila yakuba]
          Length = 508

 Score =  103 bits (257), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 70/231 (30%), Positives = 115/231 (49%), Gaps = 20/231 (8%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y   C+ + S P    S L C Y S  + FL + P K+EE+ L+P +V  HD + D +
Sbjct: 262 EDYKRLCRSSFS-PR--PSKLLCRYNSDTSPFLILAPFKMEEISLEPYIVVYHDILPDKD 318

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDH-----PFLYKIQTRIQ 115
           + ++I L++ ++   +V     +    +  S +    P  F D      P L ++  R++
Sbjct: 319 MQQLIALAEPRLRPTEVFEEDKSEARTSDRSALGTFLP--FKDMNPSGGPLLDRLTQRMR 376

Query: 116 DMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDEGLW-RLASFMFYLTDVE 170
           D+T + I    R++    I  YG G  Y  + D         EG   R+A+ +FYL D  
Sbjct: 377 DITGIQI----RHENTFNIIKYGFGSQYATNFDFFNGTNSEMEGYGDRMATVLFYLNDAP 432

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKW 220
            GGAT+FP +++ V  E+G  +FW+N +  T  ++    H+ CPV  G+KW
Sbjct: 433 NGGATVFPRIDVKVTAERGKVLFWHNLNGETHDVEPNTLHAACPVFQGSKW 483


>gi|198477148|ref|XP_002136736.1| GA29214 [Drosophila pseudoobscura pseudoobscura]
 gi|198145041|gb|EDY71753.1| GA29214 [Drosophila pseudoobscura pseudoobscura]
          Length = 520

 Score =  103 bits (257), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 68/222 (30%), Positives = 111/222 (50%), Gaps = 19/222 (8%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           C G    P +   +L CFY +   + FL +  ++ E L  DP +V  +D +  S++  + 
Sbjct: 282 CNGLCKGPRN--RHLHCFYLTKRGSPFLLLARVRTEILSDDPFIVLYYDVLTHSDMVSLR 339

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR----IQDMTNLV 121
             S+  +     + Y +     +     +F++ E     P + +   R    + D+T L 
Sbjct: 340 NTSEPLLHPATTIQYLNAPQELSNSRTAHFVWLE-----PTITEATRRADRVLWDVTGLN 394

Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASFMFYLTDVELGGATIFP 178
           +   E +    Q+NNYG+GG +  H D    +       R+A+ +FYL+DV  GGAT+F 
Sbjct: 395 LSNSEMF----QVNNYGIGGSFMRHSDLLHSERNYLVRERIATAIFYLSDVPQGGATLFT 450

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            LN+TVFP+ G+ +FWYN   +   D R  H+GCPV  G+KW
Sbjct: 451 ELNVTVFPQAGTVLFWYNLAHSGDHDMRTRHTGCPVIGGSKW 492


>gi|195341582|ref|XP_002037385.1| GM12897 [Drosophila sechellia]
 gi|194131501|gb|EDW53544.1| GM12897 [Drosophila sechellia]
          Length = 467

 Score =  103 bits (257), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 69/225 (30%), Positives = 113/225 (50%), Gaps = 26/225 (11%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y   C+ + S      S L C Y S  + FL +  LK+EE+ L+P +V  HD + D +
Sbjct: 261 EDYKRLCRSSFS---PTPSKLHCRYNSTTSRFLILASLKMEEISLEPYIVAYHDILPDKD 317

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I ++I L++  ++  +V +        +  + +           P L ++  R++D+T L
Sbjct: 318 IQQLITLAEPLLKPIEVFDENKNEAKSSDRTSL---------GGPLLDRLTERMRDITGL 368

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-RLASFMFYLTDVELGGATIFPS 179
            I +      P+ I  YG G H +         EG   R+A+ MFYL D   GGAT+FP 
Sbjct: 369 QIPQ----GNPINIIKYGFGAHSET--------EGYGDRMATVMFYLNDAPYGGATVFPR 416

Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 224
           LN+ V  E+G  + WYN + ++  D    H+ CPV  G+K+G+++
Sbjct: 417 LNVKVPAERGKVLLWYNLNGDS-QDVTTVHAVCPVFHGSKYGEIV 460


>gi|195159303|ref|XP_002020521.1| GL13468 [Drosophila persimilis]
 gi|194117290|gb|EDW39333.1| GL13468 [Drosophila persimilis]
          Length = 415

 Score =  103 bits (257), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 72/223 (32%), Positives = 107/223 (47%), Gaps = 28/223 (12%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRI 64
           L C+G    P      L C Y +    FL++ P K E L L P +V  HD I   E   +
Sbjct: 197 LCCRGG--CPYRDMHRLTCSYNTTAAPFLRLAPFKTELLSLSPYMVLYHDVITPLESLTL 254

Query: 65  IELSKGKVERGKVV---NYGDTIYVDT-RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
             LSK  ++R  +V   N     ++D+ R S   +L      ++  + +++ R+  MTN 
Sbjct: 255 KNLSKPLMKRRAMVMVNNLKVRPFIDSGRTSNSVWLTSH---ENAVMERLERRVGVMTNF 311

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCD--ATPRDEGLWRLASFMFYLTDVELGGATIFP 178
            +   E Y    Q+ NYG+GGHY  H D   TP+             L+DV  GGAT+FP
Sbjct: 312 EMENSEVY----QLINYGIGGHYKPHTDHFETPQ-------------LSDVPQGGATLFP 354

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
            LN++V P +G A+ WYN +     +    H+ CP+  G+KW 
Sbjct: 355 RLNISVQPRQGDALLWYNLNDRGQGEIGTVHTSCPIIKGSKWA 397


>gi|328718387|ref|XP_001952104.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-3-like [Acyrthosiphon
           pisum]
          Length = 293

 Score =  103 bits (257), Expect = 6e-20,   Method: Compositional matrix adjust.
 Identities = 77/208 (37%), Positives = 107/208 (51%), Gaps = 16/208 (7%)

Query: 22  KCFYESYNNTFLKI-GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
           KC Y++ NN F +I  P K E++  +P +   HD +YD EI +I  L+   +    V + 
Sbjct: 73  KCRYQT-NNLFYRILMPFKEEDINSEPLIKIYHDVLYDDEILKIKTLALENMNDAHVKSV 131

Query: 81  G---DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNY 137
               D +   TR  +VY++  E+     F   + TRI+  T       E+Y    QI NY
Sbjct: 132 DGKDDVLEEKTRSGQVYWI-SEVDAVEYFD-ALNTRIESFTGFSTKTAEQY----QIVNY 185

Query: 138 GLGGHYDLHCDA----TPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF 193
           GLGGHY  H D+    T   E   RL + +FYLTDV+  G T FP LN+    +KG+A+ 
Sbjct: 186 GLGGHYLPHHDSFAKGTENVEFGNRLVTVLFYLTDVQNDGYTSFPLLNINAPVDKGAALV 245

Query: 194 WYNAH-ANTLLDYRMYHSGCPVALGNKW 220
           W N H +N  L Y   H  CP+  GNKW
Sbjct: 246 WNNLHMSNGQLFYESLHGSCPLLKGNKW 273


>gi|313217217|emb|CBY38368.1| unnamed protein product [Oikopleura dioica]
 gi|313239835|emb|CBY17758.1| unnamed protein product [Oikopleura dioica]
          Length = 521

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 68/216 (31%), Positives = 107/216 (49%), Gaps = 17/216 (7%)

Query: 18  KSNLKCFYESYNNTF--LKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERG 75
           KSNLKCFY +  +    L+  P+K EEL+ DP VV+ ++ I D E   I  L+   + R 
Sbjct: 282 KSNLKCFYWTGPSPLSPLQWAPVKTEELHGDPLVVQFYEVISDEEERAIQFLAGEHLNRA 341

Query: 76  KVVN--YGDTIYVDTRLSKVYFLYP-EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
            + +   G  +  D R+ K  +L   E    +  + K   ++  +T    G +  Y   +
Sbjct: 342 TIQDPATGKLVNADYRIQKTAWLTEFEKLDVNGTIAKYNEKLTKIT----GLDADYAELV 397

Query: 133 QINNYGLGGHYDLHCD--ATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTV 184
           Q+ NYG+ G Y+ H D  + P  E  W      R+A+++ Y+++  +GG T+F    +  
Sbjct: 398 QVGNYGVAGQYEPHWDHQSYPGAENRWDPIEGSRIATWLAYMSEPNMGGGTVFIQAGIQA 457

Query: 185 FPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            P + SAVFWYN   +   D    H+ CPV  G KW
Sbjct: 458 RPIRNSAVFWYNLLPSGESDDNTQHAACPVLSGTKW 493


>gi|393903732|gb|EFO16802.2| hypothetical protein LOAG_11701 [Loa loa]
          Length = 531

 Score =  103 bits (256), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 73/230 (31%), Positives = 111/230 (48%), Gaps = 43/230 (18%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           + Y   C+  + V    +S L C+Y+  +  +L++ P+KVE +Y +P  V  HD + D E
Sbjct: 283 DTYQALCRQEMPVNIKAQSRLYCYYK-MDRPYLRLAPIKVEIVYQNPLAVLFHDIMSDEE 341

Query: 61  INRIIE-LSKGKVERGKV--VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
            +RIIE L+  K++R  V  V  G+      R+SK  +L      +H  + +I  R+   
Sbjct: 342 -SRIIEMLAVPKLDRATVHNVETGNLETASYRISKSAWLRS---TEHEVVNRINRRLDLA 397

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVE 170
           TNL I   E     LQ+ NYG+GGHY+ H D + RDE  +       R+A+ + Y     
Sbjct: 398 TNLEIATAEE----LQVQNYGIGGHYEPHLDCS-RDEDAFERTGTGNRIATILIY----- 447

Query: 171 LGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
                              +A+FWYN   +  +D R YH+ CPV  G KW
Sbjct: 448 ------------------NAALFWYNLMRSGAVDMRSYHAACPVLTGTKW 479


>gi|195069797|ref|XP_001997029.1| GH12978 [Drosophila grimshawi]
 gi|193891498|gb|EDV90364.1| GH12978 [Drosophila grimshawi]
          Length = 518

 Score =  102 bits (255), Expect = 9e-20,   Method: Compositional matrix adjust.
 Identities = 73/227 (32%), Positives = 116/227 (51%), Gaps = 23/227 (10%)

Query: 3   YPLACQGNLSVPEDIKSNL---KCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           Y   CQG   +PE IK+N    +C+ +S  + + K+ PLKVE++ L P +   +D + D+
Sbjct: 281 YVRLCQGK-RLPE-IKTNQSSPRCYLDSNQHAYFKLSPLKVEQVNLAPDINIYYDVLNDN 338

Query: 60  EINRIIELS-KGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           +I  I+ELS + +  R  V  Y  T   D R+S+  +L         +   I    + + 
Sbjct: 339 QIKSILELSTEFESFRSSVNKYNVT---DKRVSQQVWL--------NYSSPIMRTYRQLV 387

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELG 172
             + G        +Q+ NYG+GG Y+ H D +  +          R+++ M YL+DV+ G
Sbjct: 388 GAISGFNMTNAEIMQVANYGIGGQYEPHHDFSGANLAARYANFGDRISTNMIYLSDVQQG 447

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           G T+FP+ N+ V P KG+ V W+N   +   D R  H+GCPV  G K
Sbjct: 448 GYTVFPTQNVFVKPIKGAMVMWHNLLRSLDGDRRTLHAGCPVIEGTK 494


>gi|194751827|ref|XP_001958225.1| GF23630 [Drosophila ananassae]
 gi|190625507|gb|EDV41031.1| GF23630 [Drosophila ananassae]
          Length = 431

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 72/220 (32%), Positives = 109/220 (49%), Gaps = 36/220 (16%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y L C+G       +K+ L C Y  +   FLKI PLK E L LDP +   H+ +Y+ E++
Sbjct: 244 YELGCRGLFP----LKNKLFCQYNFHTTPFLKIAPLKQEILSLDPFISMFHEVLYEYELH 299

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            + E  K  ++  K   Y   I    R S+                    R+ D+T L  
Sbjct: 300 GLKEDLKNPIKSKK---YKKNI--TNRFSQ--------------------RLTDITGLHF 334

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL 182
            + ++    + I+NYGL    ++H +   +D G   + + +F+++D   GGAT+FP L +
Sbjct: 335 SKRDQ----INIDNYGLENQAEVHYNY--KDIG-GPVGAILFFISDDVQGGATVFPKLKV 387

Query: 183 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGK 222
           +VFP+KGS + WYN   +  LD R  HS CPV  GN  GK
Sbjct: 388 SVFPKKGSCLVWYNIKDDGRLDPRTTHSICPVLEGNSLGK 427


>gi|195069795|ref|XP_001997028.1| GH12977 [Drosophila grimshawi]
 gi|193891497|gb|EDV90363.1| GH12977 [Drosophila grimshawi]
          Length = 517

 Score =  102 bits (254), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 75/228 (32%), Positives = 114/228 (50%), Gaps = 25/228 (10%)

Query: 3   YPLACQGNLSVPEDIKSNL---KCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           Y   CQG   +PE IK+N    +C+ +S  + + K+ PLKVE++ L P +   +D + D+
Sbjct: 280 YVRLCQGK-RLPE-IKTNQSSPRCYLDSNQHAYFKLSPLKVEQVNLAPDINIYYDVLNDN 337

Query: 60  EINRIIELSKG-KVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           +I  I+ELS      R  V  Y  T   D R+S+  +L         +   I    + + 
Sbjct: 338 QIKSILELSTEFDSFRSSVNKYNVT---DKRVSQQVWL--------NYSSPIMRTYRQLV 386

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPRDEGLWRLASFMFYLTDVEL 171
             + G        +Q+ NYG+GG Y+ H D       A     G  R+++ M YL+DV+ 
Sbjct: 387 GAISGFNMTNAETMQVANYGIGGQYEPHHDFFGINLPANSVKRGD-RISTNMIYLSDVQQ 445

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           GG T+FP+ N+ V P KG+ V W+N   +   D R  H+GCPV  G K
Sbjct: 446 GGYTVFPTQNVFVKPIKGAMVMWHNLLRSLDGDRRTLHAGCPVIEGTK 493


>gi|195452770|ref|XP_002073492.1| GK14148 [Drosophila willistoni]
 gi|194169577|gb|EDW84478.1| GK14148 [Drosophila willistoni]
          Length = 444

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 73/226 (32%), Positives = 109/226 (48%), Gaps = 19/226 (8%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           CQ +   P+  K +L C Y +    FL++ P ++EEL L+P +V  H+ + D EI ++  
Sbjct: 226 CQSS-HKPKPTK-HLYCRYNTTTTPFLRLAPFRMEELSLNPYMVAYHNVLSDEEIRQLNR 283

Query: 67  LSKGKVERGKVVNYGDTIYVDTRLSKVYF---LYPEIFGDHPFLYKIQTRIQDMTNLVIG 123
           +S   +++   V+  D  Y    +   +F     P    +   + +I   + D+T L   
Sbjct: 284 MSAPLLKKAFPVSAVDIDYDVRTVDTAWFPNSETPHTKENDRLIKRIVNIVSDLTGLNAD 343

Query: 124 REERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVELGGATI 176
             + +    Q   YG GGHY  H D    +E +        RLA+ +FYL  V+ GGAT+
Sbjct: 344 VADSF----QAVRYGFGGHYSPHHDYF--NESIHQTAVNGDRLATVLFYLNTVKHGGATV 397

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMYHSGCPVALGNKWG 221
           FP LNL V  EKG  +FWYN    +L  D    H  CPV  G K G
Sbjct: 398 FPLLNLKVPAEKGKVLFWYNLDGESLDFDENTEHGVCPVVDGIKLG 443


>gi|15808763|gb|AAL08488.1| prolyl-4-hydroxylase alpha subunit-like protein [Onchocerca
           volvulus]
          Length = 571

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 71/229 (31%), Positives = 116/229 (50%), Gaps = 18/229 (7%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           IY   C+  + V   ++S L C+Y++ +  +L++ P KVE +  +P  V  +  I D + 
Sbjct: 294 IYEALCRREVPVNTKVQSQLYCYYKT-DRPYLRLAPFKVEIVRQNPLNVLFYGIISDEQA 352

Query: 62  NRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
             I  L+  K+   ++ N   G       R+ K   L      ++  + +I  R++  TN
Sbjct: 353 RIIQMLAVPKLNGSRIYNDLTGSFELPSFRILKSARLRS---TEYETVKRIDKRLELATN 409

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVELG 172
           L I   E     L + NYG+GG ++ H D   + +  +       R+A+F+ YLT+ E+G
Sbjct: 410 LEIETAE----DLAVLNYGIGGQFEPHFDCALKGDQCFEKLGTGNRIATFLIYLTEPEIG 465

Query: 173 GATIFPS-LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           G T+F S L ++V   K +A+FWYN   N  +D R  H+ CPVA G KW
Sbjct: 466 GRTVFTSNLKISVPCVKNAALFWYNLMRNGEVDTRSLHAACPVATGIKW 514


>gi|15808767|gb|AAL08490.1|AF369789_1 prolyl-4-hydroxylase alpha subunit-like protein [Onchocerca
           volvulus]
          Length = 571

 Score =  101 bits (252), Expect = 2e-19,   Method: Compositional matrix adjust.
 Identities = 74/230 (32%), Positives = 119/230 (51%), Gaps = 20/230 (8%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           IY   C+  + V   ++S L C+Y++ +  +L++ P KVE +  +P  V  +  I D + 
Sbjct: 294 IYEALCRREVPVNTKVQSQLYCYYKT-DRPYLRLAPFKVEIVRQNPLNVLFYGIISDEQA 352

Query: 62  NRIIE-LSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
            RIIE L+  K+   ++ N   G       R+ K   L      ++  + +I  R++  T
Sbjct: 353 -RIIEMLAVPKLNGSRIYNDLTGSFELPSFRILKSARLRS---TEYETVKRIDKRLELAT 408

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-------RLASFMFYLTDVEL 171
           NL I   E     L + NYG+GG ++ H D   + +  +       R+A+F+ YLT+ E+
Sbjct: 409 NLEIETAE----DLAVLNYGIGGQFEPHFDCALKGDQCFEKLGTGNRIATFLIYLTEPEI 464

Query: 172 GGATIFPS-LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GG T+F S L ++V   K +A+FWYN   N  +D R  H+ CPVA G KW
Sbjct: 465 GGRTVFTSNLKISVPCVKNAALFWYNLMRNGEVDTRSLHAACPVATGIKW 514


>gi|194871344|ref|XP_001972830.1| GG13666 [Drosophila erecta]
 gi|190654613|gb|EDV51856.1| GG13666 [Drosophila erecta]
          Length = 539

 Score =  101 bits (251), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 72/228 (31%), Positives = 109/228 (47%), Gaps = 23/228 (10%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN--RI 64
           C+G    P+     L C Y    + FLK+ PLK+E L + P +   HD +Y+ E    R 
Sbjct: 302 CRGEW--PKKSSPELICRYSRDTSAFLKLAPLKLEFLSVQPMIHLYHDVLYEKEFKSMRD 359

Query: 65  IELSKGKVERGKV-VNYGDTIYVDTRLSKVYFLYPEIFGD--HPFLYKIQTRIQDMTNLV 121
           + +    +  G+   ++   I   T+   V  +    F D   P+   I  RI DM+   
Sbjct: 360 VAVFNATMIDGRTYFDFHKKIKPKTQDRVVKMI---DFKDTTAPYTLSINRRIADMS--- 413

Query: 122 IGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLW------RLASFMFYLTDVELG 172
            G E R    L ++NYGLGG +  H D      R    +      R+A+ + Y +DV LG
Sbjct: 414 -GLEMRENMVLYLSNYGLGGDFGKHVDYVELAKRPSDFFADFKGDRIATAVLYASDVPLG 472

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           G T+FP L + V P+KG+A+ W+N +     D    HS CP+ LG++W
Sbjct: 473 GTTVFPKLKIAVQPKKGNALVWFNLNHAGEPDPLTEHSVCPIVLGSRW 520


>gi|194765140|ref|XP_001964685.1| GF23318 [Drosophila ananassae]
 gi|190614957|gb|EDV30481.1| GF23318 [Drosophila ananassae]
          Length = 412

 Score =  100 bits (250), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 66/209 (31%), Positives = 94/209 (44%), Gaps = 46/209 (22%)

Query: 18  KSN--LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERG 75
           KSN  L C+Y S    FL+I P K E++ LDP VV  HD +   EI+++I L+  K+ + 
Sbjct: 217 KSNNRLMCYYNSSTTPFLRIAPFKTEQIGLDPYVVVFHDVLSPREISKLISLTDRKLVQA 276

Query: 76  KVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQIN 135
             VN   +     R +K +++Y    G      +I  RI DM+   +   E         
Sbjct: 277 VTVN-KKSFKEMVRTAKAHWVY---RGYQELTKRIYRRIHDMSGFELADAEN-------- 324

Query: 136 NYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL----TVFPEKGSA 191
                                       F L+DVE GGAT+FP ++     TV+P  G+A
Sbjct: 325 ----------------------------FQLSDVEQGGATVFPGISADSAYTVYPRAGTA 356

Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             WYN H + L D    H  CPV +G+KW
Sbjct: 357 AMWYNLHTDGLGDPTTLHVACPVIVGSKW 385


>gi|431904119|gb|ELK09541.1| Prolyl 4-hydroxylase subunit alpha-1 [Pteropus alecto]
          Length = 507

 Score =  100 bits (250), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 64/198 (32%), Positives = 105/198 (53%), Gaps = 19/198 (9%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 289 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 348

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  V +   G       R+SK  +L      ++P + +I  RIQD+T
Sbjct: 349 IEIVKDLAKPRLSRATVHDPETGKLTTAQYRVSKSAWLSGY---ENPVVSRINMRIQDLT 405

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVE 170
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY++DV 
Sbjct: 406 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYMSDVS 461

Query: 171 LGGATIFPSLNLTVFPEK 188
            GGAT+FP +  +V+P+K
Sbjct: 462 AGGATVFPEVGASVWPKK 479


>gi|443719426|gb|ELU09607.1| hypothetical protein CAPTEDRAFT_229373 [Capitella teleta]
          Length = 576

 Score =  100 bits (250), Expect = 3e-19,   Method: Compositional matrix adjust.
 Identities = 60/194 (30%), Positives = 103/194 (53%), Gaps = 18/194 (9%)

Query: 41  EELY-LDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
           EE++  +PR+  I++ I + +IN + + +   +   +V +   +   + R+SK  +L+  
Sbjct: 359 EEIFNFNPRIALIYNVIKNRDINMLKDKATAGLSSSRVGDPAKSKLSNERISKTSWLWD- 417

Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGRE--ERYKGPLQINNYGLGGHYDLHCD--------- 148
              +   ++K+  ++ D+T L         +  P Q+ NYG+GG Y  H D         
Sbjct: 418 --TEDERIFKLSKQVADITGLSTQYSTLHSHAEPFQLVNYGIGGQYQPHFDYYENDMLRN 475

Query: 149 --ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR 206
             A  +D G  R+A+FMFYL+ V+ GGAT+FP L++ +   KG+A FW+N   +   +  
Sbjct: 476 VPAFIQDTGD-RVATFMFYLSSVKAGGATVFPKLHVRIPAVKGAAAFWFNIRRSGDREPL 534

Query: 207 MYHSGCPVALGNKW 220
             H+GCPV LG KW
Sbjct: 535 TQHAGCPVLLGEKW 548


>gi|319943342|ref|ZP_08017624.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
 gi|319743157|gb|EFV95562.1| 2OG-Fe(II) oxygenase [Lautropia mirabilis ATCC 51599]
          Length = 311

 Score =  100 bits (248), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 61/184 (33%), Positives = 97/184 (52%), Gaps = 14/184 (7%)

Query: 46  DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
           +P +  I   + D E + +I LS+GK++  +VV+       ++ + K    + E  G++ 
Sbjct: 120 NPNIAVIRGLLSDEECDEVIRLSRGKMKTSQVVDRESGGSYESSVRKSEGSHFE-RGENE 178

Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDA-TPRDEGL-------- 156
            + +I+ R+  + +L + R E    PLQI +YG GG Y  H D   P+D G         
Sbjct: 179 LVRRIEARLSALVDLPVNRGE----PLQILHYGPGGEYKAHQDFFEPKDPGSAVLTRVGG 234

Query: 157 WRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVAL 216
            R+ + + YL DV  GG T FP +  +  P KGSAV++   +A+  LDYR  H+G PV  
Sbjct: 235 QRIGTVVMYLNDVPEGGETAFPDIGFSAKPIKGSAVYFEYQNADGQLDYRCLHAGMPVIR 294

Query: 217 GNKW 220
           G+KW
Sbjct: 295 GDKW 298


>gi|339236275|ref|XP_003379692.1| prolyl 4-hydroxylase subunit alpha-2 [Trichinella spiralis]
 gi|316977629|gb|EFV60704.1| prolyl 4-hydroxylase subunit alpha-2 [Trichinella spiralis]
          Length = 441

 Score = 99.8 bits (247), Expect = 8e-19,   Method: Compositional matrix adjust.
 Identities = 78/254 (30%), Positives = 109/254 (42%), Gaps = 53/254 (20%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G   + E  +S L C+Y+  +  FL + P+KVE ++  P++V     I  +EI  +  
Sbjct: 176 CRGEYLLTEKQRSRLYCYYKR-DTPFLSLAPIKVEVMHWKPKIVIFRQVISANEIAVLKT 234

Query: 67  LSKGKVERGKVVNY-----------------------------GDTIYVDTRLSKVYFLY 97
           L+  ++ R  V N                              G   +   R+SK  +L 
Sbjct: 235 LAYPRLSRATVQNSETGELETAKYRISKRCRTLRRATVHNKETGQLEHASYRISKSAWLK 294

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE--- 154
                +HP + +I  RI DMTNL +   E     LQI NYGLGGHYD H D   RDE   
Sbjct: 295 EH---EHPVVDRIVKRIHDMTNLNMETAE----DLQIANYGLGGHYDPHFDHARRDEVDP 347

Query: 155 ----GLWRLASFMFYLTDVELGGATIFPSLN----LTVFPEKGSAVFWYNAHANTLLDYR 206
                  R+A+ +FY  +V       F SLN    +      G A FW+N   N   D  
Sbjct: 348 YEHGHGNRIATTLFYKEEV-----NAFKSLNTGNRIATVLFYGDAAFWFNLKPNGEGDMS 402

Query: 207 MYHSGCPVALGNKW 220
             H+ CPV  G KW
Sbjct: 403 TRHAACPVLAGVKW 416


>gi|167519971|ref|XP_001744325.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777411|gb|EDQ91028.1| predicted protein [Monosiga brevicollis MX1]
          Length = 492

 Score = 99.8 bits (247), Expect = 9e-19,   Method: Compositional matrix adjust.
 Identities = 69/206 (33%), Positives = 98/206 (47%), Gaps = 14/206 (6%)

Query: 21  LKCFYESYNNTFLKIGPLKVEELYL-DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           L C  + +N   L + P++VE ++  + R+    +     E   + E  + K+ R     
Sbjct: 277 LSCRLQHFNKPHLFLKPIRVEYVHEGNNRLQIFRNFASAQECAHLREEGRKKLSRAVAWT 336

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHP-FLYKIQTRIQDMTNLVIGREERYKGPLQINNYG 138
            G    V+ R+S   +L P    DH   +  + TRI D T L +     +   LQ++NYG
Sbjct: 337 DGAFRPVEFRISTAAWLQP----DHDDVVTNLHTRIADATQLDL----EFAEALQVSNYG 388

Query: 139 LGGHYDLHCDA-TPRDEGLW---RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
           +GG Y+ H D    R+  L    R+A+FM YL  VE GG T FP L   V P  G AVFW
Sbjct: 389 IGGFYETHYDHHASRERELPEGDRIATFMIYLNQVEQGGYTAFPRLGAAVEPGHGDAVFW 448

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           YN   +   D    H  CPV  G+KW
Sbjct: 449 YNLLPDGESDNNTLHGACPVLQGSKW 474


>gi|17861644|gb|AAL39299.1| GH17175p [Drosophila melanogaster]
          Length = 187

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 56/138 (40%), Positives = 82/138 (59%), Gaps = 14/138 (10%)

Query: 89  RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD 148
           R+SK  +L    +  HP +  +   ++D T L    +  +   LQ+ NYG+GGHY+ H D
Sbjct: 25  RVSKNAWL---AYESHPTMVGMLRDLKDATGL----DTTFCEQLQVANYGVGGHYEPHWD 77

Query: 149 ------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL 202
                   P +EG  R+A+ +FYL++VE GGAT FP L++ V P+ G+ +FWYN H +  
Sbjct: 78  FFRDPNHYPAEEGN-RIATAIFYLSEVEQGGATAFPFLDIAVKPQLGNVLFWYNLHRSLD 136

Query: 203 LDYRMYHSGCPVALGNKW 220
            DYR  H+GCPV  G+KW
Sbjct: 137 KDYRTKHAGCPVLKGSKW 154


>gi|195069799|ref|XP_001997030.1| GH12979 [Drosophila grimshawi]
 gi|193891499|gb|EDV90365.1| GH12979 [Drosophila grimshawi]
          Length = 517

 Score = 99.4 bits (246), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 78/246 (31%), Positives = 123/246 (50%), Gaps = 54/246 (21%)

Query: 3   YPLACQGNLSVPEDIKSNL---KCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           Y   CQG   +PE IK+N    +C+ +S  + + K+ PLKVE++ LDP +   +  + D+
Sbjct: 280 YVRLCQGK-RLPE-IKTNQSSPRCYLDSNRHAYFKLSPLKVEQVNLDPDINIYYGVLNDN 337

Query: 60  EINRIIELSKG-----KVERGKVVNYGDTIYVDTRLSK-VYFLYPEIFGDHPFLYKIQTR 113
           +I  I+ LS          R  V++       D R+S+ V+  Y       P +   +  
Sbjct: 338 QIKSILRLSDELDSFRSTHRKYVIS-------DMRISQQVWLNYSS-----PIMRTYRQL 385

Query: 114 IQ-----DMTNLVIGREERYKGPLQINNYGLGGHYDLHCD--ATP-------RDEGLWRL 159
           +      +MTN+ I         +Q+ NYG+GGHY+ H D   +P       R +   R+
Sbjct: 386 VGAISGFNMTNVEI---------MQLANYGIGGHYEPHIDYMGSPLPPYYAKRGD---RI 433

Query: 160 ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV----- 214
           ++ M YL+DV+ GG T+FP+ N+ V P KGS + WYN   +   D+R  H+GC V     
Sbjct: 434 STSMIYLSDVQQGGYTVFPTQNVFVKPVKGSMILWYNQLRSLNPDHRTLHAGCAVIEGIK 493

Query: 215 ALGNKW 220
            +GN W
Sbjct: 494 RIGNIW 499


>gi|195575103|ref|XP_002105519.1| GD17002 [Drosophila simulans]
 gi|194201446|gb|EDX15022.1| GD17002 [Drosophila simulans]
          Length = 793

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 58/173 (33%), Positives = 96/173 (55%), Gaps = 15/173 (8%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           ++Y   C+G L      + NL+C+       +  + P K+E+L +DP V  +H+ ++DSE
Sbjct: 255 KLYTQVCRGELHQSPRDQRNLRCWLSHQGVPYYHLSPFKIEQLNIDPYVAYVHEVLWDSE 314

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I+ I+E  KG +ER KV    ++   + R+S+  +L+   +  +P+L KI+ R++D+T L
Sbjct: 315 IDTIMEHGKGNMERSKVGQIENSTTTEVRISRNTWLW---YDANPWLSKIKQRLEDVTGL 371

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL----W---RLASFMFYL 166
                E    PLQ+ NYG+GG Y+ H D    D+G     W   RL + +FYL
Sbjct: 372 STESAE----PLQLVNYGIGGQYEPHFDFV-EDDGQNVFSWKGNRLLTALFYL 419


>gi|195055777|ref|XP_001994789.1| GH14121 [Drosophila grimshawi]
 gi|193892552|gb|EDV91418.1| GH14121 [Drosophila grimshawi]
          Length = 517

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 78/247 (31%), Positives = 121/247 (48%), Gaps = 56/247 (22%)

Query: 3   YPLACQGNLSVPEDIKSNL---KCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           Y   CQG   +PE IK+N    +C+ +S  + + K+ PLKVE++ LDP +   +  + D+
Sbjct: 280 YVRLCQGK-RLPE-IKTNQSSPRCYLDSNRHAYFKLSPLKVEQVNLDPDINIYYGVLNDN 337

Query: 60  EINRIIELSKG-----KVERGKVVNYGDTIYVDTRLSK-VYFLYPEIFGDHPFLYKIQTR 113
           +I  I+ LS          R  V++       D R+S+ V+  Y       P +   +  
Sbjct: 338 QIKSILRLSDELDSFRSTHRKYVIS-------DMRISQQVWLNYSS-----PIMRTYRQL 385

Query: 114 IQ-----DMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----------ATPRDEGLWR 158
           +      +MTN+ I         +Q+ NYG+GGHY+ H D          A   D    R
Sbjct: 386 VGAISGFNMTNVEI---------MQLANYGIGGHYEPHIDYMGSPLPPYYAKRGD----R 432

Query: 159 LASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV---- 214
           +++ M YL+DV+ GG T+FP+ N+ V P KGS + WYN   +   D+R  H+GC V    
Sbjct: 433 ISTSMIYLSDVQQGGYTVFPTQNVFVKPVKGSMILWYNQLRSLNPDHRTLHAGCAVIEGI 492

Query: 215 -ALGNKW 220
             +GN W
Sbjct: 493 KRIGNIW 499


>gi|319652240|ref|ZP_08006358.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
 gi|317396063|gb|EFV76783.1| prolyl 4-hydroxylase [Bacillus sp. 2_A_57_CT2]
          Length = 216

 Score = 99.0 bits (245), Expect = 1e-18,   Method: Compositional matrix adjust.
 Identities = 60/180 (33%), Positives = 98/180 (54%), Gaps = 14/180 (7%)

Query: 46  DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDH 104
           +P +V + + + D E +++I+ SK +++R KV N   ++ VD  R S   F +    G++
Sbjct: 37  EPLIVILGNVLSDEECDQLIQQSKDRMQRSKVAN---SLEVDELRTSSSTFFHE---GEN 90

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLAS 161
             + +I+ RI  + N+ +   E     LQI NY +G  Y  H D   +T R     R+++
Sbjct: 91  EIVARIEKRISQIMNIPVEHGE----GLQILNYKIGQEYKAHFDFFSSTSRAASNPRIST 146

Query: 162 FMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
            + YL DVE GG T FP LN +V P+KG AV++   + +  L+    H G PV +G+KW 
Sbjct: 147 LVMYLNDVEQGGETYFPKLNFSVSPQKGMAVYFEYFYNDQNLNDLTLHGGAPVVMGDKWA 206


>gi|423669823|ref|ZP_17644852.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
 gi|423673973|ref|ZP_17648912.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
 gi|401298950|gb|EJS04550.1| hypothetical protein IKO_03520 [Bacillus cereus VDM034]
 gi|401309524|gb|EJS14857.1| hypothetical protein IKS_01516 [Bacillus cereus VDM062]
          Length = 216

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 64/180 (35%), Positives = 91/180 (50%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ER KV +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLANVLSDEECDELIELSKSKMERSKVGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  +TN+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + + LL+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204


>gi|403274090|ref|XP_003928822.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Saimiri
           boliviensis boliviensis]
          Length = 149

 Score = 98.6 bits (244), Expect = 2e-18,   Method: Compositional matrix adjust.
 Identities = 52/126 (41%), Positives = 75/126 (59%), Gaps = 15/126 (11%)

Query: 110 IQTRIQDMTNLVIG-REERYKGP------LQINNYGLGGHYDLHCDATPRDE-------G 155
           + +R    T   +G R  +  GP      LQ+ NYG+GG Y+ H D   +DE       G
Sbjct: 1   MHSRNNGGTPRAVGLRRAQGSGPECAVLGLQVANYGVGGQYEPHFDFARKDEPDAFKELG 60

Query: 156 LW-RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
              R+A+++FY++DV  GGAT+FP +  +V+P+KG+AVFWYN  A+   DY   H+ CPV
Sbjct: 61  TGNRIATWLFYMSDVSAGGATVFPEVGASVWPKKGTAVFWYNLFASGEGDYSTRHAACPV 120

Query: 215 ALGNKW 220
            +GNKW
Sbjct: 121 LVGNKW 126


>gi|421749438|ref|ZP_16186877.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
 gi|409771699|gb|EKN53918.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator HPC(L)]
          Length = 319

 Score = 98.2 bits (243), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 65/186 (34%), Positives = 89/186 (47%), Gaps = 18/186 (9%)

Query: 46  DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGD 103
            PR+      +   E   +I LS+G++ R  VVN   GD   +D R S          G+
Sbjct: 126 SPRIALFQRLLMPDECEALIALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQ---VGE 182

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDE 154
           HP + +++ RI  +T + +   E  +G LQI NY  G  Y  H D         A     
Sbjct: 183 HPLIERLEARIAAVTGVPV---EHGEG-LQILNYKPGAEYQPHYDFFNPQRPGEARQLRV 238

Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
           G  R+A+ + YL DV  GGAT FP L L V P +G+AVF+     +  LD R  H+G PV
Sbjct: 239 GGQRMATLVIYLNDVPAGGATAFPKLGLRVNPVQGNAVFFAYLGEDGSLDERTLHAGLPV 298

Query: 215 ALGNKW 220
             G KW
Sbjct: 299 EQGEKW 304


>gi|405964867|gb|EKC30309.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
          Length = 591

 Score = 97.8 bits (242), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 73/248 (29%), Positives = 113/248 (45%), Gaps = 41/248 (16%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           +Y   C+      +++ + L+CF      T +     K E +  +PR+   HD I  + I
Sbjct: 326 MYEALCREEQKSLQEL-AKLRCFLR---ETVIPYYKAKEEVVNYEPRIAIFHDVISPTSI 381

Query: 62  NRIIELSKGKVERGKVV--NYGDTIYV------DTRLSKVYFLYPEIFGDHPFLYKIQTR 113
             +  ++     R  V   N G   +V      + R+S+  +L  +   ++P L +++ R
Sbjct: 382 EHLKSVASKGFTRSTVFLENTGPDGHVTYGKLDNVRVSQTSWLGTD---EYPELSRLENR 438

Query: 114 IQDMTNLVIGREERYKG------PLQINNYGLGGHYDLHCDATP---------------R 152
           I+    L  G    YK         Q+ NYG+GG Y +H D T                R
Sbjct: 439 IK----LTTGLSAEYKSVRSHSEKFQVLNYGVGGMYTVHYDYTGYMLGIPSNPLDSDDIR 494

Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 212
             G  R+A++MFYL DV+ GGAT+FP +   +   KG A FWYN   +   D R  H GC
Sbjct: 495 TSGE-RMATWMFYLNDVKAGGATVFPEVKTRIPVAKGGAAFWYNVRPSGATDPRTLHGGC 553

Query: 213 PVALGNKW 220
           PV +G+KW
Sbjct: 554 PVLVGSKW 561


>gi|390363005|ref|XP_797519.3| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like
           [Strongylocentrotus purpuratus]
          Length = 579

 Score = 97.8 bits (242), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 68/235 (28%), Positives = 116/235 (49%), Gaps = 24/235 (10%)

Query: 3   YPLACQGNLSVPEDIKSNL-KCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           Y   C+G+    + +   L KC Y+ YN+ FL + P K E ++ DPR+V   + + D EI
Sbjct: 329 YEALCRGDPGALKVVDHRLLKCQYQHYNHPFLYLQPAKEEVIFDDPRLVFYRNILNDKEI 388

Query: 62  NRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
             +  L+  +++R  + N   G+  + D R+SK  ++  E   +   +  I+ R+Q  T 
Sbjct: 389 AFVKRLASPRLQRATIQNAITGNLEFADYRISKSAWVKQE---EDQLIRSIRFRVQAYTG 445

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTDVEL 171
           L +   E     LQ+ NYG+GGHY+ H D    +E           R+A+ +FY++    
Sbjct: 446 LELDTAE----DLQVVNYGIGGHYEPHFDFARAEETNAFQSLGTGNRIATALFYVSITCP 501

Query: 172 GGATIFPSLN------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             ++ +   +      L++    G+AVFWYN   +   +Y   H+ CPV  G+KW
Sbjct: 502 DMSSTYEPRDEIRNGFLSLVYPSGTAVFWYNLRKSGQGNYDTRHAACPVLSGSKW 556


>gi|198466405|ref|XP_001353987.2| GA16752 [Drosophila pseudoobscura pseudoobscura]
 gi|198150585|gb|EAL29723.2| GA16752 [Drosophila pseudoobscura pseudoobscura]
          Length = 510

 Score = 97.8 bits (242), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 67/211 (31%), Positives = 100/211 (47%), Gaps = 15/211 (7%)

Query: 21  LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN--RIIELSKGKVERGKVV 78
           L C Y    + FL + PLK+E L   P +V  H+ +Y+ E+   R I      ++ G   
Sbjct: 285 LACRYNREYSAFLLLAPLKMEVLNQQPLIVLYHEVLYEKELRAMRDIANKNATMQDGWTR 344

Query: 79  NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYG 138
            + D         +V  L+        F   I  RI DMT    G E +    L ++NYG
Sbjct: 345 MHSDQRVKPEPEDRVLKLHIFQGNSESFSPSINRRIADMT----GLEVQGNNALHLSNYG 400

Query: 139 LGGHYDLHCD---ATPRDEGLWR------LASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
           LGG+++ H D    T R    +       LA+ + Y +DV LGGA +FP L ++V P+KG
Sbjct: 401 LGGYFNAHYDYVELTKRPANYFTEWGGDVLATVLLYASDVRLGGAVVFPKLKISVEPKKG 460

Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +A+ W N +     D    H+ CPV +G+ W
Sbjct: 461 NALIWDNLNNAGNPDKLSKHAVCPVVMGSHW 491


>gi|113869198|ref|YP_727687.1| prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
 gi|113527974|emb|CAJ94319.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia eutropha H16]
          Length = 297

 Score = 97.8 bits (242), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 65/185 (35%), Positives = 90/185 (48%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           P+V      + D E + ++ LS+G++ R  VVN   GD   +D R S           +H
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQ---VAEH 161

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
           P + +I+ RI  +T +     E  +G LQI NY  GG Y  H D         A     G
Sbjct: 162 PLITRIEARIAAVTGVPA---EHGEG-LQILNYKPGGEYQPHFDYFNPQRPGEARQLSVG 217

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL   E GGAT FP + L V P KG+AV++     +  LD R  H+G PVA
Sbjct: 218 GQRIATLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGALDERTLHAGLPVA 277

Query: 216 LGNKW 220
            G KW
Sbjct: 278 FGEKW 282


>gi|194373965|dbj|BAG62295.1| unnamed protein product [Homo sapiens]
          Length = 604

 Score = 97.4 bits (241), Expect = 3e-18,   Method: Compositional matrix adjust.
 Identities = 62/172 (36%), Positives = 96/172 (55%), Gaps = 12/172 (6%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           +L C YE+ +N +L + P++ E ++L+P +   HD + DSE  +I EL++  ++R  V +
Sbjct: 319 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIRELAEPWLQRSVVAS 378

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
               + V+ R+SK  +L   +    P L  +  RI  +T L +     Y   LQ+ NYG+
Sbjct: 379 GEKQLQVEYRISKSAWLKDTV---DPKLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 433

Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTV 184
           GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F   NL+V
Sbjct: 434 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSV 485


>gi|355752458|gb|EHH56578.1| hypothetical protein EGM_06023, partial [Macaca fascicularis]
          Length = 586

 Score = 97.4 bits (241), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 61/172 (35%), Positives = 95/172 (55%), Gaps = 12/172 (6%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           +L C YE+ +N +L + P++ E ++L+P +   HD + DSE  +I E ++  ++R  V +
Sbjct: 306 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVAS 365

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
               + V+ R+SK  +L   +    P L  +  RI  +T L +     Y   LQ+ NYG+
Sbjct: 366 GEKQLQVEYRISKSAWLKDTV---DPMLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 420

Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTV 184
           GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F   NL+V
Sbjct: 421 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSV 472


>gi|355566863|gb|EHH23242.1| hypothetical protein EGK_06672, partial [Macaca mulatta]
          Length = 583

 Score = 97.4 bits (241), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 61/172 (35%), Positives = 95/172 (55%), Gaps = 12/172 (6%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           +L C YE+ +N +L + P++ E ++L+P +   HD + DSE  +I E ++  ++R  V +
Sbjct: 303 SLYCSYETNSNAYLLLQPIRKEVIHLEPYIALYHDFVSDSEAQKIREFAEPWLQRSVVAS 362

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
               + V+ R+SK  +L   +    P L  +  RI  +T L +     Y   LQ+ NYG+
Sbjct: 363 GEKQLQVEYRISKSAWLKDTV---DPMLVTLNHRIAALTGLDV--RPPYAEYLQVVNYGI 417

Query: 140 GGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTV 184
           GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F   NL+V
Sbjct: 418 GGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFIYANLSV 469


>gi|423612451|ref|ZP_17588312.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
 gi|401246040|gb|EJR52392.1| hypothetical protein IIM_03166 [Bacillus cereus VD107]
          Length = 254

 Score = 97.1 bits (240), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 64/181 (35%), Positives = 92/181 (50%), Gaps = 16/181 (8%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYV-DTRLSKVYFLYPEIFG 102
           + +P +V + + + D E + +IELSK K+ER K+   G +  V D R S   FL      
Sbjct: 74  FEEPLIVVLANVLSDEECDELIELSKNKMERSKI---GSSRNVNDIRTSSGAFL-----E 125

Query: 103 DHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRL 159
           ++ F  KI+ RI  +TN+ +   E     L I NY +   Y  H D      R     R+
Sbjct: 126 ENEFTSKIEKRISSITNVPVAHGE----GLHILNYAVDQEYKAHYDYFAEHSRSAANNRI 181

Query: 160 ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           ++ + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G K
Sbjct: 182 STLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEK 241

Query: 220 W 220
           W
Sbjct: 242 W 242


>gi|405964866|gb|EKC30308.1| KRR1 small subunit processome component-like protein [Crassostrea
           gigas]
          Length = 885

 Score = 96.7 bits (239), Expect = 7e-18,   Method: Compositional matrix adjust.
 Identities = 66/234 (28%), Positives = 106/234 (45%), Gaps = 44/234 (18%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV- 77
           + L+CF     +T +     K E +  +PR+   HD I  + I  +  ++   + R  V 
Sbjct: 634 AKLRCFL---RDTVIPYYKAKEEVVNYEPRIAIFHDVISSTSIEHLKSIASKGLTRSTVF 690

Query: 78  -----------VNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
                      + YG    +  R+S+  ++  +   ++P L +++ RIQ    L+ G   
Sbjct: 691 LENTGPNGQVTITYGKQDNI--RVSQTCWIRTD---EYPELLRLENRIQ----LITGLSA 741

Query: 127 RYK------GPLQINNYGLGGHYDLHCDATPRDEGLW--------------RLASFMFYL 166
            YK         Q+ NYG+GG Y  H D T    G+               R+A++MFY+
Sbjct: 742 EYKPVRSHSEKFQVVNYGVGGMYTAHHDYTGYKLGIISNPMDSEDISTSGDRMATWMFYM 801

Query: 167 TDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            D + GGAT+FP +   +   KG A FW+N   +   D R  H GCPV +G+KW
Sbjct: 802 NDAKAGGATVFPEVRTRIPVAKGGAAFWFNLRPSGATDPRTLHGGCPVLVGSKW 855


>gi|255607134|ref|XP_002538686.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223510975|gb|EEF23697.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 318

 Score = 96.3 bits (238), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 64/169 (37%), Positives = 90/169 (53%), Gaps = 20/169 (11%)

Query: 38  LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV-NYGDTIYVD-TRLS-KVY 94
           +KV  +   PR+    D + D+E + +I  S+ +++R KVV N G   +VD TR S   Y
Sbjct: 117 IKVVMVCTAPRIALFDDVLSDAECDALIAASRSRLQRSKVVANRGSGEFVDDTRTSYGAY 176

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD------ 148
           F      G++  +  IQ RI ++T   +   E    PLQI NYGLGG Y  H D      
Sbjct: 177 FNK----GENSLVATIQRRIAELTRWPLTHAE----PLQILNYGLGGEYLPHFDYFEPQQ 228

Query: 149 ---ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFW 194
               +P + G  R+A+ + YL DVE GG TIFP LNL   P KG A+++
Sbjct: 229 PGLPSPLESGGQRIATVVMYLNDVEAGGGTIFPHLNLETRPRKGGAIYF 277


>gi|260806889|ref|XP_002598316.1| hypothetical protein BRAFLDRAFT_261183 [Branchiostoma floridae]
 gi|229283588|gb|EEN54328.1| hypothetical protein BRAFLDRAFT_261183 [Branchiostoma floridae]
          Length = 531

 Score = 96.3 bits (238), Expect = 8e-18,   Method: Compositional matrix adjust.
 Identities = 70/214 (32%), Positives = 113/214 (52%), Gaps = 22/214 (10%)

Query: 18  KSNLKCFYESYNNTFLKIGPLKVEELY-LDPRVVKIHDAIYDSEINRIIELSKGKVERGK 76
           +S+  C Y    + +  +GP+K+E L+  +P +   HD + +SE  R+ E++  K  R  
Sbjct: 310 RSSASCRY-FRPSPYFYLGPIKMEVLHETNPVIHLFHDIVSESEAARMREMAIPKFHRSV 368

Query: 77  VV--NYGDTIYVDTRLSKV--YFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
           VV  + GD I ++ R+S+   +F Y     D P + K+  R+   T L     E      
Sbjct: 369 VVGDDGGDAIILN-RVSETAWHFDY-----DDPVVAKLSRRVDYATGLSTA--EGTAEAF 420

Query: 133 QINNYGLGGHYDLHCD------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFP 186
           Q+ NYGLGG Y  H D       T   +   R+ +F+ YL+DV+ GGAT+FP +++ V P
Sbjct: 421 QVVNYGLGGQYIPHTDYFEGDHVTRHIQNGNRVVTFLLYLSDVDAGGATVFPIVDVAV-P 479

Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
              +AVFW    +  ++   + H+GCPV +G+KW
Sbjct: 480 INSAAVFWSMERSGAVVPNSL-HAGCPVLIGSKW 512


>gi|163941996|ref|YP_001646880.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
 gi|229013455|ref|ZP_04170592.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
 gi|423495146|ref|ZP_17471790.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
 gi|423498060|ref|ZP_17474677.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
 gi|163864193|gb|ABY45252.1| 2OG-Fe(II) oxygenase [Bacillus weihenstephanensis KBAB4]
 gi|228747867|gb|EEL97733.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides DSM 2048]
 gi|401151239|gb|EJQ58691.1| hypothetical protein IEW_04044 [Bacillus cereus CER057]
 gi|401161347|gb|EJQ68714.1| hypothetical protein IEY_01287 [Bacillus cereus CER074]
          Length = 216

 Score = 96.3 bits (238), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 63/180 (35%), Positives = 90/180 (50%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ER KV +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLANVLSDEECDELIELSKSKMERSKVGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  +TN+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|326435474|gb|EGD81044.1| hypothetical protein PTSG_10986 [Salpingoeca sp. ATCC 50818]
          Length = 264

 Score = 96.3 bits (238), Expect = 9e-18,   Method: Compositional matrix adjust.
 Identities = 65/193 (33%), Positives = 96/193 (49%), Gaps = 27/193 (13%)

Query: 43  LYLDPRVVKIHDAIYDSEINRIIELSKGKVERG------KVVNYGDTIYVDTRLSKVYFL 96
           L  DP V++ ++ I    I+ I+  +K K  R       +V NY        R S   ++
Sbjct: 65  LSEDPPVIQFNNFISQERIDAILHFAKPKFARSTSGIEREVSNY--------RTSSTAWM 116

Query: 97  YPEIFGDHPF---LYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD 153
            P++ G+ P    L  ++  I  +  L +  +E +    Q+  Y    +Y +H D     
Sbjct: 117 LPDVLGNDPMQAHLKDMEEEIARIVRLPVENQEHF----QVLQYQKNQYYKVHSDYIEEQ 172

Query: 154 E----GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL-LDYRMY 208
                G+ R+A+F  YL DVE GG T FP+LNLTV P KG+AV WY+A+ NT  +D R  
Sbjct: 173 RQQPCGI-RVATFFLYLNDVEEGGGTRFPNLNLTVQPAKGNAVLWYSAYPNTTRMDSRTD 231

Query: 209 HSGCPVALGNKWG 221
           H   PVA G K+G
Sbjct: 232 HEAMPVAKGMKYG 244


>gi|423489423|ref|ZP_17466105.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
 gi|402431659|gb|EJV63723.1| hypothetical protein IEU_04046 [Bacillus cereus BtB2-4]
          Length = 216

 Score = 95.9 bits (237), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 63/180 (35%), Positives = 90/180 (50%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ER KV +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLANVLSDEECDELIELSKSKMERSKVGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  +TN+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSITNVPVSHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423604110|ref|ZP_17580003.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
 gi|401245796|gb|EJR52149.1| hypothetical protein IIK_00691 [Bacillus cereus VD102]
          Length = 216

 Score = 95.5 bits (236), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 63/180 (35%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   H +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFHQDQSLNELTLHGGAPVTKGEKW 204


>gi|229168980|ref|ZP_04296697.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
 gi|423591765|ref|ZP_17567796.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
 gi|228614572|gb|EEK71680.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH621]
 gi|401231898|gb|EJR38400.1| hypothetical protein IIG_00633 [Bacillus cereus VD048]
          Length = 216

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 90/180 (50%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E   +IELSK  ++R KV +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLANVLSDEECAELIELSKSNMKRSKVGSSRDV--NDIRTSSGAFL-----EE 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +   +KI+ RI  +TN+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTWKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + + LL+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204


>gi|195575137|ref|XP_002105536.1| GD21536 [Drosophila simulans]
 gi|194201463|gb|EDX15039.1| GD21536 [Drosophila simulans]
          Length = 465

 Score = 95.1 bits (235), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 69/221 (31%), Positives = 111/221 (50%), Gaps = 26/221 (11%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y   C+ + S P  +K  L C Y S  + FL + PLK+EE+ L+P +V  HD + D +
Sbjct: 261 EDYKRLCRSSFS-PTPLK--LHCRYNSTTSPFLILAPLKMEEISLEPYIVMYHDILPDKD 317

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I ++I L++  ++  ++ +       + + S      P + G    L ++  R+ D+T L
Sbjct: 318 IQQLITLAEPLLKPTEMFDENKN---EAKSSD----RPALGG--LLLDRLNERMGDITGL 368

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-RLASFMFYLTDVELGGATIFPS 179
            I +      P+ I  Y  G H +         EG   R+ + MFYL D   GGAT+FP 
Sbjct: 369 QIPQ----GNPINIIKYAFGAHSET--------EGYGDRMDTVMFYLNDAPYGGATVFPH 416

Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           LN+ V  E+G  + WYN + +T  D    H+ CPV  G+++
Sbjct: 417 LNVKVPAERGKVLLWYNLNGDT-QDVTTVHAACPVFHGSEY 456


>gi|229192445|ref|ZP_04319408.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
 gi|228591022|gb|EEK48878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10876]
          Length = 216

 Score = 94.7 bits (234), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + I D E + +IE+SK K+ER K+ +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLANVISDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|229111709|ref|ZP_04241257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
 gi|296504733|ref|YP_003666433.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis BMB171]
 gi|423585282|ref|ZP_17561369.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
 gi|423640681|ref|ZP_17616299.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
 gi|228671703|gb|EEL26999.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-15]
 gi|296325785|gb|ADH08713.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis BMB171]
 gi|401233925|gb|EJR40411.1| hypothetical protein IIE_00694 [Bacillus cereus VD045]
 gi|401279742|gb|EJR85664.1| hypothetical protein IK9_00626 [Bacillus cereus VD166]
          Length = 248

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IE+SK K+ER K+ +  D    D R S   FL      D
Sbjct: 68  FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 120

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           + F  KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 121 NEFTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|423598444|ref|ZP_17574444.1| hypothetical protein III_01246 [Bacillus cereus VD078]
 gi|423660914|ref|ZP_17636083.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
 gi|401236714|gb|EJR43171.1| hypothetical protein III_01246 [Bacillus cereus VD078]
 gi|401300955|gb|EJS06544.1| hypothetical protein IKM_01311 [Bacillus cereus VDM022]
          Length = 216

 Score = 94.7 bits (234), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 90/180 (50%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K++R KV +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLANVLSDEECDELIELSKSKMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  +TN+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|187930127|ref|YP_001900614.1| procollagen-proline dioxygenase [Ralstonia pickettii 12J]
 gi|187727017|gb|ACD28182.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12J]
          Length = 288

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 60/185 (32%), Positives = 89/185 (48%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           PR+V     + D+E + +I + + +++R  VVN   G+   +  R S+         G+H
Sbjct: 96  PRIVLFQHFLSDAECDELIAIGRNRLKRSPVVNPDTGEENLISARTSQGGMFQ---VGEH 152

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
           P + KI+ RI     + +   E +    Q+ NY  GG Y  H D         A   + G
Sbjct: 153 PLIAKIEVRIAQAVGVPVEHGEGF----QVLNYQPGGEYQPHFDFFNPGRSGEARQLEVG 208

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G PV 
Sbjct: 209 GQRVATMVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVE 268

Query: 216 LGNKW 220
            G KW
Sbjct: 269 RGEKW 273


>gi|229019457|ref|ZP_04176278.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
 gi|229025700|ref|ZP_04182104.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
 gi|423417837|ref|ZP_17394926.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
 gi|228735575|gb|EEL86166.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1272]
 gi|228741812|gb|EEL91991.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1273]
 gi|401107008|gb|EJQ14965.1| hypothetical protein IE3_01309 [Bacillus cereus BAG3X2-1]
          Length = 216

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 90/180 (50%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K++R KV +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLANVLSDEECDELIELSKNKMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  +TN+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|299065638|emb|CBJ36810.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           CMR15]
          Length = 289

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 60/185 (32%), Positives = 90/185 (48%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           PR+V     + D E +++I L + +++R  VVN   G+   +  R S+         G+H
Sbjct: 97  PRIVLFQHFLSDEECDQLITLGRHRLKRSPVVNPETGEENLISARTSQGAMFQ---VGEH 153

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
           P + +I+ RI   T + +   E +    Q+ +Y  GG Y  H D         A   + G
Sbjct: 154 PLIARIEARIAQATGVPVEHGEGF----QVLHYQPGGEYQPHFDYFNPGRSGEARQLEVG 209

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL  V  GGAT FP L L V P KG+AVF+     +  LD +  H+G PV 
Sbjct: 210 GQRVATLVIYLNSVPAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDKTLHAGLPVE 269

Query: 216 LGNKW 220
            G KW
Sbjct: 270 RGEKW 274


>gi|241664232|ref|YP_002982592.1| procollagen-proline dioxygenase [Ralstonia pickettii 12D]
 gi|309783051|ref|ZP_07677770.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
 gi|404397139|ref|ZP_10988932.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
 gi|240866259|gb|ACS63920.1| Procollagen-proline dioxygenase [Ralstonia pickettii 12D]
 gi|308918159|gb|EFP63837.1| procollagen-proline dioxygenase [Ralstonia sp. 5_7_47FAA]
 gi|348610674|gb|EGY60360.1| hypothetical protein HMPREF0989_00773 [Ralstonia sp. 5_2_56FAA]
          Length = 288

 Score = 94.4 bits (233), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 60/185 (32%), Positives = 88/185 (47%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           PR+V     + D E + +I + + +++R  VVN   G+   +  R S+         G+H
Sbjct: 96  PRIVLFQHFLSDQECDELIAIGRNRLKRSPVVNPDTGEENLISARTSQGGMFQ---VGEH 152

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
           P + KI+ RI     + +   E +    Q+ NY  GG Y  H D         A   + G
Sbjct: 153 PLIAKIEARIAQAVGVPVEHGEGF----QVLNYQPGGEYQPHFDFFNPGRSGEARQLEVG 208

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL  V+ GGAT FP L L V P KG+AVF+     +  LD    H+G PV 
Sbjct: 209 GQRVATMVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDEDTLHAGLPVE 268

Query: 216 LGNKW 220
            G KW
Sbjct: 269 RGEKW 273


>gi|443730626|gb|ELU16050.1| hypothetical protein CAPTEDRAFT_114796, partial [Capitella teleta]
          Length = 150

 Score = 94.4 bits (233), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 50/121 (41%), Positives = 66/121 (54%), Gaps = 12/121 (9%)

Query: 109 KIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATP---------RDEGLWRL 159
           K+  R+   T L     E+Y    Q++ YG+GGHY+ H D +           ++   R+
Sbjct: 13  KLSRRVSSATKL---DAEKYAELFQVSTYGIGGHYEPHFDFSKVKYFTNPVLNEQMGDRI 69

Query: 160 ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           A+FM YL DVE GG T+FP LNL + P K SAVFW+N   +   D R  H  CPV LG K
Sbjct: 70  ATFMIYLNDVEAGGRTVFPRLNLVIEPIKNSAVFWHNLLDDGQQDDRTIHGACPVVLGRK 129

Query: 220 W 220
           W
Sbjct: 130 W 130


>gi|365158975|ref|ZP_09355162.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
 gi|363625964|gb|EHL76973.1| hypothetical protein HMPREF1014_00625 [Bacillus sp. 7_6_55CFAA_CT2]
          Length = 248

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IE+SK K+ER K+ +  D    D R S   FL      D
Sbjct: 68  FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 120

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 121 NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|195494561|ref|XP_002094890.1| GE19962 [Drosophila yakuba]
 gi|194180991|gb|EDW94602.1| GE19962 [Drosophila yakuba]
          Length = 539

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 69/229 (30%), Positives = 104/229 (45%), Gaps = 25/229 (10%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G    P      L C Y    + FLK+ PLK+E L + P ++  HD +Y++E   + +
Sbjct: 302 CRGEW--PPKSSPELICRYNRDTSAFLKLAPLKLEILSVQPVILLYHDVLYENEFKSMRD 359

Query: 67  LSKGKVERGKVVNY------GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            +           Y      G+  + D  +  + F         PF   I  R+  M+  
Sbjct: 360 AAIFNASMIDGWTYYDFDQKGNPKWQDRVVKTIGFQ----GTTAPFTLSINRRLGYMS-- 413

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATP---------RDEGLWRLASFMFYLTDVEL 171
             G E R    L + NYGLGG++  H D             D G   +A+ + Y +DV L
Sbjct: 414 --GLEMRENMMLYLTNYGLGGNFRKHFDYVELAKRPPNFFADSGGDHIATAVLYASDVPL 471

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GG T+F  L L V P+KG+A+ W+N + +   D    HS CPV LG++W
Sbjct: 472 GGTTVFSKLKLAVQPKKGNALVWFNLNHDGKPDPLTEHSVCPVVLGSRW 520


>gi|207744371|ref|YP_002260763.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum IPO1609]
 gi|206595776|emb|CAQ62703.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum IPO1609]
          Length = 280

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 64/217 (29%), Positives = 102/217 (47%), Gaps = 20/217 (9%)

Query: 17  IKSNLKCFYESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVER 74
           + S  +   E+ N+  ++    ++  L+    PR+V     + D E + +I L + +++R
Sbjct: 56  VPSPAQAEPEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKR 115

Query: 75  GKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
             VVN   G+   +  R S+         G+HP + +I+ RI   T + +   E +    
Sbjct: 116 SPVVNPETGEENLISARTSEGAMFQ---VGEHPLVARIEARIAQATGVPVEHGEGF---- 168

Query: 133 QINNYGLGGHYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLT 183
           Q+ +Y  GG Y  H D         A   + G  R+A+ + YL  V+ GGAT FP L L 
Sbjct: 169 QVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFPKLGLE 228

Query: 184 VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           V P KG+AVF+     +  LD    H+G PV  G KW
Sbjct: 229 VAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 265


>gi|344169181|emb|CCA81504.1| putative Prolyl 4-hydroxylase alpha subunit [blood disease
           bacterium R229]
          Length = 289

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 63/208 (30%), Positives = 99/208 (47%), Gaps = 20/208 (9%)

Query: 26  ESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YG 81
           E+ N+  ++    ++  L+    PR+V     + D E + +I L + +++R  VVN   G
Sbjct: 74  EAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETG 133

Query: 82  DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
           +   +  R S+         G+HP + +I+ RI   T + +   E +    Q+ +Y  GG
Sbjct: 134 EENLISARTSQGAMFQ---VGEHPLIARIEARIAQATGVPVEHGEGF----QVLHYQPGG 186

Query: 142 HYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
            Y  H D         A   + G  R+A+ + YL  V+ GGAT FP L L V P KG+AV
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAV 246

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           F+     +  LD    H+G PV  G KW
Sbjct: 247 FFVYKRPDGTLDDNTLHAGLPVERGEKW 274


>gi|300690371|ref|YP_003751366.1| prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum PSI07]
 gi|299077431|emb|CBJ50057.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           PSI07]
          Length = 289

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 63/208 (30%), Positives = 99/208 (47%), Gaps = 20/208 (9%)

Query: 26  ESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YG 81
           E+ N+  ++    ++  L+    PR+V     + D E + +I L + +++R  VVN   G
Sbjct: 74  EAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETG 133

Query: 82  DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
           +   +  R S+         G+HP + +I+ RI   T + +   E +    Q+ +Y  GG
Sbjct: 134 EENLISARTSQGAMFQ---VGEHPLIARIEARIAQATGVPVEHGEGF----QVLHYQPGG 186

Query: 142 HYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
            Y  H D         A   + G  R+A+ + YL  V+ GGAT FP L L V P KG+AV
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAV 246

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           F+     +  LD    H+G PV  G KW
Sbjct: 247 FFVYKRPDGTLDDNTLHAGLPVERGEKW 274


>gi|194290782|ref|YP_002006689.1| prolyl 4-hydroxylase subunit alpha [Cupriavidus taiwanensis LMG
           19424]
 gi|193224617|emb|CAQ70628.1| putative Prolyl 4-hydroxylase alpha subunit [Cupriavidus
           taiwanensis LMG 19424]
          Length = 296

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 63/185 (34%), Positives = 87/185 (47%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           P+V      + D E + ++ LS+G++ R  VVN   GD   +D R S           +H
Sbjct: 104 PQVQLFQQLLSDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQ---VAEH 160

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
             + +I+ RI  +T +     E     LQI NY  GG Y  H D         A     G
Sbjct: 161 ALIARIEARIAAVTGVPADHGEG----LQILNYKPGGEYQPHFDYFNPQRPGEARQLSVG 216

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL   E GGAT FP + L V P KG+AV++     +  LD R  H+G PVA
Sbjct: 217 GQRIATLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGTLDDRTLHAGLPVA 276

Query: 216 LGNKW 220
            G KW
Sbjct: 277 AGEKW 281


>gi|339327280|ref|YP_004686973.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
 gi|338167437|gb|AEI78492.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus necator N-1]
          Length = 297

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 64/185 (34%), Positives = 89/185 (48%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           P+V      + D E + ++ LS+G++ R  VVN   GD   +D R S           +H
Sbjct: 105 PQVQLFQQLLTDDECDALVALSRGRLARSPVVNPDTGDENLIDARTSMGAMFQ---VAEH 161

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
             + +I+ RI  +T +     E  +G LQI NY  GG Y  H D         A     G
Sbjct: 162 ALIARIEARIAAVTGVPA---EHGEG-LQILNYKPGGEYQPHFDYFNPQRPGEARQLSVG 217

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL   E GGAT FP + L V P KG+AV++     +  LD R  H+G PVA
Sbjct: 218 GQRIATLVIYLNTPEAGGATAFPRVGLEVAPVKGNAVYFSYLLPDGTLDERTLHAGLPVA 277

Query: 216 LGNKW 220
            G KW
Sbjct: 278 SGEKW 282


>gi|344172475|emb|CCA85118.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia syzygii R24]
          Length = 289

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 63/208 (30%), Positives = 99/208 (47%), Gaps = 20/208 (9%)

Query: 26  ESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YG 81
           E+ N+  ++    ++  L+    PR+V     + D E + +I L + +++R  VVN   G
Sbjct: 74  EAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRHRLKRSPVVNPETG 133

Query: 82  DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
           +   +  R S+         G+HP + +I+ RI   T + +   E +    Q+ +Y  GG
Sbjct: 134 EENLISARTSQGAMFQ---VGEHPLIARIEARIAQATGVPVEHGEGF----QVLHYQPGG 186

Query: 142 HYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
            Y  H D         A   + G  R+A+ + YL  V+ GGAT FP L L V P KG+AV
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAV 246

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           F+     +  LD    H+G PV  G KW
Sbjct: 247 FFVYKRPDGTLDDNTLHAGLPVERGEKW 274


>gi|83746819|ref|ZP_00943867.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
 gi|83726588|gb|EAP73718.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum UW551]
          Length = 289

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 64/217 (29%), Positives = 102/217 (47%), Gaps = 20/217 (9%)

Query: 17  IKSNLKCFYESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVER 74
           + S  +   E+ N+  ++    ++  L+    PR+V     + D E + +I L + +++R
Sbjct: 65  VPSPAQAEPEAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKR 124

Query: 75  GKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
             VVN   G+   +  R S+         G+HP + +I+ RI   T + +   E +    
Sbjct: 125 SPVVNPETGEENLISARTSEGAMFQ---VGEHPLVARIEARIAQATGVPVEHGEGF---- 177

Query: 133 QINNYGLGGHYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLT 183
           Q+ +Y  GG Y  H D         A   + G  R+A+ + YL  V+ GGAT FP L L 
Sbjct: 178 QVLHYHPGGEYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFPKLGLE 237

Query: 184 VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           V P KG+AVF+     +  LD    H+G PV  G KW
Sbjct: 238 VAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVERGEKW 274


>gi|421895470|ref|ZP_16325871.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum MolK2]
 gi|206586635|emb|CAQ17221.1| prolyl 4-hydroxylase alpha subunit homologue protein [Ralstonia
           solanacearum MolK2]
          Length = 283

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 60/185 (32%), Positives = 90/185 (48%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           PR+V     + D E + +I L + +++R  VVN   G+   +  R S+         G+H
Sbjct: 91  PRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETGEENLISARTSEGAMFQ---VGEH 147

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
           P + +I+ RI   T + +   E +    Q+ +Y  GG Y  H D         A   + G
Sbjct: 148 PLVARIEARIAQATGVPVEHGEGF----QVLHYHPGGEYQPHFDYFNPGRGGEARQLEVG 203

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL  V+ GGAT FP L L V P KG+AVF+     + +LD    H+G PV 
Sbjct: 204 GQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGMLDDNTLHAGLPVE 263

Query: 216 LGNKW 220
            G KW
Sbjct: 264 RGEKW 268


>gi|423389445|ref|ZP_17366671.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
 gi|401641536|gb|EJS59253.1| hypothetical protein ICG_01293 [Bacillus cereus BAG1X1-3]
          Length = 216

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E   +IELSK K++R KV +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLANVLSDEECEELIELSKNKMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  +TN+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|229180513|ref|ZP_04307855.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
 gi|228602937|gb|EEK60416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 172560W]
          Length = 232

 Score = 94.0 bits (232), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IE+SK K+ER K+ +  D    D R S   FL      D
Sbjct: 52  FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 104

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 105 NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 161 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220


>gi|198466393|ref|XP_001353986.2| GA18007 [Drosophila pseudoobscura pseudoobscura]
 gi|198150579|gb|EAL29722.2| GA18007 [Drosophila pseudoobscura pseudoobscura]
          Length = 455

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 62/215 (28%), Positives = 109/215 (50%), Gaps = 32/215 (14%)

Query: 11  LSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKG 70
           +S P  + +++ C Y   +  FL++ P++ E L  +  V   HD     EI  +  L++ 
Sbjct: 256 MSYPRKV-NDVHCRYLR-STPFLQLAPIRQENLDNEAHVYLYHDLFNHEEIEALKSLARP 313

Query: 71  KVERGKVV-NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYK 129
           K++R K+  N+   I    +LS               +  +  RIQD++ + +  +E   
Sbjct: 314 KLKRQKISSNFTCKI---AQLSN---------SAQDIIRTVNRRIQDVSGMDMNEKEM-- 359

Query: 130 GPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
             LQ+ NYG+ G YDL       D+     A+ + ++++V+ GG T+FP L+L V P+KG
Sbjct: 360 --LQVVNYGIAGRYDL-------DDSAGSAATALIFMSNVQQGGETVFPFLSLRVKPQKG 410

Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 224
           S + W N       D+ + H+ CP+ +GN WG+L+
Sbjct: 411 SLLLWRNT------DWSVLHNSCPLIIGNMWGELI 439


>gi|423518940|ref|ZP_17495421.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
 gi|401159995|gb|EJQ67374.1| hypothetical protein IG7_04010 [Bacillus cereus HuA2-4]
          Length = 216

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E   +IELSK  ++R KV +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLANVLSDEECAELIELSKNNMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  +TN+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + + LL+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPQLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204


>gi|423426372|ref|ZP_17403403.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
 gi|401111119|gb|EJQ19018.1| hypothetical protein IE5_04061 [Bacillus cereus BAG3X2-2]
          Length = 248

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IE+SK K+ER K+ +  D    D R S   FL      D
Sbjct: 68  FEEPLIVVLANVLSDEECDELIEISKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 120

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 121 NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|421890664|ref|ZP_16321519.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           K60-1]
 gi|378964031|emb|CCF98267.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           K60-1]
          Length = 288

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 64/208 (30%), Positives = 99/208 (47%), Gaps = 20/208 (9%)

Query: 26  ESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YG 81
           E+ N+  ++    ++  L+    PR+V     + D E + +I L + +++R  VVN   G
Sbjct: 73  EAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETG 132

Query: 82  DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
           +   +  R S+         G+HP + +I+ RI   T + +   E +    Q+ +Y  GG
Sbjct: 133 EENLISARTSEGAMFQ---VGEHPLVARIEARIAQATGVPVEHGEGF----QVLHYHPGG 185

Query: 142 HYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
            Y  H D         A   D G  R+A+ + YL  V+ GGAT FP L L V P KG+AV
Sbjct: 186 EYQPHFDYFNPGRSGEARQLDVGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAV 245

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           F+     +  LD    H+G PV  G KW
Sbjct: 246 FFVYKRPDGTLDDNTLHAGLPVERGEKW 273


>gi|423512354|ref|ZP_17488885.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
 gi|402449325|gb|EJV81162.1| hypothetical protein IG3_03851 [Bacillus cereus HuA2-1]
          Length = 216

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E   +IELSK  ++R KV +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLANVLSDEECAELIELSKSNMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  +TN+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + + LL+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204


>gi|423400914|ref|ZP_17378087.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
 gi|401653904|gb|EJS71447.1| hypothetical protein ICW_01312 [Bacillus cereus BAG2X1-2]
          Length = 216

 Score = 94.0 bits (232), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K++R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKSKMKRSKVGSSRDV--NDIRTSSGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|206971296|ref|ZP_03232247.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH1134]
 gi|229081494|ref|ZP_04213993.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
 gi|423411965|ref|ZP_17389085.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
 gi|423432249|ref|ZP_17409253.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
 gi|206734068|gb|EDZ51239.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH1134]
 gi|228701801|gb|EEL54288.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-2]
 gi|401104033|gb|EJQ12010.1| hypothetical protein IE1_01269 [Bacillus cereus BAG3O-2]
 gi|401117005|gb|EJQ24843.1| hypothetical protein IE7_04065 [Bacillus cereus BAG4O-1]
          Length = 216

 Score = 93.6 bits (231), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IE+SK K+ER K+ +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|229135058|ref|ZP_04263863.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
 gi|228648443|gb|EEL04473.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST196]
          Length = 216

 Score = 93.6 bits (231), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E   +IELSK  ++R KV +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLANVLSDEECAELIELSKSNMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  +TN+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + + LL+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204


>gi|229071739|ref|ZP_04204954.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
 gi|228711334|gb|EEL63294.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus F65185]
          Length = 232

 Score = 93.6 bits (231), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IE+SK K+ER K+ +  D    D R S   FL      D
Sbjct: 52  FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 104

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 105 NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 161 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220


>gi|423478381|ref|ZP_17455096.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
 gi|402428543|gb|EJV60640.1| hypothetical protein IEO_03839 [Bacillus cereus BAG6X1-1]
          Length = 216

 Score = 93.6 bits (231), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K++R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKSKMKRSKVGSSRDV--NDIRTSSGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|217961727|ref|YP_002340297.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus cereus AH187]
 gi|222097680|ref|YP_002531737.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           Q1]
 gi|229198365|ref|ZP_04325071.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
 gi|375286242|ref|YP_005106681.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus cereus NC7401]
 gi|423354732|ref|ZP_17332357.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
 gi|423566803|ref|ZP_17543050.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
 gi|423574080|ref|ZP_17550199.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
 gi|217067199|gb|ACJ81449.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH187]
 gi|221241738|gb|ACM14448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           Q1]
 gi|228585065|gb|EEK43177.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1293]
 gi|358354769|dbj|BAL19941.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NC7401]
 gi|401086280|gb|EJP94507.1| hypothetical protein IAU_02806 [Bacillus cereus IS075]
 gi|401212649|gb|EJR19392.1| hypothetical protein II9_01301 [Bacillus cereus MSX-D12]
 gi|401215318|gb|EJR22035.1| hypothetical protein II7_00026 [Bacillus cereus MSX-A12]
          Length = 216

 Score = 93.6 bits (231), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E +++IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|374370415|ref|ZP_09628419.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
 gi|373098067|gb|EHP39184.1| prolyl 4-hydroxylase alpha subunit [Cupriavidus basilensis OR16]
          Length = 454

 Score = 93.6 bits (231), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 62/185 (33%), Positives = 90/185 (48%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           PRV      + D+E + ++ L++G++ R  V+N   GD   ++ R S          G+H
Sbjct: 132 PRVTLFQQLLTDAECDALVALARGRLARSPVINPDTGDENLIEARTSLGAMFQ---VGEH 188

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
           P + +I+  I  +T +     ER +G LQI NY  GG Y  H D         A     G
Sbjct: 189 PLIERIEDCIAAVTGIAA---ERGEG-LQILNYKPGGEYQPHYDFFNPQRPGEARQLKVG 244

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+ + + YL     GGAT FP L L V P KG+AV++    ++  LD R  H+G PV 
Sbjct: 245 GQRVGTLVIYLNSPLAGGATAFPKLGLEVAPVKGNAVYFSYRKSDGALDERTLHAGLPVE 304

Query: 216 LGNKW 220
            G KW
Sbjct: 305 AGEKW 309


>gi|229140971|ref|ZP_04269515.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
 gi|228642547|gb|EEK98834.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST26]
          Length = 232

 Score = 93.6 bits (231), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E +++IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 52  FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 104

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 105 NELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220


>gi|423368291|ref|ZP_17345723.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
 gi|401081042|gb|EJP89322.1| hypothetical protein IC3_03392 [Bacillus cereus VD142]
          Length = 216

 Score = 93.6 bits (231), Expect = 5e-17,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E   +IELSK  ++R KV +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLANVLSDEECAELIELSKNNMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  +TN+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + + LL+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204


>gi|17547533|ref|NP_520935.1| hypothetical protein RSc2814 [Ralstonia solanacearum GMI1000]
 gi|17429837|emb|CAD16521.1| putative prolyl 4-hydroxylase alpha subunit homologue
           oxidoreductase protein [Ralstonia solanacearum GMI1000]
          Length = 289

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 60/185 (32%), Positives = 89/185 (48%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           PR+V     + D E +++I L + +++R  VVN   G+   +  R S+         G+H
Sbjct: 97  PRIVLFQHFLSDEECDQLIALGRHRLKRSPVVNPETGEENLISARTSQGAMFQ---VGEH 153

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
           P + +I+ RI   T + +   E +    Q+ +Y  GG Y  H D         A   + G
Sbjct: 154 PLVARIEARIAQATGVPVEHGEGF----QVLHYQPGGEYQPHFDYFNPGRSGEARQLEVG 209

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL  V  GGAT FP L L V P KG+AVF+     +  LD    H+G PV 
Sbjct: 210 GQRVATLVIYLNSVPAGGATGFPKLGLEVAPVKGNAVFFVYKRPDGTLDDNTLHAGLPVE 269

Query: 216 LGNKW 220
            G KW
Sbjct: 270 RGEKW 274


>gi|423395462|ref|ZP_17372663.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
 gi|401654873|gb|EJS72412.1| hypothetical protein ICU_01156 [Bacillus cereus BAG2X1-1]
          Length = 216

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 63/180 (35%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E +++IELSK K+ R KV +  D    D R SK  FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSRDV--NDIRTSKGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|229174912|ref|ZP_04302432.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
 gi|228608580|gb|EEK65882.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus MM3]
          Length = 216

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 63/180 (35%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R SK  FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSRDV--NDIRTSKGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTVKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|228916870|ref|ZP_04080433.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pulsiensis BGSC 4CC1]
 gi|228842793|gb|EEM87878.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pulsiensis BGSC 4CC1]
          Length = 232

 Score = 93.6 bits (231), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 63/180 (35%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R SK  FL      D
Sbjct: 52  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSKGAFL-----DD 104

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 105 NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220


>gi|229157835|ref|ZP_04285910.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
 gi|228625792|gb|EEK82544.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 4342]
          Length = 232

 Score = 93.2 bits (230), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 63/180 (35%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R SK  FL      D
Sbjct: 52  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSKGAFL-----DD 104

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 105 NELTEKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220


>gi|228987427|ref|ZP_04147547.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           tochigiensis BGSC 4Y1]
 gi|228772399|gb|EEM20845.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           tochigiensis BGSC 4Y1]
          Length = 232

 Score = 93.2 bits (230), Expect = 6e-17,   Method: Compositional matrix adjust.
 Identities = 63/180 (35%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R SK  FL      D
Sbjct: 52  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSKGAFL-----DD 104

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 105 NELTEKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220


>gi|347966278|ref|XP_003435891.1| AGAP013377-PA [Anopheles gambiae str. PEST]
 gi|333470133|gb|EGK97522.1| AGAP013377-PA [Anopheles gambiae str. PEST]
          Length = 290

 Score = 93.2 bits (230), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 67/221 (30%), Positives = 103/221 (46%), Gaps = 16/221 (7%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G    P  + S+L C+Y+   N    I P KVE L  DP V   H+ ++D EI ++  
Sbjct: 52  CRGVYVPPPSLTSSLYCWYD-VRNAHSVISPSKVEALSNDPFVALFHEFVHDGEIAQLQA 110

Query: 67  LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
           L    +++    N    + V     + Y L+     DHP + ++  RI+  T L     E
Sbjct: 111 LGSMHIKQSGPSN-DSWLPVFYENHQTYTLHDR---DHPVVERLTKRIERRTGLSCDTAE 166

Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRDEGLWR-------LASFMFYLTDVELGGATIFPS 179
                L++    +G       DA  + E   R       LA+ +F+L+DV  GG TIFP 
Sbjct: 167 ----DLKVIYNEVGAFKTAALDAIHKKEDAQRFAYAGDRLATMLFFLSDVTNGGYTIFPK 222

Query: 180 LNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           L + + P+KG+A FWYN       + +M +S CP+    KW
Sbjct: 223 LRVAIRPQKGTAAFWYNLKDTGEGNVQMKYSICPLQDDQKW 263


>gi|47567794|ref|ZP_00238502.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
 gi|47555471|gb|EAL13814.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus G9241]
          Length = 216

 Score = 93.2 bits (230), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 63/180 (35%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R SK  FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSKGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTEKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423406337|ref|ZP_17383486.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
 gi|401660331|gb|EJS77813.1| hypothetical protein ICY_01022 [Bacillus cereus BAG2X1-3]
          Length = 216

 Score = 93.2 bits (230), Expect = 7e-17,   Method: Compositional matrix adjust.
 Identities = 63/180 (35%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E +++IELSK K+ R KV +  D    D R SK  FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSRDV--NDIRTSKGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|206978009|ref|ZP_03238895.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           H3081.97]
 gi|423373947|ref|ZP_17351286.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
 gi|206743809|gb|EDZ55230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           H3081.97]
 gi|401094762|gb|EJQ02832.1| hypothetical protein IC5_03002 [Bacillus cereus AND1407]
          Length = 216

 Score = 93.2 bits (230), Expect = 8e-17,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E +++IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDKLIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
                KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  DELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423470454|ref|ZP_17447198.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
 gi|402436583|gb|EJV68613.1| hypothetical protein IEM_01760 [Bacillus cereus BAG6O-2]
          Length = 216

 Score = 92.8 bits (229), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ER K+ +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLANVLSDEECDGLIELSKNKIERSKIGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423521903|ref|ZP_17498376.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
 gi|401176565|gb|EJQ83760.1| hypothetical protein IGC_01286 [Bacillus cereus HuA4-10]
          Length = 216

 Score = 92.8 bits (229), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 90/180 (50%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E +++IELSK  ++R KV +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLANVLSDEECDKLIELSKNNMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  +TN+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSITNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423452458|ref|ZP_17429311.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
 gi|401140096|gb|EJQ47653.1| hypothetical protein IEE_01202 [Bacillus cereus BAG5X1-1]
          Length = 216

 Score = 92.8 bits (229), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ER K+ +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLANVLSDEECDGLIELSKNKIERSKIGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|42783360|ref|NP_980607.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 10987]
 gi|42739288|gb|AAS43215.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           ATCC 10987]
          Length = 216

 Score = 92.8 bits (229), Expect = 9e-17,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|229163182|ref|ZP_04291137.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
 gi|228620245|gb|EEK77116.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus R309803]
          Length = 229

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 63/180 (35%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R SK  FL      D
Sbjct: 49  FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSRDV--NDIRTSKGAFL-----DD 101

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 102 NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 157

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 158 TLVMYLNDVEEGGETFFPKLNLSVNPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 217


>gi|386332363|ref|YP_006028532.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
 gi|334194811|gb|AEG67996.1| Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum Po82]
          Length = 292

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 63/208 (30%), Positives = 99/208 (47%), Gaps = 20/208 (9%)

Query: 26  ESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YG 81
           E+ N+  ++    ++  L+    PR+V     + D E + +I L + +++R  VVN   G
Sbjct: 77  EAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETG 136

Query: 82  DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
           +   +  R S+         G+HP + +I+ RI   T + +   E +    Q+ +Y  GG
Sbjct: 137 EENLISARTSEGAMFQ---VGEHPLVARIEARIAQATGVPVEHGEGF----QVLHYHPGG 189

Query: 142 HYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
            Y  H D         A   + G  R+A+ + YL  V+ GGAT FP L L V P KG+AV
Sbjct: 190 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAV 249

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           F+     +  LD    H+G PV  G KW
Sbjct: 250 FFVYKRPDGTLDDNTLHAGLPVERGEKW 277


>gi|229031885|ref|ZP_04187873.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
 gi|228729503|gb|EEL80492.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH1271]
          Length = 216

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 63/180 (35%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E   +IELSK K+ R KV +  D    D R SK  FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECGELIELSKSKLARSKVGSSRDV--NDIRTSKGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTTKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|195352178|ref|XP_002042591.1| GM14978 [Drosophila sechellia]
 gi|194124475|gb|EDW46518.1| GM14978 [Drosophila sechellia]
          Length = 467

 Score = 92.8 bits (229), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 71/221 (32%), Positives = 102/221 (46%), Gaps = 54/221 (24%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           + L CQG    P+  KSNL C Y S  N FL++ PLK+EE+  DP +V  H+ + D EI 
Sbjct: 284 HNLGCQG--LFPK--KSNLVCRYNSSTNAFLQLAPLKMEEVSRDPYIVMFHEVVSDKEIE 339

Query: 63  RIIELSKGKV---ERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
            +    KG++   E GK                          +  F  +I  RI DMT 
Sbjct: 340 EM----KGEITEMENGK--------------------------ESSFSKRINQRISDMTG 369

Query: 120 LVIGREERYKGPLQINNYGLGG----HYDLHCDATPR---DEGLW-RLASFMFYLTDVEL 171
             +   E +   +Q  N+G+GG    HYD + D       +  L  R+ S +FY  +V  
Sbjct: 370 FKL---EEFPA-IQSANFGVGGYFKPHYDYYTDRLKEVDVNNTLGDRIGSIIFYAGEVSQ 425

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 212
           GG T+FP   + V P+KG+A+ W+NA       +R  H  C
Sbjct: 426 GGQTVFPDSKVMVEPKKGNALLWFNAFI-----HRQIHEPC 461


>gi|195166671|ref|XP_002024158.1| GL22696 [Drosophila persimilis]
 gi|194107513|gb|EDW29556.1| GL22696 [Drosophila persimilis]
          Length = 491

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 61/215 (28%), Positives = 109/215 (50%), Gaps = 32/215 (14%)

Query: 11  LSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKG 70
           +S P  + +++ C Y   +  FL++ P++ E L  +  V   HD     EI  +  L++ 
Sbjct: 292 MSYPRKV-NDVHCRYLR-STPFLQLAPIRQENLDNEAHVYLYHDLFNHEEIEALKSLARP 349

Query: 71  KVERGKVV-NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYK 129
           +++R K+  N+   I    +LS               +  +  RIQD++ + +  +E   
Sbjct: 350 RLKRQKISSNFTCKI---AQLSN---------SAQDIIRTVNRRIQDVSGMDMNEKE--- 394

Query: 130 GPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
             LQ+ NYG+ G YDL       D+     A+ + ++++V+ GG T+FP L+L V P+KG
Sbjct: 395 -VLQVVNYGIAGRYDL-------DDSAGSAATALIFMSNVQQGGETVFPFLSLRVKPQKG 446

Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKWGKLL 224
           S + W N       D+ + H+ CP+ +GN WG+L+
Sbjct: 447 SLLLWRNT------DWSVLHNSCPLIIGNMWGELI 475


>gi|198449528|ref|XP_002136919.1| GA26870 [Drosophila pseudoobscura pseudoobscura]
 gi|198130648|gb|EDY67477.1| GA26870 [Drosophila pseudoobscura pseudoobscura]
          Length = 491

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 73/211 (34%), Positives = 103/211 (48%), Gaps = 31/211 (14%)

Query: 18  KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
           KS L C + S+  +F     LKVEE+ LDP +V  HD +   E+    EL K        
Sbjct: 286 KSTLHCRF-SWRPSF--YARLKVEEVLLDPYIVLYHDVVSGKEM----ELLK-------- 330

Query: 78  VNYGDTIY----VDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQ 133
            +YG T      + + LS  +   PE     P +  +  R+ DMT L +   E +     
Sbjct: 331 -DYGRTNLTHDPLRSGLSAKHCALPESL---PLVQSLHQRLWDMTGLSLNGSESWL---- 382

Query: 134 INNYGLGGHYDLHCDATPRDE----GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
           I NYG+GG   LH D     E    G  RL +   +L++V  GG T+FP+L + V P+ G
Sbjct: 383 ITNYGIGGFLGLHKDYFDEIEEELQGDNRLFTIQIFLSNVSQGGYTVFPNLEVAVKPQAG 442

Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +A+ +YN   + + D R  H GCPV  GNKW
Sbjct: 443 TALVFYNLLDSLVGDTRTRHFGCPVIDGNKW 473


>gi|423657194|ref|ZP_17632493.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
 gi|401289937|gb|EJR95641.1| hypothetical protein IKG_04182 [Bacillus cereus VD200]
          Length = 248

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IE+SK K+ER K+ +  D    D R S   FL      D
Sbjct: 68  FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 120

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 121 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|229061929|ref|ZP_04199257.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
 gi|228717372|gb|EEL69042.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH603]
          Length = 216

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E   +IELSK  ++R KV +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLANVLSDEECAELIELSKSNMKRSKVGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  +TN+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSITNVPVVHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + + LL+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQLLNELTLHGGAPVTKGEKW 204


>gi|253575459|ref|ZP_04852796.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
 gi|251845106|gb|EES73117.1| prolyl 4-hydroxylase [Paenibacillus sp. oral taxon 786 str. D14]
          Length = 215

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 58/182 (31%), Positives = 97/182 (53%), Gaps = 15/182 (8%)

Query: 43  LYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFG 102
           L+ +P +++    + D E  ++IE +  ++   K+VN    +  + R S+  F   E   
Sbjct: 26  LHKEPLIMRFERLLTDDECRQLIEAAAPRLRESKLVN---KVVSEIRTSRGMFFEEE--- 79

Query: 103 DHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCDATPRDEGLWR 158
           ++PF+++I+ RI  + N+ I   E  +G LQ+ +YG G     HYD     +P      R
Sbjct: 80  ENPFIHRIEKRISALMNVPI---EHAEG-LQVLHYGPGQEYQAHYDFFGPNSPSASN-NR 134

Query: 159 LASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGN 218
           +++ + YL DVE GG T+FP L+L V PE+GSA+++   +    L+    HS  PV  G 
Sbjct: 135 ISTLIIYLNDVEAGGETVFPLLDLEVKPERGSALYFEYFYRQQELNNLTLHSSVPVVRGE 194

Query: 219 KW 220
           KW
Sbjct: 195 KW 196


>gi|423527903|ref|ZP_17504348.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
 gi|402451566|gb|EJV83385.1| hypothetical protein IGE_01455 [Bacillus cereus HuB1-1]
          Length = 248

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E +++IE+SK K++R KV +  D    D R S   FL      D
Sbjct: 68  FEEPLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 120

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 121 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|300702992|ref|YP_003744594.1| prolyl 4-hydroxylase subunit alpha [Ralstonia solanacearum
           CFBP2957]
 gi|299070655|emb|CBJ41950.1| putative Prolyl 4-hydroxylase alpha subunit [Ralstonia solanacearum
           CFBP2957]
          Length = 289

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 63/208 (30%), Positives = 99/208 (47%), Gaps = 20/208 (9%)

Query: 26  ESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YG 81
           E+ N+  ++    ++  L+    PR+V     + D E + +I L + +++R  VVN   G
Sbjct: 74  EAENSNAVRTSDREIPILFAIETPRIVLFQHFLSDEECDELIALGRYRLKRSPVVNPETG 133

Query: 82  DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
           +   +  R S+         G+HP + +I+ RI   T + +   E +    Q+ +Y  GG
Sbjct: 134 EENLISARTSEGAMFQ---VGEHPLVARIEARIAQATGVPVEHGEGF----QVLHYHPGG 186

Query: 142 HYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
            Y  H D         A   + G  R+A+ + YL  V+ GGAT FP L L V P KG+AV
Sbjct: 187 EYQPHFDYFNPGRSGEARQLEVGGQRVATLVIYLNSVQAGGATGFPKLGLEVAPVKGNAV 246

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           F+     +  LD    H+G PV  G KW
Sbjct: 247 FFVYKRPDGTLDDNTLHAGLPVERGEKW 274


>gi|384182063|ref|YP_005567825.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           finitimus YBT-020]
 gi|324328147|gb|ADY23407.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           finitimus YBT-020]
          Length = 216

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDRSLNELTLHGGAPVTKGEKW 204


>gi|75760922|ref|ZP_00740932.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|423385740|ref|ZP_17362996.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
 gi|423561293|ref|ZP_17537569.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
 gi|74491592|gb|EAO54798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           israelensis ATCC 35646]
 gi|401201550|gb|EJR08415.1| hypothetical protein II5_00697 [Bacillus cereus MSX-A1]
 gi|401635796|gb|EJS53551.1| hypothetical protein ICE_03486 [Bacillus cereus BAG1X1-2]
          Length = 248

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E +++IE+SK K++R KV +  D    D R S   FL      D
Sbjct: 68  FEEPLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 120

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 121 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|423358724|ref|ZP_17336227.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
 gi|401084596|gb|EJP92842.1| hypothetical protein IC1_00704 [Bacillus cereus VD022]
          Length = 248

 Score = 92.4 bits (228), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E +++IE+SK K++R KV +  D    D R S   FL      D
Sbjct: 68  FEEPLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 120

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 121 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|417402369|gb|JAA48034.1| Putative prolyl 4-hydroxylase alpha subunit [Desmodus rotundus]
          Length = 529

 Score = 92.0 bits (227), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 66/196 (33%), Positives = 102/196 (52%), Gaps = 13/196 (6%)

Query: 7   CQGNLSVPEDIKS-NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   ++ +L C YE+  + +L + P++ E ++L+P VV  HD + D E  +I 
Sbjct: 305 CQTLGSQPTHYQNPSLHCSYETGASPYLLLQPIRKEVVHLEPYVVLYHDFVNDLEAQKIR 364

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
             ++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T L    +
Sbjct: 365 GFAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPMLVTLDRRIAALTGL--DTQ 419

Query: 126 ERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFP 178
             Y   LQ+ NYG+GGHY+ H D AT     L+R+      A+FM YL+ VE GGAT F 
Sbjct: 420 PPYAEHLQVVNYGIGGHYEPHFDHATSPSSPLYRMKSGNRVATFMIYLSSVEAGGATAFI 479

Query: 179 SLNLTVFPEKGSAVFW 194
             N +V   K S+  W
Sbjct: 480 YANFSVPVVKCSSPRW 495


>gi|30022316|ref|NP_833947.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
 gi|229129515|ref|ZP_04258486.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
 gi|29897873|gb|AAP11148.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus ATCC 14579]
 gi|228654120|gb|EEL09987.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-Cer4]
          Length = 232

 Score = 92.0 bits (227), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IE+SK K+ER K+ +  D    D R S   FL      D
Sbjct: 52  FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 104

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 105 NKLTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 161 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220


>gi|402555628|ref|YP_006596899.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus FRI-35]
 gi|401796838|gb|AFQ10697.1| prolyl 4-hydroxylase alpha subunit [Bacillus cereus FRI-35]
          Length = 216

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E   +IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECGELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTAKIEKRISSIMNVPVSHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|430751569|ref|YP_007214477.1| 2OG-Fe(II) oxygenase [Thermobacillus composti KWC4]
 gi|430735534|gb|AGA59479.1| 2OG-Fe(II) oxygenase superfamily enzyme [Thermobacillus composti
           KWC4]
          Length = 215

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/189 (30%), Positives = 100/189 (52%), Gaps = 15/189 (7%)

Query: 36  GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYF 95
           G ++   L+ +P +V+    + D E  ++IE +  +++  K+VN    +  D R S+  F
Sbjct: 19  GVVEATVLHQEPLIVRFERLLSDDECRQLIETAAPRLKESKLVN---KVVSDIRTSRGMF 75

Query: 96  LYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATP 151
              E   + PF+++I+ RI  + N+ I   E  +G LQ+ +YG G  Y  H D     +P
Sbjct: 76  FEEE---ESPFIHRIERRIAQLMNVPI---EHAEG-LQVLHYGPGQEYKAHHDFFAPGSP 128

Query: 152 RDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSG 211
                 R+++ + YL DVE GG T+FP L + + P++G+A+++   + N  L+    HS 
Sbjct: 129 AARN-NRISTLIVYLNDVEEGGETVFPLLGIAMKPKRGAALYFEYFYRNQALNDLTLHSS 187

Query: 212 CPVALGNKW 220
            PV  G KW
Sbjct: 188 VPVVRGEKW 196


>gi|94312029|ref|YP_585239.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
 gi|93355881|gb|ABF09970.1| prolyl 4-hydroxylase [Cupriavidus metallidurans CH34]
          Length = 293

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/185 (31%), Positives = 89/185 (48%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           PR++ + + + D+E + ++ L++ +++R  VVN   GD   +D R S          G+H
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVVNPDTGDENLIDARTSMGAMFQ---VGEH 157

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
             L +I+ RI  +T   +   E +    Q+ NY  GG Y  H D         A     G
Sbjct: 158 ALLQRIEARIAAVTGWPVEHGEGF----QVLNYKPGGEYQPHFDFFNPKRPGEARQLRVG 213

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL     GGAT FP + L V P KG+AV +     +  LD R  H+G PV 
Sbjct: 214 GQRVATMVIYLNSPASGGATAFPRIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVE 273

Query: 216 LGNKW 220
            G KW
Sbjct: 274 AGEKW 278


>gi|430808003|ref|ZP_19435118.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
 gi|429499635|gb|EKZ98045.1| prolyl 4-hydroxylase [Cupriavidus sp. HMR-1]
          Length = 293

 Score = 92.0 bits (227), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 59/185 (31%), Positives = 89/185 (48%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           PR++ + + + D+E + ++ L++ +++R  VVN   GD   +D R S          G+H
Sbjct: 101 PRILLLQNLLDDAECDAVVALARDRLQRSPVVNPDTGDENLIDARTSMGAMFQ---VGEH 157

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
             L +I+ RI  +T   +   E +    Q+ NY  GG Y  H D         A     G
Sbjct: 158 ALLQRIEARIAAVTGWPVEHGEGF----QVLNYKPGGEYQPHFDFFNPKRPGEARQLRVG 213

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL     GGAT FP + L V P KG+AV +     +  LD R  H+G PV 
Sbjct: 214 GQRVATMVIYLNSPASGGATAFPRIGLEVAPVKGNAVLFSYGLPDGALDERTLHAGLPVE 273

Query: 216 LGNKW 220
            G KW
Sbjct: 274 AGEKW 278


>gi|228902749|ref|ZP_04066896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
           4222]
 gi|228967277|ref|ZP_04128313.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           sotto str. T04001]
 gi|402564350|ref|YP_006607074.1| prolyl 4-hydroxylase subunit alpha domain-containing protein
           [Bacillus thuringiensis HD-771]
 gi|434377355|ref|YP_006611999.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-789]
 gi|228792646|gb|EEM40212.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           sotto str. T04001]
 gi|228856936|gb|EEN01449.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL
           4222]
 gi|401793002|gb|AFQ19041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-771]
 gi|401875912|gb|AFQ28079.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           thuringiensis HD-789]
          Length = 216

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E +++IE+SK K++R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLANVLSDEECDKLIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|289526401|gb|ADD01323.1| FI13021p [Drosophila melanogaster]
 gi|373432715|gb|AEY70761.1| FI17809p1 [Drosophila melanogaster]
          Length = 193

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 54/145 (37%), Positives = 76/145 (52%), Gaps = 20/145 (13%)

Query: 89  RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD 148
           R +K ++L  E    +    +I  RI DMT   +   E +    Q+ NYG+GGHY LH D
Sbjct: 28  RTAKGFWLKKE---SNELTKRITRRIMDMTGFDLADSEGF----QVINYGIGGHYFLHMD 80

Query: 149 ----ATPRDEGLW---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWY 195
               A+                R+A+ +FYLTDVE GGAT+F  +   V P+ G+A+FWY
Sbjct: 81  YFDFASSNHTDTRSRYSIDLGDRIATVLFYLTDVEQGGATVFGDVGYYVSPQAGTAIFWY 140

Query: 196 NAHANTLLDYRMYHSGCPVALGNKW 220
           N   +   D R  H+ CPV +G+KW
Sbjct: 141 NLDTDGNGDPRTRHAACPVIVGSKW 165


>gi|312385412|gb|EFR29925.1| hypothetical protein AND_00803 [Anopheles darlingi]
          Length = 468

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 58/169 (34%), Positives = 84/169 (49%), Gaps = 16/169 (9%)

Query: 7   CQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           C+G         + L+C Y S    FLKI PLK+EE+ LDP +V  H  I D+EI  IIE
Sbjct: 284 CRGESPRTASEMAKLRCRYVSNRVPFLKIAPLKLEEVSLDPFIVVYHQVISDNEIKTIIE 343

Query: 67  LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
           +S+  + R  V +         R S   +L   +   HP +  +  R +DMT L +   E
Sbjct: 344 ISRDSLRRAMVGDVAKQEVSKARTSSNAWLDDPM---HPHVRSLSRRTEDMTGLTMWAAE 400

Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRDEGLW---------RLASFMFYL 166
           +    LQ+ NYG+GGHY  H D    +EG+          R+A+ M+Y+
Sbjct: 401 Q----LQVGNYGIGGHYLPHFDYGTPEEGVELYPNIEKGNRIATVMYYV 445


>gi|49187135|ref|YP_030387.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. Sterne]
 gi|228947951|ref|ZP_04110238.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           monterrey BGSC 4AJ1]
 gi|49181062|gb|AAT56438.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Sterne]
 gi|228811938|gb|EEM58272.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           monterrey BGSC 4AJ1]
          Length = 232

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 52  FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSRDV--NDIRTSSGAFL-----DD 104

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 105 NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220


>gi|340787855|ref|YP_004753320.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
           [Collimonas fungivorans Ter331]
 gi|340553122|gb|AEK62497.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit
           [Collimonas fungivorans Ter331]
          Length = 289

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/185 (33%), Positives = 90/185 (48%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDH 104
           PR +   + +   E +++I LSK K+ R  VV++  G+T   + R S   F +    G  
Sbjct: 100 PRAILFGNVLSHDECDQLIALSKTKLLRSGVVDHQTGNTKLHEHRTSSGTFFH---RGTT 156

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
           PF+  I  R+  +  +     E +   LQI NY +GG Y  H D         A     G
Sbjct: 157 PFIAMIDKRLAALMQV----PESHGEGLQILNYQMGGEYRPHYDYFRPDAPGSAKHLARG 212

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R A+ + YL DV+ GG TIFP   L++ P KGSA+++   +A   LD   +H G PV 
Sbjct: 213 GQRTATLIIYLNDVDGGGETIFPRNGLSIVPAKGSAIYFSYTNAENQLDSLSFHGGSPVI 272

Query: 216 LGNKW 220
            G KW
Sbjct: 273 EGEKW 277


>gi|229075940|ref|ZP_04208916.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
 gi|229117732|ref|ZP_04247101.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
 gi|407706764|ref|YP_006830349.1| alpha/beta fold family hydrolase [Bacillus thuringiensis MC28]
 gi|423377905|ref|ZP_17355189.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
 gi|423464099|ref|ZP_17440867.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
 gi|423547540|ref|ZP_17523898.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
 gi|423622677|ref|ZP_17598455.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
 gi|228665709|gb|EEL21182.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock1-3]
 gi|228707255|gb|EEL59452.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock4-18]
 gi|401179261|gb|EJQ86434.1| hypothetical protein IGO_03975 [Bacillus cereus HuB5-5]
 gi|401260797|gb|EJR66965.1| hypothetical protein IK3_01275 [Bacillus cereus VD148]
 gi|401636171|gb|EJS53925.1| hypothetical protein IC9_01258 [Bacillus cereus BAG1O-2]
 gi|402420366|gb|EJV52637.1| hypothetical protein IEK_01286 [Bacillus cereus BAG6O-1]
 gi|407384449|gb|AFU14950.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis MC28]
          Length = 216

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 60/180 (33%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + I D E N +IE+SK K++R  + +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLGNVISDEECNELIEMSKNKIKRSTIGSARDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPVTHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|30264308|ref|NP_846685.1| prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. Ames]
 gi|47529753|ref|YP_021102.1| prolyl 4-hydroxylase subunit alpha [Bacillus anthracis str. 'Ames
           Ancestor']
 gi|65321616|ref|ZP_00394575.1| hypothetical protein Bant_01005109 [Bacillus anthracis str. A2012]
 gi|165873278|ref|ZP_02217887.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0488]
 gi|167634610|ref|ZP_02392930.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0442]
 gi|167638693|ref|ZP_02396969.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0193]
 gi|170687507|ref|ZP_02878724.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0465]
 gi|170709341|ref|ZP_02899757.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0389]
 gi|177655890|ref|ZP_02937082.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0174]
 gi|190566156|ref|ZP_03019075.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Tsiankovskii-I]
 gi|196034803|ref|ZP_03102210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           W]
 gi|227817011|ref|YP_002817020.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus
           anthracis str. CDC 684]
 gi|228929280|ref|ZP_04092307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pondicheriensis BGSC 4BA1]
 gi|228935557|ref|ZP_04098373.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           andalousiensis BGSC 4AW1]
 gi|229123754|ref|ZP_04252949.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
 gi|229604260|ref|YP_002868528.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0248]
 gi|254683996|ref|ZP_05147856.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. CNEVA-9066]
 gi|254721830|ref|ZP_05183619.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A1055]
 gi|254736344|ref|ZP_05194050.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Western North America USA6153]
 gi|254741382|ref|ZP_05199069.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Kruger B]
 gi|254753983|ref|ZP_05206018.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Vollum]
 gi|254757854|ref|ZP_05209881.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Australia 94]
 gi|386738126|ref|YP_006211307.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
 gi|421506493|ref|ZP_15953416.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
 gi|421638315|ref|ZP_16078911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
 gi|30258953|gb|AAP28171.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Ames]
 gi|47504901|gb|AAT33577.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. 'Ames Ancestor']
 gi|164710995|gb|EDR16563.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0488]
 gi|167513541|gb|EDR88911.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0193]
 gi|167530062|gb|EDR92797.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0442]
 gi|170125767|gb|EDS94678.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0389]
 gi|170668702|gb|EDT19448.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0465]
 gi|172079923|gb|EDT65028.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0174]
 gi|190563075|gb|EDV17041.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. Tsiankovskii-I]
 gi|195992342|gb|EDX56303.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           W]
 gi|227005734|gb|ACP15477.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. CDC 684]
 gi|228659889|gb|EEL15534.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus 95/8201]
 gi|228824095|gb|EEM69911.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           andalousiensis BGSC 4AW1]
 gi|228830570|gb|EEM76180.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pondicheriensis BGSC 4BA1]
 gi|229268668|gb|ACQ50305.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus
           anthracis str. A0248]
 gi|384387978|gb|AFH85639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. H9401]
 gi|401823486|gb|EJT22633.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. UR-1]
 gi|403394741|gb|EJY91981.1| Prolyl 4-hydroxylase alpha subunit [Bacillus anthracis str. BF1]
          Length = 216

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|196046329|ref|ZP_03113555.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB108]
 gi|376268135|ref|YP_005120847.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
           F837/76]
 gi|196022799|gb|EDX61480.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB108]
 gi|364513935|gb|AEW57334.1| Peptidyl prolyl 4- hydroxylase like protein [Bacillus cereus
           F837/76]
          Length = 216

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|196041590|ref|ZP_03108882.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NVH0597-99]
 gi|218905373|ref|YP_002453207.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           AH820]
 gi|225866219|ref|YP_002751597.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB102]
 gi|423550018|ref|ZP_17526345.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
 gi|196027578|gb|EDX66193.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           NVH0597-99]
 gi|218537435|gb|ACK89833.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           AH820]
 gi|225786013|gb|ACO26230.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           03BB102]
 gi|401189634|gb|EJQ96684.1| hypothetical protein IGW_00649 [Bacillus cereus ISP3191]
          Length = 216

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|301055727|ref|YP_003793938.1| prolyl 4-hydroxylase subunit alpha [Bacillus cereus biovar
           anthracis str. CI]
 gi|300377896|gb|ADK06800.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus biovar
           anthracis str. CI]
          Length = 216

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|229186477|ref|ZP_04313640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
 gi|228596991|gb|EEK54648.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BGSC 6E1]
          Length = 216

 Score = 91.7 bits (226), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVIYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423457579|ref|ZP_17434376.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
 gi|401147963|gb|EJQ55456.1| hypothetical protein IEI_00719 [Bacillus cereus BAG5X2-1]
          Length = 216

 Score = 91.3 bits (225), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSRDV--NDIRTSSGAFLE-----D 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTVKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|52141260|ref|YP_085568.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
 gi|51974729|gb|AAU16279.1| prolyl 4-hydroxylase, alpha subunit [Bacillus cereus E33L]
          Length = 232

 Score = 91.3 bits (225), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 52  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 104

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 105 NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220


>gi|281307110|pdb|3ITQ|A Chain A, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
           Anthracis
 gi|281307111|pdb|3ITQ|B Chain B, Crystal Structure Of A Prolyl 4-Hydroxylase From Bacillus
           Anthracis
          Length = 216

 Score = 91.3 bits (225), Expect = 2e-16,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLGNVLSDEECDELIELSKSKLARSKVGSSRDV--NDIRTSSGAFL-----DD 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTAKIEKRISSIXNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVXYLNDVEEGGETFFPKLNLSVHPRKGXAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|118479416|ref|YP_896567.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis str. Al
           Hakam]
 gi|118418641|gb|ABK87060.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis str. Al
           Hakam]
          Length = 232

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 52  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 104

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 105 NELTAKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 161 TLVIYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220


>gi|49480949|ref|YP_038297.1| prolyl 4-hydroxylase subunit alpha [Bacillus thuringiensis serovar
           konkukian str. 97-27]
 gi|49332505|gb|AAT63151.1| prolyl 4-hydroxylase, alpha subunit [Bacillus thuringiensis serovar
           konkukian str. 97-27]
          Length = 232

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 52  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 104

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 105 NELTEKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220


>gi|229093299|ref|ZP_04224414.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
 gi|228690082|gb|EEL43879.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-42]
          Length = 232

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 62/180 (34%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K+ R KV +  D    D R S   FL      D
Sbjct: 52  FEEPLIVVLGNVLSDEECDELIELSKNKLARSKVGSSRDV--NDIRTSSGAFL-----DD 104

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 105 NELTEKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 160

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 161 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 220


>gi|229146822|ref|ZP_04275187.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
 gi|228636650|gb|EEK93115.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus BDRD-ST24]
          Length = 216

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IE+SK K+ER K+ +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLANVLSDEECDELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFLE-----D 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   +    L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQGQSLNELTLHGGAPVTKGEKW 204


>gi|228960501|ref|ZP_04122151.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pakistani str. T13001]
 gi|229047930|ref|ZP_04193506.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
 gi|423630961|ref|ZP_17606708.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
 gi|423650103|ref|ZP_17625673.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
 gi|228723387|gb|EEL74756.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus AH676]
 gi|228799198|gb|EEM46165.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           pakistani str. T13001]
 gi|401264328|gb|EJR70440.1| hypothetical protein IK5_03811 [Bacillus cereus VD154]
 gi|401282521|gb|EJR88420.1| hypothetical protein IKA_03890 [Bacillus cereus VD169]
          Length = 248

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IE+SK K++R KV +  D    D R S   FL      D
Sbjct: 68  FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 120

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 121 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|423558182|ref|ZP_17534484.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
 gi|401191450|gb|EJQ98472.1| hypothetical protein II3_03386 [Bacillus cereus MC67]
          Length = 216

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 60/180 (33%), Positives = 89/180 (49%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IELSK K++R K+ +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLANVLSDEECDGLIELSKNKIKRSKIGSSRDV--NDIRTSSGAFL-----EE 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQEYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|389770666|ref|ZP_10192118.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
 gi|388429637|gb|EIL86932.1| procollagen-proline dioxygenase [Rhodanobacter sp. 115]
          Length = 286

 Score = 91.3 bits (225), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 65/194 (33%), Positives = 97/194 (50%), Gaps = 22/194 (11%)

Query: 38  LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYF 95
           L+VE+    P +  +   +   E + +I  +  K++R  +V+   G    +  R S+  F
Sbjct: 90  LRVEQ----PVLAVLDGVLSHEECDELIRRAAAKLQRSTIVDPTTGKHETIADRSSEGTF 145

Query: 96  LYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE- 154
              EI  D  F+ ++  RI  + NL +   E     LQI +YG GG Y  H D  P  + 
Sbjct: 146 F--EINADD-FIARLDRRISALMNLPVDHGEG----LQILHYGPGGEYKPHFDFFPPGDP 198

Query: 155 --------GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR 206
                   G  R+++ + YL +VE GGATIFP L L+V P+KGSAV++   ++   LD R
Sbjct: 199 GSAVQMATGGQRVSTLVMYLNEVEDGGATIFPELGLSVLPKKGSAVYFEYTNSRGQLDPR 258

Query: 207 MYHSGCPVALGNKW 220
             H G PV  G KW
Sbjct: 259 TLHGGAPVLRGEKW 272


>gi|423437685|ref|ZP_17414666.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
 gi|423503075|ref|ZP_17479667.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
 gi|401120840|gb|EJQ28636.1| hypothetical protein IE9_03866 [Bacillus cereus BAG4X12-1]
 gi|402459296|gb|EJV91033.1| hypothetical protein IG1_00641 [Bacillus cereus HD73]
          Length = 248

 Score = 90.9 bits (224), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IE+SK K++R KV +  D    D R S   FL      D
Sbjct: 68  FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSARDV--NDIRTSSGAFL-----ED 120

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 121 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|228954520|ref|ZP_04116545.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. T03a001]
 gi|449091198|ref|YP_007423639.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. HD73]
 gi|228805177|gb|EEM51771.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. T03a001]
 gi|449024955|gb|AGE80118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           kurstaki str. HD73]
          Length = 216

 Score = 90.9 bits (224), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IE+SK K++R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSARDV--NDIRTSSGAFL-----ED 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|195159168|ref|XP_002020454.1| GL13504 [Drosophila persimilis]
 gi|194117223|gb|EDW39266.1| GL13504 [Drosophila persimilis]
          Length = 491

 Score = 90.9 bits (224), Expect = 3e-16,   Method: Compositional matrix adjust.
 Identities = 72/211 (34%), Positives = 103/211 (48%), Gaps = 31/211 (14%)

Query: 18  KSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
           KS L C + S+  +F     LKVEE+ LDP +V  HD +   E+    EL K        
Sbjct: 286 KSTLHCRF-SWRPSF--YARLKVEEVLLDPYIVLYHDVVSGKEM----ELLK-------- 330

Query: 78  VNYGDTIY----VDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQ 133
            +YG T      + + LS  +   PE     P +  +  R+ DMT L +   E +     
Sbjct: 331 -DYGRTNLTHDPLRSGLSAKHCALPESL---PLVQSLHQRLWDMTGLSLNGSESWL---- 382

Query: 134 INNYGLGGHYDLHCDATPRDE----GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
           I NYG+GG   LH D     E    G  RL +   +L++V  GG T+FP+L + V P+ G
Sbjct: 383 ITNYGIGGFLGLHKDYFDEIEEELQGDNRLFTIQIFLSNVSQGGYTVFPNLEVAVKPQAG 442

Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +A+ +YN   + + D R  H GCPV  G+KW
Sbjct: 443 TALVFYNLLDSLVGDTRTRHFGCPVIDGDKW 473


>gi|149180354|ref|ZP_01858859.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
 gi|148852546|gb|EDL66691.1| prolyl 4-hydroxylase, alpha subunit [Bacillus sp. SG-1]
          Length = 212

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 58/176 (32%), Positives = 92/176 (52%), Gaps = 10/176 (5%)

Query: 46  DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
           +P +V + + + D E + +I LSK K++R K+ N  +    D R S   F+     G+  
Sbjct: 36  EPLIVVLGNVLSDEECDALIGLSKDKLKRSKIGNTRNE--NDMRTSSSTFMEE---GESE 90

Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFY 165
            + +++ RI  + N+     E     LQI NY +G  Y  H D   ++    R+++ + Y
Sbjct: 91  VVTRVEKRISQIMNIPYENGE----GLQILNYKIGQEYKAHFDFF-KNASNPRISTLVMY 145

Query: 166 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           L DVE GG T FP LN +V P+KG AV++   + N  L+    H G PV +G+KW 
Sbjct: 146 LNDVEEGGETYFPKLNFSVSPQKGMAVYFEYFYDNQELNDLTLHGGAPVIIGDKWA 201


>gi|423582447|ref|ZP_17558558.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
 gi|401213326|gb|EJR20067.1| hypothetical protein IIA_03962 [Bacillus cereus VD014]
          Length = 248

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IE+SK K++R KV +  D    D R S   FL      D
Sbjct: 68  FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 120

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
                KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 121 SELTLKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|423634936|ref|ZP_17610589.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
 gi|401278922|gb|EJR84852.1| hypothetical protein IK7_01345 [Bacillus cereus VD156]
          Length = 248

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IE+SK K++R KV +  D    D R S   FL      D
Sbjct: 68  FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 120

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
                KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 121 SELTLKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|229152436|ref|ZP_04280628.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
 gi|228631044|gb|EEK87681.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus m1550]
          Length = 248

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 60/180 (33%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E   +IE+SK K+ER K+ +  D    D R S   FL      D
Sbjct: 68  FEEPLIVVLANVLSDEECGELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFL-----ED 120

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 121 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 176

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  ++    H G PV  G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSINELTLHGGAPVTKGEKW 236


>gi|377811809|ref|YP_005044249.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
 gi|357941170|gb|AET94726.1| ProCollegen-proline dioxygenase [Burkholderia sp. YI23]
          Length = 283

 Score = 90.9 bits (224), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 58/186 (31%), Positives = 89/186 (47%), Gaps = 18/186 (9%)

Query: 46  DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGD 103
           +P V  + D +   E +R+IE+ + +V R  VV+   G  + +D R S+  F+       
Sbjct: 90  EPVVALLADVLSPRECDRLIEIGRERVRRSSVVDPDSGGEVLIDARKSEGAFVNGST--- 146

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE--------- 154
            P +  I  RI ++    +   E     L I  YG GG Y  H D  P ++         
Sbjct: 147 DPLVATIDRRIAELVQQPVENGE----DLHILRYGAGGEYRPHFDYFPEEQAGSKHHMQR 202

Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
           G  R+A+ + YL  VE GG T FP + LT+ P +G+A+++   +A    D R  H+G PV
Sbjct: 203 GGQRIATLILYLNQVEEGGDTTFPDIGLTIHPRRGAALYFEYVNALGQTDPRTLHAGMPV 262

Query: 215 ALGNKW 220
             G KW
Sbjct: 263 ERGEKW 268


>gi|402813396|ref|ZP_10862991.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
 gi|402509339|gb|EJW19859.1| hypothetical protein PAV_1c08470 [Paenibacillus alvei DSM 29]
          Length = 215

 Score = 90.5 bits (223), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 61/181 (33%), Positives = 94/181 (51%), Gaps = 16/181 (8%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFG 102
           Y +P +V + + + + E + +IE SK +++R K+   G+   V+  R S   F       
Sbjct: 33  YEEPLIVILGNVLSNEECDELIEHSKERLQRSKI---GEERSVNQIRTSSGVFCE----- 84

Query: 103 DHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRL 159
           ++  + KI+ RI  + N+ I     +   LQ+  Y  G  Y  H D    T R     R+
Sbjct: 85  ENETVAKIEKRISQIMNIPI----EHGDGLQVLLYAPGQEYKPHFDFFADTSRASANNRI 140

Query: 160 ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           ++ + YL DVE GG T FP LNL+VFP KG AV++   ++N  L+ R  H+G PV  G K
Sbjct: 141 STLVMYLNDVEEGGETTFPMLNLSVFPSKGMAVYFEYFYSNHELNERTLHAGAPVRKGEK 200

Query: 220 W 220
           W
Sbjct: 201 W 201


>gi|423483822|ref|ZP_17460512.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
 gi|401141373|gb|EJQ48928.1| hypothetical protein IEQ_03600 [Bacillus cereus BAG6X1-2]
          Length = 216

 Score = 90.5 bits (223), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 59/180 (32%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + I D E + +IE+SK K++R  + +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|228922987|ref|ZP_04086280.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           huazhongensis BGSC 4BD1]
 gi|228836620|gb|EEM81968.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           huazhongensis BGSC 4BD1]
          Length = 216

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IE+SK K++R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
                KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  SELTLKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|423615424|ref|ZP_17591258.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
 gi|401259961|gb|EJR66134.1| hypothetical protein IIO_00750 [Bacillus cereus VD115]
          Length = 216

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 59/180 (32%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + I D E + +IE+SK K++R  + +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPVAHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|218231188|ref|YP_002369041.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           B4264]
 gi|218159145|gb|ACK59137.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           B4264]
          Length = 216

 Score = 90.5 bits (223), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 60/180 (33%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E   +IE+SK K+ER K+ +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLANVLSDEECGELIEMSKNKMERSKIGSSRDV--NDIRTSSGAFLE-----D 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  ++    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSINELTLHGGAPVTKGEKW 204


>gi|390352104|ref|XP_003727818.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like
           [Strongylocentrotus purpuratus]
          Length = 121

 Score = 90.1 bits (222), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 48/93 (51%), Positives = 57/93 (61%), Gaps = 5/93 (5%)

Query: 132 LQINNYGLGGHYDLHCDATPRDEGLW----RLASFMFYLTDVELGGATIFPSLNLTVFPE 187
           LQI NYGLGGHY  H D T RD        R+AS +FYL+DV  GG T+F      + PE
Sbjct: 7   LQIANYGLGGHYLPHFDFT-RDVATHKNGNRIASMLFYLSDVAKGGDTVFIDAGAKIKPE 65

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           KGSA+FWYN   N  +D R  H+ CPV  G+KW
Sbjct: 66  KGSAIFWYNLFKNGKVDERTKHASCPVISGSKW 98


>gi|228941395|ref|ZP_04103947.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           berliner ATCC 10792]
 gi|228974327|ref|ZP_04134896.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           thuringiensis str. T01001]
 gi|228980919|ref|ZP_04141223.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|384188306|ref|YP_005574202.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           chinensis CT-43]
 gi|410676625|ref|YP_006928996.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|452200698|ref|YP_007480779.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
           thuringiensis serovar thuringiensis str. IS5056]
 gi|228778855|gb|EEM27118.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|228785377|gb|EEM33387.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           thuringiensis str. T01001]
 gi|228818321|gb|EEM64394.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           berliner ATCC 10792]
 gi|326942015|gb|AEA17911.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis serovar
           chinensis CT-43]
 gi|409175754|gb|AFV20059.1| prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis Bt407]
 gi|452106091|gb|AGG03031.1| Peptidyl prolyl 4-hydroxylase-like protein, alpha subunit [Bacillus
           thuringiensis serovar thuringiensis str. IS5056]
          Length = 216

 Score = 90.1 bits (222), Expect = 6e-16,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E   +IE+SK K++R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLANVLSDEECGELIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFLE-----D 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|73542634|ref|YP_297154.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
           eutropha JMP134]
 gi|72120047|gb|AAZ62310.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Ralstonia
           eutropha JMP134]
          Length = 282

 Score = 89.7 bits (221), Expect = 7e-16,   Method: Compositional matrix adjust.
 Identities = 58/176 (32%), Positives = 85/176 (48%), Gaps = 18/176 (10%)

Query: 56  IYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR 113
           + D+E + ++EL++G++ R  V+N   GD   +D R S          G+H  + +I+ R
Sbjct: 99  LSDAECDALVELARGRLARSPVINPDTGDENLIDARTSMGAMFQ---VGEHTLIQRIEDR 155

Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEGLWRLASFMF 164
           I  +  + +   E     LQI NY  GG Y  H D         A     G  R A+ + 
Sbjct: 156 IAAVLGVPVDHGEG----LQILNYKPGGEYQPHFDFFNPKRPGEARQLRVGGQRTATLVI 211

Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           YL   + GGAT FP + L V P KG+AV++     +  LD R  H+G PV  G KW
Sbjct: 212 YLNTPQAGGATAFPRIGLEVAPVKGNAVYFSYLQPDGKLDERTLHAGLPVQSGEKW 267


>gi|423541303|ref|ZP_17517694.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
 gi|401172491|gb|EJQ79712.1| hypothetical protein IGK_03395 [Bacillus cereus HuB4-10]
          Length = 216

 Score = 89.7 bits (221), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 59/180 (32%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + I D E + +IE+SK K++R  + +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPVTHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|228910069|ref|ZP_04073889.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
 gi|228849586|gb|EEM94420.1| Prolyl 4-hydroxylase alpha subunit [Bacillus thuringiensis IBL 200]
          Length = 248

 Score = 89.7 bits (221), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IE+SK K++R KV +  D    D R S   FL      D
Sbjct: 68  FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 120

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 121 NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAVNNRIS 176

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 177 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 236


>gi|241598365|ref|XP_002404734.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215500465|gb|EEC09959.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 524

 Score = 89.7 bits (221), Expect = 8e-16,   Method: Compositional matrix adjust.
 Identities = 68/240 (28%), Positives = 104/240 (43%), Gaps = 33/240 (13%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           E Y   C+G       + S L+C Y S  + F K+ P+K+EE  L P VV + D + D +
Sbjct: 274 ENYKRLCRGEQLRTPKMDSQLRCRYYSGESGFFKLQPIKLEEYNLKPYVVVLRDLLQDRD 333

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIF---------GDHPFLYKIQ 111
           +  +I  +K +V +              +LS+   +Y + +          D P   ++ 
Sbjct: 334 LADMIAFAKPRVRK-------------LQLSRRILVYSKHYCDTSTWLNDDDAPVAARVN 380

Query: 112 TRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLW------RLA 160
             +Q +  L     +      Q+ NYG+GGHY  H D      T R   +       R+A
Sbjct: 381 QYLQSLLGLGTLYSKDEAEKYQLANYGIGGHYVPHHDYLEETLTSRHVSIVTRLFGDRVA 440

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + M Y++DVE GGAT+FPSL + V P+K S  F            R        A+ NKW
Sbjct: 441 TLMIYMSDVEEGGATVFPSLGVRVSPKKVSMQFIRAVMRWVAFTLREVCVSFCCAVANKW 500


>gi|402584932|gb|EJW78873.1| hypothetical protein WUBG_10221 [Wuchereria bancrofti]
          Length = 187

 Score = 89.7 bits (221), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 45/124 (36%), Positives = 67/124 (54%), Gaps = 10/124 (8%)

Query: 103 DHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW----- 157
           +H  + +I  R+   TNL    E      LQ+ NYG+GGHY+ H D + R+         
Sbjct: 14  EHEVVNRINKRLDLATNL----ETETAEELQVQNYGIGGHYEPHYDCSRRESVFEKTKNG 69

Query: 158 -RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVAL 216
            R+A+ + Y+T  E+GG T+F  L  ++   K +A+FWYN   +  +D R YH+ CPV  
Sbjct: 70  NRIATILIYMTKPEIGGGTVFIDLKTSISCTKNAALFWYNLMRSGAVDIRSYHAACPVLT 129

Query: 217 GNKW 220
           G KW
Sbjct: 130 GTKW 133


>gi|403238305|ref|ZP_10916891.1| procollagen-proline dioxygenase [Bacillus sp. 10403023]
          Length = 296

 Score = 89.7 bits (221), Expect = 9e-16,   Method: Compositional matrix adjust.
 Identities = 51/172 (29%), Positives = 91/172 (52%), Gaps = 14/172 (8%)

Query: 56  IYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR 113
           + + E +++IE+S+ +++   V++   G+      R SK    Y     ++ F+ K++ R
Sbjct: 118 LSEEECDQLIEMSRERLKPSTVIDPKTGEEKAATGRTSKGMSFY---LQENEFIKKVEKR 174

Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR-----DEGLWRLASFMFYLTD 168
           I ++    +   E     LQ+ NYG+G  Y  H D  P+     ++G  R+ +F+ YL D
Sbjct: 175 IAELIEFPVENGEG----LQVLNYGIGEEYKSHFDYFPQSKVVPEKGGQRVGTFLIYLND 230

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           V  GG T+FP   +++ P+KGSAV++   ++   +D    HS  PV+ G KW
Sbjct: 231 VPAGGETVFPKAGVSIVPKKGSAVYFQYGNSKGEVDRMSLHSSIPVSEGEKW 282


>gi|194745802|ref|XP_001955376.1| GF16267 [Drosophila ananassae]
 gi|190628413|gb|EDV43937.1| GF16267 [Drosophila ananassae]
          Length = 385

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 67/218 (30%), Positives = 100/218 (45%), Gaps = 31/218 (14%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           I+   C+G  +VP+  K  L+C Y +  + FL++ PLK+E+L LDP +   HD I   E 
Sbjct: 46  IHLETCRGRNTVPK--KFYLRCRYFTEGDPFLQLAPLKLEQLNLDPFIGIFHDVISIGEQ 103

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
             +I L++ ++             V+   SK              + +I  RI+DMT L 
Sbjct: 104 KNLINLTRNRLRLQNPQRAVMEAEVELNASKE-------------VERIHRRIEDMTGLN 150

Query: 122 IGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN 181
           +  EE    PL I NYG+GG + +H D               F L+DV++GG   FP L 
Sbjct: 151 L--EE--SPPLTILNYGIGGQHPIHLDCE------------QFMLSDVQMGGYASFPELG 194

Query: 182 LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
               P +GSA+  +N       D R   + CP A+  K
Sbjct: 195 FGFKPSRGSALVVHNMDNAANCDIRSLQATCPGAVTFK 232


>gi|218899396|ref|YP_002447807.1| prolyl 4-hydroxylase subunit alpha domain protein [Bacillus cereus
           G9842]
 gi|218542449|gb|ACK94843.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus cereus
           G9842]
          Length = 216

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 61/180 (33%), Positives = 88/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + + D E + +IE+SK K++R KV +  D    D R S   FL      D
Sbjct: 36  FEEPLIVVLANVLSDEECDELIEMSKNKMKRSKVGSSRDV--NDIRTSSGAFL-----ED 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+     E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPASHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAVNNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETYFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|196011906|ref|XP_002115816.1| hypothetical protein TRIADDRAFT_59903 [Trichoplax adhaerens]
 gi|190581592|gb|EDV21668.1| hypothetical protein TRIADDRAFT_59903 [Trichoplax adhaerens]
          Length = 444

 Score = 89.4 bits (220), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 60/197 (30%), Positives = 96/197 (48%), Gaps = 16/197 (8%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           Y   C+ + +    + + LKC+Y  +N +  L   P+ VEE+   P +   HD I   E 
Sbjct: 238 YTKLCRSHKNYQTSLNNGLKCYY--FNQSPLLHFNPVAVEEISYSPVIRLYHDIISHQEA 295

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQT----RIQDM 117
             +  +S  K+   +       + +    S+    Y   F  H +L  I      R+  +
Sbjct: 296 EILKNISSKKLTVARTF-----VQIMPNNSEAEGEYR--FAKHAWLGDIDNQVVRRLSVL 348

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE--GLWRLASFMFYLTDVELGGAT 175
           +  + G +  Y   LQ+ NYG+GGHY  H D+   D+  G  RLA+ MFYL+DV++GGAT
Sbjct: 349 SEELTGLDLSYAEKLQVANYGVGGHYSPHYDSASIDDDTGKPRLATIMFYLSDVDIGGAT 408

Query: 176 IFPSLNLTVFPEKGSAV 192
           +FP +   +FP K S +
Sbjct: 409 VFPDIGKAIFPRKTSEI 425


>gi|156333122|ref|XP_001619372.1| hypothetical protein NEMVEDRAFT_v1g151555 [Nematostella vectensis]
 gi|156202442|gb|EDO27272.1| predicted protein [Nematostella vectensis]
          Length = 144

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 48/116 (41%), Positives = 68/116 (58%), Gaps = 8/116 (6%)

Query: 109 KIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW---RLASFMF 164
           +I  R+Q  + L +   E     LQ+ NYG+GGHY+ H D A  +   L    R+A+F+ 
Sbjct: 16  RISYRVQAYSGLNMTTSE----DLQVVNYGIGGHYEPHYDFARDKFTSLGTGNRIATFLS 71

Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           YL+DVE GG T+F  +  TV+P+KG A FWYN   +   D    H+ CPV +G+KW
Sbjct: 72  YLSDVEAGGGTVFTRVGATVWPQKGDAAFWYNLKRSGDGDSSTRHAACPVLVGSKW 127


>gi|229104864|ref|ZP_04235524.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
 gi|228678581|gb|EEL32798.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-28]
          Length = 216

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 59/180 (32%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + I D E   +IE+SK K++R  + +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLGNVISDEECGELIEMSKNKIKRSTIGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPVTHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G PV  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGAPVTKGEKW 204


>gi|89096248|ref|ZP_01169141.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           NRRL B-14911]
 gi|89089102|gb|EAR68210.1| prolyl 4-hydroxylase, alpha subunit domain protein [Bacillus sp.
           NRRL B-14911]
          Length = 217

 Score = 89.0 bits (219), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 56/179 (31%), Positives = 92/179 (51%), Gaps = 12/179 (6%)

Query: 46  DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
           +P +V + + + D E   +I +S+ K++R K+   G+T  VD   +     + E  G++ 
Sbjct: 38  EPLIVILGNVLSDEECEGLIRMSEDKLKRSKI---GNTRTVDDIRTSSSMFFEE--GENE 92

Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASF 162
            + +I+ R+  + N+ +   E     LQ+ NY +G  Y  H D            R+++ 
Sbjct: 93  LVARIERRLSQIMNIPVEHGE----GLQMLNYHIGQEYKAHFDFFSSSSRAASNPRISTL 148

Query: 163 MFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           + YL DVE GG T FP LN +V P+KGSAV++   + N  L+    H G PV  G+KW 
Sbjct: 149 VMYLNDVEEGGETYFPKLNFSVNPQKGSAVYFEYFYDNQDLNDLTLHGGAPVIKGSKWA 207


>gi|372266874|ref|ZP_09502922.1| peptidyl prolyl 4-hydroxylase-like protein subunit alpha
           [Alteromonas sp. S89]
          Length = 294

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 61/210 (29%), Positives = 104/210 (49%), Gaps = 22/210 (10%)

Query: 25  YESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--Y 80
           + ++N   + +G  +VE  +    P +V   + + + E + ++E+S+  +   +VVN  +
Sbjct: 79  FPTFNTGVIPLGDQQVEARFAIRQPNIVLFANFLAEWECDALVEMSRPNLSPSRVVNTQH 138

Query: 81  GDTIYVDTRLSK-VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
           G      +R S   +F      G+ P +  I+ RI  +  +     E +  PLQI +Y +
Sbjct: 139 GAFELKPSRTSGGTHFAR----GETPLIADIEARIASLLKV----PEAHGEPLQILHYPV 190

Query: 140 GG----HYDLHCDATPRDE-----GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGS 190
            G    HYD      P ++     G  R+ + + YL+DVE GGAT+FP + L V P+KG+
Sbjct: 191 SGEYRPHYDFFDPEKPGNQEVLAAGGQRVGTLIMYLSDVESGGATVFPRVGLEVQPQKGA 250

Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           A+F+     +  LD +  H G PV  G KW
Sbjct: 251 ALFFSYVGEHGKLDLQSLHGGSPVLAGEKW 280


>gi|229098707|ref|ZP_04229647.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
 gi|423441025|ref|ZP_17417931.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
 gi|423533441|ref|ZP_17509859.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
 gi|228684786|gb|EEL38724.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-29]
 gi|402417686|gb|EJV49986.1| hypothetical protein IEA_01355 [Bacillus cereus BAG4X2-1]
 gi|402463660|gb|EJV95360.1| hypothetical protein IGI_01273 [Bacillus cereus HuB2-9]
          Length = 216

 Score = 88.6 bits (218), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 59/180 (32%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + I D E N +IE+SK K++R  + +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLGNVISDEECNELIEMSKNKIKRSTIGSARDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPVTHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G  V  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGASVTKGEKW 204


>gi|403234403|ref|ZP_10912989.1| Procollagen-proline dioxygenase [Bacillus sp. 10403023]
          Length = 217

 Score = 88.2 bits (217), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 57/180 (31%), Positives = 94/180 (52%), Gaps = 15/180 (8%)

Query: 46  DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDH 104
           +P +V + + + D E + +I LSK ++ R K+ N      VD  R S   F+      ++
Sbjct: 38  EPLIVVLGNVLSDEECDELIRLSKDRINRSKIANAN----VDNMRTSSSTFIEE---NEN 90

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD--ATPRDE-GLWRLAS 161
             + +I+ RI  + N+       Y   LQI NY +G  Y  H D  ++P +     R+++
Sbjct: 91  IIVSRIEKRISQIMNI----PTEYGEGLQILNYQVGQEYKSHFDFFSSPHNAINNPRIST 146

Query: 162 FMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
            + YL+DVE GG T FP L+ +V P+KG AV++   + +  L+    H G PV +G+KW 
Sbjct: 147 LVMYLSDVEQGGETYFPKLHFSVSPQKGMAVYFEYFYNDQTLNELTLHGGAPVIVGDKWA 206


>gi|433460968|ref|ZP_20418587.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
 gi|432190746|gb|ELK47751.1| prolyl 4-hydroxylase alpha subunit [Halobacillus sp. BAB-2008]
          Length = 211

 Score = 87.4 bits (215), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 57/182 (31%), Positives = 88/182 (48%), Gaps = 14/182 (7%)

Query: 46  DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
           +P++  + + + + E   +I LSK KV R K+ +  D    D R S   FL      D  
Sbjct: 33  EPKIAILGNVVSEEECEALIRLSKDKVNRSKIGSDHDV--SDIRTSSSAFL-----PDDE 85

Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLASF 162
              +I+ R+  + N+ +   E     + I +Y  G  Y  H D   +T R     R+++ 
Sbjct: 86  LTGRIEKRLAQIMNVPVEHGE----GIHILHYKPGQEYKAHHDYFRSTSRAAKNPRISTL 141

Query: 163 MFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGK 222
           + YL DVE GG T FP +NLTV P KG AV++   + +  ++ R  H G PV  G KW  
Sbjct: 142 VLYLNDVEEGGETYFPEMNLTVSPHKGMAVYFEYFYNDPAINERTLHGGSPVTAGEKWAA 201

Query: 223 LL 224
            +
Sbjct: 202 TM 203


>gi|444517246|gb|ELV11441.1| Prolyl 4-hydroxylase subunit alpha-2 [Tupaia chinensis]
          Length = 466

 Score = 87.4 bits (215), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 59/192 (30%), Positives = 94/192 (48%), Gaps = 25/192 (13%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 288 DVYESLCRGEGVKLTPRRQKRLFCRYHHGNRAPQLLIAPFKEEDEWDSPHIVRYYDVMSD 347

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 348 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 404

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
           +T L +   E     LQ+ NYG+GG Y+ H D +               ++DVE GGAT+
Sbjct: 405 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFS--------------RMSDVEAGGATV 446

Query: 177 FPSLNLTVFPEK 188
           FP L   ++P+K
Sbjct: 447 FPDLGAAIWPKK 458


>gi|242003035|ref|XP_002436120.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215499456|gb|EEC08950.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 173

 Score = 87.0 bits (214), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 53/137 (38%), Positives = 71/137 (51%), Gaps = 24/137 (17%)

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL--W---- 157
           HP + K+  RI   T L     E     LQ+ NYG+GGHY  H D + +D+ L  W    
Sbjct: 11  HPVVKKLSRRIAAATGLSTSSAEH----LQVVNYGVGGHYSPHFDFSTKDKPLRGWETFA 66

Query: 158 --RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYN---AHANTLL--------- 203
             R A+++ YL+ VE GGAT+F  L + V PE G A+FW+N      N+L          
Sbjct: 67  GQRQATWLVYLSSVERGGATLFKRLRVRVQPEAGMALFWHNLPPGSTNSLPSCCVHRSVG 126

Query: 204 DYRMYHSGCPVALGNKW 220
           D R  H  CPV +G+KW
Sbjct: 127 DERTEHGACPVLVGSKW 143


>gi|424863736|ref|ZP_18287648.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
           SAR86A]
 gi|400757057|gb|EJP71269.1| prolyl 4-hydroxylase subunit alpha-2 [SAR86 cluster bacterium
           SAR86A]
          Length = 205

 Score = 87.0 bits (214), Expect = 5e-15,   Method: Compositional matrix adjust.
 Identities = 55/186 (29%), Positives = 91/186 (48%), Gaps = 17/186 (9%)

Query: 46  DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
           DP V  +++ + D E    +E+ KGK+ER KV++  ++ +  +R +   +L         
Sbjct: 16  DPIVYVVNNFLSDDECEAFVEMGKGKMERAKVISDDESEFHASRTNDFCWLE---HSASD 72

Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDA----TPRDEGLW---- 157
            ++++  R   +  + I   E++    Q+  YG G  Y  H DA    T   +  W    
Sbjct: 73  VIHEVSKRFSVLVKMPINNAEQF----QLVYYGPGNEYKPHFDAFDKTTKEGQNNWFPGG 128

Query: 158 -RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA-HANTLLDYRMYHSGCPVA 215
            R+ + + YL DVE GGAT FP +N++V P KG  V ++N     T ++ +  H G PV 
Sbjct: 129 QRMVTALAYLNDVEEGGATDFPKINVSVKPNKGDVVVFHNCIEGTTEINPQALHGGSPVV 188

Query: 216 LGNKWG 221
            G KW 
Sbjct: 189 AGEKWA 194


>gi|423448819|ref|ZP_17425698.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
 gi|401129413|gb|EJQ37096.1| hypothetical protein IEC_03427 [Bacillus cereus BAG5O-1]
          Length = 216

 Score = 86.7 bits (213), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 58/180 (32%), Positives = 87/180 (48%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P +V + + I D E + +IE+SK K++R  + +  D    D R S   FL      +
Sbjct: 36  FEEPLIVVLGNVISDEECDELIEMSKNKIKRSTIGSSRDV--NDIRTSSGAFLE-----E 88

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +    KI+ RI  + N+ +   E     L I NY +   Y  H D      R     R++
Sbjct: 89  NELTSKIEKRISSIMNVPVTHGE----GLHILNYEVDQQYKAHYDYFAEHSRSAANNRIS 144

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP LNL+V P KG AV++   + +  L+    H G  V  G KW
Sbjct: 145 TLVMYLNDVEEGGETFFPKLNLSVHPRKGMAVYFEYFYQDQSLNELTLHGGASVTKGEKW 204


>gi|205374182|ref|ZP_03226981.1| prolyl 4-hydroxylase alpha subunit [Bacillus coahuilensis m4-4]
          Length = 210

 Score = 86.7 bits (213), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 56/183 (30%), Positives = 94/183 (51%), Gaps = 12/183 (6%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P V  + + + D E + +I LSK ++ R K+    +    D R S   FL PE   +
Sbjct: 30  FHEPFVAVLGNVLSDEECDELISLSKDRMNRSKIAGNQEN---DIRTSTSVFL-PEDASE 85

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--RLAS 161
              + +++ RI  + N+ +   E     LQ+ NY +G  Y  H D     + +   R+++
Sbjct: 86  --VVQRVEKRISQIMNIPVEHGE----GLQLLNYQIGQEYKAHFDFFSPKKLIENPRIST 139

Query: 162 FMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
            + YL DVE GG T FP+L L+V P KG AV++   + + +L+    H G PV +G+KW 
Sbjct: 140 LVLYLNDVEEGGDTYFPNLKLSVSPHKGMAVYFEYFYDDPMLNELTLHGGAPVTIGDKWA 199

Query: 222 KLL 224
             +
Sbjct: 200 ATM 202


>gi|156398644|ref|XP_001638298.1| predicted protein [Nematostella vectensis]
 gi|156225417|gb|EDO46235.1| predicted protein [Nematostella vectensis]
          Length = 495

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 51/141 (36%), Positives = 76/141 (53%), Gaps = 16/141 (11%)

Query: 89  RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD 148
           R+SK  +L     G+   + +++ RI  MT L +   E +    Q+ NYGL G YD H D
Sbjct: 339 RISKNCWLSGREHGE--VIDRVERRIAAMTRLNLETAEGF----QVQNYGLAGQYDPHFD 392

Query: 149 ATPRDEGLW---------RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA 199
            + RD             R+A+ + +++ VE GGAT+FP +   + P+KG AVFW+N   
Sbjct: 393 FS-RDLANSSLGSLGTGNRIATVLVWMSQVESGGATVFPYVGARILPQKGDAVFWHNLLR 451

Query: 200 NTLLDYRMYHSGCPVALGNKW 220
           +   D+R  H+GCPV  G KW
Sbjct: 452 SGDGDFRTRHAGCPVLSGIKW 472


>gi|195352180|ref|XP_002042592.1| GM14979 [Drosophila sechellia]
 gi|194124476|gb|EDW46519.1| GM14979 [Drosophila sechellia]
          Length = 461

 Score = 86.3 bits (212), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 65/219 (29%), Positives = 96/219 (43%), Gaps = 53/219 (24%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y + C+G        + N  C Y      FL++ PLK E L LDP +V  H+ + D EI+
Sbjct: 295 YEIGCRGQFLR----RRNHVCTYNFTITEFLRLAPLKQEVLNLDPYIVIYHNILNDDEID 350

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
           ++ + S       +VVN                              I+ RI ++T L  
Sbjct: 351 KLKQHSNDNT--AEVVN-----------------------------PIEKRINELTRLSF 379

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL 182
              ++    L ++  G G         T +    +   + +F+L +VELGGAT+FP L +
Sbjct: 380 LNSDQ----LIVSKNGPG---------TQKHIKEYSKGTLLFFLNNVELGGATVFPKLKI 426

Query: 183 TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           +VFP+KGS + WYN       D R     CPV  GNKWG
Sbjct: 427 SVFPQKGSCLIWYNTP-----DPRSDPLECPVLQGNKWG 460


>gi|449284064|gb|EMC90646.1| Prolyl 4-hydroxylase subunit alpha-3, partial [Columba livia]
          Length = 174

 Score = 85.9 bits (211), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 56/156 (35%), Positives = 81/156 (51%), Gaps = 12/156 (7%)

Query: 72  VERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGP 131
           ++R  V +       + R+SK  +L       HP +  ++ R+  +T L +     Y   
Sbjct: 1   LQRSVVASGEKQQKAEYRISKSAWLKDTA---HPVVQTLEKRMAAVTGLDL--RPPYAEY 55

Query: 132 LQINNYGLGGHYDLHCD-ATPRDEGLWRL------ASFMFYLTDVELGGATIFPSLNLTV 184
           LQ+ NYGLGGHY+ H D AT R   L+R+      A+ M YL+ V  GG+T F   NL+V
Sbjct: 56  LQVVNYGLGGHYEPHFDHATSRKSPLYRMKSGNRIATLMIYLSAVGAGGSTAFVHANLSV 115

Query: 185 FPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
              K +A+FW+N   N   D    H+GCPV  G+KW
Sbjct: 116 PVVKNAALFWWNLRRNGDGDGDTLHAGCPVLAGDKW 151


>gi|363806698|ref|NP_001242522.1| uncharacterized protein LOC100806046 [Glycine max]
 gi|255647110|gb|ACU24023.1| unknown [Glycine max]
          Length = 289

 Score = 85.9 bits (211), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 67/243 (27%), Positives = 110/243 (45%), Gaps = 34/243 (13%)

Query: 4   PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
           P + +GNL  P D+ S  +   E+ ++   + G   VE +  +PR    H+ +   E   
Sbjct: 44  PSSSRGNLPKPNDLASIARNTIETSDSD--ERGEQWVEVVSWEPRAFVYHNFLTKEECEY 101

Query: 64  IIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
           +I+++K  + +  VV+       D+R+  S   FL     G    +  I+ +I D T + 
Sbjct: 102 LIDIAKPSMHKSTVVDSETGKSKDSRVRTSSGTFL---ARGRDKIVRNIEKKISDFTFIP 158

Query: 122 IGREERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIF 177
           +   E     LQ+ +Y +G     HYD   D      G  R+A+ + YLTDVE GG T+F
Sbjct: 159 VEHGEG----LQVLHYEVGQKYEPHYDYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVF 214

Query: 178 PSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGN 218
           P+                     L++ P++G A+ +++   +  LD    H GCPV  GN
Sbjct: 215 PAAKGNFSFVPWWNELFECGKKGLSIKPKRGDALLFWSMKPDASLDPSSLHGGCPVIKGN 274

Query: 219 KWG 221
           KW 
Sbjct: 275 KWS 277


>gi|47204411|emb|CAF95476.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 284

 Score = 85.9 bits (211), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 57/176 (32%), Positives = 88/176 (50%), Gaps = 18/176 (10%)

Query: 52  IHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQ 111
           +HDA+        +  S  K+ R  V      +  + R+SK  +L          + ++ 
Sbjct: 97  LHDALDH------LAFSHFKLRRSVVATRDKQVTAEYRISKSAWLKGSA---QSAVSRLD 147

Query: 112 TRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGLW------RLASFMF 164
            RI  +T L +  +  +   LQ+ NYG+GGHY+ H D AT     ++      R+A+ M 
Sbjct: 148 QRISMLTGLNV--QHPHGEYLQVVNYGIGGHYEPHFDHATSPSSPVFKLKTGNRVATVMI 205

Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           YL+ VE GG+T F   N +V   K +A+FW+N H N   D    H+GCPV +G+KW
Sbjct: 206 YLSSVEAGGSTAFIYANFSVPVMKNAAIFWWNLHRNGRGDPDTLHAGCPVLIGDKW 261


>gi|159487419|ref|XP_001701720.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280939|gb|EDP06695.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 274

 Score = 85.5 bits (210), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 61/208 (29%), Positives = 98/208 (47%), Gaps = 33/208 (15%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV-NYGDTIYVDTRLSKVYFLYP 98
           V+++ L PR    H+ +  +E   +++L+  K++R  VV N G+ +  + R S  Y ++ 
Sbjct: 1   VQQVGLHPRAYYFHNFLTKAERGHLVKLAAPKLKRSTVVGNDGEGVVDNIRTS--YGMFI 58

Query: 99  EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---G 155
               D P + +I+ RI   T+L +  +E     +Q+  Y  G  Y  H D+  +      
Sbjct: 59  RRLQD-PVVARIEKRISLWTHLPVEHQED----IQVLRYAHGQTYGAHYDSGDKSNEPGP 113

Query: 156 LWRLASFMFYLTDVELGGATIFP----------------------SLNLTVFPEKGSAVF 193
            WRLA+F+ YL+DVE GG T FP                        N+   P+ G AV 
Sbjct: 114 KWRLATFLMYLSDVEEGGETAFPHNSVWADPSIPEKVGDKFSDCAKGNVAAKPKAGDAVL 173

Query: 194 WYNAHANTLLDYRMYHSGCPVALGNKWG 221
           +Y+ + N  +D    H+GCPV  G KW 
Sbjct: 174 FYSFYPNMTMDPAAMHTGCPVIKGVKWA 201


>gi|317127314|ref|YP_004093596.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
           2522]
 gi|315472262|gb|ADU28865.1| Procollagen-proline dioxygenase [Bacillus cellulosilyticus DSM
           2522]
          Length = 229

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 53/179 (29%), Positives = 88/179 (49%), Gaps = 13/179 (7%)

Query: 46  DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
           +P +V + + + + E +++I LSK ++ER K+ N       D R S   F       ++ 
Sbjct: 43  EPLIVLLGNVLSEEECDQLISLSKDRIERSKISNKS---VHDLRTSSSMFFDD---AEND 96

Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASF 162
            +  ++ R+  +  + +   E     +QI NY +G  Y  H D            R+++ 
Sbjct: 97  VVSTVEKRVSQIMKIPVDHGE----GIQILNYAIGQEYKAHYDYFSSGNSKVNNPRISTL 152

Query: 163 MFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           + YL DVE GG T FP LN  V P+KG AV++   + +T L+    H G PV +G+KW 
Sbjct: 153 VMYLNDVEAGGETYFPKLNFYVAPKKGMAVYFEYFYNDTTLNELTLHGGAPVVIGDKWA 211


>gi|398818543|ref|ZP_10577128.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
 gi|398027481|gb|EJL21031.1| 2OG-Fe(II) oxygenase superfamily enzyme [Brevibacillus sp. BC25]
          Length = 220

 Score = 85.5 bits (210), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 59/180 (32%), Positives = 92/180 (51%), Gaps = 14/180 (7%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           Y +P VV + + + DSE + +IE S+ +++R K+   G    + T  S V+    E    
Sbjct: 38  YEEPLVVVLGNVLSDSECDELIEHSRERLQRSKIGEDGSVNSIRTS-SGVFCEQTET--- 93

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
              + +I+ RI  + N+ I     +   LQ+  Y  G  Y  H D    T R     R++
Sbjct: 94  ---ITRIEKRISQIMNIPI----EHGDGLQVLRYTPGQEYKPHYDFFAETSRASTNNRIS 146

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T+FP L+L+VFP KG AV++   ++N  L+    H+G  V  G KW
Sbjct: 147 TLVMYLNDVEQGGETVFPLLHLSVFPTKGMAVYFEYFYSNQELNDFTLHAGTQVIHGEKW 206


>gi|194871352|ref|XP_001972832.1| GG13663 [Drosophila erecta]
 gi|190654615|gb|EDV51858.1| GG13663 [Drosophila erecta]
          Length = 420

 Score = 85.1 bits (209), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 62/202 (30%), Positives = 90/202 (44%), Gaps = 45/202 (22%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           NL C Y      FL++ PLK E L  DP +V  H+ + D EI ++ + S          N
Sbjct: 263 NLVCTYNFTATEFLRLSPLKQEVLNWDPYIVLYHEVLNDDEIEKLKQHSND--------N 314

Query: 80  YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL 139
             + I                   +PF  +I  RI  MT L I   ++    L ++    
Sbjct: 315 SAEEI-------------------NPFKKRIFQRISHMTRLRIPHSDQ----LIVSENVS 351

Query: 140 GGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA 199
             H   H   TP+        + +F+L +V+ GGAT+FP+L + VFP++GS +FW+    
Sbjct: 352 ETHR--HKGKTPK-------GTLLFFLDNVKQGGATVFPNLKIAVFPQRGSCLFWHKT-- 400

Query: 200 NTLLDYRMYHSGCPVALGNKWG 221
              LD R     CPV  GNKW 
Sbjct: 401 ---LDTRNEPLECPVLQGNKWS 419


>gi|156370183|ref|XP_001628351.1| predicted protein [Nematostella vectensis]
 gi|156215325|gb|EDO36288.1| predicted protein [Nematostella vectensis]
          Length = 478

 Score = 85.1 bits (209), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 60/176 (34%), Positives = 87/176 (49%), Gaps = 19/176 (10%)

Query: 23  CFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGD 82
           C+Y++  +  L + P KVE++  DPRVV     + D E  RI +++   + R  V N   
Sbjct: 312 CWYDNRGDARLLLKPNKVEQVNDDPRVVIFRGLVTDRETARIKQIASPMLNRATVYNIDT 371

Query: 83  TI--YVDTRLSKVYFLYPEIFGDH--PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYG 138
            +  Y D R+SK  +L      DH    +  +  RI  +T L +   E+    LQI NYG
Sbjct: 372 GVLEYADYRVSKSAWL-----EDHLDETIATVNKRIAMVTGLDVQTAEK----LQIANYG 422

Query: 139 LGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTVFPEK 188
           +GG Y+ H D    D  L       R+A+ + YL DV LGGAT+F    + V P K
Sbjct: 423 MGGQYEQHTDHGEPDSPLANDPLGNRIATLLIYLNDVALGGATVFLKAGVHVPPTK 478


>gi|413963357|ref|ZP_11402584.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
 gi|413929189|gb|EKS68477.1| ProCollegen-proline dioxygenase [Burkholderia sp. SJ98]
          Length = 286

 Score = 84.7 bits (208), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 55/185 (29%), Positives = 89/185 (48%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           P +  + D + D+E +R+IE+ +  V+R  VV+   G  I ++ R S+  F+        
Sbjct: 94  PVIALVADVLDDTECDRLIEIGREHVQRSSVVDPDSGKEITIEERRSEGAFVNASTDA-- 151

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
             +  I  RI ++    +   E     L I  YG+GG Y  H D  P ++         G
Sbjct: 152 -LVETIDRRIAELFRQPVENGE----DLHILRYGMGGEYRPHYDYFPEEQAGSKHHMQRG 206

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL +VE GG T FP + L + P +GSA+++   +     D +  H+G PV 
Sbjct: 207 GQRIATVILYLNEVEQGGDTTFPDIGLAIHPRRGSALYFEYVNELGQSDPKTLHAGTPVE 266

Query: 216 LGNKW 220
            G KW
Sbjct: 267 KGEKW 271


>gi|241778760|ref|XP_002399787.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215508519|gb|EEC17973.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 427

 Score = 84.7 bits (208), Expect = 2e-14,   Method: Compositional matrix adjust.
 Identities = 65/244 (26%), Positives = 112/244 (45%), Gaps = 34/244 (13%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G       + S L+C Y    + F K+ P+K+EE  L P +V +HD I D ++ 
Sbjct: 168 YKRLCRGEQLRTLKMDSQLRCRYYKGQDGFFKLQPIKLEEFNLKPYIVVLHDVIQDRDLE 227

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFL--YPEIFGDHPFLYK----IQTRIQD 116
            +I  +K +          +TI +   +    FL  +  +     +L++    I +R+  
Sbjct: 228 DLIAFAKPRAR--------NTIPLFRNVKWCTFLKRFCSLLAASTWLFEQNATIASRLNR 279

Query: 117 MTNLVIG---REERYKG-PLQINNYGLGGH--------YDLHCDATPRDEGLW------R 158
               ++G    +  ++  P Q+ NYG GGH        YD++ D+   D+         R
Sbjct: 280 YLTALLGMGTSDSNFEAEPYQLANYGTGGHYLPHHDYLYDVYEDSDETDDFSQFPSYGDR 339

Query: 159 LASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV--AL 216
           LA+ M Y++DVE GGAT+FP L + + P+K   V +     ++    R     C +  ++
Sbjct: 340 LATLMIYMSDVEEGGATVFPKLGVRLTPKKVKMVIYKVQPDSSAQKLRALGDCCHLRSSV 399

Query: 217 GNKW 220
            NKW
Sbjct: 400 ANKW 403


>gi|319652187|ref|ZP_08006306.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
 gi|317396176|gb|EFV76895.1| hypothetical protein HMPREF1013_02919 [Bacillus sp. 2_A_57_CT2]
          Length = 283

 Score = 84.7 bits (208), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 57/182 (31%), Positives = 88/182 (48%), Gaps = 14/182 (7%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYG--DTIYVDTRLSKVYFLYPEIFGDH 104
           P V+ +   +   E + +I LS+ +++   VV+ G  +      R SK          ++
Sbjct: 96  PFVLHLDQVLSSEECDELISLSRSRLQPSLVVDRGSGEERAGSGRTSKSMAFR---LKEN 152

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATP-----RDEGLWRL 159
             + +I+TRI ++T       E     LQI NYGLG  Y  H D  P       +G  R+
Sbjct: 153 ELVERIETRIAELTGYPAENGE----GLQILNYGLGEEYKPHFDFFPPHMADASKGGQRV 208

Query: 160 ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
            +F+ YL DVE GG T+F    L+  P+KG+A++++  +A   LD    HS  PV  G K
Sbjct: 209 GTFLIYLNDVEDGGETVFSKAGLSFVPKKGAAIYFHYGNAQGQLDRLSVHSSVPVRKGEK 268

Query: 220 WG 221
           W 
Sbjct: 269 WA 270


>gi|251794605|ref|YP_003009336.1| procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
 gi|247542231|gb|ACS99249.1| Procollagen-proline dioxygenase [Paenibacillus sp. JDR-2]
          Length = 209

 Score = 84.7 bits (208), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 52/176 (29%), Positives = 96/176 (54%), Gaps = 10/176 (5%)

Query: 46  DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
           +P ++ + + +  +E + +I+L+  +++R K+ +  D   V T  S ++F   E    + 
Sbjct: 31  EPLILILDNVLSWAECDLLIDLASARMQRAKIGSSHDVSEVRTS-SSMFFEESE----NE 85

Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-RLASFMF 164
            + +++ R+ ++ N+ +   E    PLQ+  Y  G  Y  H D   +   +  R+++ + 
Sbjct: 86  CIGQVEARVAELMNIPVSHAE----PLQVLRYQPGEQYHPHFDYFTQGSSMNNRISTLVM 141

Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           YL DVE GG T FPSL+ +V P+KGSAV++   + +T L+    H+G PV  G KW
Sbjct: 142 YLNDVEEGGETYFPSLHFSVTPKKGSAVYFEYFYNDTRLNELTLHAGHPVEAGEKW 197


>gi|329913962|ref|ZP_08276011.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
           IMCC9480]
 gi|327545257|gb|EGF30515.1| hypothetical protein IMCC9480_1311 [Oxalobacteraceae bacterium
           IMCC9480]
          Length = 280

 Score = 84.3 bits (207), Expect = 3e-14,   Method: Compositional matrix adjust.
 Identities = 56/190 (29%), Positives = 89/190 (46%), Gaps = 23/190 (12%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTI--YVDTRLSKVYFLYPEIFGDH 104
           PR+V + + + D E + I  +S+ +  R   ++    I  + D+R S+   +     G+ 
Sbjct: 92  PRIVVLGNVLSDDECDAIAAMSRTRFARSTTIDNASGINRFDDSRTSESAHIQ---RGET 148

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
             + +I  R+  ++   +   E    PLQ+  Y  G  Y  H D         A   ++ 
Sbjct: 149 ELIARIDARLAALSGWPVDHGE----PLQLQKYQAGNEYRPHFDWFDPALAGTAKHLEKS 204

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             RLA+ + YLTDVE GG T FP + L V P+KG A+F+ N     + D +  H+G PV 
Sbjct: 205 GQRLATIILYLTDVEEGGGTSFPGIGLDVHPQKGGALFFRNTTPYGVPDRKTQHAGLPVE 264

Query: 216 LG-----NKW 220
            G     NKW
Sbjct: 265 KGTKIIANKW 274


>gi|302844247|ref|XP_002953664.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
           nagariensis]
 gi|300261073|gb|EFJ45288.1| prolyl 4-hydroxylase alpha subunit-like protein [Volvox carteri f.
           nagariensis]
          Length = 364

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 63/234 (26%), Positives = 106/234 (45%), Gaps = 32/234 (13%)

Query: 13  VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKV 72
           +PE +  +    +   +  F +     VE++ L PR    H+ +  +E   ++ L+  K+
Sbjct: 21  LPERLLESALVMHTEADKQFDEEATPWVEQVGLHPRAYLFHNFLTKAERAHMVRLAAPKL 80

Query: 73  ERGKVV-NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGP 131
           +R  VV + G+ +  + R S  + ++     D P + +I+ RI   T+L I  +E     
Sbjct: 81  KRSTVVGSKGEGVVDNIRTS--FGMFIRRLSD-PIIARIEKRISLWTHLPIEHQED---- 133

Query: 132 LQINNYGLGGHYDLHCDATPRDEGL---WRLASFMFYLTDVELGGATIFPSLN------- 181
           +Q+  Y  G  Y  H D+    + +   WRLA+F+ YL+DVE GG T FP  +       
Sbjct: 134 IQVLRYAHGQTYGAHYDSGASSDHVGPKWRLATFLMYLSDVEEGGETAFPQNSVWYDPTI 193

Query: 182 --------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
                         +   P+ G AV +Y+   N  +D    H+GCPV  G KW 
Sbjct: 194 PERIGPVSECAKGHVAAKPKAGDAVLFYSFLPNNTMDPAAMHTGCPVIKGIKWA 247


>gi|357496283|ref|XP_003618430.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
 gi|217073992|gb|ACJ85356.1| unknown [Medicago truncatula]
 gi|355493445|gb|AES74648.1| Prolyl 4-hydroxylase subunit alpha-2 [Medicago truncatula]
 gi|388494436|gb|AFK35284.1| unknown [Medicago truncatula]
          Length = 313

 Score = 84.3 bits (207), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 59/208 (28%), Positives = 90/208 (43%), Gaps = 23/208 (11%)

Query: 33  LKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRL 90
           +K  P +V +L   PR     + + D E + +IELSK K+E+  V +   G +I  + R 
Sbjct: 44  VKFDPTRVTQLSWSPRAFLYKNFLTDEECDHLIELSKDKLEKSMVADNESGKSIQSEVRT 103

Query: 91  SKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDAT 150
           S   FL  +       +  I+ RI   T L +   E  +    +N      H+D   D  
Sbjct: 104 SSGMFLNKQ---QDEIVSGIEARIAAWTFLPVENGESMQVLHYMNGEKYEPHFDFFHDKA 160

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
            +  G  R+A+ + YL++VE GG TIFP                       V P KG A+
Sbjct: 161 NQRLGGHRVATVLMYLSNVEKGGETIFPHAEGKLSQPKDESWSECAHKGYAVKPRKGDAL 220

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +++ H +   D +  H  CPV  G KW
Sbjct: 221 LFFSLHLDATTDSKSLHGSCPVIEGEKW 248


>gi|260812289|ref|XP_002600853.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
 gi|229286143|gb|EEN56865.1| hypothetical protein BRAFLDRAFT_214927 [Branchiostoma floridae]
          Length = 281

 Score = 84.0 bits (206), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 50/156 (32%), Positives = 81/156 (51%), Gaps = 11/156 (7%)

Query: 71  KVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKG 130
           K+ R ++ N    +    R+S+  +L+ +   D   + ++  RI  +T L          
Sbjct: 108 KMFRSRIGNSFSEVESHIRISQQAWLHDK---DDEIVARVSKRIGLLTGL--NTTPTSTE 162

Query: 131 PLQINNYGLGGHYDLHCDATPRDEGLW------RLASFMFYLTDVELGGATIFPSLNLTV 184
            LQ+ NYGLGG Y+ H D    +E +W      R+A+F+ YL+DV  GGAT+FP  N+TV
Sbjct: 163 LLQVLNYGLGGQYEPHHDYMTAEEKMWGTILGNRMATFLMYLSDVTAGGATVFPVANVTV 222

Query: 185 FPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
              K + + + +   +   D    H+GCPV +G+KW
Sbjct: 223 PVVKNAGLLFMDLLRSGRGDVNSLHAGCPVVIGSKW 258


>gi|195591300|ref|XP_002085380.1| GD14756 [Drosophila simulans]
 gi|194197389|gb|EDX10965.1| GD14756 [Drosophila simulans]
          Length = 477

 Score = 83.6 bits (205), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 67/226 (29%), Positives = 97/226 (42%), Gaps = 51/226 (22%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y + C+G        + N  C Y      FL++ PLK E L  DP +V  H+ + D EI+
Sbjct: 295 YEIGCRGQFLG----RRNHVCSYNFTITEFLRLAPLKQEVLNWDPYIVIYHNVLKDDEID 350

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
           ++ + S       +VVN                         P   +I  RI ++T L  
Sbjct: 351 KLKQHSNDNA--AEVVN-------------------------PIEKRIFQRINELTRL-- 381

Query: 123 GREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMF-------YLTDVELGGAT 175
                +   L ++  G G     H     +   L+ +++F F        L +VELGGAT
Sbjct: 382 ----SFLNQLIVSKNGPGTQK--HIKEYSKGTLLFFVSTFSFAIYIYISLLNNVELGGAT 435

Query: 176 IFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           +FP L ++VFP+KGS + WYN       D R     CPV  GNKWG
Sbjct: 436 VFPKLKISVFPQKGSCLIWYNTP-----DPRSEPLECPVLQGNKWG 476


>gi|449432777|ref|XP_004134175.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 303

 Score = 83.6 bits (205), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 59/213 (27%), Positives = 91/213 (42%), Gaps = 34/213 (15%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           + P KV+++   PR       + D E + +I L+K +++R  V +   G +   + R S 
Sbjct: 36  VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSEVRTSS 95

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
             F++       P +  I+ +I   T L     E     +Q+  Y  G  YD H     D
Sbjct: 96  GAFIHK---AKDPIVSGIEDKIAAWTFLPKDNGE----DIQVLRYEYGQKYDAHFDYFAD 148

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
                 G  R+A+ + YL+DVE GG T+FPS                       + V P 
Sbjct: 149 KVNIARGGHRMATVLMYLSDVEKGGETVFPSAEESQRRQASETNEDLSDCAKKGIAVKPR 208

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           KG A+ +++ H N + D    H GCPV  G KW
Sbjct: 209 KGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKW 241


>gi|386712780|ref|YP_006179102.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
           2266]
 gi|384072335|emb|CCG43825.1| prolyl 4-hydroxylase alpha subunit [Halobacillus halophilus DSM
           2266]
          Length = 211

 Score = 83.6 bits (205), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 55/182 (30%), Positives = 89/182 (48%), Gaps = 14/182 (7%)

Query: 46  DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
           +P +  + + + + E   +I LSK K+ R K+ +  +    D R S   FL PE      
Sbjct: 33  NPLIAILGNVVSEEECEELIFLSKNKMNRSKIGSQHEV--SDIRTSSSTFL-PE----DD 85

Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLASF 162
              +I+ R+  + N+ +   E     L I NY  G  Y  H D   +  +     R+++ 
Sbjct: 86  LTNRIEKRVAQIMNVPVEHGE----GLHILNYKQGQEYKAHYDYFRSKAKAANNPRISTL 141

Query: 163 MFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWGK 222
           + YL DVE GG T FP +NL++ P KG AV++   +++ L++ R  H G PV  G KW  
Sbjct: 142 VLYLNDVEEGGETYFPHMNLSISPHKGMAVYFEYFYSDPLINERTLHGGSPVTSGEKWAA 201

Query: 223 LL 224
            +
Sbjct: 202 TM 203


>gi|356517655|ref|XP_003527502.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
          Length = 290

 Score = 83.2 bits (204), Expect = 8e-14,   Method: Compositional matrix adjust.
 Identities = 65/207 (31%), Positives = 94/207 (45%), Gaps = 32/207 (15%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLY 97
            E L  +PR    H+ +   E   +IEL+K ++ +  VV+   G +     R S   FL 
Sbjct: 79  TEILSWEPRAFIYHNFLSKEECEYLIELAKPQMVKSSVVDSKTGKSTESRVRTSSGMFLK 138

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
               G    +  I+ RI D T +    EE  +G LQI +Y +G     HYD   D     
Sbjct: 139 R---GKDKIVQNIEKRIADFTFIP---EENGEG-LQILHYEVGQKYEPHYDYFLDEFNTK 191

Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
            G  R+A+ + YL+DVE GG T+FP+ N                   L+V P+ G A+ +
Sbjct: 192 NGGQRIATVLMYLSDVEEGGETVFPAANANFSSVPWWNDLSQCARKGLSVKPKMGDALLF 251

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKWG 221
           ++   +  LD    H GCPV  GNKW 
Sbjct: 252 WSMRPDATLDPSSLHGGCPVIKGNKWS 278


>gi|260787668|ref|XP_002588874.1| hypothetical protein BRAFLDRAFT_235878 [Branchiostoma floridae]
 gi|229274045|gb|EEN44885.1| hypothetical protein BRAFLDRAFT_235878 [Branchiostoma floridae]
          Length = 151

 Score = 83.2 bits (204), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 49/124 (39%), Positives = 71/124 (57%), Gaps = 9/124 (7%)

Query: 103 DHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE------GL 156
           +H  + K+  R++ +T L +     Y    Q+ NYGLGG Y+ H D   RDE        
Sbjct: 8   EHTVIAKLSRRVEYITGLDVNWP--YGEAFQVLNYGLGGFYEPHVDYF-RDEQPALLTNG 64

Query: 157 WRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVAL 216
            R+ +F+FYL+DVE GGAT+F  LNLTV   K SAV +++   +   +    H+GCPV +
Sbjct: 65  QRIVTFLFYLSDVEAGGATVFTRLNLTVPAVKNSAVLFHDLKRSLEFEKDSEHAGCPVLM 124

Query: 217 GNKW 220
           G+KW
Sbjct: 125 GSKW 128


>gi|363543301|ref|NP_001241866.1| prolyl 4-hydroxylase 6 precursor [Zea mays]
 gi|195624808|gb|ACG34234.1| oxidoreductase [Zea mays]
 gi|347978818|gb|AEP37751.1| prolyl 4-hydroxylase 6 [Zea mays]
          Length = 297

 Score = 82.8 bits (203), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 87/209 (41%), Gaps = 31/209 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P  V +L   PR       + D+E + I+ L+KG +E+  V +   G ++    R S   
Sbjct: 32  PASVTQLSSRPRAFLYSGFLSDTECDHIVSLAKGSMEKSMVADNDSGKSVASQARTSSGT 91

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----AT 150
           FL      +   +  I+ R+   T L     E     LQ+  Y  G  YD H D      
Sbjct: 92  FLAKR---EDEIVSAIEKRVAAWTFL----PEENAESLQVLRYETGQKYDAHFDYFHDRN 144

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
               G  R+A+ + YLTDV+ GG T+FP+                    L V P+KG A+
Sbjct: 145 NLKLGGQRVATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDAL 204

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKWG 221
            ++N H N   D    H  CPV  G KW 
Sbjct: 205 LFFNLHVNATADTGSLHGSCPVIEGEKWS 233


>gi|427795421|gb|JAA63162.1| Putative prolyl-4-hydroxylase-alpha efb, partial [Rhipicephalus
           pulchellus]
          Length = 568

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 52/155 (33%), Positives = 79/155 (50%), Gaps = 9/155 (5%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           IY   C+G    P      L C Y + N  +L + P K E ++  PR+V  HD + + E+
Sbjct: 356 IYERLCRGEKFPPLFHDRELTCQYRTNNRPYLLLQPAKEEVMFPKPRIVIYHDVLSEHEM 415

Query: 62  NRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           N I  L++ ++ R  V NY  G+      R+SK  +L  E   +H  + ++  RI+D+T 
Sbjct: 416 NVIKTLAQPRLRRATVQNYKSGELETASYRISKSAWLKNE---EHGVIARVTRRIEDITG 472

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE 154
           L     E     LQ+ NYG+GGHY+ H D   R+E
Sbjct: 473 LTADTAEE----LQVVNYGIGGHYEPHFDFARREE 503


>gi|241044301|ref|XP_002407178.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215492128|gb|EEC01769.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 554

 Score = 82.8 bits (203), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 65/224 (29%), Positives = 100/224 (44%), Gaps = 35/224 (15%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G       + S L+C Y    + F K+ P+KVEE  L P +V +H+ I D +I 
Sbjct: 291 YKRLCRGEQLRTPKMDSKLRCRYYKGQHGFFKLQPIKVEEANLKPYIVVMHNVIQDRDIE 350

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM----T 118
            ++  +K +++R              R S   +L      D P   ++   ++ +    T
Sbjct: 351 DLMAFAKPRLQRSTHYGVRGMEASQVRTSSNAWLND---LDAPVATRLNRFLRSLLGLGT 407

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCD-----------------ATPRDEGLWRLAS 161
             + G  E+Y    Q+ NYG+GG Y  H D                  T  D    R+A+
Sbjct: 408 TYLGGEAEQY----QLANYGIGGQYMSHHDYLQDTYHIPNRVTDDFEKTSGD----RIAT 459

Query: 162 FMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDY 205
            M Y++DVE GGAT+FPSL + + P+K   V   N    +LL Y
Sbjct: 460 LMVYMSDVEEGGATVFPSLGVRLTPKK---VISPNQSRTSLLSY 500


>gi|363807286|ref|NP_001242363.1| uncharacterized protein LOC100796794 precursor [Glycine max]
 gi|255641119|gb|ACU20838.1| unknown [Glycine max]
          Length = 297

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 60/213 (28%), Positives = 91/213 (42%), Gaps = 34/213 (15%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           I P KV+++   PR       + D E + +I L+K +++R  V +   G++   D R S 
Sbjct: 31  INPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSDVRTSS 90

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
             F+        P +  I+ +I   T L     E     +Q++ Y  G  YD H     D
Sbjct: 91  GMFISKN---KDPIVAGIEDKISSWTFLPKENGE----DIQVSRYEHGQKYDPHYDYFTD 143

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
                 G  R+A+ + YLTDV  GG T+FPS                       + V P 
Sbjct: 144 KVNIARGGHRIATVLMYLTDVAKGGETVFPSAEEPPRRRGAETSSDLSECAKKGIAVKPR 203

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +G A+ +++ H N   D    H+GCPV  G KW
Sbjct: 204 RGDALLFFSLHTNATPDTSSLHAGCPVIEGEKW 236


>gi|445499353|ref|ZP_21466208.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
 gi|444789348|gb|ELX10896.1| prolyl 4-hydroxylase alpha subunit [Janthinobacterium sp. HH01]
          Length = 272

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 61/207 (29%), Positives = 93/207 (44%), Gaps = 26/207 (12%)

Query: 33  LKIGPLKVEELYL---DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGD--TIYVD 87
           L   P +V E+      P+++ + + + D E + II     +  R  V    D  ++  +
Sbjct: 66  LVAAPDRVAEVLFVLKQPQIILLGNVLSDEECDAIIAHCGTRYTRSTVTGEADGSSMVHE 125

Query: 88  TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHC 147
            R S++ F+     G+     +I+ R+  + +      E    P Q+  Y     Y  H 
Sbjct: 126 GRTSEMAFIQ---RGEAEVAERIERRLAALAHWPAECSE----PFQLQKYDATQEYRPHY 178

Query: 148 DATPRD---------EGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAH 198
           D    D          G  RLA+F+ YL+DVE GG T+FP L L V+P+KGSA+++ N  
Sbjct: 179 DWLDPDSSGHRSHLARGGQRLATFILYLSDVEQGGGTVFPGLGLEVYPKKGSALWFLNTD 238

Query: 199 ANTLLDYRMYHSGCPVALG-----NKW 220
            N   D R  H G PV  G     NKW
Sbjct: 239 INHQPDKRTLHGGAPVVRGTKIIANKW 265


>gi|168046048|ref|XP_001775487.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162673157|gb|EDQ59684.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 263

 Score = 82.4 bits (202), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 60/209 (28%), Positives = 96/209 (45%), Gaps = 30/209 (14%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           I P +V++L   PR     + + D+E + +I L+K K+E+  V +   G ++  + R S 
Sbjct: 1   IDPTRVKQLSWKPRAFLYSNFLSDAECDHMISLAKDKLEKSMVADNESGKSVKSEIRTSS 60

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCD 148
             FL   + G    + +I+ RI   T L     E     +Q+  Y  G     H+D   D
Sbjct: 61  GMFL---MKGQDDIISRIEDRIAAWTFLPKENGE----AIQVLRYQDGEKYEPHFDYFHD 113

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPS-----------------LNLTVFPEKGSA 191
              +  G  R+A+ + YL+DV  GG T+FPS                   + V P KG A
Sbjct: 114 KNNQALGGHRIATVLMYLSDVVKGGETVFPSSEDRGGPKDDSWSACGKTGVAVKPRKGDA 173

Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + +++ H + + D    H+GCPV  G KW
Sbjct: 174 LLFFSLHPSAVPDESSLHTGCPVIEGEKW 202


>gi|91778899|ref|YP_554107.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
 gi|91691559|gb|ABE34757.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
          Length = 292

 Score = 82.0 bits (201), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 58/185 (31%), Positives = 88/185 (47%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           P+V+   D +   E   +IE S+ +++R   VN   G    +  R S+  +      G+ 
Sbjct: 103 PQVIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRNRTSEGIWYQ---RGED 159

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
           PF+ ++  RI  + N  +   E  +G LQI +YG  G Y  H D  P D+         G
Sbjct: 160 PFIERMDRRISSLMNWPV---ENGEG-LQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQG 215

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL DV  GG TIFP   ++V   +G AV++   +    LD    H G PV 
Sbjct: 216 GQRVATLVIYLNDVPDGGETIFPEAGMSVAASQGGAVYFRYMNDRRQLDPLTLHGGAPVL 275

Query: 216 LGNKW 220
            G+KW
Sbjct: 276 AGDKW 280


>gi|356540840|ref|XP_003538892.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Glycine max]
          Length = 290

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 65/243 (26%), Positives = 109/243 (44%), Gaps = 33/243 (13%)

Query: 4   PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
           P + +GNL  P D+ S  +    + ++  ++ G   VE +  +PR    H+ +   E   
Sbjct: 44  PSSSRGNLPKPNDLASIARNTIHTSDDDDVR-GEQWVEVVSWEPRAFVYHNFLTKEECEY 102

Query: 64  IIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
           +I+++K  + +  VV+       D+R+  S   FL     G    +  I+ RI   + + 
Sbjct: 103 LIDIAKPNMHKSSVVDSETGKSKDSRVRTSSGTFL---ARGRDKIVRDIEKRIAHYSFIP 159

Query: 122 IGREERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIF 177
           +   E     LQ+ +Y +G     HYD   D      G  R+A+ + YLTDVE GG T+F
Sbjct: 160 VEHGEG----LQVLHYEVGQKYEPHYDYFLDDFNTKNGGQRIATVLMYLTDVEEGGETVF 215

Query: 178 PSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGN 218
           P+                     L++ P++G A+ +++   +  LD    H GCPV  GN
Sbjct: 216 PAAKGNFSSVPWWNELSECGKKGLSIKPKRGDALLFWSMKPDATLDPSSLHGGCPVIKGN 275

Query: 219 KWG 221
           KW 
Sbjct: 276 KWS 278


>gi|449529555|ref|XP_004171765.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 284

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 64/212 (30%), Positives = 95/212 (44%), Gaps = 32/212 (15%)

Query: 34  KIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLS 91
           K G   VE +  +PR    H+ +   E   +I L+K  +E+  VV+   G+++    R S
Sbjct: 68  KRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDSKTGESVDSRVRTS 127

Query: 92  KVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHC 147
              FL     G    +  I+ RI D T + I   E     LQI +Y +G     HYD   
Sbjct: 128 SGMFLNR---GQDKIIRNIEKRIADFTFIPIEHGE----GLQILHYEVGQKYDAHYDYFV 180

Query: 148 DATPRDEGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEK 188
           D     +G  R+A+ + YL+DVE GG T+FP+                     L+V P+ 
Sbjct: 181 DEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSECGKGGLSVKPKM 240

Query: 189 GSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           G A+ +++   +  LD    H  CPV  GNKW
Sbjct: 241 GDALLFWSMKPDATLDPTSLHGACPVIRGNKW 272


>gi|295699617|ref|YP_003607510.1| procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
 gi|295438830|gb|ADG17999.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1002]
          Length = 286

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 57/186 (30%), Positives = 92/186 (49%), Gaps = 20/186 (10%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK-VYFLYPEIFGD 103
           P++V   D +  +E   +IE S+ +++R   VN   G    +  R S+ V++      G+
Sbjct: 97  PQLVVFADVLSAAECAELIERSRHRLKRSTTVNPLTGREDVIRNRTSEGVWYRR----GE 152

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDE 154
              + +++ RI  +TN  +   E  +G LQ+ +YG  G Y  H D         A    +
Sbjct: 153 DQLIARVERRIASLTNWPL---ENGEG-LQVLHYGTSGEYSPHFDFFAPDQPGSAVHTTQ 208

Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
           G  R+A+ + YL DV  GG T+FP+  L+V  + G AV++   +A   LD    H G PV
Sbjct: 209 GGQRVATLIIYLNDVADGGETVFPTAGLSVAAQAGGAVYFRYMNAERQLDPSTLHGGAPV 268

Query: 215 ALGNKW 220
             G+KW
Sbjct: 269 LAGDKW 274


>gi|302844281|ref|XP_002953681.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
           nagariensis]
 gi|300261090|gb|EFJ45305.1| hypothetical protein VOLCADRAFT_63898 [Volvox carteri f.
           nagariensis]
          Length = 304

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 60/203 (29%), Positives = 90/203 (44%), Gaps = 28/203 (13%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
           +E +   PRV   H+ I D E   +IEL+  +++R  VV  G     D+    +Y     
Sbjct: 1   IEHVAWKPRVFIYHNFITDMEAKHMIELAAPQMKRSTVVGAGGQSVEDS-YRTLYTAGVR 59

Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRL 159
            + D   + +I+ R+   T + +  +E     +QI  YG+G  Y +H D    DE   R+
Sbjct: 60  RYQDD-VVERIENRVAAWTQISVLHQED----MQILRYGIGQQYKVHADTLRDDEAGVRV 114

Query: 160 ASFMFYLTDVELGGATIFP--------------------SLNLTVF-PEKGSA-VFWYNA 197
           A+ + YL + E GG T FP                    + N   F P++G A +FW   
Sbjct: 115 ATVLIYLNEPEAGGETAFPDSQWVNPKLAETIGANFSACAKNHVAFAPKRGDALLFWSIG 174

Query: 198 HANTLLDYRMYHSGCPVALGNKW 220
              T  DY   H+GCPV  G KW
Sbjct: 175 PDGTTEDYHASHTGCPVLSGVKW 197


>gi|389795384|ref|ZP_10198508.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
 gi|388430823|gb|EIL87950.1| procollagen-proline dioxygenase [Rhodanobacter fulvus Jip2]
          Length = 293

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 59/185 (31%), Positives = 93/185 (50%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIY--VDTRLSKVYFLYPEIFGDH 104
           P +  +   + D E + +I  S  K++R   V+  +  Y  +  R S+  F +P    D 
Sbjct: 97  PTIAVLDQVLDDEECDELIRRSADKLQRSTTVDPVNGGYEVIAARSSEGTF-FPVNADD- 154

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRDEGL------- 156
            F+ ++  RI ++ N  +   E  +G LQ+ +YG GG Y  H D  +P D G        
Sbjct: 155 -FIARLDRRIAELMNCPV---ENGEG-LQVLHYGEGGEYQPHFDYFSPGDPGSEAQMVVG 209

Query: 157 -WRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+++ + YL DV  GGAT+FP+L L V P KG AV++  ++ +  +D    H G PV 
Sbjct: 210 GQRVSTLLIYLNDVAQGGATVFPTLGLRVLPRKGMAVYFEYSNRDGQVDPLTLHGGEPVE 269

Query: 216 LGNKW 220
            G KW
Sbjct: 270 KGEKW 274


>gi|307110383|gb|EFN58619.1| hypothetical protein CHLNCDRAFT_19485 [Chlorella variabilis]
          Length = 328

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 53/185 (28%), Positives = 91/185 (49%), Gaps = 11/185 (5%)

Query: 39  KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYP 98
           +VE +   PR    H+ + + E + I+ L+K  ++R  VV  G    V+ ++   Y  + 
Sbjct: 31  RVEPVSWKPRAFVFHNFMTEEEADHIVALAKPFMKRSTVVGAGGA-SVEDQIRTSYGTFL 89

Query: 99  EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWR 158
           +   D P +  ++ R+   T L +  +E     +QI  YG+G  Y  H D+   D    R
Sbjct: 90  KRLQD-PIVTAVEQRLATWTKLNVSHQED----MQILRYGIGQKYGAHYDSLDNDSP--R 142

Query: 159 LASFMFYLTDVEL--GGATIFPSLN-LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
           + + + YL+DV    GG T FP +    ++P+KG A+ +Y+   +   D    H+GCP+ 
Sbjct: 143 VCTVLLYLSDVPADGGGETAFPGVRRQALYPKKGDALLFYSLKPDGTSDAYSLHTGCPII 202

Query: 216 LGNKW 220
            G KW
Sbjct: 203 SGVKW 207


>gi|229086310|ref|ZP_04218488.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
 gi|228697005|gb|EEL49812.1| Prolyl 4-hydroxylase alpha subunit [Bacillus cereus Rock3-44]
          Length = 220

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 58/179 (32%), Positives = 88/179 (49%), Gaps = 16/179 (8%)

Query: 46  DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFGDH 104
           +P +V + + + D E   +IELSK  ++R K+   G +  VD  R S   FL      ++
Sbjct: 42  EPLIVVLENVLSDEECESLIELSKDSMKRSKI---GASREVDNIRTSSGTFLE-----EN 93

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLAS 161
             +  I+ R+  + N+ +   E     L I  Y  G  Y  H D      R     R+++
Sbjct: 94  ETVAIIEKRVSSIMNIPVEHGE----GLHILKYTPGQEYKAHYDYFAEHSRAAENNRIST 149

Query: 162 FMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            + YL DVE GG T FP LNL++ P+KGSAV++   + +  L+    H G PV  G KW
Sbjct: 150 LVMYLNDVEEGGETFFPKLNLSIAPKKGSAVYFEYFYNDKSLNELTLHGGAPVIKGEKW 208


>gi|226314793|ref|YP_002774689.1| hypothetical protein BBR47_52080 [Brevibacillus brevis NBRC 100599]
 gi|226097743|dbj|BAH46185.1| conserved hypothetical protein [Brevibacillus brevis NBRC 100599]
          Length = 215

 Score = 81.6 bits (200), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 58/181 (32%), Positives = 92/181 (50%), Gaps = 16/181 (8%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYPEIFG 102
           Y +P VV + + + DSE + +IE S+ +++R K+   G+   V++ R S   F       
Sbjct: 33  YEEPLVVVLGNVLSDSECDELIEHSRERLQRSKI---GEDRSVNSIRTSSGVFC-----E 84

Query: 103 DHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRL 159
               + +I+ RI  + N+ I     +   LQ+  Y  G  Y  H D    T R     R+
Sbjct: 85  QTETITRIEKRISQIMNIPI----EHGDGLQVLRYTPGQEYKPHYDFFAETSRASTNNRI 140

Query: 160 ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           ++ + YL DVE GG T+FP L+L+VFP KG AV++   + N  ++    H+G  V  G K
Sbjct: 141 STLVMYLNDVEQGGETVFPLLHLSVFPTKGMAVYFEYFYRNQEVNEFTLHAGAQVIHGEK 200

Query: 220 W 220
           W
Sbjct: 201 W 201


>gi|224141327|ref|XP_002324025.1| predicted protein [Populus trichocarpa]
 gi|222867027|gb|EEF04158.1| predicted protein [Populus trichocarpa]
          Length = 239

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 61/228 (26%), Positives = 98/228 (42%), Gaps = 38/228 (16%)

Query: 17  IKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGK 76
           I+S    F ++++       P +  +L   PR       + D E + +I L+KGK+ +  
Sbjct: 2   IRSKTGAFTKAFD-------PTRAAQLSWQPRAFVYKGFLSDEECDHLINLAKGKLVKSM 54

Query: 77  VVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
           V N   G+++    R S   F++     +   +  I+ RI   T L     E    P+QI
Sbjct: 55  VANDETGESMESQERTSSGMFIFKT---EDEIVNGIEARIAAWTFL----PEENGEPIQI 107

Query: 135 NNYGLGGHYDLH----CDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL-------- 182
             Y  G  Y+ H     D   ++EG  R A+ + YL+DV+ GG T+FP+           
Sbjct: 108 LRYEHGQKYEAHIDYFVDKANQEEGGHRAATVLMYLSDVKKGGETVFPTSEAEGSQAKDD 167

Query: 183 ----------TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
                      V P KG A+ +++ H +   D    H+ CPV  G KW
Sbjct: 168 SWSDCAKKGYAVKPNKGDALLFFSLHPDATPDPGSLHASCPVIEGEKW 215


>gi|413932756|gb|AFW67307.1| oxidoreductase [Zea mays]
          Length = 297

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 61/208 (29%), Positives = 88/208 (42%), Gaps = 31/208 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P  V +L   PR       + D+E + ++ L+KG +E+  V +   G ++    R S   
Sbjct: 32  PASVTQLSSRPRAFLYSGFLSDTECDHLVSLAKGSMEKSMVADNDSGKSVASQARTSSGT 91

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-ATPRD 153
           FL      +   +  I+ R+   T L     E     LQ+  Y  G  YD H D    R+
Sbjct: 92  FLAKR---EDEIVSAIEKRVAAWTFL----PEENAESLQVLRYETGQKYDAHFDYFHDRN 144

Query: 154 E---GLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
               G  R+A+ + YLTDV  GG T+FP+                    L V P+KG A+
Sbjct: 145 NLKLGGQRVATVLMYLTDVNKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDAL 204

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            ++N H N   D    H  CPV  G KW
Sbjct: 205 LFFNLHVNATADTGSLHGSCPVIEGEKW 232


>gi|241029040|ref|XP_002406378.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215491954|gb|EEC01595.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 539

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 64/241 (26%), Positives = 115/241 (47%), Gaps = 28/241 (11%)

Query: 1   EIYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           + Y   C+G L     ++S L+C Y    + F  + P+K+EE+ L P ++ +HD + D +
Sbjct: 282 QSYKRLCRGKLLRSPKMESQLRCRYYKGQDGFFALQPIKLEEMNLKPYIIVMHDVLQDKD 341

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           I  ++  ++ +V   K + Y    ++ T  S   +L  +   + P   ++ + ++ +  +
Sbjct: 342 IKELMAFAEPRVR--KTLPYLFICHIHTFYSA--WLNED---EAPIAVRMNSYLRALLGM 394

Query: 121 VIGREERYKGPLQINNYGLGG----HYDLHCDA-----TPRDEGLW-----RLASFMFYL 166
                +      Q+ NYG GG    H+D   D+     +  D  L      R+A+ M YL
Sbjct: 395 GTSDTDEEAEAYQLANYGTGGQFLPHHDFLQDSFHSYNSSADYYLQYGTGDRVATLMIYL 454

Query: 167 TDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLLDYRMYHSGCPV-----ALGNK 219
           TDVE GGAT+FP+L L + P+K +  F    N+    +L + ++     V     A+ NK
Sbjct: 455 TDVEEGGATVFPTLGLRLTPKKVNLFFISLRNSDGARILHWVVFTVCIKVTFFCLAVANK 514

Query: 220 W 220
           W
Sbjct: 515 W 515


>gi|242032633|ref|XP_002463711.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
 gi|241917565|gb|EER90709.1| hypothetical protein SORBIDRAFT_01g004670 [Sorghum bicolor]
          Length = 297

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 59/208 (28%), Positives = 87/208 (41%), Gaps = 31/208 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PR       + D+E + +I L+KG +E+  V +   G ++    R S   
Sbjct: 32  PARVTQLSWRPRAFLYSGFLSDTECDHLINLAKGSMEKSMVADNDSGKSLMSQVRTSSGA 91

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----AT 150
           FL      +   +  I+ R+   T L     E     +Q+  Y +G  YD H D      
Sbjct: 92  FLAKH---EDEIVSAIEKRVAAWTFL----PEENAESMQVLRYEIGQKYDAHFDYFHDKN 144

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
               G  R A+ + YLTDV+ GG T+FP+                    L V P+KG A+
Sbjct: 145 NVKHGGQRFATVLMYLTDVKKGGETVFPNAEGSHLQYKDETWSECSRSGLAVKPKKGDAL 204

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            ++  H N   D    H  CPV  G KW
Sbjct: 205 LFFGLHLNATTDTSSLHGSCPVIEGEKW 232


>gi|221512814|ref|NP_649045.2| CG18231 [Drosophila melanogaster]
 gi|220902637|gb|AAF49253.3| CG18231 [Drosophila melanogaster]
          Length = 470

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 63/221 (28%), Positives = 93/221 (42%), Gaps = 57/221 (25%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y + C+G        + N  C Y      FLK+ PLK E L  DP +V  HD + D EI+
Sbjct: 278 YEIGCRGQFLR----RRNHVCTYNFTITEFLKLAPLKQEVLNWDPYIVIYHDVLNDDEID 333

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
           ++             +N  D + V+                 P   +I  RI ++T L  
Sbjct: 334 KL----------KNHLNDTDAVEVN-----------------PIEKRIFQRINELTRLSF 366

Query: 123 GREERY----KGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFP 178
              ++      GP                  T + +  +   + +F+L +VELGGA +FP
Sbjct: 367 EHSDQQIVSKNGP-----------------RTHKHKKEYLKGTLLFFLNNVELGGAMVFP 409

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
            L ++VFP+KGS +FW+N      LD R     CPV  GNK
Sbjct: 410 KLKISVFPQKGSCLFWHNT-----LDPRSEPLECPVLQGNK 445


>gi|195505253|ref|XP_002099424.1| GE23369 [Drosophila yakuba]
 gi|194185525|gb|EDW99136.1| GE23369 [Drosophila yakuba]
          Length = 164

 Score = 81.3 bits (199), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 50/181 (27%), Positives = 78/181 (43%), Gaps = 39/181 (21%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
           +E++ L+P VV  HD I   E  ++IEL+   ++   V     + +   R  K  ++  E
Sbjct: 1   MEQVGLNPYVVLYHDVISPQESAQLIELAASDLKASGVFQAKGSTFKRLRTVKARWIKKE 60

Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRL 159
               +    +I  RI+DMT   +   E+                                
Sbjct: 61  F---NELTKRITRRIRDMTGFDLKEGEK-------------------------------- 85

Query: 160 ASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
               F L+DVE GGAT+FP    T++P  G+A+ WYN H +   D    H+ CPV +G+K
Sbjct: 86  ----FQLSDVEQGGATVFPMSGYTIYPRAGTALLWYNLHTDGHCDPSTLHAACPVMVGSK 141

Query: 220 W 220
           W
Sbjct: 142 W 142


>gi|228990015|ref|ZP_04149988.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
           12442]
 gi|228769681|gb|EEM18271.1| Prolyl 4-hydroxylase alpha subunit [Bacillus pseudomycoides DSM
           12442]
          Length = 219

 Score = 80.9 bits (198), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 62/217 (28%), Positives = 98/217 (45%), Gaps = 16/217 (7%)

Query: 9   GNLSVPEDIKSN--LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           G +S   +   N  L  F  + N    +   +++     +P +V + + + D E   +IE
Sbjct: 2   GQMSTKNETVKNTELTIFNHTGNTIVTEDREIQIISRLEEPLIVVLANVLSDEECETLIE 61

Query: 67  LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
           +SK K++R K+     T   D R S   FL      +     +I+ RI  + N+     E
Sbjct: 62  MSKNKMKRSKIGVSRKT--NDIRTSSGAFLE-----ESEITTRIERRIASIMNVPAPHGE 114

Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASFMFYLTDVELGGATIFPSLNLT 183
                LQI  Y +G  Y  H D    +       R+++ + YL  VE GG T FP LNL+
Sbjct: 115 ----GLQILKYTVGQEYQAHYDFFVENSAAASNNRMSTLVMYLNHVEEGGETFFPKLNLS 170

Query: 184 VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           V P+KG AV++   + +  ++    H G PV  G KW
Sbjct: 171 VSPKKGMAVYFEYFYQDESINKLTLHGGAPVIKGEKW 207


>gi|357467085|ref|XP_003603827.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492875|gb|AES74078.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 280

 Score = 80.9 bits (198), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 60/207 (28%), Positives = 91/207 (43%), Gaps = 32/207 (15%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLY 97
            E L  +PR    H+ +   E   +I L+K  + +  VV+   G +     R S   FL 
Sbjct: 69  TEILSWEPRAFVYHNFLSKEECEHLINLAKPFLAKSSVVDSKTGKSTESRVRTSSGMFLK 128

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
               G    +  I+ RI D T + +   E     LQ+ +YG+G     HYD   D     
Sbjct: 129 R---GKDKIIQNIERRIADFTFIPVENGE----GLQVLHYGVGEKYEPHYDYFLDEFNTK 181

Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
            G  R+A+ + YL+DVE GG T+FP+                     L++ P+ G A+ +
Sbjct: 182 NGGQRVATVLMYLSDVEEGGETVFPAAKANFSSVPWWNDLSECARKGLSLKPKMGDALLF 241

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKWG 221
           ++   +  LD    H GCPV +GNKW 
Sbjct: 242 WSMRPDATLDASSLHGGCPVIVGNKWS 268


>gi|66771505|gb|AAY55064.1| IP12044p [Drosophila melanogaster]
          Length = 484

 Score = 80.9 bits (198), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 63/221 (28%), Positives = 93/221 (42%), Gaps = 57/221 (25%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y + C+G        + N  C Y      FLK+ PLK E L  DP +V  HD + D EI+
Sbjct: 292 YEIGCRGQFLR----RRNHVCTYNFTITEFLKLAPLKQEVLNWDPYIVIYHDVLNDDEID 347

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
           ++             +N  D + V+                 P   +I  RI ++T L  
Sbjct: 348 KL----------KNHLNDTDAVEVN-----------------PIEKRIFQRINELTRLSF 380

Query: 123 GREERY----KGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFP 178
              ++      GP                  T + +  +   + +F+L +VELGGA +FP
Sbjct: 381 EHSDQQIVSKNGP-----------------RTHKHKKEYLKGTLLFFLNNVELGGAMVFP 423

Query: 179 SLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
            L ++VFP+KGS +FW+N      LD R     CPV  GNK
Sbjct: 424 KLKISVFPQKGSCLFWHNT-----LDPRSEPLECPVLQGNK 459


>gi|229002593|ref|ZP_04160640.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
 gi|229003816|ref|ZP_04161625.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
 gi|228757417|gb|EEM06653.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock1-4]
 gi|228758520|gb|EEM07660.1| Prolyl 4-hydroxylase alpha subunit [Bacillus mycoides Rock3-17]
          Length = 219

 Score = 80.9 bits (198), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 62/217 (28%), Positives = 98/217 (45%), Gaps = 16/217 (7%)

Query: 9   GNLSVPEDIKSN--LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIE 66
           G +S   +   N  L  F  + N    +   +++     +P +V + + + D E   +IE
Sbjct: 2   GQMSTKNETVENTELTIFNHTGNTIVTEDREIQIISRLEEPLIVVLANVLSDEECETLIE 61

Query: 67  LSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
           +SK K++R K+     T   D R S   FL      +     +I+ RI  + N+     E
Sbjct: 62  MSKNKMKRSKIGISRKT--NDIRTSSGAFLE-----ESEITTRIERRIASIMNVPAPHGE 114

Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRDEGLW---RLASFMFYLTDVELGGATIFPSLNLT 183
                LQI  Y +G  Y  H D    +       R+++ + YL  VE GG T FP LNL+
Sbjct: 115 ----GLQILKYTVGQEYQAHYDFFVENSAAASNNRMSTLVMYLNHVEEGGETFFPKLNLS 170

Query: 184 VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           V P+KG AV++   + +  ++    H G PV  G KW
Sbjct: 171 VSPKKGMAVYFEYFYQDESINKLTLHGGAPVIKGEKW 207


>gi|6437556|gb|AAF08583.1|AC011623_16 unknown protein [Arabidopsis thaliana]
          Length = 278

 Score = 80.9 bits (198), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 55/188 (29%), Positives = 88/188 (46%), Gaps = 13/188 (6%)

Query: 39  KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFL 96
           KV+++   PR       + D E + +I L+K  ++R  V +   G++   D R S   F+
Sbjct: 37  KVKQVSSKPRAFVYEGFLTDLECDHLISLAKENLQRSAVADNDNGESQVSDVRTSSGTFI 96

Query: 97  YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATPR 152
                G  P +  I+ ++   T L     E     LQ+  Y  G  YD H D        
Sbjct: 97  SK---GKDPIVSGIEDKLSTWTFLPKENGE----DLQVLRYEHGQKYDAHFDYFHDKVNI 149

Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGC 212
             G  R+A+ + YL++V  GG T+FP   + + P+KG+A+ ++N   + + D    H GC
Sbjct: 150 ARGGHRIATVLLYLSNVTKGGETVFPDAQVCLKPKKGNALLFFNLQQDAIPDPFSLHGGC 209

Query: 213 PVALGNKW 220
           PV  G KW
Sbjct: 210 PVIEGEKW 217


>gi|449520146|ref|XP_004167095.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 249

 Score = 80.5 bits (197), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 65/212 (30%), Positives = 94/212 (44%), Gaps = 32/212 (15%)

Query: 34  KIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLS 91
           K G   VE +  +PR    H+ +   E   +I L+K  +E+  VV+   G  +    R S
Sbjct: 33  KRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGKNVEDSVRTS 92

Query: 92  KVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATP 151
              FL     G    +  I+ RI D T + I   E     LQI +Y +G  YD H D   
Sbjct: 93  SGMFLNR---GQDKIVSNIEKRIADFTFIPIEHGE----GLQILHYEVGQKYDAHYDFFD 145

Query: 152 RDEGL----WRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEK 188
            +  L     R+A+ + YL+DVE GG T+FP+                     L+V P+ 
Sbjct: 146 DEFNLKEIGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVKPKM 205

Query: 189 GSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           G A+ +++   +T LD    H  CPV  GNKW
Sbjct: 206 GDALLFWSMKPDTTLDPTSLHGACPVIRGNKW 237


>gi|218193936|gb|EEC76363.1| hypothetical protein OsI_13952 [Oryza sativa Indica Group]
          Length = 1062

 Score = 80.5 bits (197), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 60/209 (28%), Positives = 89/209 (42%), Gaps = 31/209 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PR       +   E + ++ L+KG++E+  V +   G +I    R S   
Sbjct: 34  PARVTQLSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSGT 93

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----AT 150
           FL      +   +  I+ R+   T L     E     +QI +Y LG  YD H D      
Sbjct: 94  FLSKH---EDDIVSGIEKRVAAWTFL----PEENAESIQILHYELGQKYDAHFDYFHDKN 146

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSL------------------NLTVFPEKGSAV 192
               G  R+A+ + YLTDV+ GG T+FP+                    L V P+KG A+
Sbjct: 147 NLKRGGHRVATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDAL 206

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKWG 221
            +++ H N   D    H  CPV  G KW 
Sbjct: 207 LFFSLHVNATTDPASLHGSCPVIEGEKWS 235


>gi|356550516|ref|XP_003543632.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 318

 Score = 80.5 bits (197), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 62/214 (28%), Positives = 92/214 (42%), Gaps = 31/214 (14%)

Query: 31  TFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDT 88
           + +K  P +V +L   PR       + D E + +I L+K K+E+  V +   G +I  + 
Sbjct: 47  SSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMSEV 106

Query: 89  RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYD 144
           R S   FL          +  I+ RI   T L I   E     +QI +Y  G     H+D
Sbjct: 107 RTSSGMFLNK---AQDEIVAGIEARIAAWTFLPIENGES----MQILHYENGQKYEPHFD 159

Query: 145 LHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL------------------TVFP 186
              D   +  G  R+A+ + YL+DVE GG TIFP+                      V P
Sbjct: 160 YFHDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAKAKLLQPKDESWSECAHKGYAVKP 219

Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            KG A+ +++ H +   D +  H  CPV  G KW
Sbjct: 220 RKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKW 253


>gi|168002780|ref|XP_001754091.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162694645|gb|EDQ80992.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 214

 Score = 80.5 bits (197), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 61/207 (29%), Positives = 92/207 (44%), Gaps = 32/207 (15%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
           VE L  +PR    H  + + E N +IE+++  + +  VV+       D+RL  S   FL 
Sbjct: 3   VEVLSWEPRAFLYHHFLTEEECNHLIEVARPSLVKSTVVDSDTGKSKDSRLRTSSGTFL- 61

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
             + G  P + +I+ RI D T +   + E     LQ+  Y        HYD   DA    
Sbjct: 62  --MRGQDPVIKRIEKRIADFTFIPAEQGE----GLQVLQYKESEKYEPHYDYFHDAYNTK 115

Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
            G  R+A+ + YL++VE GG T+FP+                     L+V P  G A+ +
Sbjct: 116 NGGQRIATVLMYLSNVEEGGETVFPAAQVNKTEVPDWDKLSECAQKGLSVRPRMGDALLF 175

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKWG 221
           ++   +  LD    H GCPV  G KW 
Sbjct: 176 WSMKPDATLDSTSLHGGCPVIKGTKWS 202


>gi|302844249|ref|XP_002953665.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
 gi|300261074|gb|EFJ45289.1| prolyl 4-hydroxylase [Volvox carteri f. nagariensis]
          Length = 245

 Score = 80.5 bits (197), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 61/226 (26%), Positives = 102/226 (45%), Gaps = 32/226 (14%)

Query: 13  VPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKV 72
           +PE +  +    +   +  F +     VE++ L PR    H+ +  +E   ++ L+  K+
Sbjct: 27  LPERLLPSALVMHHEADKQFDEEATPWVEQVGLHPRAYLFHNFLTKAERAHMVRLAAPKL 86

Query: 73  ERGKVV-NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGP 131
           +R  VV N G+ +  + R S  Y ++     D P + +I+ RI   T+L I  +E     
Sbjct: 87  KRSTVVGNDGEGVVDEIRTS--YGMFIRRLAD-PVITRIEKRISLWTHLPIEHQED---- 139

Query: 132 LQINNYGLGGHYDLHCDATPRDE---GLWRLASFMFYLTDVELGGATIFPSLN------- 181
           +Q+  Y  G  Y  H D+  +       WRLA+F+ YL+DVE GG T FP  +       
Sbjct: 140 IQVLRYAHGQTYGAHYDSGDKSNEPGPKWRLATFLMYLSDVEEGGETAFPQNSVWYDPTI 199

Query: 182 --------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCP 213
                         +   P+ G AV +Y+ + N  +D    H+GCP
Sbjct: 200 PERIGPVSECAKGHVAAKPKAGDAVLFYSFYPNLTMDPAAMHTGCP 245


>gi|302793288|ref|XP_002978409.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
 gi|300153758|gb|EFJ20395.1| hypothetical protein SELMODRAFT_418273 [Selaginella moellendorffii]
          Length = 256

 Score = 80.1 bits (196), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 95/209 (45%), Gaps = 32/209 (15%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVY 94
           P+  E +   PR    H+ +   E + +I L++  ++R  VV+       D+R+  S   
Sbjct: 43  PVWTETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSRVRTSSGT 102

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----AT 150
           FL     G    + +I+ RI   T +    +E  +G LQ+ +Y +G  YD H D      
Sbjct: 103 FLR---RGQDEIISRIEERIAKFTFIP---KEHGEG-LQVLHYEVGQKYDAHHDYFHDKV 155

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLNL-------------------TVFPEKGSA 191
               G  R+A+ + YL+DVE GG T+FPS  +                   +V P KG A
Sbjct: 156 NTKNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECGKKGVSVKPRKGDA 215

Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + +++   +  LD    H GCPV  GNKW
Sbjct: 216 LLFWSMSPDAELDPFSLHGGCPVIKGNKW 244


>gi|302773668|ref|XP_002970251.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
 gi|300161767|gb|EFJ28381.1| hypothetical protein SELMODRAFT_411114 [Selaginella moellendorffii]
          Length = 256

 Score = 80.1 bits (196), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 61/209 (29%), Positives = 95/209 (45%), Gaps = 32/209 (15%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVY 94
           P+  E +   PR    H+ +   E + +I L++  ++R  VV+       D+R+  S   
Sbjct: 43  PVWTETISWQPRASVFHNFLSSEECDHLIRLAQPNMKRSAVVDNQTGKSKDSRVRTSSGT 102

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----AT 150
           FL     G    + +I+ RI   T +    +E  +G LQ+ +Y +G  YD H D      
Sbjct: 103 FLR---RGQDEIISRIEERIAKFTFIP---KEHGEG-LQVLHYEVGQKYDAHHDYFHDKV 155

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLNL-------------------TVFPEKGSA 191
               G  R+A+ + YL+DVE GG T+FPS  +                   +V P KG A
Sbjct: 156 NTKNGGQRVATVLMYLSDVEEGGETVFPSAKVNSSSVPWWDELSECAKKGVSVKPRKGDA 215

Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + +++   +  LD    H GCPV  GNKW
Sbjct: 216 LLFWSMSPDAELDPFSLHGGCPVIKGNKW 244


>gi|255083627|ref|XP_002508388.1| predicted protein [Micromonas sp. RCC299]
 gi|226523665|gb|ACO69646.1| predicted protein [Micromonas sp. RCC299]
          Length = 253

 Score = 80.1 bits (196), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 61/208 (29%), Positives = 94/208 (45%), Gaps = 42/208 (20%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDH 104
           PR   +H+ +   E +RI+E+++ +V R  V++   G +     R S+  FL     G  
Sbjct: 5   PRAFHLHNFMSHEECDRILEIARPRVRRSTVIDSVTGQSKVDPIRTSEQTFLN---RGTW 61

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGP-LQINNYGLGGHYDLHCD------ATPRD---E 154
             + K++ R+  +T L       Y G  +QI  YGLG  YD H D      A+ +    E
Sbjct: 62  DIVTKVEERLAVVTQLPA-----YHGEDMQILKYGLGQKYDAHHDVGELTSASGKQLAAE 116

Query: 155 GLWRLASFMFYLTDVELGGATIFPSL----------------------NLTVFPEKGSAV 192
           G  R+A+ + YL+DVE GG T FP                        N+ V P KG  +
Sbjct: 117 GGHRVATVLLYLSDVEEGGETAFPDSEWMTPELRKWAEGQKWSDCAEGNVAVKPRKGDGL 176

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +++ +    +D    H+GCPV  G KW
Sbjct: 177 LFWSVNNENAIDPHSMHAGCPVIRGEKW 204


>gi|215490181|dbj|BAG86624.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
          Length = 294

 Score = 80.1 bits (196), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 62/214 (28%), Positives = 90/214 (42%), Gaps = 34/214 (15%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           I P K +++   PR       + D E N +I L+K +++R  V +   G++   + R S 
Sbjct: 28  INPSKAKQISWKPRAFVYEGFLTDEECNHLISLAKSELKRSAVADNESGNSKTSEVRTSS 87

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCD 148
             F+ P+     P +  I+ +I   T L     E     +Q+  Y  G     HYD   D
Sbjct: 88  GMFI-PK--AKDPIVSGIEEKIATWTFLPKENGEE----IQVLRYEEGQKYEPHYDYFVD 140

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
                 G  RLA+ + YLT+VE GG T+FP                        + V P 
Sbjct: 141 KVNIARGGHRLATVLMYLTNVEKGGETVFPKAEESPRRRSMIADDSLSECAKKGIPVKPR 200

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           KG A+ +Y+ H N   D    H GCPV  G KW 
Sbjct: 201 KGDALLFYSLHPNATPDPLSLHGGCPVIQGEKWS 234


>gi|356572148|ref|XP_003554232.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Glycine max]
          Length = 319

 Score = 80.1 bits (196), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 61/212 (28%), Positives = 92/212 (43%), Gaps = 31/212 (14%)

Query: 33  LKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRL 90
           +K  P +V +L   PR       + + E + +I L+K K+E+  V +   G +I  D R 
Sbjct: 50  VKFDPTRVTQLSWSPRAFLYKGFLSEEECDHLIVLAKDKLEKSMVADNDSGKSIMSDIRT 109

Query: 91  SKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLH 146
           S   FL          +  I+ RI   T L +   E     +QI +Y  G     H+D  
Sbjct: 110 SSGMFLNK---AQDEIVAGIEARIAAWTFLPVENGES----MQILHYENGQKYEPHFDYF 162

Query: 147 CDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL------------------TVFPEK 188
            D   +  G  R+A+ + YL+DVE GG TIFP+                      V P+K
Sbjct: 163 HDKANQVMGGHRIATVLMYLSDVEKGGETIFPNAEAKLLQPKDESWSECAHKGYAVKPQK 222

Query: 189 GSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           G A+ +++ H +   D +  H  CPV  G KW
Sbjct: 223 GDALLFFSLHLDASTDTKSLHGSCPVIEGEKW 254


>gi|449443243|ref|XP_004139389.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 284

 Score = 80.1 bits (196), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 64/212 (30%), Positives = 93/212 (43%), Gaps = 32/212 (15%)

Query: 34  KIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLS 91
           K G   VE +  +PR    H+ +   E   +I L+K  +E+  VV+   G  +    R S
Sbjct: 68  KRGDQWVEFISWEPRAFVYHNFLSKEECLYLISLAKPHMEKSTVVDNETGKNVEDSVRTS 127

Query: 92  KVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHC 147
              FL     G    +  I+ RI D T + I   E     LQI +Y +G     HYD   
Sbjct: 128 SGMFLNR---GQDKIVSNIEKRIADFTFIPIEHGE----GLQILHYEVGQKYDAHYDYFV 180

Query: 148 DATPRDEGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEK 188
           D     +G  R+A+ + YL+DVE GG T+FP+                     L+V P+ 
Sbjct: 181 DEYNIKKGGQRMATLLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSKCGKGGLSVKPKM 240

Query: 189 GSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           G A+ +++   +  LD    H  CPV  GNKW
Sbjct: 241 GDALLFWSMKPDATLDPTSLHGACPVIRGNKW 272


>gi|115456019|ref|NP_001051610.1| Os03g0803500 [Oryza sativa Japonica Group]
 gi|29150365|gb|AAO72374.1| putative oxidoreductase [Oryza sativa Japonica Group]
 gi|108711618|gb|ABF99413.1| oxidoreductase, 2OG-Fe oxygenase family protein, putative,
           expressed [Oryza sativa Japonica Group]
 gi|113550081|dbj|BAF13524.1| Os03g0803500 [Oryza sativa Japonica Group]
 gi|215765410|dbj|BAG87107.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222625993|gb|EEE60125.1| hypothetical protein OsJ_13003 [Oryza sativa Japonica Group]
          Length = 299

 Score = 80.1 bits (196), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 60/208 (28%), Positives = 89/208 (42%), Gaps = 31/208 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PR       +   E + ++ L+KG++E+  V +   G +I    R S   
Sbjct: 34  PARVTQLSWRPRAFLYSGFLSHDECDHLVNLAKGRMEKSMVADNDSGKSIMSQVRTSSGT 93

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----AT 150
           FL      +   +  I+ R+   T L     E     +QI +Y LG  YD H D      
Sbjct: 94  FLSKH---EDDIVSGIEKRVAAWTFL----PEENAESIQILHYELGQKYDAHFDYFHDKN 146

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSL------------------NLTVFPEKGSAV 192
               G  R+A+ + YLTDV+ GG T+FP+                    L V P+KG A+
Sbjct: 147 NLKRGGHRVATVLMYLTDVKKGGETVFPNAAGRHLQLKDETWSDCARSGLAVKPKKGDAL 206

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +++ H N   D    H  CPV  G KW
Sbjct: 207 LFFSLHVNATTDPASLHGSCPVIEGEKW 234


>gi|385205097|ref|ZP_10031967.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
 gi|385184988|gb|EIF34262.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
          Length = 292

 Score = 80.1 bits (196), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 56/185 (30%), Positives = 87/185 (47%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           P+++   D +   E   +IE S+ +++R   VN   G    +  R S+  +      G+ 
Sbjct: 103 PQMIVFADVLSPDECAEMIERSRHRLKRSTTVNPATGKEDVIRNRTSEGIWYQ---RGED 159

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
           PF+ ++  RI  + N  +   E  +G LQ+  YG  G Y  H D  P D+         G
Sbjct: 160 PFIERMDRRISSLMNWPV---ENGEG-LQLLRYGTTGEYRPHFDYFPPDQPGSTVHTAQG 215

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL DV  GG TIFP   ++V   +G AV++   +    LD    H G PV 
Sbjct: 216 GQRVATLVIYLNDVPDGGETIFPEAGMSVAASQGGAVYFRYMNGRRQLDPLTLHGGAPVL 275

Query: 216 LGNKW 220
            G+KW
Sbjct: 276 SGDKW 280


>gi|297802350|ref|XP_002869059.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297314895|gb|EFH45318.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 290

 Score = 80.1 bits (196), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 67/236 (28%), Positives = 107/236 (45%), Gaps = 34/236 (14%)

Query: 12  SVPEDIKSNLKCF--YESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSK 69
           S P D+ + ++     ESY +     G   +E +  +PR    H+ + + E   +I L+K
Sbjct: 50  SRPMDLTTIVQTIEERESYGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAK 109

Query: 70  GKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREER 127
             + + KVV+   G +I    R S   FL     G    + +I+ RI D T + I   E 
Sbjct: 110 PSMVKSKVVDVKTGKSIDSRVRTSSGTFLKR---GHDEIVEEIENRISDFTFIPIENGEG 166

Query: 128 YKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN-- 181
               LQ+ +Y +G     H+D   D     +G  R+A+ + YL+DV+ GG T+FP+    
Sbjct: 167 ----LQVLHYEVGQKYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGN 222

Query: 182 -----------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
                            L+V P+K  A+ +++   +  LD    H GCPV  GNKW
Sbjct: 223 ISDVPWWDELSQCGKEGLSVLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKW 278


>gi|218192156|gb|EEC74583.1| hypothetical protein OsI_10158 [Oryza sativa Indica Group]
          Length = 299

 Score = 79.7 bits (195), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 62/206 (30%), Positives = 92/206 (44%), Gaps = 36/206 (17%)

Query: 42  ELYLDPRVVKIHDAIYDSEINRIIELSK-GKVERGKVVN--YGDTIYVDTRLSKVYFLYP 98
           ++   PRV      + D+E   +I L+K G++ER  VVN   G+++   TR S   FL  
Sbjct: 39  DVSWSPRVFLYEGFLSDAECEHLIALAKQGRMERSTVVNGKSGESVMSKTRTSSGMFL-- 96

Query: 99  EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD------ATPR 152
            I      + +I+ RI   T       E     +Q+  YG G  Y+ H D      A+ R
Sbjct: 97  -IRKQDEVVARIEERIAAWTMFPAENGE----SMQMLRYGQGEKYEPHFDYIRGRQASAR 151

Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAVFW 194
             G  R+A+ + YL++V++GG T+FP                       V P KGSAV +
Sbjct: 152 --GGHRIATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLF 209

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           ++ + N   D    H  CPV  G KW
Sbjct: 210 FSLYPNATFDPGSLHGSCPVIQGEKW 235


>gi|407708877|ref|YP_006792741.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
 gi|407237560|gb|AFT87758.1| prolyl 4-hydroxylase [Burkholderia phenoliruptrix BR3459a]
          Length = 300

 Score = 79.7 bits (195), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 56/185 (30%), Positives = 91/185 (49%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           P+V+   + +   E + +IE S+ +++R  +V+   G    +  R S+  +      G+ 
Sbjct: 111 PQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEGVIRNRTSEGIWYQ---RGED 167

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
            F+ ++  RI  + N  +   E  +G LQI +YG  G Y  H D  P D+         G
Sbjct: 168 AFIERLDRRIASLMNWPV---ENGEG-LQILHYGPTGEYRPHFDYFPPDQPGSAVHTARG 223

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL DV  GG TIFP+  L+V  ++G AV++   +    LD    H G PV 
Sbjct: 224 GQRVATLVVYLNDVADGGETIFPAAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVR 283

Query: 216 LGNKW 220
            G+KW
Sbjct: 284 AGDKW 288


>gi|323528042|ref|YP_004230194.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
 gi|323385044|gb|ADX57134.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1001]
          Length = 300

 Score = 79.7 bits (195), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 56/185 (30%), Positives = 91/185 (49%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           P+V+   + +   E + +IE S+ +++R  +V+   G    +  R S+  +      G+ 
Sbjct: 111 PQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEGVIRNRTSEGIWYQ---RGED 167

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
            F+ ++  RI  + N  +   E  +G LQI +YG  G Y  H D  P D+         G
Sbjct: 168 AFIERLDQRIASLMNWPV---ENGEG-LQILHYGPTGEYRPHFDYFPPDQPGSAVHTARG 223

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL DV  GG TIFP+  L+V  ++G AV++   +    LD    H G PV 
Sbjct: 224 GQRVATLVVYLNDVADGGETIFPAAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVH 283

Query: 216 LGNKW 220
            G+KW
Sbjct: 284 AGDKW 288


>gi|295704991|ref|YP_003598066.1| 2OG-Fe(II) oxygenase [Bacillus megaterium DSM 319]
 gi|294802650|gb|ADF39716.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium DSM 319]
          Length = 219

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 56/201 (27%), Positives = 97/201 (48%), Gaps = 12/201 (5%)

Query: 23  CFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGD 82
            F  S N   L+   + +   + +P V+ + + + + E + +I LSK K++R K+   G 
Sbjct: 15  IFNHSGNKIKLEDREINIVARFEEPLVLVLGNVLSNEECDELIRLSKDKMQRSKI---GA 71

Query: 83  TIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGH 142
              V++  +     + E   ++  +++I+ R+      ++G    Y   LQI  Y     
Sbjct: 72  AREVNSIRTSSGMFFDE--SENELVHQIERRLSK----IMGPSIEYAEGLQILKYLPDQE 125

Query: 143 YDLHCD---ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA 199
           Y  H D   +  +     R+++ + YL DVE GG T FP L L+V P KG AV++   ++
Sbjct: 126 YKAHHDYFTSASKASKNNRISTLVMYLNDVEEGGETYFPKLGLSVSPTKGMAVYFEYFYS 185

Query: 200 NTLLDYRMYHSGCPVALGNKW 220
           +  L+ R  H G PV  G KW
Sbjct: 186 DAELNDRTLHGGAPVIKGEKW 206


>gi|341878860|gb|EGT34795.1| hypothetical protein CAEBREN_10065 [Caenorhabditis brenneri]
          Length = 163

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 43/112 (38%), Positives = 59/112 (52%), Gaps = 12/112 (10%)

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYLTD 168
           MTNL +   E     LQI NYG+GGHYD H D   ++E           R+A+ +FY++ 
Sbjct: 1   MTNLEMETAEE----LQIANYGIGGHYDPHFDHAKKEESKSFESLGTGNRIATVLFYMSQ 56

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
              GG T+F  +  TV P K  A+FWYN +     +    H+ CPV +G KW
Sbjct: 57  PSHGGGTVFTEVKSTVLPTKNDALFWYNLYKQGDGNPDTRHAACPVLVGIKW 108


>gi|170690448|ref|ZP_02881615.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
 gi|170144883|gb|EDT13044.1| Procollagen-proline dioxygenase [Burkholderia graminis C4D1M]
          Length = 307

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 56/185 (30%), Positives = 90/185 (48%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           P+V+   + +   E + +IE S+ +++R  +V+   G    +  R S+  +      G+ 
Sbjct: 118 PQVIVFANVLSPEECDEVIERSRHRLKRSTIVDPATGQEDVIRNRTSEGIWYQ---RGED 174

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
            F+ ++  RI  + N  +   E  +G LQI +YG  G Y  H D  P D+         G
Sbjct: 175 AFIERLDQRIASLMNWPV---ENGEG-LQILHYGPTGEYRPHFDYFPPDQPGSMVHTARG 230

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL DV  GG TIFP   L+V  ++G AV++   +    LD    H G PV 
Sbjct: 231 GQRVATLVIYLNDVPDGGETIFPEAGLSVAAKQGGAVYFRYMNGQRQLDPLTLHGGAPVR 290

Query: 216 LGNKW 220
            G+KW
Sbjct: 291 AGDKW 295


>gi|357483925|ref|XP_003612249.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355513584|gb|AES95207.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 289

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 65/241 (26%), Positives = 106/241 (43%), Gaps = 33/241 (13%)

Query: 6   ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           +   NL  P D+ S +    +  ++   K G   VE +  +PR    H+ +   E   +I
Sbjct: 45  SSNQNLPKPNDLTSIVHNTVDRNDDEEGK-GEQWVEVVSWEPRAFVYHNFLTKEECEYLI 103

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIG 123
           +++K  + +  VV+       D+R+  S   FL     G    +  I+ +I D T + + 
Sbjct: 104 DIAKPSMHKSTVVDSETGKSKDSRVRTSSGTFL---ARGRDKIVRNIEKKIADFTFIPVE 160

Query: 124 REERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPS 179
             E     LQ+ +Y +G     HYD   D      G  R+A+ + YLTDVE GG T+FP+
Sbjct: 161 HGEG----LQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLTDVEEGGETVFPA 216

Query: 180 LN-------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
                                L++ P++G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 217 AKGNFSNVPWYNELSDCGKKGLSIKPKRGDALLFWSMKPDATLDASSLHGGCPVIKGNKW 276

Query: 221 G 221
            
Sbjct: 277 S 277


>gi|297832394|ref|XP_002884079.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297329919|gb|EFH60338.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 291

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 62/210 (29%), Positives = 94/210 (44%), Gaps = 32/210 (15%)

Query: 36  GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKV 93
           G   VE +  +PR V  H+ + + E   +I L+K  + +  VV+       D+R+  S  
Sbjct: 76  GERWVEVISWEPRAVVYHNFLSNEECEHLINLAKPSMVKSTVVDEKTGGSKDSRVRTSSG 135

Query: 94  YFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDA 149
            FL     G    +  I+ RI D T + +   E     LQ+ +Y +G     HYD   D 
Sbjct: 136 TFLRR---GHDEVVEVIEKRISDFTFIPVENGE----GLQVLHYQVGQKYEPHYDYFLDE 188

Query: 150 TPRDEGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGS 190
                G  R+A+ + YL+DV+ GG T+FP+                     L+V P+K  
Sbjct: 189 FNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRD 248

Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           A+ ++N   +  LD    H GCPV  GNKW
Sbjct: 249 ALLFWNMRPDASLDPSSLHGGCPVVKGNKW 278


>gi|15227885|ref|NP_179363.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|25411813|pir||F84555 similar to prolyl 4-hydroxylase alpha subunit [imported] -
           Arabidopsis thaliana
 gi|89274129|gb|ABD65585.1| At2g17720 [Arabidopsis thaliana]
 gi|110738861|dbj|BAF01353.1| similar to prolyl 4-hydroxylase alpha subunit [Arabidopsis
           thaliana]
 gi|330251579|gb|AEC06673.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 291

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 62/210 (29%), Positives = 94/210 (44%), Gaps = 32/210 (15%)

Query: 36  GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKV 93
           G   VE +  +PR V  H+ + + E   +I L+K  + +  VV+       D+R+  S  
Sbjct: 76  GERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSG 135

Query: 94  YFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDA 149
            FL     G    +  I+ RI D T + +   E     LQ+ +Y +G     HYD   D 
Sbjct: 136 TFLR---RGHDEVVEVIEKRISDFTFIPVENGE----GLQVLHYQVGQKYEPHYDYFLDE 188

Query: 150 TPRDEGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGS 190
                G  R+A+ + YL+DV+ GG T+FP+                     L+V P+K  
Sbjct: 189 FNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKKRD 248

Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           A+ ++N   +  LD    H GCPV  GNKW
Sbjct: 249 ALLFWNMRPDASLDPSSLHGGCPVVKGNKW 278


>gi|384046522|ref|YP_005494539.1| prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
 gi|345444213|gb|AEN89230.1| Prolyl 4-hydroxylase alpha subunit [Bacillus megaterium WSH-002]
          Length = 219

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 55/201 (27%), Positives = 98/201 (48%), Gaps = 12/201 (5%)

Query: 23  CFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGD 82
            F  S N   L+   + +   + +P V+ + + + + E + +I+LSK K++R K+   G 
Sbjct: 15  IFNHSGNKIKLEDREIDIVARFEEPLVLVLGNVLSNEECDELIQLSKDKMQRSKI---GA 71

Query: 83  TIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGH 142
              V++  +     + E   ++  +++I+ R+      ++G    Y   LQI  Y     
Sbjct: 72  EREVNSIRTSSGMFFEE--SENELVHQIERRLSK----IMGPSIEYAEGLQILKYLPDQE 125

Query: 143 YDLHCD---ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHA 199
           Y  H D   +  +     R+++ + YL DVE GG T FP L L++ P KG AV++   ++
Sbjct: 126 YKAHHDYFTSASKASKNNRISTLVMYLNDVEEGGETYFPKLGLSISPTKGMAVYFEYFYS 185

Query: 200 NTLLDYRMYHSGCPVALGNKW 220
           +  L+ R  H G PV  G KW
Sbjct: 186 DAELNDRTLHGGAPVIKGEKW 206


>gi|444512226|gb|ELV10078.1| Prolyl 4-hydroxylase subunit alpha-1 [Tupaia chinensis]
          Length = 474

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 58/179 (32%), Positives = 92/179 (51%), Gaps = 20/179 (11%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 262 YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 321

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   GD   V  R+SK  +L      ++P + +I  RIQD+T
Sbjct: 322 IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLSG---YENPVVSRINMRIQDLT 378

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFY-LTD 168
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY LTD
Sbjct: 379 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYDLTD 433


>gi|255071007|ref|XP_002507585.1| predicted protein [Micromonas sp. RCC299]
 gi|226522860|gb|ACO68843.1| predicted protein [Micromonas sp. RCC299]
          Length = 433

 Score = 79.3 bits (194), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 59/208 (28%), Positives = 99/208 (47%), Gaps = 29/208 (13%)

Query: 37  PLKVEELYLD-PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKV 93
           P  ++ + LD PR       + + E + ++E ++  + +  VV+   G + + + R S  
Sbjct: 155 PRNIQVVSLDNPRAFMHIGFLSERECDLLVEYARPNMYKSGVVDASNGGSSFSNIRTSTG 214

Query: 94  YFLYPEIF--GDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATP 151
            F+ P +F  G +  + +I+ RI   T +     E    P+Q+  Y +G  Y  H D   
Sbjct: 215 SFV-PTVFPLGMNDVVRRIERRIAAWTQIPAAHGE----PIQVLRYQIGQEYQSHFDYFF 269

Query: 152 RDEGLW--RLASFMFYLTDVELGGATIFPSLN-----------------LTVFPEKGSAV 192
            + G+   R+A+ + YL+DV+ GG T+FPS                   +TV P+KG A+
Sbjct: 270 HEGGMKNNRIATVLMYLSDVKDGGETVFPSAESLQVKPEPIHHACAKNGITVIPKKGDAI 329

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            ++N      LD    H+GCPV LG KW
Sbjct: 330 LFWNMKVGGDLDGGSTHAGCPVVLGEKW 357


>gi|356502610|ref|XP_003520111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Glycine max]
          Length = 286

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 58/209 (27%), Positives = 95/209 (45%), Gaps = 32/209 (15%)

Query: 38  LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYF 95
           L++E +   PR    H+ +   E   +I ++   +++  V +   G ++  D R S   F
Sbjct: 72  LRMEVISWQPRAFLYHNFLTKEECEYLINIATPHMQKSTVADNQSGQSVVHDVRKSTGAF 131

Query: 96  LYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD-- 153
           L     G    +  I+ RI D+T + I   E    P+ + +Y +G +YD H D    D  
Sbjct: 132 LD---RGQDEIVRNIEKRIADVTFIPIENGE----PIYVIHYEVGQYYDPHYDYFIDDFN 184

Query: 154 --EGLWRLASFMFYLTDVELGGATIFP-------------------SLNLTVFPEKGSAV 192
              G  R+A+ + YL++VE GG T+FP                    + L++ P+ G A+
Sbjct: 185 IENGGQRIATMLMYLSNVEEGGETMFPRAKANFSSVPWWNELSNCGKMGLSIKPKMGDAL 244

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKWG 221
            +++   N  LD    HS CPV  GNKW 
Sbjct: 245 LFWSMKPNATLDALTLHSACPVIKGNKWS 273


>gi|333981907|ref|YP_004511117.1| procollagen-proline dioxygenase [Methylomonas methanica MC09]
 gi|333805948|gb|AEF98617.1| Procollagen-proline dioxygenase [Methylomonas methanica MC09]
          Length = 286

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 61/238 (25%), Positives = 104/238 (43%), Gaps = 32/238 (13%)

Query: 5   LACQGNLSVPEDIKSNLKCFYESYNNTFLKIG--------PLKVEELYLDPRVVKIHDAI 56
           +AC  N   P+   SN       Y  + +  G         +KV      P +V + + +
Sbjct: 46  IACVANDVSPQPEPSNKAKLPYQYETSLVAAGNNIDLFDRSVKVSLRVSRPDIVVVDEFM 105

Query: 57  YDSEINRIIELSKGKVERGKVVNYGD---TIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR 113
              E  ++IE S+ K+    +V+       +  D      YF      G+ P + ++  R
Sbjct: 106 SGEECEQLIEQSRRKLTPSAIVDPQTGKFQVIADRSSEGTYFQR----GESPLISRLDRR 161

Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEGLWRLASFMF 164
           I ++ N      E +   +QI +YG+G  Y  H D         A    +   R+A+ + 
Sbjct: 162 ISELMNW----PEDHGEGIQILHYGVGAQYKPHFDYFLENESGGALQMTQSGQRVATLVM 217

Query: 165 YLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTL--LDYRMYHSGCPVALGNKW 220
           YL +V  GG T+FP + +++ P++GSA ++  A+ N+L  +D    H G PV  G KW
Sbjct: 218 YLNEVTEGGETVFPDVGISITPKRGSAAYF--AYCNSLGQVDPATLHGGAPVLTGEKW 273


>gi|91789558|ref|YP_550510.1| procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
           JS666]
 gi|91698783|gb|ABE45612.1| Procollagen-proline,2-oxoglutarate-4-dioxygenase [Polaromonas sp.
           JS666]
          Length = 277

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 56/186 (30%), Positives = 87/186 (46%), Gaps = 22/186 (11%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY---GDTIYVDTRLSKVYFLYPEIFGD 103
           P +V   + + DSE   ++E+++ ++ R   VN    G+    D     ++F      G+
Sbjct: 90  PDLVVFGNLLSDSECEALMEVAQPRLARSLTVNIKTGGEERNRDRTSQGMFFAR----GE 145

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPR--DE 154
           +P + +++ RI  +    + R E     LQ+  Y  G  Y  H D        TP     
Sbjct: 146 NPLVQRVEARIARLVGWPVDRGEG----LQVLRYRQGAQYKPHYDYFDPAEPGTPAILQR 201

Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
           G  R+A+ + YL + E GGAT+FP + L V P +G+AVF+    AN     R  H G PV
Sbjct: 202 GGQRVATLIMYLNEPEQGGATVFPDIGLQVTPRRGTAVFFSYPAANPASLTR--HGGEPV 259

Query: 215 ALGNKW 220
             G KW
Sbjct: 260 KAGEKW 265


>gi|357467075|ref|XP_003603822.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492870|gb|AES74073.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 683

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 59/194 (30%), Positives = 87/194 (44%), Gaps = 26/194 (13%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           PR    H+ +   E   +I L+K  + R  VV+   G+     +R S   FL     G  
Sbjct: 119 PRASMYHNFLSKEECEHLINLAKPFMARSLVVDGVTGEVKESSSRTSSGMFLDR---GKD 175

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLA 160
             +  I+ RI D+T++ I   E     L + +YG+G     HYD   D      G  R+A
Sbjct: 176 KIVQNIERRIADITSVPIENGE----GLHVIHYGVGQKCEPHYDYTSDGVVTKNGGPRVA 231

Query: 161 SFMFYLTDVELGGATIFPSLN-------------LTVFPEKGSAVFWYNAHANTLLDYRM 207
           + + YL+DVE GG T+FP                L+V P+ G A+ +++   +  LD   
Sbjct: 232 TVLMYLSDVEEGGETVFPDAQPNFTSVSKCSGDGLSVKPKMGDALLFWSMKPDGTLDTSS 291

Query: 208 YHSGCPVALGNKWG 221
            H G PV  GNKW 
Sbjct: 292 LHGGSPVIRGNKWA 305



 Score = 54.7 bits (130), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 48/177 (27%), Positives = 71/177 (40%), Gaps = 33/177 (18%)

Query: 60  EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
           E   +I L+K  + R  VV+   G       R S   FL     G    +  I+ RI D+
Sbjct: 377 ECEHLINLAKPFMTRSLVVDGLTGKGRESSARTSSGRFLER---GKDKIVQNIEQRIADI 433

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIF 177
           T++          P    ++ L               G  R+A+ + YL+DVE GG T+F
Sbjct: 434 TSI----------PRMARDFML-----FTAGGVVTKNGGPRVATVLMYLSDVEEGGETVF 478

Query: 178 PSLN-------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           P+               L+V P+ G A+ + +   +  LD    H G PV  GNKW 
Sbjct: 479 PNAKPNINSVSKYPEKGLSVKPKMGDALLFRSMKPDGTLDTSSLHGGSPVIRGNKWA 535


>gi|405970696|gb|EKC35577.1| Prolyl 4-hydroxylase subunit alpha-1 [Crassostrea gigas]
          Length = 171

 Score = 79.0 bits (193), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 47/130 (36%), Positives = 73/130 (56%), Gaps = 17/130 (13%)

Query: 107 LYKIQTRIQDMTNLVIGREERYKGP--LQINNYGLGG----HYDL----------HCDAT 150
           L+ +  RI+ +T L     + +      +IN++G+GG    H+D           + + +
Sbjct: 16  LFPLTKRIEIITGLSTSVSKLFSDSENYEINHFGIGGMMKPHFDFLNISLGEYQKNVERS 75

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHS 210
            R  G  R+A++MFYLTDVE GGAT+FP   + V   KG+A+FWYN   N+  D R  ++
Sbjct: 76  VRMSGD-RVATWMFYLTDVEKGGATVFPEAKVRVPVTKGAALFWYNIKRNSEKDQRSLNA 134

Query: 211 GCPVALGNKW 220
            CPV LG+K+
Sbjct: 135 DCPVILGSKF 144


>gi|18405808|ref|NP_566838.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
 gi|21617881|gb|AAM66931.1| prolyl 4-hydroxylase, putative [Arabidopsis thaliana]
 gi|332643929|gb|AEE77450.1| prolyl 4-hydroxylase [Arabidopsis thaliana]
          Length = 316

 Score = 79.0 bits (193), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 58/209 (27%), Positives = 89/209 (42%), Gaps = 31/209 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PRV      + D E +  I+L+KGK+E+  V +   G+++  + R S   
Sbjct: 53  PTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGM 112

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
           FL          +  ++ ++   T L     E     +QI +Y  G     H+D   D  
Sbjct: 113 FLSKR---QDDIVSNVEAKLAAWTFL----PEENGESMQILHYENGQKYEPHFDYFHDQA 165

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFP------------------SLNLTVFPEKGSAV 192
             + G  R+A+ + YL++VE GG T+FP                       V P KG A+
Sbjct: 166 NLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDAL 225

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKWG 221
            ++N H N   D    H  CPV  G KW 
Sbjct: 226 LFFNLHPNATTDSNSLHGSCPVVEGEKWS 254


>gi|294499597|ref|YP_003563297.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
 gi|294349534|gb|ADE69863.1| 2OG-Fe(II) oxygenase family protein [Bacillus megaterium QM B1551]
          Length = 219

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 51/180 (28%), Positives = 91/180 (50%), Gaps = 12/180 (6%)

Query: 44  YLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGD 103
           + +P V+ + + + + E + +I+LSK K++R K+   G    V++  +     + E   +
Sbjct: 36  FEEPLVLVLGNVLSNEECDELIQLSKDKMQRSKI---GAAREVNSIRTSSGMFFEE--SE 90

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDEGLWRLA 160
           +  +++I+ R+      ++G    Y   LQ+  Y     Y  H D   +  +     R++
Sbjct: 91  NELVHQIERRLSK----IMGPSIEYAEGLQVLKYLPDQEYKAHHDYFTSASKASKNNRIS 146

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + + YL DVE GG T FP L L+V P KG AV++   +++  L+ R  H G PV  G KW
Sbjct: 147 TLVMYLNDVEEGGETYFPKLGLSVSPTKGMAVYFEYFYSDAELNDRTLHGGAPVIKGEKW 206


>gi|242001766|ref|XP_002435526.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
 gi|215498862|gb|EEC08356.1| prolyl 4-hydroxylase alpha subunit, putative [Ixodes scapularis]
          Length = 559

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 54/200 (27%), Positives = 93/200 (46%), Gaps = 17/200 (8%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           Y   C+G +     + S L+C Y    + F  + P+K+EE+ L P ++ + D + + +I 
Sbjct: 295 YRRLCRGEVLRTPQMDSKLRCRYYKGQDGFFTLHPIKLEEINLKPYIIVMRDVVQERDIE 354

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            ++  ++ +++R              R S   +L+ +   + P   ++   ++ +  L  
Sbjct: 355 DLMAFAEPRLQRSTTYTGDGNAPSTRRTSSNAWLWDD---EAPIANRMNWYLRALVGLGT 411

Query: 123 GREERYKGPLQINNYGLGG----HYD-----LHCDATPRDEGLW-----RLASFMFYLTD 168
              E      Q+ NYG GG    HYD     LH   +  D  L      RLA+ M Y+TD
Sbjct: 412 LGSEYEAEAYQLANYGSGGYFLPHYDYLQDTLHAHNSTADYYLQNNEGDRLATLMIYMTD 471

Query: 169 VELGGATIFPSLNLTVFPEK 188
           VE GGAT+FP L + + P+K
Sbjct: 472 VEEGGATVFPRLGVRLVPKK 491


>gi|108706361|gb|ABF94156.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|222624253|gb|EEE58385.1| hypothetical protein OsJ_09545 [Oryza sativa Japonica Group]
          Length = 299

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 62/206 (30%), Positives = 91/206 (44%), Gaps = 36/206 (17%)

Query: 42  ELYLDPRVVKIHDAIYDSEINRIIELSK-GKVERGKVVN--YGDTIYVDTRLSKVYFLYP 98
           ++   PRV      + D E   +I L+K G++ER  VVN   G+++   TR S   FL  
Sbjct: 39  DVSWSPRVFLYEGFLSDVECEHLIALAKQGRMERSTVVNGKSGESVMSKTRTSSGMFL-- 96

Query: 99  EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD------ATPR 152
            I      + +I+ RI   T       E     +Q+  YG G  Y+ H D      A+ R
Sbjct: 97  -IRKQDEVVARIEERIAAWTMFPAENGE----SMQMLRYGQGEKYEPHFDYIRGRQASAR 151

Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAVFW 194
             G  R+A+ + YL++V++GG T+FP                       V P KGSAV +
Sbjct: 152 --GGHRIATVLMYLSNVKMGGETVFPDAEARLSQPKDETWSDCAEQGFAVKPTKGSAVLF 209

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           ++ + N   D    H  CPV  G KW
Sbjct: 210 FSLYPNATFDPGSLHGSCPVIQGEKW 235


>gi|148684485|gb|EDL16432.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III [Mus musculus]
          Length = 396

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 60/197 (30%), Positives = 93/197 (47%), Gaps = 21/197 (10%)

Query: 7   CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   +  +L C YE+ ++ +L + P + E ++L P +   HD + D E  +I 
Sbjct: 146 CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPARKEVVHLRPLIALYHDFVSDEEAQKIR 205

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
           EL++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T L I  +
Sbjct: 206 ELAEPWLQRSVVASGEKQLQVEYRISKSAWLKDTV---DPMLVTLDHRIAALTGLDI--Q 260

Query: 126 ERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIF-------P 178
             Y   LQ+ NYG+GGHY+ H D      G          L+ VE GGAT F       P
Sbjct: 261 PPYAEYLQVVNYGIGGHYEPHFDHATVTMG--------SMLSSVEAGGATAFIYGNFSVP 312

Query: 179 SLNLTVFPEKGSAVFWY 195
            + L+     G+  F Y
Sbjct: 313 VVKLSSVEAGGATAFIY 329


>gi|18086437|gb|AAL57673.1| AT3g28480/MFJ20_16 [Arabidopsis thaliana]
 gi|24796986|gb|AAN64505.1| At3g28480/MFJ20_16 [Arabidopsis thaliana]
          Length = 316

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 58/209 (27%), Positives = 89/209 (42%), Gaps = 31/209 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PRV      + D E +  I+L+KGK+E+  V +   G+++  + R S   
Sbjct: 53  PTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGM 112

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
           FL          +  ++ ++   T L     E     +QI +Y  G     H+D   D  
Sbjct: 113 FLSKR---QDDIVNNVEAKLAAWTFL----PEENGESMQILHYENGQKYEPHFDYFHDQA 165

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFP------------------SLNLTVFPEKGSAV 192
             + G  R+A+ + YL++VE GG T+FP                       V P KG A+
Sbjct: 166 NLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDAL 225

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKWG 221
            ++N H N   D    H  CPV  G KW 
Sbjct: 226 LFFNLHPNATTDSNSLHGSCPVVEGEKWS 254


>gi|9294583|dbj|BAB02864.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
          Length = 332

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 58/208 (27%), Positives = 89/208 (42%), Gaps = 31/208 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PRV      + D E +  I+L+KGK+E+  V +   G+++  + R S   
Sbjct: 69  PTRVTQLSWTPRVFLYEGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGM 128

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
           FL          +  ++ ++   T L     E     +QI +Y  G     H+D   D  
Sbjct: 129 FLSKR---QDDIVSNVEAKLAAWTFL----PEENGESMQILHYENGQKYEPHFDYFHDQA 181

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFP------------------SLNLTVFPEKGSAV 192
             + G  R+A+ + YL++VE GG T+FP                       V P KG A+
Sbjct: 182 NLELGGHRIATVLMYLSNVEKGGETVFPMWKGKATQLKDDSWTECAKQGYAVKPRKGDAL 241

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            ++N H N   D    H  CPV  G KW
Sbjct: 242 LFFNLHPNATTDSNSLHGSCPVVEGEKW 269


>gi|240256489|ref|NP_201407.4| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
 gi|332010770|gb|AED98153.1| iron ion binding / oxidoreductase/ oxidoreductase protein
           [Arabidopsis thaliana]
          Length = 289

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 61/206 (29%), Positives = 93/206 (45%), Gaps = 32/206 (15%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
           VE +  +PR    H+ +   E   +IEL+K  +E+  VV+       D+R+  S   FL 
Sbjct: 78  VEIISWEPRASVYHNFLTKEECKYLIELAKPHMEKSTVVDEKTGKSTDSRVRTSSGTFL- 136

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
               G    + +I+ RI D T + +   E     LQ+ +Y +G     HYD   D     
Sbjct: 137 --ARGRDKTIREIEKRISDFTFIPVEHGE----GLQVLHYEIGQKYEPHYDYFMDEYNTR 190

Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
            G  R+A+ + YL+DVE GG T+FP+                     L+V P+ G A+ +
Sbjct: 191 NGGQRIATVLMYLSDVEEGGETVFPAAKGNYSAVPWWNELSECGKGGLSVKPKMGDALLF 250

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           ++   +  LD    H GC V  GNKW
Sbjct: 251 WSMTPDATLDPSSLHGGCAVIKGNKW 276


>gi|194871369|ref|XP_001972835.1| GG15736 [Drosophila erecta]
 gi|190654618|gb|EDV51861.1| GG15736 [Drosophila erecta]
          Length = 476

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 55/200 (27%), Positives = 95/200 (47%), Gaps = 24/200 (12%)

Query: 21  LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
           L C Y  +   FLK+ PLK+EEL ++  +   +  +   +I+ +  +S+ K++R + ++ 
Sbjct: 294 LVCRYVDWT-PFLKLAPLKMEELSMETHISIFYGVLRQKDIDELKNVSRPKLQRIEHLSG 352

Query: 81  GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG 140
             +  +    S            H  + K+   I D+T    G   +    L++ NYG+ 
Sbjct: 353 NCSCKIGNLSS----------SSHDVVRKVNELILDIT----GFPSKGNQMLEVINYGIA 398

Query: 141 GHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN 200
           G+Y+    A PR +     A+ + +L + E GG  +FPS +L V P KGS + W N   +
Sbjct: 399 GNYNPDDTARPRKQNK---ANALIFLDNAERGGEIVFPSRHLKVRPRKGSMLVWMNLERS 455

Query: 201 TLLDYRMYHSGCPVALGNKW 220
            +     YH  CP+  GN W
Sbjct: 456 VI-----YHQ-CPILKGNMW 469


>gi|297818458|ref|XP_002877112.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
 gi|297322950|gb|EFH53371.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
          Length = 289

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 62/209 (29%), Positives = 89/209 (42%), Gaps = 32/209 (15%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV---NYGDTIYVDTRLSKV 93
           P +V +L   PR    +  + D E + +I L+KGK+E+  VV   N G++I  + R S  
Sbjct: 29  PTRVTQLSWTPRAFLYNGFLSDEECDHLINLAKGKLEKSMVVADDNSGESIDSEERTSSG 88

Query: 94  YFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD 153
            FL          +  ++ ++   T L     E     LQI +Y  G  YD H D     
Sbjct: 89  VFLTKR---QDDIVANVEAKLATWTFL----PEENGEALQILHYENGQKYDPHFDYYYDK 141

Query: 154 EGL----WRLASFMFYLTDVELGGATIFP------------------SLNLTVFPEKGSA 191
           E L     R+A+ + YL++V  GG T+FP                       V P KG A
Sbjct: 142 ETLKLGGHRIATVLMYLSNVTKGGETVFPMWKGKTPQLKDDTWSECAKQGYAVKPRKGDA 201

Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + ++N H N   D    H  CPV  G KW
Sbjct: 202 LLFFNLHPNATTDPTSLHGSCPVIEGEKW 230


>gi|147800995|emb|CAN64470.1| hypothetical protein VITISV_014644 [Vitis vinifera]
          Length = 288

 Score = 78.6 bits (192), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 60/205 (29%), Positives = 92/205 (44%), Gaps = 32/205 (15%)

Query: 41  EELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLYP 98
           E +  +PR    H+ +   E   +I+L+K  +++  VV+       D+R+  S   FL  
Sbjct: 78  EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSSTGKSKDSRVRTSSGTFL-- 135

Query: 99  EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDE 154
              G    +  I+ R+ D T L +   E     LQI +Y +G     HYD   D      
Sbjct: 136 -TRGQDKIIRGIEKRLSDFTFLPVEHGEG----LQILHYEVGQKYEPHYDYFLDDYNTKN 190

Query: 155 GLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWY 195
           G  R+A+ + YL+DVE GG T+FP+                     L+V P+ G A+ ++
Sbjct: 191 GGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSXCGKEGLSVKPKMGDALLFW 250

Query: 196 NAHANTLLDYRMYHSGCPVALGNKW 220
           +   +  LD    H GCPV  GNKW
Sbjct: 251 SMKPDASLDPSSLHGGCPVIKGNKW 275


>gi|325922187|ref|ZP_08183974.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
           19865]
 gi|325547306|gb|EGD18373.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas gardneri ATCC
           19865]
          Length = 285

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 69/228 (30%), Positives = 102/228 (44%), Gaps = 26/228 (11%)

Query: 6   ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINR 63
           A   +L VP  + + L    +  + + L +G  +V  L   L PRVV + D + D+E + 
Sbjct: 57  AVTHSLPVPVRVPTVL----QDNDASLLDLGDRQVRVLVSLLLPRVVVLGDFLSDAECDA 112

Query: 64  IIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
           +I L++ ++ R + V+   G  I    R S    L     G      +I+ RI  + +  
Sbjct: 113 LIALAQPRLARSRTVDNDNGAQIVHAARTSDSMCLQ---LGQDALCQRIEARIARLLDWP 169

Query: 122 IGREERYKGPLQINNYGLGGHYDLHCD-------ATP--RDEGLWRLASFMFYLTDVELG 172
           +   E     LQ+  Y  G  Y  H D        TP     G  RLAS + YL   E G
Sbjct: 170 VDHGEG----LQVLRYATGAEYQPHYDYFDPTAAGTPVLLQAGGQRLASLVMYLNTPERG 225

Query: 173 GATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GAT FP ++L V   KG+AVF+     + +   R  H+G PV  G KW
Sbjct: 226 GATRFPDVHLDVAAVKGNAVFFSYDRPHPM--TRSLHAGAPVLAGEKW 271


>gi|187920106|ref|YP_001889137.1| procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
 gi|187718544|gb|ACD19767.1| Procollagen-proline dioxygenase [Burkholderia phytofirmans PsJN]
          Length = 295

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 57/185 (30%), Positives = 87/185 (47%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           P+V+   D +   E   +IE S+ +++R   VN   G    +  R S+  +      G+ 
Sbjct: 106 PQVIVFGDVLSPDECAEMIERSRHRLKRSTTVNPETGKEDVIRNRTSEGIWYQ---RGED 162

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
            F+ ++  RI  + N  +   E  +G LQI +YG  G Y  H D  P D+         G
Sbjct: 163 AFIERMDRRISSLMNWPV---ENGEG-LQILHYGTTGEYRPHFDYFPPDQPGSAVHTAQG 218

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL DV  GG TIFP   ++V   +G AV++   +    LD    H G PV 
Sbjct: 219 GQRVATLVIYLNDVPDGGETIFPEAGISVAARQGGAVYFRYMNGQRQLDPLTLHGGAPVL 278

Query: 216 LGNKW 220
            G+KW
Sbjct: 279 GGDKW 283


>gi|359806348|ref|NP_001241485.1| uncharacterized protein LOC100783075 precursor [Glycine max]
 gi|255645457|gb|ACU23224.1| unknown [Glycine max]
          Length = 298

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 58/213 (27%), Positives = 90/213 (42%), Gaps = 34/213 (15%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           + P KV+++   PR       + D E + +I L+K +++R  V +   G++   D R S 
Sbjct: 32  VNPSKVKQISWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNLSGESQLSDVRTSS 91

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
             F+        P +  I+ +I   T L     E     +Q+  Y  G  YD H     D
Sbjct: 92  GMFISKN---KDPIISGIEDKISSWTFLPKENGED----IQVLRYEHGQKYDPHYDYFTD 144

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
                 G  R+A+ + YLT+V  GG T+FPS                       + V P 
Sbjct: 145 KVNIARGGHRIATVLMYLTNVTKGGETVFPSAEEPPRRRGTETSSDLSECAKKGIAVKPH 204

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +G A+ +++ H N   D    H+GCPV  G KW
Sbjct: 205 RGDALLFFSLHTNATPDTSSLHAGCPVIEGEKW 237


>gi|242039227|ref|XP_002467008.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
 gi|241920862|gb|EER94006.1| hypothetical protein SORBIDRAFT_01g018200 [Sorghum bicolor]
          Length = 307

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 57/204 (27%), Positives = 92/204 (45%), Gaps = 28/204 (13%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
            E L  +PR    H+ +   E + +I L+K  +++  VV+       D+R+     ++  
Sbjct: 96  TEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGASKDSRVRTSSGMFLR 155

Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EG 155
             G    +  I+ RI D T + +   E     LQ+ +Y +G  Y+ H D    D     G
Sbjct: 156 R-GQDKIIQTIEKRIADFTFIPVEHGE----GLQVLHYEVGQKYEPHFDYFHDDYNTKNG 210

Query: 156 LWRLASFMFYLTDVELGGATIFPSL-------------------NLTVFPEKGSAVFWYN 196
             R+A+ + YL+DVE GG T+FPS                     L+V P+ G A+ +++
Sbjct: 211 GQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWS 270

Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
              +  +D    H GCPV  GNKW
Sbjct: 271 MKPDGSMDSTSLHGGCPVIKGNKW 294


>gi|225468574|ref|XP_002263060.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296084059|emb|CBI24447.3| unnamed protein product [Vitis vinifera]
          Length = 288

 Score = 78.2 bits (191), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 60/205 (29%), Positives = 92/205 (44%), Gaps = 32/205 (15%)

Query: 41  EELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLYP 98
           E +  +PR    H+ +   E   +I+L+K  +++  VV+       D+R+  S   FL  
Sbjct: 78  EVISWEPRAFVYHNFLSKDECEYLIKLAKPHMQKSTVVDSSTGKSKDSRVRTSSGTFL-- 135

Query: 99  EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDE 154
              G    +  I+ R+ D T L +   E     LQI +Y +G     HYD   D      
Sbjct: 136 -TRGQDKIIRGIEKRLSDFTFLPVEHGEG----LQILHYEVGQKYEPHYDYFLDDYNTKN 190

Query: 155 GLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWY 195
           G  R+A+ + YL+DVE GG T+FP+                     L+V P+ G A+ ++
Sbjct: 191 GGQRMATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKEGLSVKPKMGDALLFW 250

Query: 196 NAHANTLLDYRMYHSGCPVALGNKW 220
           +   +  LD    H GCPV  GNKW
Sbjct: 251 SMKPDASLDPSSLHGGCPVIKGNKW 275


>gi|224085946|ref|XP_002307750.1| predicted protein [Populus trichocarpa]
 gi|222857199|gb|EEE94746.1| predicted protein [Populus trichocarpa]
          Length = 288

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 62/242 (25%), Positives = 102/242 (42%), Gaps = 28/242 (11%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           + L      S P D+ S  +   ES  +   K      E L  +PR    H+ +   E  
Sbjct: 40  FSLPVSSEDSSPNDLNSYRRIASESDGDGMGKREEQWTEILSWEPRAFLYHNFLSKEECE 99

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVI 122
            +I L+K  + +  VV+       D+R+     ++    G    + +I+ RI D + + +
Sbjct: 100 YLINLAKPHMMKSTVVDSKTGRSKDSRVRTSSGMFLRR-GRDRVIREIEKRIADFSFIPV 158

Query: 123 GREERYKGPLQINNYGLG----GHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFP 178
              E     LQ+ +Y +G     H+D   D      G  R A+ + YL+DVE GG T+FP
Sbjct: 159 EHGE----GLQVLHYEVGQKYEAHFDYFLDEFNTKNGGQRTATLLMYLSDVEEGGETVFP 214

Query: 179 SLN-------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNK 219
           + N                   L++ P+ G+A+ +++   +  LD    H  CPV  GNK
Sbjct: 215 AANMNISAVPWWNELSECAKQGLSLKPKMGNALLFWSTRPDATLDPSSLHGSCPVIRGNK 274

Query: 220 WG 221
           W 
Sbjct: 275 WS 276


>gi|195379214|ref|XP_002048375.1| GJ13932 [Drosophila virilis]
 gi|194155533|gb|EDW70717.1| GJ13932 [Drosophila virilis]
          Length = 444

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 48/202 (23%), Positives = 92/202 (45%), Gaps = 32/202 (15%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV 78
           + + C+Y++    FL + PLKVE L  +P V   HD IY++EI +++ +    +   +  
Sbjct: 264 TQMYCYYQNSKEPFLILAPLKVELLNTEPYVALYHDVIYENEIKKLLSIDLASMRHDRTA 323

Query: 79  NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYG 138
           ++ +++   T   ++  +             +  R+ DMT + +  E+ +     + NYG
Sbjct: 324 DHKNSVKYTTVTRELNDV-------------LNHRVMDMTAMNVASEKDF----LLINYG 366

Query: 139 LGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAH 198
           +GGH     +                 L++V  GG TI P L + +  +KG+A+  ++  
Sbjct: 367 IGGHIRALSEQQ---------------LSEVPQGGDTILPELEIAIKSKKGAALVTHHLD 411

Query: 199 ANTLLDYRMYHSGCPVALGNKW 220
               +D    H  CPV +G+ W
Sbjct: 412 KQLKIDLSSDHLSCPVLVGSMW 433


>gi|42567428|ref|NP_195306.2| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|332661174|gb|AEE86574.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 290

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 66/236 (27%), Positives = 109/236 (46%), Gaps = 34/236 (14%)

Query: 12  SVPEDIKSNLKCFYE--SYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSK 69
           S+P D+ + ++   E  S+ +     G   +E +  +PR    H+ + + E   +I L+K
Sbjct: 50  SMPMDLTTIVQTIQERESFGDEEDGNGDRWLEVISWEPRAFVYHNFLTNEECEHLISLAK 109

Query: 70  GKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREER 127
             + + KVV+   G +I    R S   FL     G    + +I+ RI D T +     E 
Sbjct: 110 PSMMKSKVVDVKTGKSIDSRVRTSSGTFLN---RGHDEIVEEIENRISDFTFIP---PEN 163

Query: 128 YKGPLQINNYGLGGHYDLH----CDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN-- 181
            +G LQ+ +Y +G  Y+ H     D     +G  R+A+ + YL+DV+ GG T+FP+    
Sbjct: 164 GEG-LQVLHYEVGQRYEPHHDYFFDEFNVRKGGQRIATVLMYLSDVDEGGETVFPAAKGN 222

Query: 182 -----------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
                            L+V P+K  A+ +++   +  LD    H GCPV  GNKW
Sbjct: 223 VSDVPWWDELSQCGKEGLSVLPKKRDALLFWSMKPDASLDPSSLHGGCPVIKGNKW 278


>gi|406665340|ref|ZP_11073114.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
 gi|405387266|gb|EKB46691.1| hypothetical protein B857_00901 [Bacillus isronensis B3W22]
          Length = 211

 Score = 78.2 bits (191), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 54/186 (29%), Positives = 91/186 (48%), Gaps = 13/186 (6%)

Query: 38  LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLY 97
           +  E L+ +P +VK  + + D E   +I+ +  ++ER K+     +     R S   F  
Sbjct: 21  ITAEVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKEIS---SIRTSSGMFFE 77

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDE 154
                ++P + +I+ RI  + +L I   E  +G LQ+ +Y  G  +  H D         
Sbjct: 78  E---NENPLISEIEKRISSLMHLPI---EHAEG-LQVLHYEPGQEFKAHFDFFGPNHPSS 130

Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
              R+++ + YL DVE GG T FP+L +   P+KG+AV++   + +  L+    HSG PV
Sbjct: 131 SNNRISTLVVYLNDVEEGGVTTFPNLGIVNVPKKGTAVYFEYFYNDQKLNELTLHSGEPV 190

Query: 215 ALGNKW 220
             G KW
Sbjct: 191 IQGEKW 196


>gi|290243077|ref|YP_003494747.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
 gi|288945582|gb|ADC73280.1| Procollagen-proline dioxygenase [Thioalkalivibrio sp. K90mix]
          Length = 575

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 63/226 (27%), Positives = 96/226 (42%), Gaps = 23/226 (10%)

Query: 12  SVPEDIKSNLKCFYESYNNTFLKIGPLKVEE------LYLDPRVVKIHDAIYDSEINRII 65
           SV E + S+L    E      ++   +  E       L  DP VV + + +   E   +I
Sbjct: 16  SVREAVTSSLPVVAEEAEPERVERNRMPAERYDGMETLSQDPLVVYLDEFLEPGECEALI 75

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
            L++G+++R  V   G +     R     +L    + + P   +I  R+       +   
Sbjct: 76  HLAQGRMKRALVSLDGSSGVSQGRTGSNCWLR---YQEEPLARRIGERVAKRVGFPL--- 129

Query: 126 ERYKGPLQINNYGLGGHYDLHCDA----TPRD-----EGLWRLASFMFYLTDVELGGATI 176
             Y  PLQ+ +YG    Y  H DA    TPR      +G  R+ + + YL +VE GGAT 
Sbjct: 130 -EYAEPLQVIHYGHEQEYRPHYDAYDLDTPRGLRCTRQGGQRMVTALLYLNEVEEGGATA 188

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDY-RMYHSGCPVALGNKWG 221
           FP+  + V P KG    + N  A+    + R  H G PV  G KW 
Sbjct: 189 FPNAGVEVAPRKGRIAIFNNVGADPGRPHPRSLHGGMPVKSGEKWA 234


>gi|357478545|ref|XP_003609558.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
 gi|355510613|gb|AES91755.1| Prolyl 4-hydroxylase subunit alpha-1 [Medicago truncatula]
          Length = 299

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 60/213 (28%), Positives = 90/213 (42%), Gaps = 34/213 (15%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           I P KV+++   PR       + D E + +I L+K +++R  V +   GD+   D R S 
Sbjct: 32  INPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSS 91

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
             F+        P +  I+ RI   T L     E     +Q+  Y  G  YD H     D
Sbjct: 92  GMFISKN---KDPIVSGIEDRISAWTFLPKENGE----DIQVLRYEHGQKYDPHYDYFAD 144

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
                +G  RLA+ + YLT+V  GG T+FP                        + V P 
Sbjct: 145 KVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPR 204

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +G A+ +++   N + D    H+GCPV  G KW
Sbjct: 205 RGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKW 237


>gi|281206564|gb|EFA80750.1| putative prolyl 4-hydroxylase alpha subunit [Polysphondylium
           pallidum PN500]
          Length = 251

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 60/191 (31%), Positives = 83/191 (43%), Gaps = 20/191 (10%)

Query: 39  KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYP 98
           K+ E+   PR+ +I   + D E   +IE SK K++    ++ G         S       
Sbjct: 56  KLIEVSQKPRIYRIPKFLTDEECEHLIETSKNKLKPCNEISSG------VHRSGWGLFMK 109

Query: 99  EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCDATPRDE 154
           E   DHP    I  R++   NL    E      +Q+  Y  G     H+D     T    
Sbjct: 110 EGEEDHPVTQNIFNRMKTFVNLTESSEV-----MQVIRYNPGEETSAHFDYFNPLTTNGA 164

Query: 155 ---GLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYH 209
              GL+  R+ + + YL DVE GG T FP +N+ V P KG AV +YN   N  +D    H
Sbjct: 165 MKIGLYGQRICTILMYLADVEEGGETSFPEVNVKVKPIKGDAVLFYNCKPNGEVDPLSLH 224

Query: 210 SGCPVALGNKW 220
            G PV  G KW
Sbjct: 225 QGDPVIKGTKW 235


>gi|388492638|gb|AFK34385.1| unknown [Medicago truncatula]
          Length = 299

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 60/213 (28%), Positives = 90/213 (42%), Gaps = 34/213 (15%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           I P KV+++   PR       + D E + +I L+K +++R  V +   GD+   D R S 
Sbjct: 32  INPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSS 91

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
             F+        P +  I+ RI   T L     E     +Q+  Y  G  YD H     D
Sbjct: 92  GMFISKN---KDPIVSGIEDRISAWTFLPKENGE----DIQVLRYEHGQKYDPHYDYFAD 144

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
                +G  RLA+ + YLT+V  GG T+FP                        + V P 
Sbjct: 145 KVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPR 204

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +G A+ +++   N + D    H+GCPV  G KW
Sbjct: 205 RGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKW 237


>gi|255637501|gb|ACU19077.1| unknown [Glycine max]
          Length = 318

 Score = 77.8 bits (190), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 61/214 (28%), Positives = 91/214 (42%), Gaps = 31/214 (14%)

Query: 31  TFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDT 88
           + +K  P +V +L   PR       + D E + +I L+K K+E+  V +   G +I  + 
Sbjct: 47  SSVKFDPTRVTQLSWSPRAFLYKGFLSDEECDHLITLAKDKLEKSMVADNESGKSIMSEV 106

Query: 89  RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYD 144
           R S   FL          +  I+ RI   T L I   E     +QI +Y  G     H+D
Sbjct: 107 RTSSGMFLNK---AQDEIVAGIEARIAAWTFLPIENGE----SMQILHYENGQKYEPHFD 159

Query: 145 LHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL------------------TVFP 186
              D   +  G  R+A+ + YL+DVE GG TIF +                      V P
Sbjct: 160 YFHDKANQVMGGHRIATVLMYLSDVEKGGETIFSNAKAKLLQPKDESWSECAHKGYAVKP 219

Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            KG A+ +++ H +   D +  H  CPV  G KW
Sbjct: 220 RKGDALLFFSLHLDASTDNKSLHGSCPVIEGEKW 253


>gi|325267002|ref|ZP_08133672.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
 gi|324981502|gb|EGC17144.1| 2OG-Fe(II) oxygenase [Kingella denitrificans ATCC 33394]
          Length = 279

 Score = 77.8 bits (190), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 58/186 (31%), Positives = 86/186 (46%), Gaps = 20/186 (10%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYV---DTRLSKVYFLYPEIFGD 103
           P VV + + I   E  ++I L++GKVE   VV+     +V   D       F   E    
Sbjct: 91  PEVVVLDNFITAEECAQLIALAEGKVEDATVVDPATGEFVKHQDRTSMNAAFARAE---- 146

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDE 154
           HP + +++ RI    +      E  +G +Q+  Y  GG Y  H D               
Sbjct: 147 HPLIARLEARIAAAIHWPA---ENGEG-MQVLRYRSGGEYKAHFDYFDTQSEGGRKNMQT 202

Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
           G  R+ +F+ YL DV+ GGAT FP+LN  + P+KG A+F+ N   N   +    H+G PV
Sbjct: 203 GGQRVGTFLVYLCDVDAGGATRFPALNFEIRPKKGMALFFANTLPNGEGNPLTLHAGVPV 262

Query: 215 ALGNKW 220
             G K+
Sbjct: 263 VSGVKY 268


>gi|449495423|ref|XP_004159836.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 304

 Score = 77.4 bits (189), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 57/215 (26%), Positives = 89/215 (41%), Gaps = 35/215 (16%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           + P KV+++   PR       + D E + +I L+K +++R  V +   G +   + R S 
Sbjct: 36  VNPAKVKQISWSPRAFVYEGFLTDLECDHLISLAKAELKRSSVADNLSGKSKVSEVRTSS 95

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
             F++       P +  I+ +I   T L     E     +Q+  Y  G  YD H     D
Sbjct: 96  GAFIHK---AKDPIVSGIEDKIAAWTFLPKDNGE----DIQVLRYEYGQKYDAHFDYFAD 148

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIF----------------------PSLNLTVFP 186
                 G  R+A+ + YL+DVE GG T+F                          + V P
Sbjct: 149 KVNIARGGHRMATVLMYLSDVEKGGETVFLLRRSESQRRQASETNEDLSDCAKKGIAVKP 208

Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
            KG A+ +++ H N + D    H GCPV  G KW 
Sbjct: 209 RKGDALLFFSLHPNAIPDTSSLHGGCPVIEGEKWS 243


>gi|21593091|gb|AAM65040.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
          Length = 291

 Score = 77.4 bits (189), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 61/210 (29%), Positives = 93/210 (44%), Gaps = 32/210 (15%)

Query: 36  GPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKV 93
           G   VE +  +PR V  H+ + + E   +I L+K  + +  VV+       D+R+  S  
Sbjct: 76  GERWVEVISWEPRAVVYHNFLTNEECEHLISLAKPSMVKSTVVDEKTGGSKDSRVRTSSG 135

Query: 94  YFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDA 149
            FL     G    +  I+ RI D T + +   E     LQ+ +Y +G     HYD   D 
Sbjct: 136 TFLR---RGHDEVVEVIEKRISDFTFIPVENGE----GLQVLHYQVGQKYEPHYDYFLDE 188

Query: 150 TPRDEGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGS 190
                G  R+A+ + YL+DV+ GG T+FP+                     L+V P+   
Sbjct: 189 FNTKNGGQRIATVLMYLSDVDDGGETVFPAARGNISAVPWWNELSKCGKEGLSVLPKXRD 248

Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           A+ ++N   +  LD    H GCPV  GNKW
Sbjct: 249 ALLFWNMRPDASLDPSSLHGGCPVVKGNKW 278


>gi|330799463|ref|XP_003287764.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
 gi|325082219|gb|EGC35708.1| hypothetical protein DICPUDRAFT_151895 [Dictyostelium purpureum]
          Length = 220

 Score = 77.4 bits (189), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 60/198 (30%), Positives = 88/198 (44%), Gaps = 20/198 (10%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFL 96
           P+K+ EL   PRV +I + + + E N +I+ SK K+     ++ G         S     
Sbjct: 22  PIKLIELSQKPRVYRIPEFLTEEECNHLIDTSKNKLRPCNEISSG------VHRSGWGLF 75

Query: 97  YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCDATPR 152
             E   +HP    I  ++++  N+    E      +QI  Y  G     HYD     T  
Sbjct: 76  MKEGEEEHPVTKNIFNKMKNFVNISDSCE-----VMQIIRYNPGEETSAHYDYFNPLTTN 130

Query: 153 DE---GLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 207
                GL+  R+ + + YL DVE GG T FP + + V P +G AV +YN   N  +D   
Sbjct: 131 GSMKIGLYGQRICTILMYLCDVEEGGETSFPEVGIKVKPIRGDAVLFYNCKPNGDVDPLS 190

Query: 208 YHSGCPVALGNKWGKLLL 225
            H G PV  G KW  + L
Sbjct: 191 LHQGDPVTKGTKWVAIKL 208


>gi|297850430|ref|XP_002893096.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297338938|gb|EFH69355.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 287

 Score = 77.4 bits (189), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 69/244 (28%), Positives = 101/244 (41%), Gaps = 33/244 (13%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           ++ L    + S P D+    +   E  +    K G    E L  +PR    H+ +   E 
Sbjct: 39  VFSLPINNDESSPIDLSYFRRAATER-SEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEEC 97

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
             +I L+K  + +  VV+       D+R+  S   FL     G    +  I+ RI D T 
Sbjct: 98  EYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRR---GRDKIIKTIEKRIADYTF 154

Query: 120 LVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGAT 175
           +     E     LQI +Y  G     HYD   D      G  R+A+ + YL+DVE GG T
Sbjct: 155 IPADHGE----GLQILHYEAGQKYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGET 210

Query: 176 IFPSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVAL 216
           +FP+ N                   L+V P  G A+ +++   +  LD    H GCPV  
Sbjct: 211 VFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIR 270

Query: 217 GNKW 220
           GNKW
Sbjct: 271 GNKW 274


>gi|414870899|tpg|DAA49456.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 364

 Score = 77.4 bits (189), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 59/204 (28%), Positives = 95/204 (46%), Gaps = 28/204 (13%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
            E L  +PR    H+ +   E + +I L+K  +++  VV+       D+R+     ++  
Sbjct: 153 TEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGMFLR 212

Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EG 155
             G    +  I+ RI D T + +   E+ +G LQ+ +Y +G  Y+ H D    D     G
Sbjct: 213 -RGQDKIIRTIEKRIADYTFIPV---EQGEG-LQVLHYEVGQKYEPHFDYFHDDYNTKNG 267

Query: 156 LWRLASFMFYLTDVELGGATIFPSL-------------------NLTVFPEKGSAVFWYN 196
             R+A+ + YL+DVE GG T+FPS                     L+V P+ G A+ +++
Sbjct: 268 GQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWS 327

Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
              +  LD    H GCPV  GNKW
Sbjct: 328 MKPDGSLDPTSLHGGCPVIKGNKW 351


>gi|357137804|ref|XP_003570489.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 318

 Score = 77.0 bits (188), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 59/204 (28%), Positives = 92/204 (45%), Gaps = 28/204 (13%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
            E +  +PR    H+ +   E   +I L+K ++E+  VV+       D+R+     ++  
Sbjct: 107 TEVISWEPRAFVYHNFLSKEECEYLIGLAKPRMEKSTVVDSTTGKSKDSRVRTSSGMFLR 166

Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEG 155
             G    +  I+ RI D T +     E  +G LQ+ +Y +G     H+D   D      G
Sbjct: 167 -RGRDKVIRAIERRIADYTFIPA---EHGEG-LQVLHYEVGQKYEPHFDYFLDEFNTKNG 221

Query: 156 LWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYN 196
             R+A+ + YL+DVE GG TIFP  N                   L V P+ G A+ +++
Sbjct: 222 GQRMATILMYLSDVEEGGETIFPDANVNSSSLPWHNELSECARKGLAVKPKMGDALLFWS 281

Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
            + +  LD    H GCPV  GNKW
Sbjct: 282 MNPDATLDPLSLHGGCPVIRGNKW 305


>gi|212720775|ref|NP_001131953.1| uncharacterized protein LOC100193348 [Zea mays]
 gi|194693016|gb|ACF80592.1| unknown [Zea mays]
 gi|347978798|gb|AEP37741.1| prolyl 4-hydroxylase 1 [Zea mays]
 gi|414870898|tpg|DAA49455.1| TPA: hypothetical protein ZEAMMB73_536273 [Zea mays]
          Length = 307

 Score = 77.0 bits (188), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 58/204 (28%), Positives = 93/204 (45%), Gaps = 28/204 (13%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
            E L  +PR    H+ +   E + +I L+K  +++  VV+       D+R+     ++  
Sbjct: 96  TEVLSWEPRAFVYHNFLSKEECDHLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGMFLR 155

Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD----EG 155
             G    +  I+ RI D T + + + E     LQ+ +Y +G  Y+ H D    D     G
Sbjct: 156 -RGQDKIIRTIEKRIADYTFIPVEQGEG----LQVLHYEVGQKYEPHFDYFHDDYNTKNG 210

Query: 156 LWRLASFMFYLTDVELGGATIFPSL-------------------NLTVFPEKGSAVFWYN 196
             R+A+ + YL+DVE GG T+FPS                     L+V P+ G A+ +++
Sbjct: 211 GQRIATLLMYLSDVEDGGETVFPSSTTNSSSSPFYNELSECAKGGLSVKPKMGDALLFWS 270

Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
              +  LD    H GCPV  GNKW
Sbjct: 271 MKPDGSLDPTSLHGGCPVIKGNKW 294


>gi|390570433|ref|ZP_10250698.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
 gi|389937613|gb|EIM99476.1| procollagen-proline dioxygenase [Burkholderia terrae BS001]
          Length = 285

 Score = 77.0 bits (188), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 61/229 (26%), Positives = 99/229 (43%), Gaps = 21/229 (9%)

Query: 6   ACQGNLSVPEDIKSNL---KCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           A    ++ PED         C   + N        + V   +  P+V+   D +   E  
Sbjct: 52  AVAAVIASPEDEARAYHYDACPVAAGNTVHAHDRDVTVRIRFERPQVIAFDDVLSGEECA 111

Query: 63  RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            +IE ++ +++R   VN   G    +  R S+ ++       +  F+ ++  RI  + N 
Sbjct: 112 ELIERARHRLKRSTTVNPENGSEDVIQLRTSEGFWFQR---CEDAFIERLDHRISALMNW 168

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------GLWRLASFMFYLTDVEL 171
            +   E  +G LQI +Y  GG Y  H D  P  +         G  R+A+ + YL+DVE 
Sbjct: 169 PL---EHGEG-LQILHYRQGGEYRPHFDYFPPGQNGSVLHTARGGQRVATLIVYLSDVEG 224

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GG T+FP   L V   +G A+++   +    LD    H G PV  G+KW
Sbjct: 225 GGETVFPDAGLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDKW 273


>gi|420246706|ref|ZP_14750139.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
 gi|398073616|gb|EJL64785.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. BT03]
          Length = 282

 Score = 77.0 bits (188), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 61/229 (26%), Positives = 99/229 (43%), Gaps = 21/229 (9%)

Query: 6   ACQGNLSVPEDIKSNL---KCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           A    ++ PED         C   + N        + V   +  P+V+   D +   E  
Sbjct: 49  AVAAVIASPEDEARAYHYDACPVAAGNTVHAHDRDVTVRIRFERPQVIAFDDVLSGEECA 108

Query: 63  RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            +IE ++ +++R   VN   G    +  R S+ ++       +  F+ ++  RI  + N 
Sbjct: 109 ELIERARHRLKRSTTVNPENGSEDVIQLRTSEGFWFQR---CEDAFIERLDHRISALMNW 165

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------GLWRLASFMFYLTDVEL 171
            +   E  +G LQI +Y  GG Y  H D  P  +         G  R+A+ + YL+DVE 
Sbjct: 166 PL---EHGEG-LQILHYRQGGEYRPHFDYFPPGQNGSVLHTARGGQRVATLIVYLSDVEG 221

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GG T+FP   L V   +G A+++   +    LD    H G PV  G+KW
Sbjct: 222 GGETVFPDAGLAVMARQGGAIYFRYMNGRRQLDPLTLHGGAPVTSGDKW 270


>gi|357517885|ref|XP_003629231.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523253|gb|AET03707.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 279

 Score = 77.0 bits (188), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 93/210 (44%), Gaps = 38/210 (18%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFL- 96
           VE +  +PRV   H+ +   E   +I ++K  V++  VV+   G ++    R S   F+ 
Sbjct: 70  VEIVSWEPRVFLYHNFLAKEECEHLINIAKPDVQKSTVVDDTTGKSVNSSARTSSGTFID 129

Query: 97  --YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----AT 150
             Y +I  D      I+ RI D T + +   E     + I +Y +G  YD H D      
Sbjct: 130 RGYDKILSD------IEKRIADFTFIPVEHGED----VNILHYEVGQKYDFHTDYFEDEV 179

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSA 191
               G  R+A+ + YL+DVE GG T+FPS                     L++ P+ G+A
Sbjct: 180 NTKHGGERIATMLMYLSDVEEGGETVFPSAKGNFSSVPWWNELSDCGKKGLSIKPKMGNA 239

Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           + ++    +  +D    H  CPV  G+KW 
Sbjct: 240 ILFWGMKPDATVDPLSVHGACPVIKGDKWS 269


>gi|393200372|ref|YP_006462214.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
 gi|327439703|dbj|BAK16068.1| prolyl 4-hydroxylase [Solibacillus silvestris StLB046]
          Length = 211

 Score = 77.0 bits (188), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 54/186 (29%), Positives = 90/186 (48%), Gaps = 13/186 (6%)

Query: 38  LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLY 97
           +  E L+ +P +VK  + + D E   +I+ +  ++ER K+     +     R S   F  
Sbjct: 21  ITAEVLHEEPLIVKFLNVLSDEECQNLIDCASSRLERSKLAKKEIS---SIRTSSGMFFE 77

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---ATPRDE 154
                ++P + +I+ RI  + +L I   E  +G LQ+ +Y  G  +  H D         
Sbjct: 78  E---NENPLISEIEKRISSLMHLPI---EHAEG-LQVLHYEPGQEFKPHFDFFGPNHPSS 130

Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
              R+ + + YL DVE GG T FP+L +   P+KG+AV++   + +  L+    HSG PV
Sbjct: 131 SNNRICTLVVYLNDVEEGGVTTFPNLGIVNVPKKGTAVYFEYFYNDQKLNELTLHSGEPV 190

Query: 215 ALGNKW 220
             G KW
Sbjct: 191 IQGEKW 196


>gi|125542543|gb|EAY88682.1| hypothetical protein OsI_10157 [Oryza sativa Indica Group]
          Length = 321

 Score = 76.6 bits (187), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 61/212 (28%), Positives = 90/212 (42%), Gaps = 41/212 (19%)

Query: 47  PRVVKIHDAIYDSEINRIIELSK-GKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGD 103
           PR       + D+E + +I L+K GK+E+  VV+   G+++    R S   FL  +    
Sbjct: 49  PRAFLYEGFLSDAECDHLISLAKQGKMEKSTVVDGESGESVTSKVRTSSGMFLDKK---Q 105

Query: 104 HPFLYKIQTRIQDMT-------------NLVIGREERYKGPLQINNYGLGGHYDLHCDAT 150
              + +I+ RI   T             N  I +       +QI  YG G  Y+ H D  
Sbjct: 106 DEVVARIEERIAAWTMLPTECIIFYCFANFAILKLSENGESMQILRYGQGEKYEPHFDYI 165

Query: 151 PRDEGLWR----LASFMFYLTDVELGGATIFPSLN------------------LTVFPEK 188
              +G  R    +A+ + YL++V++GG TIFP                       V P K
Sbjct: 166 SGRQGSTREGDRVATVLMYLSNVKMGGETIFPDCEARLSQPKDETWSDCAEQGFAVKPAK 225

Query: 189 GSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GSAV +++ H N  LD    H  CPV  G KW
Sbjct: 226 GSAVLFFSLHPNATLDTDSLHGSCPVIEGEKW 257


>gi|225459748|ref|XP_002285898.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2 [Vitis vinifera]
 gi|302141716|emb|CBI18919.3| unnamed protein product [Vitis vinifera]
          Length = 288

 Score = 76.6 bits (187), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 58/210 (27%), Positives = 92/210 (43%), Gaps = 28/210 (13%)

Query: 34  KIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKV 93
           K G    E +  +PR    H+ +   E   +I L+K  +++  VV+       D+R+   
Sbjct: 71  KRGEQWTEIVSWEPRAFIYHNFLSKEECEYMISLAKPYMKKSTVVDSETGRSKDSRVRTS 130

Query: 94  YFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCDA 149
             ++    G    +  I+ RI D T + +   E     LQ+ +Y +G     HYD   D 
Sbjct: 131 SGMFLRR-GRDKIIRDIEKRIADFTFIPVEHGE----GLQVLHYEVGQKYDAHYDYFLDE 185

Query: 150 TPRDEGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGS 190
                G  R+A+ + YL+DVE GG T+FP+                     L+V P+ G 
Sbjct: 186 FNTKNGGQRIATLLMYLSDVEEGGETVFPATKANFSSVPWWNELSECGKKGLSVKPKMGD 245

Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           A+ +++   +  LD    H GCPV  GNKW
Sbjct: 246 ALLFWSMRPDATLDPSSLHGGCPVIKGNKW 275


>gi|21537370|gb|AAM61711.1| putative prolyl 4-hydroxylase, alpha subunit [Arabidopsis thaliana]
          Length = 287

 Score = 76.6 bits (187), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 68/244 (27%), Positives = 101/244 (41%), Gaps = 33/244 (13%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           ++ L    + S P D+    +   E  +    K G    E L  +PR    H+ +   E 
Sbjct: 39  VFSLPINNDESSPIDLSYFRRAATER-SEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEEC 97

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
             +I L+K  + +  VV+       D+R+  S   FL     G    +  I+ RI D T 
Sbjct: 98  EYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRR---GRDKIIKTIEKRIADYTF 154

Query: 120 LVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGAT 175
           +     E     LQ+ +Y  G     HYD   D      G  R+A+ + YL+DVE GG T
Sbjct: 155 IPADHGE----GLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGET 210

Query: 176 IFPSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVAL 216
           +FP+ N                   L+V P  G A+ +++   +  LD    H GCPV  
Sbjct: 211 VFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIR 270

Query: 217 GNKW 220
           GNKW
Sbjct: 271 GNKW 274


>gi|449513594|ref|XP_002191636.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like, partial
           [Taeniopygia guttata]
          Length = 346

 Score = 76.6 bits (187), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 56/176 (31%), Positives = 89/176 (50%), Gaps = 19/176 (10%)

Query: 3   YPLACQGN-LSVPEDIKSNLKC-FYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  L +    +  L C +Y+   N    +GP+K E+ +  PR+V+  D I D E
Sbjct: 178 YEMLCRGEGLKMTPRRQKRLFCRYYDGNRNPRYILGPVKQEDEWDKPRIVRFLDIISDEE 237

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + EL+K ++ R  V +   G       R+SK  +L      + P + +I TRIQD+T
Sbjct: 238 IETVKELAKPRLSRATVHDPETGKLTTAHYRVSKSAWLSG---YESPVVSRINTRIQDLT 294

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--------RLASFMFYL 166
            L +   E     LQ+ NYG+GG Y+ H D   +DE           R+A+++FY+
Sbjct: 295 GLDVSTAEE----LQVANYGVGGQYEPHFDFARKDEPDAFKELGTGNRIATWLFYV 346


>gi|18394842|ref|NP_564109.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
 gi|9558598|gb|AAF88161.1|AC026234_12 Contains similarity to a prolyl 4-hydroxylase alpha subunit protein
           from Gallus gallus gi|212530 [Arabidopsis thaliana]
 gi|90962978|gb|ABE02413.1| At1g20270 [Arabidopsis thaliana]
 gi|332191835|gb|AEE29956.1| 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein
           [Arabidopsis thaliana]
          Length = 287

 Score = 76.6 bits (187), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 68/244 (27%), Positives = 101/244 (41%), Gaps = 33/244 (13%)

Query: 2   IYPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEI 61
           ++ L    + S P D+    +   E  +    K G    E L  +PR    H+ +   E 
Sbjct: 39  VFSLPINNDESSPIDLSYFRRAATER-SEGLGKRGDQWTEVLSWEPRAFVYHNFLSKEEC 97

Query: 62  NRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
             +I L+K  + +  VV+       D+R+  S   FL     G    +  I+ RI D T 
Sbjct: 98  EYLISLAKPHMVKSTVVDSETGKSKDSRVRTSSGTFLRR---GRDKIIKTIEKRIADYTF 154

Query: 120 LVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGAT 175
           +     E     LQ+ +Y  G     HYD   D      G  R+A+ + YL+DVE GG T
Sbjct: 155 IPADHGE----GLQVLHYEAGQKYEPHYDYFVDEFNTKNGGQRMATMLMYLSDVEEGGET 210

Query: 176 IFPSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVAL 216
           +FP+ N                   L+V P  G A+ +++   +  LD    H GCPV  
Sbjct: 211 VFPAANMNFSSVPWYNELSECGKKGLSVKPRMGDALLFWSMRPDATLDPTSLHGGCPVIR 270

Query: 217 GNKW 220
           GNKW
Sbjct: 271 GNKW 274


>gi|255551575|ref|XP_002516833.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223543921|gb|EEF45447.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 297

 Score = 76.6 bits (187), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 58/213 (27%), Positives = 91/213 (42%), Gaps = 34/213 (15%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           I P KV+++   PR       + D E + +I L+K +++R  V +   G +   + R S 
Sbjct: 31  IDPSKVKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSEVRTSS 90

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
             F+     G  P +  I+ +I   T L     E     LQ+  Y  G  YD H     D
Sbjct: 91  GMFIAK---GKDPIIAGIEEKISTWTFLPKENGED----LQVLRYEHGQKYDPHYDYFAD 143

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
                 G  R+A+ + YL+DV  GG T+FP+                       ++V P 
Sbjct: 144 KINIARGGHRMATVLMYLSDVVKGGETVFPNAEEPPRRKATESHEDLSECAKKGISVKPR 203

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +G A+ +++ H   + D    H+GCPV  G KW
Sbjct: 204 RGDALLFFSLHPTAIPDPNSLHAGCPVIEGEKW 236


>gi|159487421|ref|XP_001701721.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158280940|gb|EDP06696.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 336

 Score = 76.6 bits (187), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 58/207 (28%), Positives = 94/207 (45%), Gaps = 31/207 (14%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
           V+++ L PR    H+ +  +E   ++ ++  K++R  VV       VD  +   Y ++  
Sbjct: 19  VQQVGLHPRAYYFHNFLTKAERAHLVRVAAPKLKRSTVVGGKGEGVVDD-IRTSYGMFIR 77

Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGL--- 156
              D P + +I+ RI   T+L +  +E     +QI  Y  G  Y  H D+    + +   
Sbjct: 78  RLSD-PVVTRIEKRISLWTHLPVEHQED----IQILRYAHGQTYGAHYDSGASSDHVGPK 132

Query: 157 WRLASFMFYLTDVELGGATIFP----------------------SLNLTVFPEKGSAVFW 194
           WRLA+F+ YL+DVE GG T FP                        ++   P+ G AV +
Sbjct: 133 WRLATFLMYLSDVEEGGETAFPHNSVWADPSIPEQVGDKFSDCAKGHVAAKPKAGDAVLF 192

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKWG 221
           Y+ + N  +D    H+GCPV  G KW 
Sbjct: 193 YSFYPNNTMDPASMHTGCPVIKGVKWA 219


>gi|384429387|ref|YP_005638747.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
           campestris pv. raphani 756C]
 gi|341938490|gb|AEL08629.1| procollagen-proline, 2-oxoglutarate-4-dioxygenase [Xanthomonas
           campestris pv. raphani 756C]
          Length = 286

 Score = 76.6 bits (187), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 68/224 (30%), Positives = 100/224 (44%), Gaps = 26/224 (11%)

Query: 10  NLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIEL 67
            L VP  + + L+    S     L +G  +V+ L   + PRVV +   + D E + +I L
Sbjct: 61  GLPVPVRVPAPLQADASS----LLDLGDRQVQVLVSLMLPRVVVLGGLLSDDECDALIAL 116

Query: 68  SKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
           ++ ++ R + V+   G  I    R S    L P   G      +I+ RI  +    +   
Sbjct: 117 ARPQLARSRTVDNRDGSEIVHAARTSHSMALQP---GQDALCQRIEARIARLLEWPV--- 170

Query: 126 ERYKGPLQINNYGLGGHYDLHCD-------ATP--RDEGLWRLASFMFYLTDVELGGATI 176
           E  +G LQ+  Y  G  Y  H D        TP     G  R+AS + YL   E GGAT 
Sbjct: 171 EHGEG-LQVLRYATGAQYAPHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATR 229

Query: 177 FPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           FP ++L V   KG+AVF+     + +   R  H+G PV  G KW
Sbjct: 230 FPDVHLDVAAVKGNAVFFSYDRPHPM--TRTLHAGAPVLAGEKW 271


>gi|326316001|ref|YP_004233673.1| procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
           ATCC 19860]
 gi|323372837|gb|ADX45106.1| Procollagen-proline dioxygenase [Acidovorax avenae subsp. avenae
           ATCC 19860]
          Length = 298

 Score = 76.6 bits (187), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 63/203 (31%), Positives = 95/203 (46%), Gaps = 26/203 (12%)

Query: 33  LKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVV--NYGDTIYVDT 88
           + +G  +V+ L     PRVV   + +   E + II+ ++ ++ R   V    G     D 
Sbjct: 95  IDVGDRRVDVLMAMAQPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDD 154

Query: 89  RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD 148
           R S   F   E   ++P + K++ RI  + N  +   E  +G LQ+ +Y  G  Y  H D
Sbjct: 155 RTSNGMFFQRE---ENPMVAKLEARIARLVNWPL---ENGEG-LQVLHYRPGAEYKPHYD 207

Query: 149 -------ATPR--DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNA 197
                   TP     G  R+A+ + YL D E GG T FP ++L V P +G+AVF  +   
Sbjct: 208 YFDPTEPGTPTILRRGGQRVATIVIYLNDPEKGGGTTFPDVHLEVAPRRGNAVFFSYERP 267

Query: 198 HANTLLDYRMYHSGCPVALGNKW 220
           H +T    R  H G PV  G+KW
Sbjct: 268 HPST----RTLHGGAPVVAGDKW 286


>gi|326489721|dbj|BAK01841.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 315

 Score = 76.3 bits (186), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 58/204 (28%), Positives = 92/204 (45%), Gaps = 28/204 (13%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
            E +  +PR    H+ +   E   +IEL+K ++ +  VV+       D+R+     ++ +
Sbjct: 104 TEVISWEPRAFVYHNFLSKEECEYLIELAKPRMVKSTVVDSETGKSKDSRVRTSSGMFLQ 163

Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEG 155
             G    +  I+ RI D T +     E  +G LQ+ +Y +G     H+D   D      G
Sbjct: 164 -RGRDKVIRAIERRIADYTFIPA---EHGEG-LQVLHYEVGQKYEPHFDYFLDEFNTKNG 218

Query: 156 LWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYN 196
             R+A+ + YL+D+E GG TIFP  N                   L V P+ G A+ +++
Sbjct: 219 GQRMATILMYLSDIEEGGETIFPDANVNSSSLPWYNELSECARKGLAVKPKMGDALLFWS 278

Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
              +  LD    H GCPV  GNKW
Sbjct: 279 MKPDATLDPLSLHGGCPVIKGNKW 302


>gi|302791635|ref|XP_002977584.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
 gi|300154954|gb|EFJ21588.1| hypothetical protein SELMODRAFT_106693 [Selaginella moellendorffii]
          Length = 296

 Score = 76.3 bits (186), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 58/208 (27%), Positives = 91/208 (43%), Gaps = 29/208 (13%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           + P KV +L   PR       +  +E + +++++K K+++  V +   G ++  + R S 
Sbjct: 37  VDPTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSNIRTSS 96

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCD 148
             FL     G    + +I+ RI   T L     E     +Q+  Y  G     HYD   D
Sbjct: 97  GMFLSK---GQDEVINRIEERIAAWTFLPKENGE----AIQVLRYEFGEKYEPHYDYFHD 149

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN----------------LTVFPEKGSAV 192
              +  G  R+A+ + YL+DV  GG T+FPS                  + V P KG A+
Sbjct: 150 KYNQALGGHRIATVLMYLSDVVKGGETVFPSSEDTTVKDDSWSDCAKKGIAVKPRKGDAL 209

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +Y+ H +   D    H GCPV  G KW
Sbjct: 210 LFYSLHPDATPDESSLHGGCPVIEGEKW 237


>gi|297818456|ref|XP_002877111.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297322949|gb|EFH53370.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 316

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 56/209 (26%), Positives = 88/209 (42%), Gaps = 31/209 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PR       + D E +  I+L+KGK+E+  V +   G+++  + R S   
Sbjct: 53  PTRVTQLSWTPRAFLYKGFLSDEECDHFIKLAKGKLEKSMVADNDSGESVESEVRTSSGM 112

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
           FL          +  ++ ++   T +     E     +QI +Y  G     H+D   D  
Sbjct: 113 FLSKR---QDDIVANVEAKLAAWTFI----PEENGESMQILHYENGQKYEPHFDYFHDQA 165

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFP------------------SLNLTVFPEKGSAV 192
             + G  R+A+ + YL++VE GG T+FP                       V P KG A+
Sbjct: 166 NLELGGHRIATVLMYLSNVEKGGETVFPMWKGKTTQLKDDSWTECAKQGYAVKPRKGDAL 225

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKWG 221
            ++N H N   D    H  CPV  G KW 
Sbjct: 226 LFFNLHPNATTDSNSLHGSCPVVEGEKWS 254


>gi|77761111|ref|YP_241833.2| hypothetical protein XC_0735 [Xanthomonas campestris pv. campestris
           str. 8004]
          Length = 288

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 63/208 (30%), Positives = 96/208 (46%), Gaps = 22/208 (10%)

Query: 26  ESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--G 81
           ++  ++ L +G  +V+ L   + PRVV +   + D E + +I L++ ++ R + V+   G
Sbjct: 75  QADASSLLDLGDRQVQVLVSLMLPRVVVLGGLLADDECDALIALARPQLARSRTVDNRDG 134

Query: 82  DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
             I    R S    L P   G      +I+ RI  +    +   E  +G LQ+  Y  G 
Sbjct: 135 SEIVHAARTSHSMALQP---GQDALCQRIEARIAQLLEWPV---EHGEG-LQVLRYATGA 187

Query: 142 HYDLHCD-------ATP--RDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
            Y  H D        TP     G  R+AS + YL   E GGAT FP ++L V   KG+AV
Sbjct: 188 QYAPHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFPDVHLDVAAVKGNAV 247

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           F+     + +   R  H+G PV  G KW
Sbjct: 248 FFSYDRPHPM--TRTLHAGAPVLAGEKW 273


>gi|66572403|gb|AAY47813.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. 8004]
          Length = 308

 Score = 76.3 bits (186), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 63/208 (30%), Positives = 96/208 (46%), Gaps = 22/208 (10%)

Query: 26  ESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--G 81
           ++  ++ L +G  +V+ L   + PRVV +   + D E + +I L++ ++ R + V+   G
Sbjct: 95  QADASSLLDLGDRQVQVLVSLMLPRVVVLGGLLADDECDALIALARPQLARSRTVDNRDG 154

Query: 82  DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
             I    R S    L P   G      +I+ RI  +    +   E  +G LQ+  Y  G 
Sbjct: 155 SEIVHAARTSHSMALQP---GQDALCQRIEARIAQLLEWPV---EHGEG-LQVLRYATGA 207

Query: 142 HYDLHCD-------ATP--RDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
            Y  H D        TP     G  R+AS + YL   E GGAT FP ++L V   KG+AV
Sbjct: 208 QYAPHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRFPDVHLDVAAVKGNAV 267

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           F+     + +   R  H+G PV  G KW
Sbjct: 268 FFSYDRPHPM--TRTLHAGAPVLAGEKW 293


>gi|224102545|ref|XP_002312720.1| predicted protein [Populus trichocarpa]
 gi|222852540|gb|EEE90087.1| predicted protein [Populus trichocarpa]
          Length = 300

 Score = 75.9 bits (185), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 59/214 (27%), Positives = 91/214 (42%), Gaps = 36/214 (16%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           I P KV+++   PR       + D E + +I L+K +++R  V +   G +   + R S 
Sbjct: 34  INPAKVKQVSWKPRAFVYEGFLTDLECDHLISLAKSELKRSAVADNESGKSKLSEVRTSS 93

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGP-LQINNYGLGGHYDLH----C 147
             F+        P +  I+ +I   T L      R  G  +Q+  Y  G  YD H     
Sbjct: 94  GMFITK---AKDPIVAGIEDKIATWTFL-----PRENGEDIQVLRYEHGQKYDPHYDYFS 145

Query: 148 DATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFP 186
           D      G  R+A+ + YLTDVE GG T+FPS                       + V P
Sbjct: 146 DKVNIARGGHRVATVLMYLTDVEKGGETVFPSAEELPRRKASVSHEDLSECARKGIAVKP 205

Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +G A+ +++ +   + D    H+GCPV  G KW
Sbjct: 206 RRGDALLFFSLYPTAVPDTSSIHAGCPVIEGEKW 239


>gi|363543369|ref|NP_001241694.1| prolyl 4-hydroxylase 8-4 [Zea mays]
 gi|347978838|gb|AEP37761.1| prolyl 4-hydroxylase 8-4 [Zea mays]
          Length = 307

 Score = 75.9 bits (185), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 57/204 (27%), Positives = 91/204 (44%), Gaps = 28/204 (13%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
            E +  +PR    H+ +   E   +I L+K  + +  VV+       D+R+     ++ +
Sbjct: 96  TEVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQ 155

Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEG 155
             G +  +  I+ RI D T + +   E     LQ+ +Y +G     H+D   D      G
Sbjct: 156 -RGRNKVIRAIEKRIADYTFIPVDHGEG----LQVLHYEVGQKYEPHFDYFLDEFNTKNG 210

Query: 156 LWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYN 196
             R+A+ + YL+DVE GG TIFP  N                   L+V P+ G A+ +++
Sbjct: 211 GQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWS 270

Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
              +  LD    H GCPV  GNKW
Sbjct: 271 MKPDATLDPLSLHGGCPVIKGNKW 294


>gi|340357957|ref|ZP_08680560.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
 gi|339616017|gb|EGQ20677.1| prolyl 4-hydroxylase [Sporosarcina newyorkensis 2681]
          Length = 211

 Score = 75.5 bits (184), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 51/177 (28%), Positives = 87/177 (49%), Gaps = 11/177 (6%)

Query: 46  DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHP 105
           +P +V + + + D E + +I+L+  KV+R K+    +    + R S   F+  +   ++ 
Sbjct: 32  EPLIVVLGNVLSDEECDELIQLAGDKVKRSKIGTTREE--NELRTSSSMFIEDD---ENL 86

Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW--RLASFM 163
            + +++ RI  +  + +   E     LQI  Y  G  Y  H D    D  +   R+++ +
Sbjct: 87  IVTRVKKRISAIMKIPMEHGE----GLQILRYTPGQQYKAHHDFFSSDSKITNNRISTLV 142

Query: 164 FYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            YL DVE GG T FP L  +V P KG AV++   +++  L+    H G PV  G KW
Sbjct: 143 MYLNDVEQGGETFFPHLKFSVSPRKGMAVYFEYFYSDQTLNDFTLHGGAPVVEGEKW 199


>gi|148701599|gb|EDL33546.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha II polypeptide, isoform CRA_d [Mus
           musculus]
          Length = 545

 Score = 75.5 bits (184), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 59/183 (32%), Positives = 92/183 (50%), Gaps = 18/183 (9%)

Query: 1   EIYPLACQGN-LSVPEDIKSNLKCFYESYNNT-FLKIGPLKVEELYLDPRVVKIHDAIYD 58
           ++Y   C+G  + +    +  L C Y   N    L I P K E+ +  P +V+ +D + D
Sbjct: 363 DVYESLCRGEGVKLTPRRQKKLFCRYHHGNRVPQLLIAPFKEEDEWDSPHIVRYYDVMSD 422

Query: 59  SEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQD 116
            EI RI E++K K+ R  V +   G       R+SK  +L  +   D P + ++  R+Q 
Sbjct: 423 EEIERIKEIAKPKLARATVRDPKTGVLTVASYRVSKSSWLEED---DDPVVARVNRRMQH 479

Query: 117 MTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR--DEGLW----RLASFMFYLTDVE 170
           +T L +   E     LQ+ NYG+GG Y+ H D + R  D GL     RLA+F+ Y++   
Sbjct: 480 ITGLTVKTAEL----LQVANYGMGGQYEPHFDFSRRPFDSGLKTEGNRLATFLNYVS-TG 534

Query: 171 LGG 173
           LGG
Sbjct: 535 LGG 537


>gi|168060785|ref|XP_001782374.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666166|gb|EDQ52828.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 211

 Score = 75.5 bits (184), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 59/207 (28%), Positives = 90/207 (43%), Gaps = 32/207 (15%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
           VE L  +PR    H  +   E N +IE++K  + +  V++       D+R+  S   FL 
Sbjct: 2   VEVLSWEPRAFLYHHFLTQVECNHLIEVAKPSLVKSTVIDSATGKSKDSRVRTSSGTFL- 60

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
             + G    + +I+ RI D T + + + E     LQ+  Y        HYD   DA    
Sbjct: 61  --VRGQDHIIKRIEKRIADFTFIPVEQGE----GLQVLQYRESEKYEPHYDYFHDAFNTK 114

Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
            G  R+A+ + YL+DVE GG T+FP+                     L+V P  G A+ +
Sbjct: 115 NGGQRIATVLMYLSDVEKGGETVFPASKVNASEVPDWDQRSECAKRGLSVRPRMGDALLF 174

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKWG 221
           ++   +  LD    H  CPV  G KW 
Sbjct: 175 WSMKPDAKLDPTSLHGACPVIQGTKWS 201


>gi|149068803|gb|EDM18355.1| procollagen-proline, 2-oxoglutarate 4-dioxygenase (proline
           4-hydroxylase), alpha polypeptide III [Rattus
           norvegicus]
          Length = 266

 Score = 75.5 bits (184), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 57/179 (31%), Positives = 87/179 (48%), Gaps = 20/179 (11%)

Query: 7   CQGNLSVPEDIK-SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           CQ   S P   +  +L C YE+ ++ +L + P + E ++L P V   HD + D E  +I 
Sbjct: 43  CQTLGSQPTHYQIPSLYCSYETNSSPYLLLQPARKEVIHLRPLVALYHDFVSDEEAQKIR 102

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
           EL++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T L I  +
Sbjct: 103 ELAEPWLQRSVVASGEKQLQVEYRISKSAWLKDTV---DPVLVTLDRRIAALTGLDI--Q 157

Query: 126 ERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTV 184
             Y   LQ+ NYG+GGHY+ H D                 L+ VE GGAT F   N +V
Sbjct: 158 PPYAEYLQVVNYGIGGHYEPHFDHA--------------TLSSVEAGGATAFIYGNFSV 202


>gi|48716447|dbj|BAD23054.1| putative prolyl 4-hydroxylase [Oryza sativa Japonica Group]
          Length = 310

 Score = 75.5 bits (184), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 62/238 (26%), Positives = 99/238 (41%), Gaps = 28/238 (11%)

Query: 6   ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           A  G  + P D +          +    + G    E +  +PR    H+ +   E + +I
Sbjct: 65  AAAGGDAEPADPRPPRTRARRDLSEGLGERGAQWTEVISWEPRAFVYHNFLSKEECDYLI 124

Query: 66  ELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGRE 125
            L+K  + +  VV+       D+R+     ++ +  G    +  I+ RI D T + +   
Sbjct: 125 GLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQ-RGRDKVIRAIEKRIADYTFIPMEHG 183

Query: 126 ERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN 181
           E     LQ+ +Y +G     H+D   D      G  R+A+ + YL+DVE GG TIFP  N
Sbjct: 184 EG----LQVLHYEVGQKYEPHFDYFLDEYNTKNGGQRMATLLMYLSDVEEGGETIFPDAN 239

Query: 182 -------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
                              L V P+ G A+ +++   +  LD    H GCPV  GNKW
Sbjct: 240 VNSSSLPWYNELSECARKGLAVKPKMGDALLFWSMKPDATLDPLSLHGGCPVIKGNKW 297


>gi|388500582|gb|AFK38357.1| unknown [Medicago truncatula]
          Length = 299

 Score = 75.5 bits (184), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 59/213 (27%), Positives = 89/213 (41%), Gaps = 34/213 (15%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           I P KV+++   PR       + D E + +I L+K +++R  V +   GD+   D R S 
Sbjct: 32  INPSKVKQISWIPRAFVYQGFLTDLECDHLISLAKSELKRSAVADNLSGDSQLSDVRTSS 91

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
              +        P +  I+ RI   T L     E     +Q+  Y  G  YD H     D
Sbjct: 92  GMLISKN---KDPIVSGIEDRISAWTFLPKENGE----DIQVLRYEHGQKYDPHYDYFAD 144

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
                +G  RLA+ + YLT+V  GG T+FP                        + V P 
Sbjct: 145 KVNIVQGGHRLATVLMYLTNVTKGGETVFPEAEEPPRRRGSKKSSDLSECAKKGIAVKPR 204

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +G A+ +++   N + D    H+GCPV  G KW
Sbjct: 205 RGDALLFFSLDTNAIPDTNSLHAGCPVLEGEKW 237


>gi|328710203|ref|XP_001949232.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Acyrthosiphon
           pisum]
          Length = 500

 Score = 75.5 bits (184), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 72/230 (31%), Positives = 107/230 (46%), Gaps = 31/230 (13%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           +  ACQ   S  + I    KC Y      +L IGPL+ E + L P +   H+ +YD EI 
Sbjct: 283 FKTACQ---STTDFIYPKFKCRYYHGGRKYLMIGPLREEIVSLIPSMKLYHNVLYDDEIK 339

Query: 63  RIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYP---EIFGD-HPFLYKIQTRIQDMT 118
           +I EL+  K+E+  +    DT   D  L KV        ++F   H  L +I ++    T
Sbjct: 340 KIKELANPKLEKLSI----DT-NEDISLRKVASFRKHNDQVFETIHHRLAQISSK--PTT 392

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLHC---DATPRDEGLWRLASFMFYLTDVELGGAT 175
           N+V    ++Y     + NYG+GGHY  H    D         R A  + ++ DV  GGAT
Sbjct: 393 NIV----DKY----VVTNYGIGGHYLPHTKYIDDNHLINSKRRDAIVIIHMDDVPEGGAT 444

Query: 176 IFPSLNLTVFPEKGSAVFWYNAH-----ANTLLDYRMYHSGCPVALGNKW 220
           + P++   V   KGSA+  Y+          L ++  Y S CP+  G+KW
Sbjct: 445 VLPNVEFCVPSVKGSALVIYSTRNTLPPIKELFEFAQYGS-CPIVYGDKW 493


>gi|357467077|ref|XP_003603823.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355492871|gb|AES74074.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
          Length = 291

 Score = 75.5 bits (184), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 60/202 (29%), Positives = 92/202 (45%), Gaps = 27/202 (13%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLY 97
            E L  +PR    H+ +   E   +I L+K  ++R  VV+   G  I    R S   FL 
Sbjct: 85  TEVLSSEPRASMYHNFLSKEECEHLINLAKPFMQRSLVVDGVTGQGILNSVRTSSGTFLE 144

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
               G    +  ++ RI D+T++ I   E     LQI +Y +G     HYD + +    +
Sbjct: 145 R---GKDKIVQNVERRIADITSIPIENGE----GLQIIHYEVGQKFEPHYDYNFNWRITN 197

Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN--------------LTVFPEKGSAVFWYNAHA 199
            G  R+A+ + YL+DVE GG T+FP+                L V P+ G A+ +++   
Sbjct: 198 NGGPRVATVLMYLSDVEEGGETVFPNAKPNFNSVSKYHPGKGLVVKPKMGDALLFWSVKP 257

Query: 200 NTLLDYRMYHSGCPVALGNKWG 221
           +  LD    H G PV  G+KW 
Sbjct: 258 DGSLDTASLHGGSPVIRGSKWA 279


>gi|326495334|dbj|BAJ85763.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 300

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 59/206 (28%), Positives = 92/206 (44%), Gaps = 32/206 (15%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
            E L  +PR    H+ +   E   +I L+K  +++  VV+       D+R+  S   FL 
Sbjct: 89  TEVLSWEPRAFIYHNFLSKEECEYLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGTFLR 148

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD---- 153
               G    +  I+ RI D T + +   E     LQ+ +Y +G  Y+ H D    D    
Sbjct: 149 ---RGQDKIVRTIEKRISDFTFIPVENGE----GLQVLHYEVGQKYEPHFDYFHDDFNTK 201

Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
            G  R+A+ + YL+DVE GG T+FPS                     ++V P+ G A+ +
Sbjct: 202 NGGQRIATVLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALLF 261

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           ++   +  LD    H GCPV  G+KW
Sbjct: 262 WSMRPDGTLDPTSLHGGCPVIKGDKW 287


>gi|344253558|gb|EGW09662.1| Glucose 1,6-bisphosphate synthase [Cricetulus griseus]
          Length = 904

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 58/185 (31%), Positives = 90/185 (48%), Gaps = 20/185 (10%)

Query: 1   EIYPLACQGNLSVPEDIKS-NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDS 59
           + Y   CQ   S P   ++  L C YE+ ++ +L + P + E ++L P V   HD + D+
Sbjct: 691 DTYEGLCQTLGSQPTHYQNPRLYCSYETNSSPYLLLQPARKEVIHLRPFVALYHDFVSDA 750

Query: 60  EINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTN 119
           E  +I EL++  ++R  V +    + V+ R+SK  +L   +    P L  +  RI  +T 
Sbjct: 751 EAQKIRELAEPWLQRSVVASGEKQLPVEYRISKSAWLKDTV---DPMLGTLDHRIAALTG 807

Query: 120 LVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPS 179
           L I  +  Y   LQ+ NYG+GGHY+ H D                 L+ VE GGAT F  
Sbjct: 808 LDI--QPPYAEYLQVVNYGIGGHYEPHFDHAT--------------LSAVEAGGATAFIY 851

Query: 180 LNLTV 184
            N +V
Sbjct: 852 ANFSV 856


>gi|120609859|ref|YP_969537.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
 gi|120588323|gb|ABM31763.1| 2OG-Fe(II) oxygenase [Acidovorax citrulli AAC00-1]
          Length = 309

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 62/203 (30%), Positives = 95/203 (46%), Gaps = 26/203 (12%)

Query: 33  LKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGKVV--NYGDTIYVDT 88
           + +G  +V+ L     PRVV   + +   E + II+ ++ ++ R   V    G     D 
Sbjct: 106 IDVGDRRVDVLMAMAQPRVVLFGNLLSPEECDAIIDAARPRMARSLTVATRTGGEEVNDD 165

Query: 89  RLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD 148
           R S   F   E   ++P + +++ RI  + N  +   E  +G LQ+ +Y  G  Y  H D
Sbjct: 166 RTSNGMFFQRE---ENPVVARLEARIARLVNWPL---ENGEG-LQVLHYRPGAEYKPHYD 218

Query: 149 -------ATPR--DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNA 197
                   TP     G  R+A+ + YL D E GG T FP ++L V P +G+AVF  +   
Sbjct: 219 YFDPAEPGTPTILRRGGQRVATIVIYLNDPEKGGGTTFPDVHLEVAPRRGNAVFFSYERP 278

Query: 198 HANTLLDYRMYHSGCPVALGNKW 220
           H +T    R  H G PV  G+KW
Sbjct: 279 HPST----RTLHGGAPVVAGDKW 297


>gi|195352174|ref|XP_002042589.1| GM14934 [Drosophila sechellia]
 gi|194124473|gb|EDW46516.1| GM14934 [Drosophila sechellia]
          Length = 438

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 57/200 (28%), Positives = 93/200 (46%), Gaps = 24/200 (12%)

Query: 21  LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
           L C Y  +   FLK+ PLK+EEL + P +   +  +   +I  +   S+ K++R K ++ 
Sbjct: 256 LVCRYVDWTQ-FLKLAPLKMEELSMKPHISIFYGFLGQKDIEVLKNASRPKLQRVKHLSG 314

Query: 81  GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG 140
             +  +    S            H  + K+   I D+T    G   +    L++ NYG+ 
Sbjct: 315 NCSCKIGNLSS----------SSHDVVRKVNELILDIT----GFPSKGNQMLEVINYGIA 360

Query: 141 GHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN 200
           G+Y+    A P+   +   A+   +L +   GG  +FPS +L V P KGS +FW N    
Sbjct: 361 GNYNPEDTAKPK---IHNKANAFIFLENAGKGGEIVFPSRHLKVRPRKGSMLFWEN---- 413

Query: 201 TLLDYRMYHSGCPVALGNKW 220
            L +  +YH  CP+  GN W
Sbjct: 414 -LKNSVIYHQ-CPILKGNMW 431


>gi|357125236|ref|XP_003564301.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 293

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 55/206 (26%), Positives = 90/206 (43%), Gaps = 31/206 (15%)

Query: 39  KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFL 96
           +V +L   PR       +  +E + +++L+KG++++  V +   G ++    R S   FL
Sbjct: 30  RVTQLSWRPRAFLYSGFLSHAECDHLVKLAKGRLQKSMVADNDSGKSVMSQVRTSSGTFL 89

Query: 97  YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATPR 152
                 +   +  I+ R+   T L     E     +Q+ +Y +G  YD H D       +
Sbjct: 90  NKH---EDEIISGIEKRVAAWTFL----PEENAESIQVLHYEVGQKYDAHFDYFHDKNNQ 142

Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAVFW 194
             G  R+A+ + YLTDV+ GG T+FP+                    L V P KG A+ +
Sbjct: 143 KLGGHRVATVLMYLTDVKKGGETVFPNAEGRHLQHKDETWSECARSGLAVKPRKGDALLF 202

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           ++ H N   D    H  CPV  G KW
Sbjct: 203 FSLHINATTDPSSLHGSCPVIEGEKW 228


>gi|259490206|ref|NP_001159002.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
 gi|195626402|gb|ACG35031.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
 gi|347978830|gb|AEP37757.1| prolyl 4-hydroxylase 8 [Zea mays]
 gi|347978832|gb|AEP37758.1| prolyl 4-hydroxylase 8-1 [Zea mays]
 gi|413939569|gb|AFW74120.1| prolyl 4-hydroxylase alpha-2 subunit isoform 1 [Zea mays]
 gi|413939570|gb|AFW74121.1| prolyl 4-hydroxylase alpha-2 subunit isoform 2 [Zea mays]
 gi|413939571|gb|AFW74122.1| prolyl 4-hydroxylase alpha-2 subunit isoform 3 [Zea mays]
 gi|413939572|gb|AFW74123.1| prolyl 4-hydroxylase alpha-2 subunit isoform 4 [Zea mays]
          Length = 307

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 57/204 (27%), Positives = 90/204 (44%), Gaps = 28/204 (13%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
            E +  +PR    H+ +   E   +I L+K  + +  VV+       D+R+     ++ +
Sbjct: 96  TEVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQ 155

Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEG 155
             G    +  I+ RI D T + +   E     LQ+ +Y +G     H+D   D      G
Sbjct: 156 -RGRDKVIRAIEKRIADYTFIPVDHGEG----LQVLHYEVGQKYEPHFDYFLDEFNTKNG 210

Query: 156 LWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYN 196
             R+A+ + YL+DVE GG TIFP  N                   L+V P+ G A+ +++
Sbjct: 211 GQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWS 270

Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
              +  LD    H GCPV  GNKW
Sbjct: 271 MKPDATLDPLSLHGGCPVIKGNKW 294


>gi|449522594|ref|XP_004168311.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Cucumis
           sativus]
          Length = 313

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 91/208 (43%), Gaps = 31/208 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PR       + D+E + +I+L+K K+E+  V +   G ++  + R S   
Sbjct: 50  PTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGM 109

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
           FL          +  ++ RI   T L     E     +QI +Y  G     H+D   D  
Sbjct: 110 FLRK---AQDEVVAGVEARIAAWTLLPAENGE----SIQILHYENGQKYEPHFDFFHDKV 162

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLNL------------------TVFPEKGSAV 192
            ++ G  R+A+ + YL++VE GG TIFP+                      V  +KG A+
Sbjct: 163 NQELGGHRIATVLMYLSNVEKGGETIFPNSEFKESQAKDESWSDCSRKGYAVKAQKGDAL 222

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +++ + +   D R  H  CPV  G KW
Sbjct: 223 LFFSLNLDATTDERSLHGSCPVIAGEKW 250


>gi|159487763|ref|XP_001701892.1| predicted protein [Chlamydomonas reinhardtii]
 gi|158281111|gb|EDP06867.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 259

 Score = 75.1 bits (183), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 61/205 (29%), Positives = 92/205 (44%), Gaps = 32/205 (15%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDT-RLSKVYFLYP 98
           +E +   PRV   H+ I + E   +IEL+  +++R  VV  G     D  R S   FL  
Sbjct: 1   IEHVAWKPRVFIYHNFITEVEAKHLIELAAPQMKRSTVVGAGGKSVEDNYRTSYGTFL-- 58

Query: 99  EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWR 158
           + + D   + +I+ R+   T + +  +E      QI  YGLG  Y +H D    +E   R
Sbjct: 59  KRYQDE-IVERIENRVAAWTQIPVAHQED----TQILRYGLGQQYKVHADTLRDEEAGVR 113

Query: 159 LASFMFYLTDVELGGATIFPSL---------------------NLTVFPEKGSA-VFWY- 195
           +A+ + YL + + GG T FPS                      ++   P++G A +FW  
Sbjct: 114 VATVLIYLNEPDGGGETAFPSSEWVNPQLAKTLGANFSDCAKNHVAFAPKRGDALLFWSI 173

Query: 196 NAHANTLLDYRMYHSGCPVALGNKW 220
           N   NT  D    H+GCPV  G KW
Sbjct: 174 NPDGNT-EDTHASHTGCPVLSGVKW 197


>gi|307725787|ref|YP_003909000.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
 gi|307586312|gb|ADN59709.1| Procollagen-proline dioxygenase [Burkholderia sp. CCGE1003]
          Length = 313

 Score = 74.7 bits (182), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 55/185 (29%), Positives = 88/185 (47%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           P+V+   + +   E   +IE S+ +++R  +V+   G    +  R S+  +      G+ 
Sbjct: 124 PQVIVFGNVLSPDECAEMIERSRHRLKRSTIVDPATGREDVIRNRTSEGIWYQ---RGED 180

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
             + ++  RI  + N  +   E  +G LQI +YG  G Y  H D  P D+         G
Sbjct: 181 ALIERLDQRIASLMNWPL---ENGEG-LQILHYGPSGEYRPHFDYFPPDQPGSAVHTARG 236

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL DV  GG TIFP   L+V  ++G AV++   +    LD    H G PV 
Sbjct: 237 GQRVATLVVYLNDVPDGGETIFPEAGLSVAAQQGGAVYFRYMNGRRQLDPLTLHGGAPVL 296

Query: 216 LGNKW 220
            G+KW
Sbjct: 297 SGDKW 301


>gi|226529219|ref|NP_001151238.1| LOC100284871 [Zea mays]
 gi|195645242|gb|ACG42089.1| prolyl 4-hydroxylase alpha-2 subunit precursor [Zea mays]
 gi|347978812|gb|AEP37748.1| prolyl 4-hydroxylase 5 [Zea mays]
 gi|413923983|gb|AFW63915.1| prolyl 4-hydroxylase alpha-2 subunit [Zea mays]
          Length = 308

 Score = 74.7 bits (182), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 57/204 (27%), Positives = 90/204 (44%), Gaps = 28/204 (13%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
            E +  +PR    H+ +   E   +I L+K  + +  VV+       D+R+     ++ +
Sbjct: 97  TEVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQ 156

Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEG 155
             G    +  I+ RI D T + +   E     LQ+ +Y +G     H+D   D      G
Sbjct: 157 -RGRDKVIRVIEKRIADYTFIPVDHGEG----LQVLHYEVGQKYEPHFDYFLDEFNTKNG 211

Query: 156 LWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYN 196
             R+A+ + YL+DVE GG TIFP  N                   L+V P+ G A+ +++
Sbjct: 212 GQRMATLLMYLSDVEEGGETIFPDANVNVSSLPWYNELSECAKRGLSVKPKMGDALLFWS 271

Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
              +  LD    H GCPV  GNKW
Sbjct: 272 MKPDATLDPLSLHGGCPVIRGNKW 295


>gi|449491267|ref|XP_004158845.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 287

 Score = 74.7 bits (182), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 58/206 (28%), Positives = 94/206 (45%), Gaps = 32/206 (15%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
           VE +  +PR    H+ +   E   +I L+K  +++  VV+       D+R+  S   FL 
Sbjct: 76  VEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQSKDSRVRTSSGTFL- 134

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
           P   G    +  I+ R+ D + + +   E     LQ+ +Y +G     H+D   D     
Sbjct: 135 PR--GRDKTVRTIEKRLSDFSFIPVEHGE----GLQVLHYEVGQKYEPHFDYFLDEYNTK 188

Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
            G  R+A+ + YL+DVE GG T+FP+                     L+V P++G A+ +
Sbjct: 189 NGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLF 248

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           ++   +  LD    H GCPV  GNKW
Sbjct: 249 WSMKPDASLDPSSLHGGCPVIKGNKW 274


>gi|449434114|ref|XP_004134841.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 287

 Score = 74.7 bits (182), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 58/206 (28%), Positives = 94/206 (45%), Gaps = 32/206 (15%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
           VE +  +PR    H+ +   E   +I L+K  +++  VV+       D+R+  S   FL 
Sbjct: 76  VEVISWEPRAFVYHNFLTKEECEYLISLAKPHMQKSTVVDSETGQSKDSRVRTSSGTFL- 134

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
           P   G    +  I+ R+ D + + +   E     LQ+ +Y +G     H+D   D     
Sbjct: 135 PR--GRDKTVRTIEKRLSDFSFIPVEHGE----GLQVLHYEVGQKYEPHFDYFLDEYNTK 188

Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
            G  R+A+ + YL+DVE GG T+FP+                     L+V P++G A+ +
Sbjct: 189 NGGQRIATVLMYLSDVEEGGETVFPAAKGNFSSVPWWNELSDCGKKGLSVKPKRGDALLF 248

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           ++   +  LD    H GCPV  GNKW
Sbjct: 249 WSMKPDASLDPSSLHGGCPVIKGNKW 274


>gi|307111754|gb|EFN59988.1| hypothetical protein CHLNCDRAFT_49444 [Chlorella variabilis]
          Length = 344

 Score = 74.7 bits (182), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 65/227 (28%), Positives = 96/227 (42%), Gaps = 37/227 (16%)

Query: 24  FYESYNNTFLKIGPL-------KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGK 76
           F  S+ N+     P        +V+ L+ D R+   H+ + D E + II+L++  + R  
Sbjct: 40  FAASFGNSSCASEPACDPSRSPRVQVLHEDARIFLYHNFLTDEECDHIIKLAEPTMARSG 99

Query: 77  VV--NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQI 134
           VV  + G +   + R SK  FL     G    +  I+ RI   T +  G  E     LQ+
Sbjct: 100 VVETDSGKSKIDNVRTSKGTFLN---RGHDSVIADIEARIAKWTLMPAGNGEG----LQV 152

Query: 135 NNYGLG----GHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN--------- 181
             Y  G    GHYD          G  R  + + YL DVE GG T FP++          
Sbjct: 153 LKYEHGQEYEGHYDYFFHKAGTANGGNRYLTVLMYLNDVEEGGETCFPNIPSPNGDNGPE 212

Query: 182 --------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
                   L   P+KG+AV +++      L+ R  H+ CPV  G KW
Sbjct: 213 FSECARKVLAAKPKKGNAVLFHSIKPTGELERRSLHTACPVIKGVKW 259


>gi|302786814|ref|XP_002975178.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
 gi|300157337|gb|EFJ23963.1| hypothetical protein SELMODRAFT_174666 [Selaginella moellendorffii]
          Length = 283

 Score = 74.7 bits (182), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 57/209 (27%), Positives = 90/209 (43%), Gaps = 30/209 (14%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           + P KV +L   PR       +  +E + +++++K K+++  V +   G ++  + R S 
Sbjct: 23  VDPTKVIQLSWKPRAFLYKGFMSAAECDHVVKMAKDKLQKSMVADNESGKSVLSNIRTSS 82

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCD 148
             FL     G    + +I+ RI   T L     E     +Q+  Y  G     HYD   D
Sbjct: 83  GMFLSK---GQDEVINRIEERIAAWTFLPKENGE----AIQVLRYEFGEKYEPHYDYFHD 135

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN-----------------LTVFPEKGSA 191
              +  G  R+A+ + YL+D   GG T+FPS                   + V P KG A
Sbjct: 136 KYNQALGGHRIATVLMYLSDAVKGGETVFPSSEEDTTVKDDSWSDCAKKGIAVKPRKGDA 195

Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + +Y+ H +   D    H GCPV  G KW
Sbjct: 196 LLFYSLHPDATPDESSLHGGCPVIEGEKW 224


>gi|241710335|ref|XP_002412046.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
 gi|215505101|gb|EEC14595.1| prolyl 4-hydroxylase alpha subunit 1, putative [Ixodes scapularis]
          Length = 65

 Score = 74.7 bits (182), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 31/63 (49%), Positives = 43/63 (68%)

Query: 158 RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALG 217
           R+A+ M Y++DVE GGAT+FP L + + P+KG A FW+N  AN   +    H+GCPV  G
Sbjct: 3   RVATLMIYMSDVEEGGATVFPYLGVRLTPQKGDAAFWWNLKANGEGEVLTTHAGCPVLYG 62

Query: 218 NKW 220
           +KW
Sbjct: 63  SKW 65


>gi|115482738|ref|NP_001064962.1| Os10g0497800 [Oryza sativa Japonica Group]
 gi|78708853|gb|ABB47828.1| prolyl 4-hydroxylase alpha subunit, putative, expressed [Oryza
           sativa Japonica Group]
 gi|113639571|dbj|BAF26876.1| Os10g0497800 [Oryza sativa Japonica Group]
 gi|215767852|dbj|BAH00081.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218184821|gb|EEC67248.1| hypothetical protein OsI_34188 [Oryza sativa Indica Group]
          Length = 321

 Score = 74.3 bits (181), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 61/206 (29%), Positives = 90/206 (43%), Gaps = 32/206 (15%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
            E L  +PR    H+ +   E   +I L+K  +++  VV+       D+R+  S   FL 
Sbjct: 110 TEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSRVRTSSGMFLG 169

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
               G    +  I+ RI D T + +   E     LQ+ +Y +G     H+D   D     
Sbjct: 170 R---GQDKIIRTIEKRISDYTFIPVENGE----GLQVLHYEVGQKYEPHFDYFHDEFNTK 222

Query: 154 EGLWRLASFMFYLTDVELGGATIFPSL-------------------NLTVFPEKGSAVFW 194
            G  R+A+ + YL+DVE GG TIFPS                     L V P+ G A+ +
Sbjct: 223 NGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLF 282

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           ++   +  LD    H GCPV  GNKW
Sbjct: 283 WSMRPDGSLDATSLHGGCPVIKGNKW 308


>gi|363543371|ref|NP_001241695.1| prolyl 4-hydroxylase 8-5 [Zea mays]
 gi|347978840|gb|AEP37762.1| prolyl 4-hydroxylase 8-5 [Zea mays]
          Length = 307

 Score = 74.3 bits (181), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 57/204 (27%), Positives = 89/204 (43%), Gaps = 28/204 (13%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
            E +  +PR    H+ +   E   +I L+K  + +  VV+       D+R+     ++ +
Sbjct: 96  TEVISWEPRAFVYHNFLSKDECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQ 155

Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEG 155
             G    +  I+ RI D T + +   E     LQ+ +Y +G     H+D   D      G
Sbjct: 156 -RGRDKVIRAIEKRIADYTFIPVDHGEG----LQVLHYEVGQKYEPHFDYFLDEFNTKNG 210

Query: 156 LWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYN 196
             R+A+ + YL+DVE GG TIFP  N                   L+V P+ G A+ +++
Sbjct: 211 GQRIATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSDCAKRGLSVKPKMGDALLFWS 270

Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
                 LD    H GCPV  GNKW
Sbjct: 271 MKPGATLDPLSLHGGCPVIKGNKW 294


>gi|186474111|ref|YP_001861453.1| procollagen-proline dioxygenase [Burkholderia phymatum STM815]
 gi|184196443|gb|ACC74407.1| Procollagen-proline dioxygenase [Burkholderia phymatum STM815]
          Length = 305

 Score = 74.3 bits (181), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 53/185 (28%), Positives = 88/185 (47%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           P+V+   D +   E + +IE ++ +++R   VN   G    +  R S+ ++       + 
Sbjct: 116 PQVIVFDDVLSRDECDELIERARHRLKRSTTVNPESGREDVIQLRTSEGFWFQ---RCED 172

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------G 155
            F+ ++  RI  + N  +   E  +G LQI +Y  GG Y  H D  P  +         G
Sbjct: 173 AFIERLDRRISALMNWPL---EHGEG-LQILHYTKGGEYRPHFDYFPPSQSGSVLHTSRG 228

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL+DV  GG T+FP+  L V   +G A+++   + +  LD    H G PV 
Sbjct: 229 GQRVATLIVYLSDVAGGGETVFPNAGLAVMARQGGAIYFRYLNGHRQLDPLTLHGGAPVT 288

Query: 216 LGNKW 220
            G KW
Sbjct: 289 NGEKW 293


>gi|357447555|ref|XP_003594053.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355483101|gb|AES64304.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 303

 Score = 74.3 bits (181), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 59/223 (26%), Positives = 93/223 (41%), Gaps = 36/223 (16%)

Query: 27  SYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTI 84
           SY  T   I P KV+++   PR       + D E + +I ++K +++R  V +   G++ 
Sbjct: 27  SYAGTSAIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESK 86

Query: 85  YVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYD 144
             + R S   F+          +  I+ +I   T L     E     +Q+  Y  G  YD
Sbjct: 87  LSEVRTSSGMFISKN---KDAIVSGIEDKISSWTFLPKENGED----IQVLRYEHGQKYD 139

Query: 145 LH----CDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL------------------ 182
            H     D      G  R+A+ + YLT+V  GG T+FP+  L                  
Sbjct: 140 PHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAELQESPRHKLSETDEDLSEC 199

Query: 183 -----TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
                 V P +G A+ +++ H N + D    H+GCPV  G KW
Sbjct: 200 GKKGVAVKPRRGDALLFFSLHPNAIPDTLSLHAGCPVIEGEKW 242


>gi|148653656|ref|YP_001280749.1| procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
 gi|148572740|gb|ABQ94799.1| Procollagen-proline dioxygenase [Psychrobacter sp. PRwf-1]
          Length = 268

 Score = 74.3 bits (181), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 60/217 (27%), Positives = 105/217 (48%), Gaps = 24/217 (11%)

Query: 19  SNLKCFYESYNNTFLKIGPLKVEELYL--DPRVVKIHDAIYDSEINRIIELSKGKVERGK 76
           SN K  + +  N ++++   +V   ++   P V  I+D +   E + +I  +  K++  +
Sbjct: 49  SNQKIPHINMTNNYVELSDKRVSLSFVCYKPFVTVINDFLSPEECDALISDADQKLKASR 108

Query: 77  VVNYGDTIYVD----TRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
           VV+  D  +V+    T  S  Y       G+   +  I+ RI D+ N  +   E     L
Sbjct: 109 VVDPEDGSFVEHSARTSTSTGYHR-----GEIDIIKTIEARIADLINWPVDHGE----GL 159

Query: 133 QINNYGLGGHYDLHCD------ATPR---DEGLWRLASFMFYLTDVELGGATIFPSLNLT 183
           Q+  Y  GG Y  H D       + R    +G  R+ +F+ YL++V+ GG+T FP+LN  
Sbjct: 160 QVLRYEDGGEYRPHFDFFDPAKKSSRLVTKQGGQRVGTFLMYLSEVDSGGSTRFPNLNFE 219

Query: 184 VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + P KGSA+++ N +    ++    H+G PV  G K+
Sbjct: 220 IRPNKGSALYFANTNLKAEIEPLTLHAGMPVTEGVKY 256


>gi|319786559|ref|YP_004146034.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
 gi|317465071|gb|ADV26803.1| Procollagen-proline dioxygenase [Pseudoxanthomonas suwonensis 11-1]
          Length = 289

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 66/229 (28%), Positives = 99/229 (43%), Gaps = 32/229 (13%)

Query: 4   PLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINR 63
           PL C+    VP  I  N     ++ +     +  L +      PRVV +   + D E + 
Sbjct: 69  PLPCR----VPAPIGLNGPALLDAGDRQVQLLASLML------PRVVVLGGLLSDEECDA 118

Query: 64  IIELSKGKVERGKVVNY---GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           ++ELS+ ++ R   V+    G  ++ D R S+  F      G HP    I+ RI  +   
Sbjct: 119 LVELSRPRLRRSTTVDAQTGGSQVHAD-RTSRGTFFE---RGAHPVCATIEARIARLLEW 174

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDE---------GLWRLASFMFYLTDVEL 171
            +   E  +G LQ+ +Y  G  +  H D    DE         G  R+A+ + YL     
Sbjct: 175 PV---ENGEG-LQVLHYPPGAEFRPHYDYFDPDEPGAEVLLRQGGQRVATVVMYLNTPAR 230

Query: 172 GGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           GGAT FP  +L V   KG+AVF+     + +   R  H G PV  G KW
Sbjct: 231 GGATTFPDAHLEVAAVKGNAVFFSYDRPHPM--TRTLHGGAPVTEGEKW 277


>gi|384251901|gb|EIE25378.1| hypothetical protein COCSUDRAFT_35772 [Coccomyxa subellipsoidea
           C-169]
          Length = 222

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 57/204 (27%), Positives = 95/204 (46%), Gaps = 30/204 (14%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLY 97
           +E L  +PR    H+ + ++E + +++  K  +E+ +VV+   G +     R S   FL 
Sbjct: 1   MEVLSWEPRAYLYHNFLTEAEADYLVQKGKPHMEKSEVVDNETGKSAPSKVRTSSGMFLN 60

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
               G+   + +I+ RI   T +    +E  +G LQI +Y        H+D   D     
Sbjct: 61  ---RGEDDVIERIEARIAKYTAIP---KENGEG-LQILHYQASEEYRPHFDYFHDNFNTQ 113

Query: 154 EGLWRLASFMFYLTDVELGGATIFP-----------------SLNLTVFPEKGSAVFWYN 196
            G  R+A+ + YL+DVE GG T+FP                        P+KG A+F+Y+
Sbjct: 114 NGGQRIATMLMYLSDVEDGGETVFPESSDKPNVGNTKFSQCAQAGAAAKPKKGDALFFYS 173

Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
              +  +D +  H+GCPV  G+KW
Sbjct: 174 LTPDGRMDEKSLHAGCPVMKGDKW 197


>gi|156352046|ref|XP_001622583.1| predicted protein [Nematostella vectensis]
 gi|156209154|gb|EDO30483.1| predicted protein [Nematostella vectensis]
          Length = 497

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 60/220 (27%), Positives = 107/220 (48%), Gaps = 29/220 (13%)

Query: 3   YPLACQGNLSVPEDIK--SNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y   C+G    P  ++    L+C+Y+S ++  L++ P K+E L  D +++ + D I +S+
Sbjct: 284 YERLCRGQ---PNKVRIPKQLRCYYKS-SHPLLRLKPAKIEVLDPDRQILLLRDVINESQ 339

Query: 61  INRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
           +  I EL+  KV    +     +   + R S   +L      D   +  +  RI+ +T+ 
Sbjct: 340 MQFIKELAAPKVSSLHLSPTNRSP-SERRFSSSAWLGD---ADGAPIAALSRRIEAITDF 395

Query: 121 VIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSL 180
            +  +      LQ+ ++G+GGH++      PR    +   +  F    V+ GG+ +F   
Sbjct: 396 HVTGDS--AESLQVVHFGIGGHFE------PR----YGYNALNF----VDAGGSNVFLDS 439

Query: 181 NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            L+V P+KGSAVFW N   +        H+ CPV +G+KW
Sbjct: 440 ELSVSPQKGSAVFWLNMRRSG---KETLHAACPVIVGHKW 476


>gi|357146834|ref|XP_003574128.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-2-like [Brachypodium
           distachyon]
          Length = 306

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 59/206 (28%), Positives = 91/206 (44%), Gaps = 32/206 (15%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
            E L  +PR    H+ +   E   +I L+K  +++  VV+       D+R+  S   FL 
Sbjct: 95  TEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDSATGGSKDSRVRTSSGTFLR 154

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD---- 153
               G    +  I+ RI D T +     E     LQ+ +Y +G  Y+ H D    D    
Sbjct: 155 ---RGQDKVIRTIEKRISDFTFIPAENGE----GLQVLHYEVGQKYEPHFDYFHDDFNTK 207

Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
            G  R+A+ + YL+DVE GG T+FPS                     ++V P+ G A+ +
Sbjct: 208 NGGQRIATLLMYLSDVEEGGETVFPSAKVNSSSIPFYNELSECAKRGISVKPKMGDALLF 267

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           ++   +  LD    H GCPV  G+KW
Sbjct: 268 WSMRPDGTLDPTSLHGGCPVIKGDKW 293


>gi|30689216|ref|NP_189490.2| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
 gi|332643931|gb|AEE77452.1| Oxoglutarate/iron-dependent oxygenase [Arabidopsis thaliana]
          Length = 288

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 58/211 (27%), Positives = 90/211 (42%), Gaps = 32/211 (15%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV---NYGDTIYVDTRLS 91
           + P ++ +L   PR       + D E + +I+L+KGK+E+  VV   + G++   + R S
Sbjct: 27  VDPTRITQLSWTPRAFLYKGFLSDEECDHLIKLAKGKLEKSMVVADVDSGESEDSEVRTS 86

Query: 92  KVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD--- 148
              FL          +  ++ ++   T L     E     LQI +Y  G  YD H D   
Sbjct: 87  SGMFLTKR---QDDIVANVEAKLAAWTFL----PEENGEALQILHYENGQKYDPHFDYFY 139

Query: 149 -ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKG 189
                + G  R+A+ + YL++V  GG T+FP+                      V P KG
Sbjct: 140 DKKALELGGHRIATVLMYLSNVTKGGETVFPNWKGKTPQLKDDSWSKCAKQGYAVKPRKG 199

Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            A+ ++N H N   D    H  CPV  G KW
Sbjct: 200 DALLFFNLHLNGTTDPNSLHGSCPVIEGEKW 230


>gi|224013908|ref|XP_002296618.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|220968970|gb|EED87314.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 601

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 57/187 (30%), Positives = 81/187 (43%), Gaps = 16/187 (8%)

Query: 47  PRVVKIHDAIYDSEINRIIELS-------KGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
           P VV +   + D E +R+++L          KV+  K  N  D    + R S   +    
Sbjct: 400 PWVVSLEGFLSDEEADRLVQLGNQQGYKRSTKVQTHKGGNSIDAGITEDRTSHNTWCQEP 459

Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW-- 157
              D P +  I  RI  +T       E     LQ+  Y  G  Y  H D  P+   +   
Sbjct: 460 SCYDDPLVAPIIERIAMLTKSSANHSEH----LQLLQYTEGQFYKQHNDYIPQQRDMACG 515

Query: 158 -RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN--TLLDYRMYHSGCPV 214
            R+ +   YL DVE GG T FP L+LTV P++G+A+ W +   +     D R  H   PV
Sbjct: 516 PRIMTLFLYLNDVEEGGGTRFPLLDLTVQPKRGNAILWASVRDDDPEEKDIRTDHEALPV 575

Query: 215 ALGNKWG 221
           A G K+G
Sbjct: 576 AKGMKYG 582


>gi|224117220|ref|XP_002331751.1| predicted protein [Populus trichocarpa]
 gi|222874448|gb|EEF11579.1| predicted protein [Populus trichocarpa]
          Length = 266

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 63/206 (30%), Positives = 97/206 (47%), Gaps = 32/206 (15%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
           VE +  +PR    H+ +  +E + +I L+K  +++  VV+       D+R+  S   FL 
Sbjct: 55  VEAISWEPRAFIYHNFLTKAECDYLINLAKPHMQKSMVVDSSSGKSKDSRVRTSSGTFL- 113

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD---- 153
           P   G    +  I+ RI D + +     E  +G LQI +Y +G  Y+ H D    D    
Sbjct: 114 PR--GRDKIIRDIEKRIADFSFIP---SEHGEG-LQILHYEVGQKYEPHFDYFMDDYNTE 167

Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
            G  R+A+ + YL+DVE GG T+FPS                     L+V P+ G A+ +
Sbjct: 168 NGGQRIATVLMYLSDVEEGGETVFPSAKGNISSVPWWNELSECGKGGLSVKPKMGDALLF 227

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           ++   +  LD    H GCPV  GNKW
Sbjct: 228 WSMKPDASLDPSSLHGGCPVIRGNKW 253


>gi|412992163|emb|CCO19876.1| predicted protein [Bathycoccus prasinos]
          Length = 350

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 56/213 (26%), Positives = 92/213 (43%), Gaps = 40/213 (18%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLY 97
            E +   PR   +H  + + E   I+ ++K  ++R  VV+   G+      R SK  FL 
Sbjct: 80  TEPISWQPRAFVLHSILSEEECEEILRIAKPMMKRSTVVDSITGEIKTDPIRTSKQTFL- 138

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGP-LQINNYGLGGHYDLHCDATPRD--- 153
               G +P + +++ R+   T L       Y G  +QI +YG+G  Y  H D   ++   
Sbjct: 139 --ARGKYPVVTRVEERLSRFTMLPW-----YNGEDMQILSYGVGEKYSAHHDVGEKNTKS 191

Query: 154 ------EGLWRLASFMFYLTDVELGGATIFP-------------------SLNLTVF-PE 187
                 +G  R+A+ + YL D E GG T FP                   + N   F P+
Sbjct: 192 GQQLSADGGQRVATVLLYLQDTEEGGETAFPDSEWIEPESEYAQQKFSECAKNGVAFKPK 251

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +G  + +++      +D +  H+GCPV  G KW
Sbjct: 252 RGDGLLFFSITPEGDIDQKSMHAGCPVVKGTKW 284


>gi|388496942|gb|AFK36537.1| unknown [Lotus japonicus]
          Length = 302

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 57/215 (26%), Positives = 91/215 (42%), Gaps = 36/215 (16%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           I P KV+++   PR       + + E + +I L+K +++R  V +   GD+   D R S 
Sbjct: 36  IDPSKVKQVSWKPRAFVYKGFLTELECDHLISLAKSELKRSAVADNLSGDSKLSDVRTSS 95

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCD 148
             F+        P +  I+ +I   T L     E     +Q+  Y  G     HYD   D
Sbjct: 96  GMFISKN---KDPIVAGIEDKISSWTFLPKENGED----IQVLRYEHGQKYDPHYDFFAD 148

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVF----------------------- 185
                 G  R+A+ + YLT+V  GG T+FP+  +  F                       
Sbjct: 149 KVNIARGGHRVATVLMYLTNVTRGGETVFPNAEVEEFPRHRGSETIDDLSECAKKGIAVK 208

Query: 186 PEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           P +G A+ +++ + N + D    H+GCPV  G KW
Sbjct: 209 PRRGDALLFFSLYPNAVPDTMSLHAGCPVIEGEKW 243


>gi|242063586|ref|XP_002453082.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
 gi|241932913|gb|EES06058.1| hypothetical protein SORBIDRAFT_04g038020 [Sorghum bicolor]
          Length = 307

 Score = 74.3 bits (181), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 57/204 (27%), Positives = 89/204 (43%), Gaps = 28/204 (13%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
            E +  +PR    H+ +   E   +I L+K  + +  VV+       D+R+     ++ +
Sbjct: 96  TEVISWEPRAFVYHNFLSKEECEYLIGLAKPHMVKSTVVDSTTGKSKDSRVRTSSGMFLQ 155

Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEG 155
             G    +  I+ RI D T +     E     LQ+ +Y +G     H+D   D      G
Sbjct: 156 -RGRDKVIRAIEKRIADYTFIPADHGEG----LQVLHYEVGQKYEPHFDYFLDEFNTKNG 210

Query: 156 LWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYN 196
             R+A+ + YL+DVE GG TIFP  N                   L+V P+ G A+ +++
Sbjct: 211 GQRMATLLMYLSDVEEGGETIFPDANVNASSLPWYNELSECAKRGLSVKPKMGDALLFWS 270

Query: 197 AHANTLLDYRMYHSGCPVALGNKW 220
              +  LD    H GCPV  GNKW
Sbjct: 271 MKPDATLDPLSLHGGCPVIRGNKW 294


>gi|222613083|gb|EEE51215.1| hypothetical protein OsJ_32038 [Oryza sativa Japonica Group]
          Length = 222

 Score = 73.9 bits (180), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 61/206 (29%), Positives = 90/206 (43%), Gaps = 32/206 (15%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
            E L  +PR    H+ +   E   +I L+K  +++  VV+       D+R+  S   FL 
Sbjct: 11  TEVLSWEPRAFLYHNFLSKEECEYLISLAKPHMKKSTVVDASTGGSKDSRVRTSSGMFLG 70

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
               G    +  I+ RI D T + +   E     LQ+ +Y +G     H+D   D     
Sbjct: 71  R---GQDKIIRTIEKRISDYTFIPVENGE----GLQVLHYEVGQKYEPHFDYFHDEFNTK 123

Query: 154 EGLWRLASFMFYLTDVELGGATIFPSL-------------------NLTVFPEKGSAVFW 194
            G  R+A+ + YL+DVE GG TIFPS                     L V P+ G A+ +
Sbjct: 124 NGGQRIATLLMYLSDVEEGGETIFPSSKANSSSSPFYNELSECAKKGLAVKPKMGDALLF 183

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           ++   +  LD    H GCPV  GNKW
Sbjct: 184 WSMRPDGSLDATSLHGGCPVIKGNKW 209


>gi|357447553|ref|XP_003594052.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
 gi|355483100|gb|AES64303.1| Prolyl 4-hydroxylase alpha subunit-like protein [Medicago
           truncatula]
          Length = 301

 Score = 73.9 bits (180), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 58/221 (26%), Positives = 93/221 (42%), Gaps = 34/221 (15%)

Query: 27  SYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTI 84
           SY  T   I P KV+++   PR       + D E + +I ++K +++R  V +   G++ 
Sbjct: 27  SYAGTSAIIDPTKVKQVSWKPRAFVYKGFLTDLECDHLISIAKSELKRSAVADNLSGESK 86

Query: 85  YVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYD 144
             + R S   F+          +  I+ +I   T L     E     +Q+  Y  G  YD
Sbjct: 87  LSEVRTSSGMFISKN---KDAIVSGIEDKISSWTFLPKENGED----IQVLRYEHGQKYD 139

Query: 145 LH----CDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------- 181
            H     D      G  R+A+ + YLT+V  GG T+FP+                     
Sbjct: 140 PHYDYFADKVNIARGGHRVATVLMYLTNVTKGGETVFPNAEESPRHKLSETDEDLSECGK 199

Query: 182 --LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
             + V P +G A+ +++ H N + D    H+GCPV  G KW
Sbjct: 200 KGVAVKPRRGDALLFFSLHPNAIPDTLSLHAGCPVIEGEKW 240


>gi|344199983|ref|YP_004784309.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
 gi|343775427|gb|AEM47983.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrivorans SS3]
          Length = 212

 Score = 73.9 bits (180), Expect = 4e-11,   Method: Compositional matrix adjust.
 Identities = 43/122 (35%), Positives = 64/122 (52%), Gaps = 9/122 (7%)

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDA----TPRDE-GLWR 158
           +P +  ++ RI    +L IG  E  + PLQ+ +Y  GG YD+H D+    +P+ E G  R
Sbjct: 68  YPIIKAVRRRI----SLFIGVAEENQEPLQVLHYTRGGRYDIHYDSFLEGSPQLENGGNR 123

Query: 159 LASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGN 218
           + + + YL DVE GG T FP +   + P  G+ + + N  A  L      H+G PV  G 
Sbjct: 124 MLTVLLYLNDVEQGGWTQFPHIMANIVPNVGTGILFRNTDAQNLQLRESLHAGLPVIDGE 183

Query: 219 KW 220
           KW
Sbjct: 184 KW 185


>gi|449461905|ref|XP_004148682.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 295

 Score = 73.9 bits (180), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 56/211 (26%), Positives = 92/211 (43%), Gaps = 34/211 (16%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PR       + D+E + +I+L+K K+E+  V +   G ++  + R S   
Sbjct: 29  PTRVTQLSWQPRAFLYKGFLSDAECDHLIDLAKDKLEKSMVADNDSGKSVSSEVRTSSGM 88

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
           FL          +  ++ RI   T L     E     +QI +Y  G     H+D   D  
Sbjct: 89  FLRK---AQDEVVAGVEARIAAWTLLPAENGE----SIQILHYENGQKYEPHFDFFHDKV 141

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLNL---------------------TVFPEKG 189
            ++ G  R+A+ + YL++VE GG TIFP+  +                      V  +KG
Sbjct: 142 NQELGGHRIATVLMYLSNVEKGGETIFPNSEVWYGSESQAKDESWSDCSRKGYAVKAQKG 201

Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            A+ +++ + +   D R  H  CPV  G KW
Sbjct: 202 DALLFFSLNLDATTDERSLHGSCPVIAGEKW 232


>gi|385206010|ref|ZP_10032880.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
 gi|385185901|gb|EIF35175.1| 2OG-Fe(II) oxygenase superfamily enzyme [Burkholderia sp. Ch1-1]
          Length = 296

 Score = 73.9 bits (180), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 49/185 (26%), Positives = 88/185 (47%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           P  + + D +  +E  ++I L++ ++ R  VV+   G  +    R S   F      G+ 
Sbjct: 102 PAAILLDDFLSANECEQLISLARPRLSRSTVVDPVTGRNVVAGHRSSDGMFFR---LGET 158

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDE-----G 155
           P + +++ RI ++T L +   E  +G LQ+ +Y +G     H D      P ++      
Sbjct: 159 PLIARLEARIAELTGLPV---ENGEG-LQLLHYEVGAESTPHVDYLIAGNPANQESIARS 214

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+ + + YL DVE GG T+FP    +V P +G A+++   +   L D    H+  P+ 
Sbjct: 215 GQRVGTLLMYLNDVEGGGETMFPQTGWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPLR 274

Query: 216 LGNKW 220
           +G KW
Sbjct: 275 VGEKW 279


>gi|428671901|gb|EKX72816.1| conserved hypothetical protein [Babesia equi]
          Length = 234

 Score = 73.9 bits (180), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 62/211 (29%), Positives = 98/211 (46%), Gaps = 27/211 (12%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIELSKGK-----V 72
           N+ C+  S   T +    +K+ +LY  L+P +  I + +    I  +++ S+GK      
Sbjct: 26  NIHCYKRS---TLIINDSIKLMKLYIHLNPEISMIFNVLEPEWIQHMMDASEGKWVKSQT 82

Query: 73  ERGKVVNYGDT---IYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYK 129
            RG    + DT      +TR S+      E   +   + KI+ R+     LV G    + 
Sbjct: 83  SRGLSSGHPDTYQTTVSETRKSQSAIFEHE---ETDVIAKIERRVA----LVAGIGVEFL 135

Query: 130 GPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
             L +  Y  G ++  H D      G +R A+ + YL DVE GG T+FP+L L + P   
Sbjct: 136 EKLVMVKYNPGDYFKEHHD------GSFRTATILLYLNDVE-GGETVFPNLGLAIKPVGN 188

Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           SAVFW N +    +D RM H+G    +G K+
Sbjct: 189 SAVFWRNLNGENEMDERMIHAGTTPKVGTKY 219


>gi|114796723|gb|ABI79328.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
          Length = 297

 Score = 73.6 bits (179), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 56/223 (25%), Positives = 92/223 (41%), Gaps = 35/223 (15%)

Query: 27  SYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTI 84
           S N++  K+ P KV ++   PR       + D E + +I ++K +++R  V +   G + 
Sbjct: 24  SSNDSIFKLNPSKVRQISWKPRAFVYEGFLTDEECDHLISIAKTELKRSAVADNESGKSQ 83

Query: 85  YVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG---- 140
             + R S   F+          + +I+ ++   T L I   E     +Q+  Y  G    
Sbjct: 84  VSEVRTSSGAFISK---AKDAIVQRIEEKLATWTFLPIENGE----DIQVLRYEEGQKYE 136

Query: 141 GHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL------------------ 182
            H+D   D      G  R A+ + YL++VE GG T+FP+  L                  
Sbjct: 137 NHFDFFSDKVNIARGGHRYATVLMYLSNVEKGGDTVFPNAELSERQKAAIAANDDLSECA 196

Query: 183 ----TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
               +V P KG A+ +++       D    H GCPV  G KW 
Sbjct: 197 KRGISVKPRKGDALLFFSLTPTATPDQLSLHGGCPVIEGEKWS 239


>gi|15239594|ref|NP_197391.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
 gi|21593296|gb|AAM65245.1| prolyl 4-hydroxylase alpha subunit-like protein [Arabidopsis
           thaliana]
 gi|332005243|gb|AED92626.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis
           thaliana]
          Length = 298

 Score = 73.6 bits (179), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 55/213 (25%), Positives = 90/213 (42%), Gaps = 34/213 (15%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           + P KV+++   PR       + + E + ++ L+K  ++R  V +   G++ + + R S 
Sbjct: 32  VNPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSS 91

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---- 148
             F+     G  P +  I+ +I   T L     E     +Q+  Y  G  YD H D    
Sbjct: 92  GTFISK---GKDPIVSGIEDKISTWTFLPKENGE----DIQVLRYEHGQKYDAHFDYFHD 144

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL---------------------TVFPE 187
                 G  R+A+ + YL++V  GG T+FP   +                      V P 
Sbjct: 145 KVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENKEDLSDCAKRGIAVKPR 204

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           KG A+ ++N H + + D    H GCPV  G KW
Sbjct: 205 KGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKW 237


>gi|20260280|gb|AAM13038.1| unknown protein [Arabidopsis thaliana]
 gi|22136524|gb|AAM91340.1| unknown protein [Arabidopsis thaliana]
          Length = 298

 Score = 73.6 bits (179), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 55/213 (25%), Positives = 90/213 (42%), Gaps = 34/213 (15%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           + P KV+++   PR       + + E + ++ L+K  ++R  V +   G++ + + R S 
Sbjct: 32  VNPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSS 91

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---- 148
             F+     G  P +  I+ +I   T L     E     +Q+  Y  G  YD H D    
Sbjct: 92  GTFISK---GKDPIVSGIEDKISTWTFLPKENGE----DIQVLRYEHGQKYDAHFDYFHD 144

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL---------------------TVFPE 187
                 G  R+A+ + YL++V  GG T+FP   +                      V P 
Sbjct: 145 KVNIVRGGHRMATILMYLSNVTKGGETVFPDAEIPSRRVLSENEEDLSDCAKRGIAVKPR 204

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           KG A+ ++N H + + D    H GCPV  G KW
Sbjct: 205 KGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKW 237


>gi|415977972|ref|ZP_11559036.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
 gi|339834153|gb|EGQ61937.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein
           [Acidithiobacillus sp. GGI-221]
          Length = 215

 Score = 73.6 bits (179), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 87/191 (45%), Gaps = 20/191 (10%)

Query: 45  LDPRVVKIHDAIYDSEINRIIELSKGK--VERGKVVNYGDTIYVDTRLSKVY-------- 94
           + P+++ ++D I       ++ L +    +  G V +   ++ VD      Y        
Sbjct: 3   MGPKILSVNDTIGLVHFKGLLSLDECAELIAIGSVSDAKPSVVVDGASDAAYETPGRCST 62

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDA----T 150
            + P +   +P + +I+ RI+    L  G  +  + PLQI +Y  GG YD+H DA    +
Sbjct: 63  VVAPSVDA-YPIILEIRRRIE----LFSGISQENQEPLQILHYTRGGKYDIHYDAFSDGS 117

Query: 151 PR-DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYH 209
           P+   G  RL + + YL DVE GG T FP +   + P  GS + + N  A         H
Sbjct: 118 PQLRNGGNRLLTVLLYLNDVEYGGWTQFPHIMANIVPNAGSGILFRNTDAQNRQLRESLH 177

Query: 210 SGCPVALGNKW 220
           +G PV  G KW
Sbjct: 178 AGLPVTHGEKW 188


>gi|198284815|ref|YP_002221136.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
 gi|218668131|ref|YP_002427500.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 23270]
 gi|198249336|gb|ACH84929.1| 2OG-Fe(II) oxygenase [Acidithiobacillus ferrooxidans ATCC 53993]
 gi|218520344|gb|ACK80930.1| oxidoreductase, 2OG-Fe(II) oxygenase family [Acidithiobacillus
           ferrooxidans ATCC 23270]
          Length = 213

 Score = 73.6 bits (179), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 54/191 (28%), Positives = 87/191 (45%), Gaps = 20/191 (10%)

Query: 45  LDPRVVKIHDAIYDSEINRIIELSKGK--VERGKVVNYGDTIYVDTRLSKVY-------- 94
           + P+++ ++D I       ++ L +    +  G V +   ++ VD      Y        
Sbjct: 1   MGPKILSVNDTIGLVHFKGLLSLDECAELIAIGSVSDAKPSVVVDGASDAAYETPGRCST 60

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDA----T 150
            + P +   +P + +I+ RI+    L  G  +  + PLQI +Y  GG YD+H DA    +
Sbjct: 61  VVAPSVDA-YPIILEIRRRIE----LFSGISQENQEPLQILHYTRGGKYDIHYDAFSDGS 115

Query: 151 PR-DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYH 209
           P+   G  RL + + YL DVE GG T FP +   + P  GS + + N  A         H
Sbjct: 116 PQLRNGGNRLLTVLLYLNDVEYGGWTQFPHIMANIVPNAGSGILFRNTDAQNRQLRESLH 175

Query: 210 SGCPVALGNKW 220
           +G PV  G KW
Sbjct: 176 AGLPVTHGEKW 186


>gi|356555587|ref|XP_003546112.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 2
           [Glycine max]
          Length = 297

 Score = 73.6 bits (179), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 56/209 (26%), Positives = 93/209 (44%), Gaps = 30/209 (14%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           I P KV+++   PR       + + E + +I ++K +++R  V +   G++   + R S 
Sbjct: 35  IDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSS 94

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
             F+ P+     P +  ++ +I   T L     E     +Q+  Y  G  YD H     D
Sbjct: 95  GMFI-PK--NKDPIVAGVEDKISSWTLLPKENGED----IQVLRYEHGQKYDPHYDYFAD 147

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL-----------------TVFPEKGSA 191
                 G  R+A+ + YLTDV  GG T+FP+  L                  V P +G A
Sbjct: 148 KVNIARGGHRVATVLMYLTDVTKGGETVFPNAELKSSETKEDLSECAQKGIAVKPRRGDA 207

Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + +++ + N + D    H+GCPV  G KW
Sbjct: 208 LLFFSLYPNAIPDTMSLHAGCPVIEGEKW 236


>gi|77747935|ref|NP_638775.2| hypothetical protein XCC3429 [Xanthomonas campestris pv. campestris
           str. ATCC 33913]
          Length = 288

 Score = 73.6 bits (179), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 62/208 (29%), Positives = 95/208 (45%), Gaps = 22/208 (10%)

Query: 26  ESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--G 81
           ++  ++ L +G  +V+ L   + PRVV +   + D E + +I L++ ++ R + V+   G
Sbjct: 75  QADASSLLDLGDRQVQVLVSLMLPRVVVLGGLLADDECDALIALARPQLARSRTVDNRDG 134

Query: 82  DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
             I    R S    L P   G      +I+ RI  +    +   E  +G LQ+  Y  G 
Sbjct: 135 SEIVHAARTSHSMALQP---GQDALCQRIEARIAQLLEWPV---EHGEG-LQVLRYATGA 187

Query: 142 HYDLHCD-------ATP--RDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
            Y  H D        TP     G  R+AS + YL   E GGAT  P ++L V   KG+AV
Sbjct: 188 QYAPHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRVPDVHLDVAAVKGNAV 247

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           F+     + +   R  H+G PV  G KW
Sbjct: 248 FFSYDRPHPM--TRTLHAGAPVLAGEKW 273


>gi|337280547|ref|YP_004620019.1| hypothetical protein Rta_28970 [Ramlibacter tataouinensis TTB310]
 gi|334731624|gb|AEG94000.1| conserved hypothetical protein [Ramlibacter tataouinensis TTB310]
          Length = 286

 Score = 73.6 bits (179), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 57/189 (30%), Positives = 88/189 (46%), Gaps = 26/189 (13%)

Query: 46  DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY---GDTIYVDTRLSKVYFLYPEIFG 102
           +PRVV     + D E  ++I L+K ++ R   V     G+ +  D   S ++F      G
Sbjct: 98  NPRVVVFGSLLSDQECEQLIGLAKPRLARSLTVATKTGGEEVNEDRTSSGMFFQR----G 153

Query: 103 DHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPR--D 153
           ++  + +I+ RI  + N  +   E  +G LQ+ +Y  G  Y  H D        TP    
Sbjct: 154 ENELVARIEARIARLVNWPV---ENGEG-LQVLHYRPGAEYKPHYDYFDPAEPGTPTILK 209

Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLLDYRMYHSG 211
            G  R+ + + YL + E GG T FP ++L V P++G  VF  +   H +T    R  H G
Sbjct: 210 RGGQRVGTLVMYLGEPEKGGGTTFPDVHLEVAPKRGHGVFFSYERPHPST----RTLHGG 265

Query: 212 CPVALGNKW 220
            PV  G KW
Sbjct: 266 APVLAGEKW 274


>gi|21114687|gb|AAM42699.1| conserved hypothetical protein [Xanthomonas campestris pv.
           campestris str. ATCC 33913]
          Length = 308

 Score = 73.6 bits (179), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 62/208 (29%), Positives = 95/208 (45%), Gaps = 22/208 (10%)

Query: 26  ESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--G 81
           ++  ++ L +G  +V+ L   + PRVV +   + D E + +I L++ ++ R + V+   G
Sbjct: 95  QADASSLLDLGDRQVQVLVSLMLPRVVVLGGLLADDECDALIALARPQLARSRTVDNRDG 154

Query: 82  DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
             I    R S    L P   G      +I+ RI  +    +   E  +G LQ+  Y  G 
Sbjct: 155 SEIVHAARTSHSMALQP---GQDALCQRIEARIAQLLEWPV---EHGEG-LQVLRYATGA 207

Query: 142 HYDLHCD-------ATP--RDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
            Y  H D        TP     G  R+AS + YL   E GGAT  P ++L V   KG+AV
Sbjct: 208 QYAPHYDYFEPDAPGTPVLLQHGGQRVASLVMYLNTPERGGATRVPDVHLDVAAVKGNAV 267

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           F+     + +   R  H+G PV  G KW
Sbjct: 268 FFSYDRPHPM--TRTLHAGAPVLAGEKW 293


>gi|90704797|dbj|BAE92293.1| putative prolyl 4-hydroxylase, alpha subunit [Cryptomeria japonica]
          Length = 302

 Score = 73.6 bits (179), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 56/208 (26%), Positives = 91/208 (43%), Gaps = 32/208 (15%)

Query: 39  KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFL 96
           +VE L  +PR    H+ +   E   +I ++K  + +  VV+   G ++  + R S  +FL
Sbjct: 90  RVEVLSWEPRAFLYHNFLAKDECEYLINIAKPHMVKSMVVDSKTGGSMDSNVRTSSGWFL 149

Query: 97  YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGL----GGHYDLHCDATPR 152
                G    + +I+ RI D +++ +   E     L + +Y +      HYD   D    
Sbjct: 150 NR---GQDKIIRRIEKRIADFSHIPVEHGE----GLHVLHYEVEQKYDAHYDYFSDTINV 202

Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVF 193
             G  R A+ + YL+DVE GG T+FP                      L+V P+ G A+ 
Sbjct: 203 KNGGQRGATMLMYLSDVEKGGETVFPQSKVNSSSVPWWDELSECGRSGLSVRPKMGDALL 262

Query: 194 WYNAHANTLLDYRMYHSGCPVALGNKWG 221
           +++   +  LD    H  CPV  GNKW 
Sbjct: 263 FWSVKPDASLDPSSLHGSCPVIQGNKWS 290


>gi|50845214|gb|AAT84604.1| prolyl 4-hydroxylase [Dianthus caryophyllus]
          Length = 316

 Score = 73.6 bits (179), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 65/243 (26%), Positives = 107/243 (44%), Gaps = 33/243 (13%)

Query: 3   YPLACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEIN 62
           +P +C G L+  +  KS L+   E+  ++ + + P  V +L   PR       +   E +
Sbjct: 20  HPSSC-GWLNNVKKGKSVLRLKSENVPSS-VGVDPSHVTQLSWKPRAFLYEGFLTHEECD 77

Query: 63  RIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNL 120
            +I+++K K+E+  V +   G +I  + R S   FL          +  I+ RI   T L
Sbjct: 78  HLIDMAKDKLEKSMVADNESGKSIPSEVRTSSGMFLQK---AQDDVVAAIEARIAAWTFL 134

Query: 121 VIGREERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGATI 176
            I   E     +QI +Y  G     H+D   D   +  G  R+A+ + YL++VE GG T+
Sbjct: 135 PIENGE----AMQILHYERGQKYEPHFDYFHDKVNQQLGGHRIATVLMYLSNVEEGGETV 190

Query: 177 FPSLNL------------------TVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGN 218
           FP+                     +V P+KG A+ +++ H +   D    H  CPV  G 
Sbjct: 191 FPNAEAKLQLANNESLSDCAKGGYSVKPKKGDALLFFSLHPDASTDSLSLHGSCPVIEGE 250

Query: 219 KWG 221
           KW 
Sbjct: 251 KWS 253


>gi|195494572|ref|XP_002094895.1| GE22068 [Drosophila yakuba]
 gi|194180996|gb|EDW94607.1| GE22068 [Drosophila yakuba]
          Length = 438

 Score = 73.6 bits (179), Expect = 6e-11,   Method: Compositional matrix adjust.
 Identities = 54/200 (27%), Positives = 93/200 (46%), Gaps = 24/200 (12%)

Query: 21  LKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY 80
           L C Y  +   FLK+ PLK+EEL + P +   +  +   +I  +  +S+ K++R + ++ 
Sbjct: 256 LVCHYVDWT-PFLKLAPLKMEELSMKPHISIFYGFLGPKDIEVLKNVSRPKLQRNEHLSA 314

Query: 81  GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG 140
             +  +    S            H  + K+   I D+T    G   +    +++ NYG+ 
Sbjct: 315 NCSCKIGNLFS----------SSHDVVRKVNELILDIT----GFPSKGNEMVEVINYGIA 360

Query: 141 GHYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHAN 200
           G+Y+    A PR     +  +F+F L +   GG  +FPS +L + P KGS + W N   +
Sbjct: 361 GNYNPDDTAQPRKHN--KANAFIF-LGNAGKGGEIVFPSRDLKIRPRKGSMIVWENLKKS 417

Query: 201 TLLDYRMYHSGCPVALGNKW 220
            +     YH  CP+  GN W
Sbjct: 418 VI-----YHQ-CPILKGNLW 431


>gi|297812067|ref|XP_002873917.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
 gi|297319754|gb|EFH50176.1| oxidoreductase [Arabidopsis lyrata subsp. lyrata]
          Length = 298

 Score = 73.6 bits (179), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 57/213 (26%), Positives = 92/213 (43%), Gaps = 34/213 (15%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           I P KV+++   PR       + + E + ++ L+K  ++R  V +   G++ + + R S 
Sbjct: 32  INPSKVKQVSSKPRAFVYEGFLTELECDHMVSLAKASLKRSAVADNDSGESKFSEVRTSS 91

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---- 148
             F+ P+  G  P +  I+ +I   T L     E     +Q+  Y  G  YD H D    
Sbjct: 92  GTFI-PK--GKDPIVSGIEDKISTWTFLPKENGED----IQVLRYEHGQKYDAHFDYFHD 144

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
                 G  R+A+ + YL++V  GG T+FP                        + V P 
Sbjct: 145 KVNIVRGGHRIATVLMYLSNVTKGGETVFPDAEVPSCRVLSENKEDLSDCAKRGIAVKPR 204

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           KG A+ ++N H + + D    H GCPV  G KW
Sbjct: 205 KGDALLFFNLHPDAIPDPLSLHGGCPVIEGEKW 237


>gi|124267278|ref|YP_001021282.1| hypothetical protein Mpe_A2091 [Methylibium petroleiphilum PM1]
 gi|124260053|gb|ABM95047.1| conserved hypothetical protein [Methylibium petroleiphilum PM1]
          Length = 289

 Score = 73.2 bits (178), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 60/196 (30%), Positives = 87/196 (44%), Gaps = 24/196 (12%)

Query: 38  LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYF 95
           ++V     DPRV+     + D+E + I+ L+  ++ R   V+   G +     R S   F
Sbjct: 93  VRVVMAMRDPRVIVFSGLLSDAECDEIVALAGARLARSHTVDTATGASEVNAARTSDGMF 152

Query: 96  LYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD------- 148
                 G+HP   + + RI  + N  +   E  +G LQ+ +Y  G  Y  H D       
Sbjct: 153 F---TRGEHPVCARFEARIAALLNWPV---ENGEG-LQVLHYRPGAEYKPHYDYFDPDQP 205

Query: 149 ATPR--DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWY--NAHANTLLD 204
            TP     G  R+A+ + YL     GG T FP + L V P KG AVF+     H +T   
Sbjct: 206 GTPAVLRRGGQRVATLVTYLNTPTRGGGTTFPDIGLEVTPLKGHAVFFSYDRPHPST--- 262

Query: 205 YRMYHSGCPVALGNKW 220
            R  H G PV  G+KW
Sbjct: 263 -RSLHGGAPVLEGDKW 277


>gi|383757171|ref|YP_005436156.1| putative prolyl 4-hydroxylase alpha subunit [Rubrivivax gelatinosus
           IL144]
 gi|381377840|dbj|BAL94657.1| putative prolyl 4-hydroxylase alpha subunit homologue
           oxidoreductase protein [Rubrivivax gelatinosus IL144]
          Length = 279

 Score = 73.2 bits (178), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 56/185 (30%), Positives = 84/185 (45%), Gaps = 20/185 (10%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           PRVV     + D E + ++ L++ ++ R + V+   G +     R S   F      G+ 
Sbjct: 92  PRVVVFGGLLSDEECDELVALARPRLARSETVDNSTGGSEVNAARTSDGMFFE---RGEK 148

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
           P + +I+ RI ++    +   ER +G LQ+  Y  G  Y  H D         A     G
Sbjct: 149 PLIERIERRIAELVRWPV---ERGEG-LQVLRYRPGAQYKPHHDFFDPAHPGTANILRRG 204

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+ + + YL     GGAT FP + L V P KG+AVF+  ++   L   R  H G PV 
Sbjct: 205 GQRVGTVVMYLNTPAGGGATTFPEVGLEVQPVKGNAVFF--SYERPLASTRTLHGGAPVL 262

Query: 216 LGNKW 220
            G KW
Sbjct: 263 DGEKW 267


>gi|303287328|ref|XP_003062953.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226455589|gb|EEH52892.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 259

 Score = 73.2 bits (178), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 59/214 (27%), Positives = 97/214 (45%), Gaps = 37/214 (17%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLY 97
           VE +   PR   +H+ + D+E + ++EL++ +V R  VV+   G++     R S+  FL 
Sbjct: 1   VEPISWHPRAFHLHNIMTDAECDEVLELARTRVRRSTVVDSTTGESKVDPIRTSEQCFLN 60

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVI-GREERYKGPLQINNYGLGGHYDLHCDATPRD--- 153
               G  P +  I+ R++  T L     E+    P ++  Y  G  YD H D    D   
Sbjct: 61  ---RGHFPIVSVIEKRLERYTMLPWYNGEDLQARPSRVLKYSNGQKYDAHHDVGELDTAS 117

Query: 154 ------EGLWRLASFMFYLTDVEL--GGATIFPSL-------------------NLTVFP 186
                 EG  R+A+ + YL+DV+   GG T FP                     ++ V P
Sbjct: 118 GKQLAAEGGHRVATVLLYLSDVDDDGGGETAFPDSEWIDPTADRGSGWSECAEDHVAVKP 177

Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +KG  + +++     ++D +  H+GCPV LG  W
Sbjct: 178 KKGDGLLFWSITPEGVIDQQSMHAGCPV-LGKSW 210


>gi|215490183|dbj|BAG86625.1| type 2 proly 4-hydroxylase [Nicotiana tabacum]
          Length = 318

 Score = 73.2 bits (178), Expect = 8e-11,   Method: Compositional matrix adjust.
 Identities = 56/211 (26%), Positives = 88/211 (41%), Gaps = 31/211 (14%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           I P +V ++   PR     + + D E +  I L+K K+E+  V +   G ++  + R S 
Sbjct: 57  IDPTRVTQISWRPRAFVYRNFLTDEECDHFITLAKHKLEKSMVADNESGKSVESEVRTSS 116

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCD 148
             F           +  ++ RI   T L     E     +QI +Y  G     H+D   D
Sbjct: 117 GMFFRK---AQDQVVANVEARIAAWTFL----PEENGESIQILHYEHGQKYEPHFDYFHD 169

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLNL------------------TVFPEKGS 190
              ++ G  R+A+ + YL+DVE GG T+FP+                      V P KG 
Sbjct: 170 KVNQELGGHRVATVLMYLSDVEKGGETVFPNSEAKKTQAKGDDWSDCAKKGYAVKPRKGD 229

Query: 191 AVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           A+ +++ H +   D    H  CPV  G KW 
Sbjct: 230 ALLFFSLHPDATTDPLSLHGSCPVIEGEKWS 260


>gi|224133600|ref|XP_002327635.1| predicted protein [Populus trichocarpa]
 gi|222836720|gb|EEE75113.1| predicted protein [Populus trichocarpa]
          Length = 291

 Score = 73.2 bits (178), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 59/206 (28%), Positives = 92/206 (44%), Gaps = 32/206 (15%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLY 97
            E +   PR    H+ +  +E   +I L+K ++++  VV+   G +     R S   FL 
Sbjct: 80  AEVISWKPRAFVYHNFLTKAECEYLINLAKPRMQKSTVVDSSTGKSKDSKVRTSSGTFL- 138

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
           P   G    +  I+ RI D + + +   E     LQI +Y +G     H+D   D     
Sbjct: 139 PR--GRDKIVRDIEKRIADFSFIPVEHGE----GLQILHYEVGQRYEPHFDYFMDEYNTK 192

Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
            G  R+A+ + YL+DVE GG T+FPS                     L+V P+ G A+ +
Sbjct: 193 NGGQRIATVLMYLSDVEEGGETVFPSAEGNISAVPWWNELSECGKGGLSVKPKMGDALLF 252

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           ++ + +   D    H GCPV  GNKW
Sbjct: 253 WSMNPDGSPDPSSLHGGCPVIRGNKW 278


>gi|308812133|ref|XP_003083374.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
           [Ostreococcus tauri]
 gi|116055254|emb|CAL57650.1| oxidoreductase, 2OG-Fe(II) oxygenase family protein (ISS)
           [Ostreococcus tauri]
          Length = 311

 Score = 73.2 bits (178), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 61/216 (28%), Positives = 95/216 (43%), Gaps = 43/216 (19%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLY 97
           +E++   PR     + + D+E +R+IE +   +E  +V +   G+    D R S   ++ 
Sbjct: 68  IEKISDSPRAYVFREFLTDAECDRVIERAYPTMEASEVTDDDSGEARPDDARSSIGGWVS 127

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDE--- 154
            +   D   +  I+ R      L + R E     +Q+  Y  G  YD H D    DE   
Sbjct: 128 GD---DDEVIRNIELRASTWAMLPMNRGE----TMQVLRYEKGQKYDAHDDFF-HDEHNV 179

Query: 155 --GLWRLASFMFYLTDVELGGATIFP------------------------SLN----LTV 184
             G  R+A+ + YL+DVE GG T+FP                        S N    L V
Sbjct: 180 KNGGQRVATILMYLSDVEEGGETVFPLGTPLGGRDPEKSGVTGDNACELASQNDPRVLAV 239

Query: 185 FPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            P +G A+ ++NAH +  +D +  H+GCPV  G KW
Sbjct: 240 KPRRGDALLFFNAHLSGEMDEKANHAGCPVNRGTKW 275


>gi|255539064|ref|XP_002510597.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223551298|gb|EEF52784.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 289

 Score = 72.8 bits (177), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 55/205 (26%), Positives = 88/205 (42%), Gaps = 28/205 (13%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPE 99
            E +  +PR    H+ +   E   +I L+K  + +  VV+       D+R+     ++  
Sbjct: 78  TEIISWEPRAFVYHNFLSKEECEYLIALAKPHMVKSTVVDSKTGRSKDSRVRTSSGMFLR 137

Query: 100 IFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCDATPRDEG 155
             G    +  I+ RI D + + I   E     LQ+ +Y +G     HYD   D      G
Sbjct: 138 -RGRDKIIRNIEKRIADFSFIPIEHGE----GLQVLHYEVGQKYEAHYDYFLDEFNTKNG 192

Query: 156 LWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFWYN 196
             R A+ + YL+DVE GG T+FP+                     L+V P+ G+A+ +++
Sbjct: 193 GQRTATLLMYLSDVEEGGETVFPAAKANISNVPSWNELSECARQGLSVKPKMGNALLFWS 252

Query: 197 AHANTLLDYRMYHSGCPVALGNKWG 221
              +  LD    H  CPV  GNKW 
Sbjct: 253 TRPDATLDPASLHGSCPVIRGNKWS 277


>gi|224011205|ref|XP_002295377.1| predicted protein [Thalassiosira pseudonana CCMP1335]
 gi|209583408|gb|ACI64094.1| predicted protein [Thalassiosira pseudonana CCMP1335]
          Length = 207

 Score = 72.8 bits (177), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 56/188 (29%), Positives = 82/188 (43%), Gaps = 19/188 (10%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--------SKVYFLYP 98
           P VV I   + D E NR IEL   + ER     Y  T+ +D           +       
Sbjct: 9   PWVVAIEGFLSDEECNRFIELGGDRYERS--TEYASTMNLDGTFDSKESSGRTSTNTWCG 66

Query: 99  EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW- 157
           E   D P + K+  R++ +T +     E     LQ+  Y +G  Y+ H D +   EG   
Sbjct: 67  EGCRDDPIIKKVIERMESLTGIPYANFE----DLQLVRYEIGQRYEEHHDYSSSHEGTQY 122

Query: 158 --RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNA--HANTLLDYRMYHSGCP 213
             R+ +  FYL DVE GG T F  L+    P++G A+ W +    A  ++D   +H   P
Sbjct: 123 GPRILTVFFYLNDVEEGGGTQFDELDFVTEPKRGMALIWPSTTNEAPDVMDDWTWHEALP 182

Query: 214 VALGNKWG 221
           V  G K+G
Sbjct: 183 VTKGIKYG 190


>gi|260806885|ref|XP_002598314.1| hypothetical protein BRAFLDRAFT_204780 [Branchiostoma floridae]
 gi|229283586|gb|EEN54326.1| hypothetical protein BRAFLDRAFT_204780 [Branchiostoma floridae]
          Length = 282

 Score = 72.8 bits (177), Expect = 9e-11,   Method: Compositional matrix adjust.
 Identities = 37/93 (39%), Positives = 53/93 (56%), Gaps = 5/93 (5%)

Query: 133 QINNYGLGGHYDLHCDATPRD-----EGLWRLASFMFYLTDVELGGATIFPSLNLTVFPE 187
           Q+ NYGLGG Y+ H D    +         R+ +F+FYL++VE GGAT+F   N+ V   
Sbjct: 167 QVLNYGLGGQYEPHYDHLKEEVSRTLMAANRILTFLFYLSEVEAGGATVFTEANIAVPVV 226

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           K SAV + N +   +      H+GCPV +G+KW
Sbjct: 227 KNSAVLFENTNKALVRSRASVHAGCPVLIGSKW 259


>gi|359477455|ref|XP_002278454.2| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 1 [Vitis
           vinifera]
          Length = 296

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 61/230 (26%), Positives = 94/230 (40%), Gaps = 33/230 (14%)

Query: 17  IKSNLKCFYESYNNTF-LKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERG 75
           I S +  F  SY +     +   KV ++   PR       + + E + +I L+K +++R 
Sbjct: 13  ISSTILEFSSSYADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLISLAKSELKRS 72

Query: 76  KVVNYGDTIYVDTRLSKVYFLYPEIFGD--HPFLYKIQTRIQDMTNLVIGREERYKGPLQ 133
            V    D +   +RLS+V        G    P +  I+ +I   T L     E     +Q
Sbjct: 73  AVA---DNVSGKSRLSEVRTSSGMFIGKGKDPIVAGIEDKIAAWTFLPKDNGED----MQ 125

Query: 134 INNYGLGGHYDLH----CDATPRDEGLWRLASFMFYLTDVELGGATIFPSLNLT------ 183
           +  Y  G  YD H     D      G  R+A+ + YL+DV  GG T+FP   ++      
Sbjct: 126 VLRYEPGQKYDAHYDYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEVSSSTLPT 185

Query: 184 -------------VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
                        V P KG A+ +++ H   + D    H GCPV  G KW
Sbjct: 186 NDDLSECARKGIAVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKW 235


>gi|195627276|gb|ACG35468.1| prolyl 4-hydroxylase [Zea mays]
          Length = 298

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 59/208 (28%), Positives = 91/208 (43%), Gaps = 31/208 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PR       + D+E + +I L+K K+E+  V +   G ++  + R S   
Sbjct: 32  PSRVVQLSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSEVRTSSGM 91

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
           FL  +       + +I+ RI   T L     E     +QI +Y  G     HYD   D  
Sbjct: 92  FLEKK---QDEVVTRIEERISAWTFLPPENGE----AIQILHYQNGEKYEPHYDYFHDKN 144

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
            +  G  R+A+ + YL++VE GG TIFP+                      V P KG A+
Sbjct: 145 NQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDAL 204

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +++ H ++  D    H  CPV  G KW
Sbjct: 205 LFFSLHPDSTTDSDSLHGSCPVIEGQKW 232


>gi|388567209|ref|ZP_10153646.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
 gi|388265592|gb|EIK91145.1| procollagen-proline dioxygenase [Hydrogenophaga sp. PBC]
          Length = 296

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 60/188 (31%), Positives = 84/188 (44%), Gaps = 26/188 (13%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY---GDTIYVDTRLSKVYFLYPEIFGD 103
           PRVV + + +   E + IIE +K K+ R   V     G+ +  D   S ++F      G 
Sbjct: 109 PRVVVLGNLLSAEECDAIIESAKPKLARSLTVQTATGGEELNADRTSSGMFFTR----GQ 164

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPR--DE 154
            P +  ++ RI  +    +   E  +G LQ+ +Y  G  Y  H D        TP     
Sbjct: 165 TPEVTAVERRIARLVGWPV---ENGEG-LQVLHYRPGAEYKPHYDYFDPKEAGTPTILKR 220

Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWY--NAHANTLLDYRMYHSGC 212
           G  R+A+ + YL +   GG T FP + L V P KGSAVF+     H  T    R  H G 
Sbjct: 221 GGQRVATLVMYLNEPARGGGTTFPDVGLEVAPVKGSAVFFSYDRPHPTT----RSLHGGA 276

Query: 213 PVALGNKW 220
           PV  G KW
Sbjct: 277 PVLEGEKW 284


>gi|159464219|ref|XP_001690339.1| hypothetical protein CHLREDRAFT_114525 [Chlamydomonas reinhardtii]
 gi|158279839|gb|EDP05598.1| predicted protein [Chlamydomonas reinhardtii]
          Length = 244

 Score = 72.8 bits (177), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 54/194 (27%), Positives = 89/194 (45%), Gaps = 28/194 (14%)

Query: 48  RVVKIHDAIYDSEINRIIELSKGKVER-GKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPF 106
           R+  I   + D E + I+++S+ ++ER G V   G +     R S   FL     G+ P 
Sbjct: 1   RIFLIEHFLTDEEADHIVQVSERRLERSGVVATNGGSEESQIRTSFGVFLE---RGEDPV 57

Query: 107 LYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLW----RLASF 162
           +  ++ RI  +T + +G  E     LQ+  Y     YD H D     +G+     R A+ 
Sbjct: 58  VKGVEERISALTLMPVGNGE----GLQVLRYQKEQKYDAHWDYFFHKDGIANGGNRYATV 113

Query: 163 MFYLTDVELGGATIFPSL----------------NLTVFPEKGSAVFWYNAHANTLLDYR 206
           + YL D E GG T+FP++                +L   P+KG+A+ +++      L+ +
Sbjct: 114 LMYLVDTEEGGETVFPNIAAPGGENVGFSECARYHLAAKPKKGTAILFHSIKPTGELERK 173

Query: 207 MYHSGCPVALGNKW 220
             H+ CPV  G KW
Sbjct: 174 SLHTACPVIKGIKW 187


>gi|221482398|gb|EEE20746.1| prolyl 4-hydroxylase alpha subunit, putative [Toxoplasma gondii
           GT1]
 gi|221504447|gb|EEE30120.1| prolyl 4-hydroxylase alpha subunit, putative [Toxoplasma gondii
           VEG]
          Length = 401

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/189 (27%), Positives = 90/189 (47%), Gaps = 16/189 (8%)

Query: 38  LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTI----YVDTRL-SK 92
           +++  ++ +P V  I + + DS+  R+++L +G+ ER K      T     Y  ++  S+
Sbjct: 203 IQILAIHENPEVFLIPELLTDSDCERLLQLCEGRWERSKTSTGYATAEPRDYTSSKSPSR 262

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR 152
             +  P    +     +I   I+ + +   G    +  PL +  Y  G ++ LH D    
Sbjct: 263 TSWSVPLAIAE----TEIVENIERIVSAFAGMPVEHLEPLVVVRYEEGQYFKLHSD---- 314

Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT-LLDYRMYHSG 211
             G +R  + + YL DVE GG T F +L   V P KG+ V W N++  T  +D R+ H+G
Sbjct: 315 --GGFRPKTILLYLNDVEAGGETSFENLGFRVAPMKGAGVLWNNSYPGTNEIDPRLIHAG 372

Query: 212 CPVALGNKW 220
            P   G K+
Sbjct: 373 LPPEKGVKF 381


>gi|15077349|gb|AAK83137.1| prolyl 4-hydroxylase alpha subunit [Cavia porcellus]
          Length = 141

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 48/148 (32%), Positives = 77/148 (52%), Gaps = 11/148 (7%)

Query: 3   YPLACQGN-LSVPEDIKSNLKCFYESYN-NTFLKIGPLKVEELYLDPRVVKIHDAIYDSE 60
           Y + C+G  + +    +  L C Y   N N    + P K E+ +  PR+++ HD I D+E
Sbjct: 1   YEMLCRGEGIKMTPRRQKKLFCRYHDGNRNPKFILAPAKQEDEWDKPRIIRFHDIISDAE 60

Query: 61  INRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMT 118
           I  + +L+K ++ R  + N   GD   V  R+SK  +L      ++P + +I  RIQD+T
Sbjct: 61  IEIVKDLAKPRLRRATISNPITGDLETVHYRISKSAWLS---GYENPVVSRINMRIQDLT 117

Query: 119 NLVIGREERYKGPLQINNYGLGGHYDLH 146
            L +   E     LQ+ NYG+GG Y+ H
Sbjct: 118 GLDVSTAEE----LQVANYGVGGQYEPH 141


>gi|237841319|ref|XP_002369957.1| 2OG-Fe(II) oxygenase family protein, putative [Toxoplasma gondii
           ME49]
 gi|211967621|gb|EEB02817.1| 2OG-Fe(II) oxygenase family protein, putative [Toxoplasma gondii
           ME49]
          Length = 401

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 52/189 (27%), Positives = 90/189 (47%), Gaps = 16/189 (8%)

Query: 38  LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTI----YVDTRL-SK 92
           +++  ++ +P V  I + + DS+  R+++L +G+ ER K      T     Y  ++  S+
Sbjct: 203 IQILAIHENPEVFLIPELLTDSDCERLLQLCEGRWERSKTSTGYATAEPRDYTSSKSPSR 262

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR 152
             +  P    +     +I   I+ + +   G    +  PL +  Y  G ++ LH D    
Sbjct: 263 TSWSVPLAIAE----TEIVENIERIVSAFAGMPVEHLEPLVVVRYEEGQYFKLHSD---- 314

Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANT-LLDYRMYHSG 211
             G +R  + + YL DVE GG T F +L   V P KG+ V W N++  T  +D R+ H+G
Sbjct: 315 --GGFRPKTILLYLNDVEAGGETSFENLGFRVAPMKGAGVLWNNSYPGTNEIDPRLIHAG 372

Query: 212 CPVALGNKW 220
            P   G K+
Sbjct: 373 LPPEKGVKF 381


>gi|357417854|ref|YP_004930874.1| procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
 gi|355335432|gb|AER56833.1| Procollagen-proline dioxygenase [Pseudoxanthomonas spadix BD-a59]
          Length = 283

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 62/208 (29%), Positives = 94/208 (45%), Gaps = 22/208 (10%)

Query: 26  ESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YG 81
           E      L+ G  +V+ L   L PRV+   + +   E + +I L++ +++R  V +   G
Sbjct: 73  ERNGPALLQAGDRQVQVLASLLHPRVIVFGNLLAAEECDALIALARRQIKRSPVFDPDTG 132

Query: 82  DTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG 141
                  R S+  F      G +P   +++ RI  + N  +   E  +G LQ+  YG G 
Sbjct: 133 QDQQHQARTSEGMFFG---RGANPLCARVEARIAALLNWPL---ENGEG-LQVLRYGPGA 185

Query: 142 HYDLHCD----ATPRDE-----GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAV 192
            Y+ H D    A P  E     G  R+AS + YL     GGAT FP  +L V P KG+AV
Sbjct: 186 QYEPHYDYFDPARPGAEVALRRGGQRVASLVIYLNTPTQGGATTFPDAHLEVAPIKGNAV 245

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
           ++     + +      H G PV  G KW
Sbjct: 246 YFSYDRPHPMTG--TLHGGAPVVEGEKW 271


>gi|242047772|ref|XP_002461632.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
 gi|241925009|gb|EER98153.1| hypothetical protein SORBIDRAFT_02g005750 [Sorghum bicolor]
          Length = 307

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 56/206 (27%), Positives = 89/206 (43%), Gaps = 31/206 (15%)

Query: 39  KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFL 96
           +V+ +   PR+      + D+E + ++ L+K K++R  V +   G ++  + R S   FL
Sbjct: 43  RVKAVSWQPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNQSGKSVMSEVRTSSGMFL 102

Query: 97  YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPR 152
                   P + +I+ RI   T L     E     +QI  Y  G     H+D   D   +
Sbjct: 103 NKR---QDPVVSRIEERIAAWTFLPQENAEN----MQILRYEHGQKYEPHFDYFHDKINQ 155

Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAVFW 194
             G  R A+ + YL+ V+ GG T+FP+                    L V P KG AV +
Sbjct: 156 VRGGHRYATVLMYLSTVDKGGETVFPNAKGWESQPKDDTFSECAHQGLAVKPVKGDAVLF 215

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           ++ H + + D    H  CPV  G KW
Sbjct: 216 FSLHVDGVPDPLSLHGSCPVIQGEKW 241


>gi|226495689|ref|NP_001149322.1| LOC100282945 precursor [Zea mays]
 gi|194697650|gb|ACF82909.1| unknown [Zea mays]
 gi|194708468|gb|ACF88318.1| unknown [Zea mays]
 gi|195626376|gb|ACG35018.1| oxidoreductase [Zea mays]
 gi|347978842|gb|AEP37763.1| prolyl 4-hydroxylase 9 [Zea mays]
 gi|413945802|gb|AFW78451.1| oxidoreductase [Zea mays]
          Length = 308

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 58/207 (28%), Positives = 84/207 (40%), Gaps = 30/207 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P    ++   PRV      + D E N +I L++ +++R  V +   G +   + R S   
Sbjct: 48  PHHSRQISCKPRVFLYQHFLSDDEANHLISLARAELKRSAVADNMSGKSTLSEVRTSSGT 107

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCDAT 150
           FL     G  P +  I+ +I   T L     E     +Q+  Y  G     HYD   D  
Sbjct: 108 FLRK---GQDPIVEGIEDKIAAWTFLPKENGED----IQVLRYKHGEKYEPHYDYFTDNV 160

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFP-----------------SLNLTVFPEKGSAVF 193
               G  R A+ + YLTDV  GG T+FP                    + V P KG A+ 
Sbjct: 161 NTVRGGHRYATVLLYLTDVPEGGETVFPLAEEPDDAKDATLSECAQKGIAVRPRKGDALL 220

Query: 194 WYNAHANTLLDYRMYHSGCPVALGNKW 220
           ++N + +   D    H GCPV  G KW
Sbjct: 221 FFNLNPDGTTDSVSLHGGCPVIKGEKW 247


>gi|255552788|ref|XP_002517437.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223543448|gb|EEF44979.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 311

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 59/211 (27%), Positives = 91/211 (43%), Gaps = 37/211 (17%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PR       +   E + +I+L++ K+E+  V +   G +I  + R S   
Sbjct: 46  PTRVTQLSWHPRAFLYKGFLSYEECDHLIDLARDKLEKSMVADNESGKSIESEVRTSSGM 105

Query: 95  FLYP---EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHC 147
           F+     EI  D      I+ RI   T L     E     +QI +Y  G     H+D   
Sbjct: 106 FIAKAQDEIVAD------IEARIAAWTFL----PEENGESMQILHYEHGQKYEPHFDYFH 155

Query: 148 DATPRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKG 189
           D   ++ G  R+A+ + YL++VE GG T+FP+                      V PEKG
Sbjct: 156 DKANQELGGHRVATVLMYLSNVEKGGETVFPNAEGKLSQPKEDSWSDCAKGGYAVKPEKG 215

Query: 190 SAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            A+ +++ H +   D    H  CPV  G KW
Sbjct: 216 DALLFFSLHPDATTDSDSLHGSCPVIEGEKW 246


>gi|78046308|ref|YP_362483.1| 2OG-Fe(II) oxygenase [Xanthomonas campestris pv. vesicatoria str.
           85-10]
 gi|78034738|emb|CAJ22383.1| putative 2OG-Fe(II) oxygenase superfamily protein [Xanthomonas
           campestris pv. vesicatoria str. 85-10]
          Length = 296

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 61/214 (28%), Positives = 93/214 (43%), Gaps = 22/214 (10%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
            +    +  + + L +G   V  L   L PRVV +   + D E + +I L++ ++ R + 
Sbjct: 77  RVPALQQDADASLLALGDRDVRVLVSLLLPRVVVLGGFLSDEECDALIALARPRLARSRT 136

Query: 78  VNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQIN 135
           V+   G+ +    R S    L     G      +I+ RI  + +  +   E     LQ+ 
Sbjct: 137 VDNANGEHVVHAARTSDSMCLR---LGQDALCQRIEARIARLLDWPVDHGEG----LQVL 189

Query: 136 NYGLGGHYDLHCD-------ATPR--DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFP 186
            Y  G  Y  H D        TP     G  R+AS + YL   E GGAT FP  +L V  
Sbjct: 190 RYATGAEYRPHYDYFDPDAAGTPVLVQAGGQRVASLVMYLNTPERGGATRFPDAHLDVAA 249

Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            KG+AVF+     + +   R  H+G PV  G+KW
Sbjct: 250 VKGNAVFFSYDRPHPM--TRSLHAGAPVLAGDKW 281


>gi|363807814|ref|NP_001242181.1| uncharacterized protein LOC100782154 [Glycine max]
 gi|255644463|gb|ACU22735.1| unknown [Glycine max]
          Length = 285

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 63/238 (26%), Positives = 102/238 (42%), Gaps = 38/238 (15%)

Query: 11  LSVPEDIKSNLKCFY--ESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELS 68
           LS P D+ S  +  +  E  NN   +     VE +  +PR    H+ +   E   +I  +
Sbjct: 56  LSKPNDLNSVPRNTHVSEGENNRVKRW----VEVMSWEPRAFLYHNFLTKEECEYLINTA 111

Query: 69  KGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREE 126
              + +  V++   G+ I    R S  Y +     G    +  I+ RI D+T + I   E
Sbjct: 112 TPNMLKSLVIDNESGEGIETSYRTSTEYVVE---RGKDKIVRNIEKRIADVTFIPIEHGE 168

Query: 127 RYKGPLQINNYGLGGHYDLHCDATPRD----EGLWRLASFMFYLTDVELGGATIFPSLN- 181
               PL +  Y +G +Y+ H D    +     G  R+A+ + YL++VE GG T+FP  N 
Sbjct: 169 ----PLHVIRYAVGQYYEPHVDYFEEEFSLVNGGQRIATMLMYLSNVEGGGETVFPIANA 224

Query: 182 ------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
                             L++ P+ G A+ +++   +  LD    H  CPV  GNKW 
Sbjct: 225 NFSSVPWWNELSECGQTGLSIKPKMGDALLFWSMKPDATLDPLTLHRACPVIKGNKWS 282


>gi|66820122|ref|XP_643703.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
 gi|60471803|gb|EAL69758.1| hypothetical protein DDB_G0275385 [Dictyostelium discoideum AX4]
          Length = 221

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 58/193 (30%), Positives = 83/193 (43%), Gaps = 20/193 (10%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFL 96
           P+K+ EL   PR+ +I   + D E   +I+ SK K+     ++ G         S     
Sbjct: 22  PVKLIELSQAPRIYRIPGFLTDEECEFLIDTSKNKLRPCNEISSG------VHRSGWGLF 75

Query: 97  YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCDATPR 152
             E   DH     I  +++   N+    E      +Q+  Y  G     H+D     T  
Sbjct: 76  MKEGEEDHQITKNIFNKMKSFVNISESCE-----VMQVIRYNQGEETSSHFDYFNPLTTN 130

Query: 153 DE---GLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRM 207
                GL+  R+ + + YL DVE GG T FP + + V P KG AV +YN   N  +D   
Sbjct: 131 GSMKIGLYGQRVCTILMYLCDVEEGGETTFPEVGIKVKPIKGDAVLFYNCKPNGDVDPLS 190

Query: 208 YHSGCPVALGNKW 220
            H G PV  GNKW
Sbjct: 191 LHQGDPVLKGNKW 203


>gi|330821584|ref|YP_004350446.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           gladioli BSR3]
 gi|327373579|gb|AEA64934.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           gladioli BSR3]
          Length = 302

 Score = 72.4 bits (176), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 50/185 (27%), Positives = 82/185 (44%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           P  V +   +   E  ++IEL++ ++ R  VV+   G  I    R S   F      G+ 
Sbjct: 102 PAAVLLDGFLSAGECRQLIELARPRLNRSTVVDPVTGRNIVAGHRSSDGMFFR---LGET 158

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
           P + +I+ RI  +T   +   E  +G LQ+ +Y  G     H D         A      
Sbjct: 159 PLISRIEQRIAALTGFPV---ENGEG-LQMLHYEAGAESTPHVDYLVPGNPANAESIARS 214

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+ + + YL DVE GG T+FP +  +V P +G A ++   + +   D    H+  P+ 
Sbjct: 215 GQRVGTLLMYLNDVESGGETLFPQVGCSVVPRRGQAFYFEYGNGSGRSDPASLHASSPIG 274

Query: 216 LGNKW 220
            G+KW
Sbjct: 275 SGDKW 279


>gi|47210159|emb|CAF93191.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 78

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 29/55 (52%), Positives = 38/55 (69%)

Query: 166 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           ++DVE GGAT+FP     ++P KG+AVFWYN   +   DYR  H+ CPV +GNKW
Sbjct: 1   MSDVEAGGATVFPDFGAAIWPRKGTAVFWYNLFRSGEGDYRTRHAACPVLVGNKW 55


>gi|194751833|ref|XP_001958228.1| GF10815 [Drosophila ananassae]
 gi|190625510|gb|EDV41034.1| GF10815 [Drosophila ananassae]
          Length = 273

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 33/64 (51%), Positives = 42/64 (65%)

Query: 161 SFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           S +  L+DVE GG T+FP LNL V  +KGS + WYN  +N   D R+ H+ CPV +GNKW
Sbjct: 207 SLLKNLSDVEQGGDTVFPHLNLKVPAQKGSLMVWYNLLSNGTTDSRVLHASCPVLMGNKW 266

Query: 221 GKLL 224
            K L
Sbjct: 267 SKYL 270



 Score = 42.4 bits (98), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 41/66 (62%), Gaps = 8/66 (12%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN 79
           +LKC Y    + FLK+ P+K+E+++LDP +   HD I + EI+ +  LS   VE+G    
Sbjct: 166 HLKCQYLK-ASPFLKLAPIKMEKVFLDPPMSIYHDLINEKEISLLKNLS--DVEQG---- 218

Query: 80  YGDTIY 85
            GDT++
Sbjct: 219 -GDTVF 223


>gi|328876967|gb|EGG25330.1| putative prolyl 4-hydroxylase alpha subunit [Dictyostelium
           fasciculatum]
          Length = 244

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 59/196 (30%), Positives = 88/196 (44%), Gaps = 20/196 (10%)

Query: 39  KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYP 98
           K+ E+   PRV ++ D +  +E   +I++SK K+     ++ G         S       
Sbjct: 25  KLIEMSQCPRVYRVPDFLSPAECEHLIDISKNKLRPCNEISSG------VHRSGWGLFMK 78

Query: 99  EIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCDATPRDE 154
           E   DH  + KI  R++ + NL    E      +Q+  Y  G     HYD     T    
Sbjct: 79  EGEEDHDVVKKIFQRMKMLVNLTENCEV-----MQVIRYHPGEETSAHYDYFNPLTTNGA 133

Query: 155 ---GLW--RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYH 209
              GL+  R+ + + YL++VE GG T FP + + V P KG AV +YN   N  +D    H
Sbjct: 134 MKIGLYGQRVCTILMYLSEVEEGGETSFPEVGVKVKPVKGDAVLFYNCKPNGEVDPLSLH 193

Query: 210 SGCPVALGNKWGKLLL 225
            G PV  G KW  + L
Sbjct: 194 QGDPVIKGTKWVAIKL 209


>gi|363543295|ref|NP_001241863.1| prolyl 4-hydroxylase 4 precursor [Zea mays]
 gi|347978806|gb|AEP37745.1| prolyl 4-hydroxylase 4 [Zea mays]
 gi|414591890|tpg|DAA42461.1| TPA: hypothetical protein ZEAMMB73_637248 [Zea mays]
          Length = 274

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 56/206 (27%), Positives = 88/206 (42%), Gaps = 31/206 (15%)

Query: 39  KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFL 96
           +V+ +   PR+      + D+E + ++ L+K K++R  V +   G ++  + R S   FL
Sbjct: 44  RVKAVSWHPRIFVYKGFLSDAECDHLVTLAKKKIQRSMVADNESGKSVKSEVRTSSGMFL 103

Query: 97  YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPR 152
                   P + +I+ RI   T L     E     +Q+  Y  G     H+D   D   +
Sbjct: 104 DKR---QDPVVSRIEERIAAWTFLPQENAEN----MQVLRYEPGQKYEPHFDYFHDRVNQ 156

Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAVFW 194
             G  R A+ + YL+ V  GG T+FP+                    L V P KG AV +
Sbjct: 157 ARGGHRYATVLMYLSTVREGGETVFPNAKGWESQPKDATFSECAHKGLAVKPVKGDAVLF 216

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           ++ HA+   D    H  CPV  G KW
Sbjct: 217 FSLHADGTPDPLSLHGSCPVIRGEKW 242


>gi|160900716|ref|YP_001566298.1| procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
 gi|160366300|gb|ABX37913.1| Procollagen-proline dioxygenase [Delftia acidovorans SPH-1]
          Length = 294

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 63/232 (27%), Positives = 105/232 (45%), Gaps = 26/232 (11%)

Query: 5   LACQGNLSVPEDI--KSNLKCFYESYNNTFLKIGPLKVEEL--YLDPRVVKIHDAIYDSE 60
           L  QG +S P  +   S+L     + + + + +G  +V+ L    +PR+V   + +   E
Sbjct: 61  LPEQGEVSPPAVVVSASSLPEPDLAQDPSSIDVGDRQVQVLVSMRNPRIVVFGNLLSHEE 120

Query: 61  INRIIELSKGKVERGKVV---NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
            + II  ++ ++ R   V   + G+ I  D   + ++F      G+   + +++ RI  +
Sbjct: 121 CDAIIAAARPRMARSLTVATQSGGEEINDDRTSNGMFFQR----GETGIVSQLEERIARL 176

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPR--DEGLWRLASFMFYLTD 168
               +   E     LQ+ +YG G  Y  H D        TP     G  R+ + + YL +
Sbjct: 177 LRWPLDHGEG----LQVLHYGPGAEYKPHHDYFAPGEPGTPTILKRGGQRVGTLVIYLNE 232

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            E GGATIFP + L V P +G+AVF+     +     R  H G PV  G KW
Sbjct: 233 PERGGATIFPEVPLQVVPRRGNAVFFSYERPDP--STRTLHGGAPVLAGEKW 282


>gi|91779740|ref|YP_554948.1| procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
 gi|91692400|gb|ABE35598.1| Procollagen-proline,2-oxoglutarate-4- dioxygenase [Burkholderia
           xenovorans LB400]
          Length = 296

 Score = 72.0 bits (175), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 50/185 (27%), Positives = 85/185 (45%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           P  V + D +  +E  ++I L++ ++ R  VV+   G  +    R S   F      G+ 
Sbjct: 102 PAAVLLDDFLSANECEQLIALARPRLSRSTVVDPVTGRNVVAGHRSSDGMFFR---LGET 158

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD----ATPRDE-----G 155
           P + +++ RI ++T L +   E  +G LQ+ +Y  G     H D      P +       
Sbjct: 159 PLIARLEARIAELTGLPV---ENGEG-LQLLHYEAGAESTPHVDYLIAGNPANRESIARS 214

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+ + + YL DVE GG T+FP    +V P +G A+++   +   L D    H+  P+ 
Sbjct: 215 GQRVGTLLMYLNDVEGGGETMFPQTGWSVVPRRGQALYFEYGNRFGLADPSSLHTSTPLR 274

Query: 216 LGNKW 220
            G KW
Sbjct: 275 AGEKW 279


>gi|359477453|ref|XP_003631980.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 isoform 2 [Vitis
           vinifera]
 gi|297736941|emb|CBI26142.3| unnamed protein product [Vitis vinifera]
          Length = 298

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 61/232 (26%), Positives = 93/232 (40%), Gaps = 35/232 (15%)

Query: 17  IKSNLKCFYESYNNTF-LKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERG 75
           I S +  F  SY +     +   KV ++   PR       + + E + +I L+K +++R 
Sbjct: 13  ISSTILEFSSSYADAAGSNVSAAKVRQISWKPRAFVYEGFLSEEECDHLISLAKSELKRS 72

Query: 76  KVVNYGDTIYVDTRLSKVYFLYPEIFGD--HPFLYKIQTRIQDMTNLVIGREERYKGPLQ 133
            V    D +   +RLS+V        G    P +  I+ +I   T L     E     +Q
Sbjct: 73  AVA---DNVSGKSRLSEVRTSSGMFIGKGKDPIVAGIEDKIAAWTFLPKDNGED----MQ 125

Query: 134 INNYGLGGHYDLH----CDATPRDEGLWRLASFMFYLTDVELGGATIFPSLN-------- 181
           +  Y  G  YD H     D      G  R+A+ + YL+DV  GG T+FP           
Sbjct: 126 VLRYEPGQKYDAHYDYFVDKVNIARGGHRIATVLMYLSDVVKGGETVFPMAEEPSRRKPL 185

Query: 182 -------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
                        + V P KG A+ +++ H   + D    H GCPV  G KW
Sbjct: 186 PTNDDLSECARKGIAVKPRKGDALLFFSLHPTAIPDPMSLHGGCPVIEGEKW 237


>gi|293337056|ref|NP_001169835.1| uncharacterized protein LOC100383727 precursor [Zea mays]
 gi|224031897|gb|ACN35024.1| unknown [Zea mays]
 gi|347978800|gb|AEP37742.1| prolyl 4-hydroxylase 2 [Zea mays]
 gi|414871435|tpg|DAA49992.1| TPA: hypothetical protein ZEAMMB73_500506 [Zea mays]
          Length = 299

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 59/208 (28%), Positives = 90/208 (43%), Gaps = 31/208 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PR       + D+E + +I L+K K+E+  V +   G ++  + R S   
Sbjct: 33  PSRVVQLSWRPRAFLHKGFLSDAECDHLIALAKDKLEKSMVADNESGKSVQSEVRTSSGM 92

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
           FL  +       + +I+ RI   T L     E     +QI +Y  G     HYD   D  
Sbjct: 93  FLERK---QDEVVTRIEERISAWTFLPPENGES----IQILHYQNGEKYEPHYDYFHDKK 145

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
            +  G  R+A+ + YL++VE GG TIFP+                      V P KG A+
Sbjct: 146 NQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDNTWSDCARNGYAVKPVKGDAL 205

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +++ H +   D    H  CPV  G KW
Sbjct: 206 LFFSLHPDATTDSDSLHGSCPVIEGQKW 233


>gi|171059332|ref|YP_001791681.1| procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
 gi|170776777|gb|ACB34916.1| Procollagen-proline dioxygenase [Leptothrix cholodnii SP-6]
          Length = 287

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 57/188 (30%), Positives = 89/188 (47%), Gaps = 24/188 (12%)

Query: 46  DPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGD 103
           DPRVV     +   E + ++ L++ ++ R + V+   G +   + R S+  F    + G+
Sbjct: 99  DPRVVVFGGFLSHDECDALVALAQPRLARSETVDNDTGGSEVNEARTSQGMFF---MRGE 155

Query: 104 HPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPR--DE 154
              + +I+ RI  + +  +   E  +G +Q+ +Y  G  Y  H D        TP     
Sbjct: 156 GELISRIEARIAALLDWPL---ENGEG-VQVLHYRPGAEYKPHYDYFDPAQPGTPTILKR 211

Query: 155 GLWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVF--WYNAHANTLLDYRMYHSGC 212
           G  R+ + + YL   E GG T FP +NL V P KG+AVF  +  AH +T    R  H G 
Sbjct: 212 GGQRVGTLVMYLNTPERGGGTTFPDVNLEVAPIKGNAVFFSYERAHPST----RSLHGGA 267

Query: 213 PVALGNKW 220
           PV  G KW
Sbjct: 268 PVLAGEKW 275


>gi|443686890|gb|ELT90009.1| hypothetical protein CAPTEDRAFT_129682, partial [Capitella teleta]
          Length = 93

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 36/86 (41%), Positives = 48/86 (55%), Gaps = 7/86 (8%)

Query: 133 QINNYGLGGHYDLHCDATPRDE-------GLWRLASFMFYLTDVELGGATIFPSLNLTVF 185
           Q +NYG+GGHY+ H D   R E          R+A+FM Y+  V  GGAT+FP + L   
Sbjct: 1   QTSNYGIGGHYEPHYDHDERSEVAPEVALSGDRIATFMIYMNHVNAGGATVFPKIGLYAK 60

Query: 186 PEKGSAVFWYNAHANTLLDYRMYHSG 211
           PEK +A+FWYN   +   D    H+G
Sbjct: 61  PEKNAAIFWYNYKKSGESDANTLHAG 86


>gi|255579590|ref|XP_002530636.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
 gi|223529809|gb|EEF31744.1| prolyl 4-hydroxylase alpha subunit, putative [Ricinus communis]
          Length = 287

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 57/206 (27%), Positives = 90/206 (43%), Gaps = 32/206 (15%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRL--SKVYFLY 97
            E +  +PR    H+ +   E   +I L+K  +++  VV+       D+R+  S   FL 
Sbjct: 76  AEVISWEPRAFVYHNFLTKEECEYLINLAKPNMQKSTVVDSETGRSKDSRVRTSSGTFLS 135

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
               G    +  I+ RI D + + +   E     LQ+ +Y +G     H+D   D     
Sbjct: 136 R---GRDKKIRDIEKRIADFSFIPVEHGE----GLQVLHYEVGQKYEPHFDYFNDEFNTK 188

Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
            G  R+A+ + YL+DVE GG T+FP+                     L+V P  G A+ +
Sbjct: 189 NGGQRVATLLMYLSDVEEGGETVFPAAKGNFSAVPWWNELSECGKKGLSVKPNMGDALLF 248

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           ++   +  LD    H GCPV  GNKW
Sbjct: 249 WSMKPDATLDPSSLHGGCPVINGNKW 274


>gi|357517897|ref|XP_003629237.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|355523259|gb|AET03713.1| Prolyl 4-hydroxylase alpha-2 subunit [Medicago truncatula]
 gi|388513409|gb|AFK44766.1| unknown [Medicago truncatula]
 gi|388516345|gb|AFK46234.1| unknown [Medicago truncatula]
          Length = 275

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 53/207 (25%), Positives = 92/207 (44%), Gaps = 32/207 (15%)

Query: 40  VEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLY 97
           V+ +  +PR    H+ +   E   +I ++K  + + +V++   G ++    R S   FL 
Sbjct: 66  VQIISWEPRAFLYHNFLTKEECEHLINIAKPSMHKSEVIDEKTGKSLNSSIRTSSGTFLD 125

Query: 98  PEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPRD 153
            E  GD   +  I+ RI D T + +   E +     + +Y +G     HYD   D     
Sbjct: 126 RE--GDE-IVSNIEKRIADFTFIPVEHGESF----NVLHYEVGQKYEPHYDYFLDTFSTR 178

Query: 154 EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEKGSAVFW 194
               R+A+ + YL+DVE GG T+FP+                     L++ P+ G+A+ +
Sbjct: 179 HAGQRIATMLMYLSDVEEGGETVFPNAKGNFSSVPWWNELSDCGKGGLSIKPKMGNAILF 238

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKWG 221
           ++   +  LD    H  CPV  G+KW 
Sbjct: 239 WSMKPDATLDPSSLHGACPVIKGDKWS 265


>gi|357140446|ref|XP_003571778.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Brachypodium
           distachyon]
          Length = 298

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 59/208 (28%), Positives = 89/208 (42%), Gaps = 31/208 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PR       + + E + +IEL+K K+E+  V +   G ++  + R S   
Sbjct: 32  PSRVVQLSWRPRAFLHKGFLSEPECDHMIELAKDKLEKSMVADNESGKSVQSEVRTSSGM 91

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
           FL          + +I+ RI   T L     E     +QI +Y  G     HYD   D  
Sbjct: 92  FLEKR---QDEVVARIEERIAAWTFLPSENGES----IQILHYKNGEKYEPHYDYFHDKN 144

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
            +  G  R+A+ + YL++VE GG TIFP+                      V P KG A+
Sbjct: 145 NQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLTQHKDETASECAKNGYAVKPMKGDAL 204

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +++ H +   D    H  CPV  G KW
Sbjct: 205 LFFSLHPDATTDPDSLHGSCPVIEGQKW 232


>gi|110289076|gb|ABB47602.2| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
          Length = 309

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 59/209 (28%), Positives = 88/209 (42%), Gaps = 32/209 (15%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PR       + D+E   +I L+K K+E+  V +   G ++  + R S   
Sbjct: 42  PSRVVQLSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGM 101

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
           FL  +       + +I+ RI   T L     E     +QI +Y  G     HYD   D  
Sbjct: 102 FLEKK---QDEVVARIEERIAAWTFLPPDNGES----IQILHYQNGEKYEPHYDYFHDKN 154

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLNL-------------------TVFPEKGSA 191
            +  G  R+A+ + YL+DV  GG TIFP   +                    V P KG A
Sbjct: 155 NQALGGHRIATVLMYLSDVGKGGETIFPEAEVGKLLQPKDDTWSDCAKNGYAVKPVKGDA 214

Query: 192 VFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           + +++ H +   D    H  CPV  G KW
Sbjct: 215 LLFFSLHPDATTDSDSLHGSCPVIEGQKW 243


>gi|302762452|ref|XP_002964648.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
 gi|300168377|gb|EFJ34981.1| hypothetical protein SELMODRAFT_82355 [Selaginella moellendorffii]
          Length = 225

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/198 (25%), Positives = 91/198 (45%), Gaps = 30/198 (15%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPF 106
           PR   +H+ + D E + +I ++   +++  VV+       D+R+     ++    G    
Sbjct: 21  PRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSRVRTSSGMFLN-RGQDRV 79

Query: 107 LYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLAS 161
           + +I+ +I  +T +     E     +Q+ +Y  G  YD H D        R+ G  R+A+
Sbjct: 80  ISEIEDKIAKLTFIPKDHGE----GIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQ-RIAT 134

Query: 162 FMFYLTDVELGGATIFPS-------------------LNLTVFPEKGSAVFWYNAHANTL 202
            + YLTDVE GG T+FP                      ++V P++G A+ +++   +  
Sbjct: 135 LLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSPDAQ 194

Query: 203 LDYRMYHSGCPVALGNKW 220
           LD+   H GCPV  G+KW
Sbjct: 195 LDHSSLHGGCPVIKGDKW 212


>gi|209522122|ref|ZP_03270769.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
 gi|209497434|gb|EDZ97642.1| Procollagen-proline dioxygenase [Burkholderia sp. H160]
          Length = 296

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 50/185 (27%), Positives = 87/185 (47%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDH 104
           P  V + D +   E  ++I L++ +++R  VV+   G  +    R S   F      G+ 
Sbjct: 102 PAAVHLADFLSADECEQLIALAQPRLDRSTVVDPVTGRNVVAGHRSSHGMFFR---LGET 158

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGL--- 156
           P + +I+ RI  +T   +   E  +G LQ+ +Y  G     H D          E +   
Sbjct: 159 PLIVRIEARIAALTGTPV---ENGEG-LQMLHYEEGAESTPHVDYLITGNEANRESIARS 214

Query: 157 -WRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+ + + YL DVE GG T+FP +  +V P++G A+++   +   L D    H+  P+ 
Sbjct: 215 GQRMGTLLMYLKDVEGGGETVFPQIGWSVAPQRGHALYFEYGNRFGLCDPSSLHASTPLR 274

Query: 216 LGNKW 220
           +G+KW
Sbjct: 275 VGDKW 279


>gi|148537204|dbj|BAF63493.1| prolyl 4-hydroxylase [Potamogeton distinctus]
          Length = 246

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 56/185 (30%), Positives = 81/185 (43%), Gaps = 31/185 (16%)

Query: 60  EINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
           E + +I L K K+E+  V +   G ++  + R S   FL  E   D   + +I+ RI   
Sbjct: 8   ECDHLIALGKDKLEKSMVADNESGKSVMSEIRTSSGMFL--ERRQDET-ITRIEKRIAAW 64

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLH----CDATPRDEGLWRLASFMFYLTDVELGG 173
           T L     E    P+QI +Y  G  YD H     D   +  G  R+A+ + YL+DV+ GG
Sbjct: 65  TFLP----EENGEPIQILHYEKGQKYDAHYDYFHDKNNQRVGGHRMATVLMYLSDVKKGG 120

Query: 174 ATIFPSLN------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
            T+FP                       V P KG A+ +++ H N   D    H+ CPV 
Sbjct: 121 ETVFPDAEGKLLQVKDDTWSDCARSGYAVKPRKGDALLFFSCHPNATTDPNSLHASCPVI 180

Query: 216 LGNKW 220
            G KW
Sbjct: 181 EGEKW 185


>gi|302823087|ref|XP_002993198.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
 gi|300138968|gb|EFJ05718.1| hypothetical protein SELMODRAFT_431327 [Selaginella moellendorffii]
          Length = 269

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 57/207 (27%), Positives = 90/207 (43%), Gaps = 19/207 (9%)

Query: 32  FLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDT-----IYV 86
            L+IG +K E L   PR++ +H  +   E + +I ++  ++ +  VV+         I  
Sbjct: 52  LLRIGLVKPEVLNWSPRIILLHKFLSAEECDYLIAIAGPRLAKSTVVDTSTGKARHGIES 111

Query: 87  DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH 146
             R S   FL       +P +  I+ RI   + + +   E  +      N     H+D  
Sbjct: 112 KVRTSTGMFL-SNYDRRYPMIQAIERRIAVYSMIPVENGELLQVLRYEPNQYYKPHHDYF 170

Query: 147 CDATPRDEGLWRLASFMFYLTDVELGGATIFPSL-------------NLTVFPEKGSAVF 193
            D      G  R+A+ + YL+DVE GG TIFPS+              L V P KG A+ 
Sbjct: 171 SDQFNLKRGGQRVATVLMYLSDVEEGGETIFPSVGDGECECGGELRKGLCVKPRKGDAIL 230

Query: 194 WYNAHANTLLDYRMYHSGCPVALGNKW 220
           +++A  +  +D    H GC V  G KW
Sbjct: 231 FWSAALDGNVDSNSLHGGCSVLRGEKW 257


>gi|145341735|ref|XP_001415959.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144576182|gb|ABO94251.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 254

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 54/212 (25%), Positives = 89/212 (41%), Gaps = 38/212 (17%)

Query: 39  KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFL 96
           +VE L   PR   + DA+ +++   ++  ++ +V R  VV+   G++     R SK  FL
Sbjct: 2   RVEPLSWYPRAFALRDALTEAQCEAVLRATRARVRRSTVVDSVTGESKVDPIRTSKQTFL 61

Query: 97  YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRD--- 153
                 D   + +I   +  +T L     E     +Q+  Y +G  YD H D    D   
Sbjct: 62  N----RDEEVVREIYDALSAVTMLPWTHNED----MQVLEYRVGEKYDAHEDVGAEDSLS 113

Query: 154 ------EGLWRLASFMFYLTDVELGGATIFPSLN-------------------LTVFPEK 188
                 +G  R+A+ + YL + E GG T FP                      + + P +
Sbjct: 114 GRELSKDGGKRVATVLLYLEEPEAGGETAFPDSEWIDPKMAEGTSWSKCAEHRVAMKPRR 173

Query: 189 GSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           G  + +++   N  +D+R  H GCPV  G KW
Sbjct: 174 GDGLIFWSVDPNGKIDHRALHVGCPVVAGVKW 205


>gi|302815629|ref|XP_002989495.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
 gi|300142673|gb|EFJ09371.1| hypothetical protein SELMODRAFT_129912 [Selaginella moellendorffii]
          Length = 213

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 51/198 (25%), Positives = 91/198 (45%), Gaps = 30/198 (15%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYPEIFGDHPF 106
           PR   +H+ + D E + +I ++   +++  VV+       D+R+     ++    G    
Sbjct: 9   PRASLVHNFLTDDECDHLIRVAMPLMQKSTVVDSQTGGSRDSRVRTSSGMFLN-RGQDRV 67

Query: 107 LYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD-----ATPRDEGLWRLAS 161
           + +I+ +I  +T +     E     +Q+ +Y  G  YD H D        R+ G  R+A+
Sbjct: 68  ISEIEDKIAKLTFIPKDHGE----GIQVLHYEPGQKYDAHHDFFYDTVNTRNGGQ-RIAT 122

Query: 162 FMFYLTDVELGGATIFPS-------------------LNLTVFPEKGSAVFWYNAHANTL 202
            + YLTDVE GG T+FP                      ++V P++G A+ +++   +  
Sbjct: 123 LLMYLTDVEEGGETVFPKSAKNSSSLPWHNQLSECGRRGVSVRPKRGDALLFWSMSPDAQ 182

Query: 203 LDYRMYHSGCPVALGNKW 220
           LD+   H GCPV  G+KW
Sbjct: 183 LDHSSLHGGCPVIKGDKW 200


>gi|333912984|ref|YP_004486716.1| procollagen-proline dioxygenase [Delftia sp. Cs1-4]
 gi|333743184|gb|AEF88361.1| Procollagen-proline dioxygenase [Delftia sp. Cs1-4]
          Length = 294

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 62/232 (26%), Positives = 106/232 (45%), Gaps = 26/232 (11%)

Query: 5   LACQGNLSVPEDI--KSNLKCFYESYNNTFLKIGPLKVEEL--YLDPRVVKIHDAIYDSE 60
           L  QG+++ P  +   S+L     + + + + +G  +V+ L    +PR+V   + +   E
Sbjct: 61  LPEQGDVAPPAVVISASSLPEPDLAQDPSSIDVGDRQVQVLVSMRNPRIVVFGNLLSHEE 120

Query: 61  INRIIELSKGKVERGKVV---NYGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDM 117
            + II  ++ ++ R   V   + G+ I  D   + ++F      G+   + +++ RI  +
Sbjct: 121 CDAIIAAARPRMARSLTVATQSGGEEINDDRTSNGMFFQR----GETGIVSQLEERIARL 176

Query: 118 TNLVIGREERYKGPLQINNYGLGGHYDLHCD-------ATPR--DEGLWRLASFMFYLTD 168
               +   E     LQ+ +YG G  Y  H D        TP     G  R+ + + YL +
Sbjct: 177 LRWPLDHGEG----LQVLHYGPGAEYKPHHDYFAPGEPGTPTILKRGGQRVGTLVIYLNE 232

Query: 169 VELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            E GGATIFP + L V P +G+AVF+     +     R  H G PV  G KW
Sbjct: 233 PERGGATIFPEVPLQVVPRRGNAVFFSYERPDP--STRTLHGGAPVLAGEKW 282


>gi|254254263|ref|ZP_04947580.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
 gi|124898908|gb|EAY70751.1| hypothetical protein BDAG_03558 [Burkholderia dolosa AUO158]
          Length = 285

 Score = 71.6 bits (174), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 54/185 (29%), Positives = 85/185 (45%), Gaps = 18/185 (9%)

Query: 47  PRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDH 104
           P++V   + +   E + +I+ S  K+E+   VN   G    +  R S   +      G+ 
Sbjct: 96  PQIVVFGNVLDQDECDEMIQRSMHKLEQSTTVNAETGTQEVIRHRTSHGTWFQ---NGED 152

Query: 105 PFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD---------ATPRDEG 155
             + +I+TR+  + N  +   E  +G LQ+  Y  GG Y  H D          T    G
Sbjct: 153 ALIRRIETRLAALMNCPV---ENGEG-LQVLRYTPGGEYRSHYDYFQPTAAGSLTHVRTG 208

Query: 156 LWRLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVA 215
             R+A+ + YL DV  GG T+FP   ++V P +G AV++   +    LD    H+G PV 
Sbjct: 209 GQRVATLIVYLNDVPSGGETVFPEAGISVVPRRGDAVYFRYMNRLRQLDPATLHAGAPVR 268

Query: 216 LGNKW 220
            G KW
Sbjct: 269 DGEKW 273


>gi|448930198|gb|AGE53763.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus IL-3A]
 gi|448931603|gb|AGE55164.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus MA-1E]
          Length = 239

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 62/189 (32%), Positives = 97/189 (51%), Gaps = 26/189 (13%)

Query: 51  KIHDAIYDSEINRIIE--LSKG--KVERGKVVNYGDTIYVD--TRLSKVYFLYPEIFGDH 104
           ++HD + D+E + +I   + KG  K E G   +  D I +D  +R S+  +  P   G+H
Sbjct: 54  ELHDFLSDAECDVLINAAIKKGLIKSEVGGATD-DDPIKLDPKSRNSEQTWFTP---GEH 109

Query: 105 PFLYKIQTRIQDMTNLVIGREERYK-GPLQINNYGLGGHYDLH-----CD-ATPRDEGLW 157
             + KIQ + +++ N      ++Y    +Q+  Y  G +Y  H     CD A P+D+   
Sbjct: 110 KIIDKIQKKTRELLNSKKHCIDKYNFEDVQVARYKPGQYYYHHYDGDDCDDACPKDQ--- 166

Query: 158 RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR-MYHSGCPV-- 214
           RLA+ M YL   + GG T FP+LN  V P+KG AVF++ A   T   Y+   H+G PV  
Sbjct: 167 RLATLMVYLKAPKEGGETDFPTLNTQVLPKKGKAVFFWVADPATRKLYKETLHAGLPVKN 226

Query: 215 ---ALGNKW 220
               + N+W
Sbjct: 227 GVKVIANQW 235


>gi|448927821|gb|AGE51393.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus CviKI]
          Length = 239

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 62/189 (32%), Positives = 97/189 (51%), Gaps = 26/189 (13%)

Query: 51  KIHDAIYDSEINRIIE--LSKG--KVERGKVVNYGDTIYVD--TRLSKVYFLYPEIFGDH 104
           ++HD + D+E + +I   + KG  K E G   +  D I +D  +R S+  +  P   G+H
Sbjct: 54  ELHDFLSDAECDVLINAAIKKGLIKSEVGGATD-DDPIKLDPKSRNSEQTWFTP---GEH 109

Query: 105 PFLYKIQTRIQDMTNLVIGREERYK-GPLQINNYGLGGHYDLH-----CD-ATPRDEGLW 157
             + KIQ + +++ N      ++Y    +Q+  Y  G +Y  H     CD A P+D+   
Sbjct: 110 KIIDKIQKKTRELLNSKKHCIDKYNFEDVQVARYKPGQYYYHHYDGDDCDDACPKDQ--- 166

Query: 158 RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR-MYHSGCPV-- 214
           RLA+ M YL   + GG T FP+LN  V P+KG AVF++ A   T   Y+   H+G PV  
Sbjct: 167 RLATLMVYLKAPKEGGETDFPTLNTQVLPKKGKAVFFWVADPATRKLYKETLHAGLPVKN 226

Query: 215 ---ALGNKW 220
               + N+W
Sbjct: 227 GVKVIANQW 235


>gi|448928822|gb|AGE52391.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus CvsA1]
          Length = 239

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 63/189 (33%), Positives = 97/189 (51%), Gaps = 26/189 (13%)

Query: 51  KIHDAIYDSEINRIIE--LSKG--KVERGKVVNYGDTIYVD--TRLSKVYFLYPEIFGDH 104
           ++HD + D+E + +I   + KG  K E G   +  D I +D  +R S+  +  P   G+H
Sbjct: 54  ELHDFLSDAECDVLINAAIKKGLIKSEVGGATD-DDPIKLDPKSRNSEQTWFTP---GEH 109

Query: 105 PFLYKIQTRIQDMTNLVIGREERYK-GPLQINNYGLGGHYDLH-----CD-ATPRDEGLW 157
             + KIQ + +++ N      ++Y    +Q+  Y  G +Y  H     CD A P+D+   
Sbjct: 110 EVIDKIQNKTRELLNNKKHCIDKYIFEDVQVARYKPGQYYYHHYDGDDCDDACPKDQ--- 166

Query: 158 RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR-MYHSGCPV-- 214
           RLA+ M YL   E GG T FP+LN  V P+KG AVF++ A   T   Y+   H+G PV  
Sbjct: 167 RLATLMVYLKAPEEGGETDFPTLNTQVLPKKGKAVFFWVADPATRKLYKETLHAGLPVKN 226

Query: 215 ---ALGNKW 220
               + N+W
Sbjct: 227 GVKVIANQW 235


>gi|325915062|ref|ZP_08177391.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
           ATCC 35937]
 gi|325538760|gb|EGD10427.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas vesicatoria
           ATCC 35937]
          Length = 286

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 59/217 (27%), Positives = 98/217 (45%), Gaps = 22/217 (10%)

Query: 17  IKSNLKCFYESYNNTFLKIGPLKVEELYLD--PRVVKIHDAIYDSEINRIIELSKGKVER 74
           +   +    +  + + L +G  +V  L     PRV+ +   + D+E + +I L++ ++ R
Sbjct: 64  VPVRVPALLQDSDASLLDLGDRQVHVLMRMQLPRVMVLGGFLSDAECDAMIALAQPRLAR 123

Query: 75  GKVVNYGDTIYV--DTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPL 132
            + V+  +  +V    R S    L     G      +I+ RI  + +  +   E  +G L
Sbjct: 124 SRTVDNANGAHVVHAARTSDSMCLQ---LGQDALCQRIEARIARLLDWPV---ENGEG-L 176

Query: 133 QINNYGLGGHYDLHCD-------ATP--RDEGLWRLASFMFYLTDVELGGATIFPSLNLT 183
           Q+  YG G  Y  H D        TP     G  R+AS + YL   + GGAT FP ++L 
Sbjct: 177 QVLRYGTGAEYQPHYDYFDPDAAGTPVLLQAGGQRVASLVMYLNTPDRGGATRFPDVHLD 236

Query: 184 VFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +   KG+AVF+     + +   R  H+G PV  G KW
Sbjct: 237 IAAIKGNAVFFSYDRPHPM--TRSLHAGAPVLAGEKW 271


>gi|448924767|gb|AGE48348.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus AN69C]
 gi|448933638|gb|AGE57193.1| prolyl 4-hydroxylase [Paramecium bursaria Chlorella virus NE-JV-4]
          Length = 239

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 62/189 (32%), Positives = 97/189 (51%), Gaps = 26/189 (13%)

Query: 51  KIHDAIYDSEINRIIE--LSKG--KVERGKVVNYGDTIYVD--TRLSKVYFLYPEIFGDH 104
           ++HD + D+E + +I   + KG  K E G   +  D I +D  +R S+  +  P   G+H
Sbjct: 54  ELHDFLSDAECDILINAAIKKGLIKSEVGGATD-DDPIKLDPKSRNSEQTWFTP---GEH 109

Query: 105 PFLYKIQTRIQDMTNLVIGREERYK-GPLQINNYGLGGHYDLH-----CD-ATPRDEGLW 157
             + KIQ + +++ +      ++Y    +Q+  Y  G +Y  H     CD A P+D+   
Sbjct: 110 KIIDKIQNKTRELLDSKKHCIDKYNFEDVQVARYKPGQYYYHHYDGDDCDDACPKDQ--- 166

Query: 158 RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYR-MYHSGCPV-- 214
           RLA+ M YL   E GG T FP+LN  V P+KG AVF++ A   T   Y+   H+G PV  
Sbjct: 167 RLATLMVYLKAPEEGGETDFPTLNTQVLPKKGKAVFFWVADPATRKLYKETLHAGLPVKN 226

Query: 215 ---ALGNKW 220
               + N+W
Sbjct: 227 GVKVIANQW 235


>gi|260802724|ref|XP_002596242.1| hypothetical protein BRAFLDRAFT_117983 [Branchiostoma floridae]
 gi|229281496|gb|EEN52254.1| hypothetical protein BRAFLDRAFT_117983 [Branchiostoma floridae]
          Length = 527

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 46/155 (29%), Positives = 81/155 (52%), Gaps = 14/155 (9%)

Query: 1   EIYPLACQGN----LSVPEDIKSNLKCFYESYNN-TFLKIGPLKVEELYLDPRVVKIHDA 55
           EIY L CQ       ++      +LKC Y + NN   L + P K+E+++  P++   H+ 
Sbjct: 306 EIYELLCQAEQPDMFNITPSRAKHLKCRYFTNNNHPRLLLAPQKLEQVFDKPKMWIFHNI 365

Query: 56  IYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR 113
           + D E+  I +L++ ++ R  + N   G+  +   R+SK  +L      +H  + ++  R
Sbjct: 366 LTDPEMKVIKDLAQPRLRRATIQNSITGELEHASYRISKSAWLQG---WEHKVIRRVNQR 422

Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD 148
           ++D+T L +   E     LQ+ NYG+GGHY+ H D
Sbjct: 423 VEDVTGLTMETAEE----LQVVNYGMGGHYEPHFD 453


>gi|308799555|ref|XP_003074558.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
 gi|116000729|emb|CAL50409.1| putative oxidoreductase (ISS) [Ostreococcus tauri]
          Length = 274

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 54/213 (25%), Positives = 90/213 (42%), Gaps = 38/213 (17%)

Query: 38  LKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYF 95
           + VE L   PR   + +A+ ++E+  I+ L++ +V R  V++   G ++    R SK  F
Sbjct: 7   IAVEPLSWYPRAFALRNALDETEMRAILALARTRVARSTVIDSESGKSVVNPIRTSKQTF 66

Query: 96  LYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPR--- 152
           L      + P + K+  R+  +T+L     E     LQ+  Y  G  YD H D       
Sbjct: 67  LS----RNDPVVRKVLERMSSVTHLPWYHCED----LQVLEYSAGEKYDAHEDVGEEGTK 118

Query: 153 ------DEGLWRLASFMFYLTDVELGGATIFPS-------------------LNLTVFPE 187
                   G  R+A+ + YL + E GG T FP                      + + P 
Sbjct: 119 SGDQLSKNGGKRVATILLYLEEPEEGGETAFPDSEWIDPERAKTETWSKCAHRRVAMKPT 178

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +G  + +++   +  +D+R  H GCP   G KW
Sbjct: 179 RGDGLMFWSVRPDGTIDHRALHVGCPPTRGTKW 211


>gi|212720650|ref|NP_001132477.1| uncharacterized protein LOC100193935 precursor [Zea mays]
 gi|194694488|gb|ACF81328.1| unknown [Zea mays]
 gi|347978828|gb|AEP37756.1| prolyl 4-hydroxylase 7 [Zea mays]
 gi|413934218|gb|AFW68769.1| prolyl 4-hydroxylase [Zea mays]
          Length = 298

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 58/208 (27%), Positives = 90/208 (43%), Gaps = 31/208 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PR       + D+E + +I L+K K+E+  V +   G ++  + R S   
Sbjct: 32  PSRVVQLSWRPRAFLHKGFLLDAECDHLIALAKDKLEKSMVADNKSGKSVQSEVRTSSGM 91

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
           FL  +       + +I+ RI   T L     E     +QI +Y  G     HYD   D  
Sbjct: 92  FLEKK---QDEVVTRIEERISAWTFLPPENGE----AIQILHYQNGEKYEPHYDYFHDKN 144

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
            +  G  R+A+ + YL++VE GG TIFP+                      V P KG A+
Sbjct: 145 NQALGGHRIATVLMYLSNVEKGGETIFPNAEGKLLQPKDDTWSDCARNGYAVKPVKGDAL 204

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +++ H ++  D    H  CP   G KW
Sbjct: 205 LFFSLHPDSTTDSDSLHGSCPAIEGQKW 232


>gi|198417608|ref|XP_002125299.1| PREDICTED: similar to prolyl-4-hydroxylase-alpha EFB CG31022-PA
           [Ciona intestinalis]
          Length = 471

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 31/64 (48%), Positives = 43/64 (67%)

Query: 158 RLASFMFYLTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALG 217
           R+A+ + YL++V+ GG+T F   N+   P KGSAVFWYN + +  LD R  H+ CPV +G
Sbjct: 381 RIATALVYLSEVQKGGSTAFFYPNIVAEPIKGSAVFWYNLYPSGALDKRTLHAACPVLIG 440

Query: 218 NKWG 221
           NKW 
Sbjct: 441 NKWA 444


>gi|388495016|gb|AFK35574.1| unknown [Lotus japonicus]
          Length = 297

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 56/213 (26%), Positives = 89/213 (41%), Gaps = 34/213 (15%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSK 92
           I P KV+++   PR       +   E + +I L+K +++R  V +   GD+   + R S 
Sbjct: 31  INPSKVKQVSWKPRAFVYEGFLTGLECDHLISLAKSELKRSAVADNLPGDSKLSEVRTSS 90

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
             F+  +     P +  I+ +I   T L     E     +Q+  Y  G  YD H     D
Sbjct: 91  GMFISKK---KDPIVAGIEDKISAWTFLPKENGED----MQVLRYEHGQKYDPHYDYFTD 143

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFP---------------------SLNLTVFPE 187
                 G  R+A+ + YLT+V  GG T+FP                        + V P 
Sbjct: 144 KVNIVRGGHRMATVLLYLTNVTRGGETVFPVAEEPPRRRGLETNSDLSECAKKGIAVKPR 203

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +G A+ +++ H   + D    H+GCPV  G KW
Sbjct: 204 RGDALLFFSLHTTAIPDTDSLHAGCPVIEGEKW 236


>gi|307102963|gb|EFN51228.1| hypothetical protein CHLNCDRAFT_141231 [Chlorella variabilis]
          Length = 313

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 58/186 (31%), Positives = 82/186 (44%), Gaps = 28/186 (15%)

Query: 56  IYDSEINRIIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTR 113
           + + E + I+ L+K  +ER  VV+   G +   D R SK  FL     G    +  I+ R
Sbjct: 48  LTEEECDHIVALAKPHLERSGVVDTATGGSEISDIRTSKGMFLE---RGHDDTVAAIEER 104

Query: 114 IQDMTNLVIGREERYKGPLQINNYGLGGHYD--LHCDATPRDEGLWRLASFMFYLTDVEL 171
           I   T L +G  E     LQ+ NY  G  YD            G  R A+ + YL  VE 
Sbjct: 105 IARWTLLPVGNGEG----LQVLNYHPGEKYDDYFFDKVNGESNGGNRYATVLMYLNTVEE 160

Query: 172 GGATIFPSL-----------------NLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPV 214
           GG T+FP++                 +L   P KGSAV +++   +  L+ R  H+ CPV
Sbjct: 161 GGETVFPNIPAPGGDNGPTFTECARRHLAAKPTKGSAVLFHSIKPSGDLERRSLHTACPV 220

Query: 215 ALGNKW 220
             G KW
Sbjct: 221 VKGEKW 226


>gi|449459442|ref|XP_004147455.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
 gi|449515722|ref|XP_004164897.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like [Cucumis
           sativus]
          Length = 319

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 58/213 (27%), Positives = 92/213 (43%), Gaps = 30/213 (14%)

Query: 31  TFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVV-NYGDTIYVDTR 89
           + + I P +V +L   PR       +   E   +I  +KGK+ +  V    G ++    R
Sbjct: 51  SAMTIDPTRVIQLSSKPRAFLYKGFLSAEECQHLINSAKGKLHQSLVAAGTGQSVTSKER 110

Query: 90  LSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCD- 148
            S   FL+         + +I++RI   T L +   E    P+QI  Y  G  Y+ H D 
Sbjct: 111 TSTGMFLHK---AQDEIVARIESRIAAWTFLPLDNGE----PIQILRYENGQKYEPHFDF 163

Query: 149 -ATPRDE--GLWRLASFMFYLTDVELGGATIFPS------------------LNLTVFPE 187
              P +   G  R+A+ + YL++VE GG T+FP+                  +   V P+
Sbjct: 164 FQDPGNIAIGGHRIATILMYLSNVEKGGETVFPNSPVKLSEEEKADLSECGKVGYGVRPK 223

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            G A+ +++ + N   D   YH  CPV  G KW
Sbjct: 224 LGDALLFFSMNPNVTPDTTSYHGSCPVIEGEKW 256


>gi|388520325|gb|AFK48224.1| unknown [Lotus japonicus]
          Length = 188

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 54/182 (29%), Positives = 81/182 (44%), Gaps = 32/182 (17%)

Query: 64  IIELSKGKVERGKVVNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLV 121
           +I L+K  + +  VV+   G ++    R S   FL     G    +  I+ RI D   + 
Sbjct: 1   MINLAKPHMAKSSVVDSQTGKSVGSRVRTSSGMFLKR---GKDKVIQTIEKRIADFAFIP 57

Query: 122 IGREERYKGPLQINNYGLGG----HYDLHCDATPRDEGLWRLASFMFYLTDVELGGATIF 177
           +   E     LQ+ +Y +G     HYD   D      G  R+A+ + YL+DVE GG TIF
Sbjct: 58  VENGEG----LQVLHYEVGQKYEPHYDYFLDEFNTKNGGQRIATVLMYLSDVEEGGETIF 113

Query: 178 PSLN-------------------LTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGN 218
           P+                     L+V P++G A+ +++   +  LD    H GCPV  GN
Sbjct: 114 PAAKANFSSVPWYNDLSVCAKKGLSVKPKRGDALLFWSIRPDATLDPSSLHGGCPVIRGN 173

Query: 219 KW 220
           KW
Sbjct: 174 KW 175


>gi|356555585|ref|XP_003546111.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1-like isoform 1
           [Glycine max]
          Length = 301

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 55/213 (25%), Positives = 93/213 (43%), Gaps = 34/213 (15%)

Query: 35  IGPLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSK 92
           I P KV+++   PR       + + E + +I ++K +++R  V +   G++   + R S 
Sbjct: 35  IDPSKVKQVSWKPRAFVYEGFLTELECDHLISIAKSELKRSAVADNLSGESKLSEVRTSS 94

Query: 93  VYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLH----CD 148
             F+ P+     P +  ++ +I   T L     E     +Q+  Y  G  YD H     D
Sbjct: 95  GMFI-PK--NKDPIVAGVEDKISSWTLLPKENGED----IQVLRYEHGQKYDPHYDYFAD 147

Query: 149 ATPRDEGLWRLASFMFYLTDVELGGATIFPSLN---------------------LTVFPE 187
                 G  R+A+ + YLTDV  GG T+FP+                       + V P 
Sbjct: 148 KVNIARGGHRVATVLMYLTDVTKGGETVFPNAEESPRHRGSETKEDLSECAQKGIAVKPR 207

Query: 188 KGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           +G A+ +++ + N + D    H+GCPV  G KW
Sbjct: 208 RGDALLFFSLYPNAIPDTMSLHAGCPVIEGEKW 240


>gi|284035817|ref|YP_003385747.1| 2OG-Fe(II) oxygenase [Spirosoma linguale DSM 74]
 gi|283815110|gb|ADB36948.1| 2OG-Fe(II) oxygenase [Spirosoma linguale DSM 74]
          Length = 328

 Score = 71.2 bits (173), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 55/176 (31%), Positives = 83/176 (47%), Gaps = 14/176 (7%)

Query: 50  VKIHDAIYDS-EINRIIELSKGKV--ERGKVVNYGDTI-YVDTRLSKVYFLYPEIFGDHP 105
           V+IH   + + E   II+ ++ K    R ++    +T+   DTR S   FL       HP
Sbjct: 152 VQIHPHFFSADECAYIIQYAEEKTLFTRSQLEYDDNTVNESDTRTSYSAFLKDR---QHP 208

Query: 106 FLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGGHYDLHCDATPRDEGLWRLASFMFY 165
               I  R+     + +     Y  PLQ   YG G  +  H D+   +    RL + + Y
Sbjct: 209 VFQAIYERVAASLKVDLN----YIEPLQCVRYGEGQQFKPHFDSMSANH---RLHTMLVY 261

Query: 166 LTDVELGGATIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKWG 221
           L D  +GG T FP LN+ V P++GSA+++ N   N LL     H+G P+A G K+ 
Sbjct: 262 LNDDFVGGETYFPELNMNVHPKRGSALYFLNRDDNNLLLLNSVHAGLPIAQGMKYA 317


>gi|332526359|ref|ZP_08402485.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
 gi|332110495|gb|EGJ10818.1| procollagen-proline dioxygenase [Rubrivivax benzoatilyticus JA2]
          Length = 224

 Score = 70.9 bits (172), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 61/226 (26%), Positives = 98/226 (43%), Gaps = 26/226 (11%)

Query: 6   ACQGNLSVPEDIKSNLKCFYESYNNTFLKIGPLKVEELYLDPRVVKIHDAIYDSEINRII 65
           A  G  SVPE   +       + +     +  + +      PRVV     + + E + ++
Sbjct: 2   APAGAASVPEPALAGAPGVLRAGDREVHVLATMAL------PRVVVFGGLLSEQECDELV 55

Query: 66  ELSKGKVERGKVVN--YGDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIG 123
            L++ ++ R + V+   G +     R S   F      G+ P + +I+ RI ++ +  + 
Sbjct: 56  ALAQPRLLRSETVDNSTGGSEVNAARTSDGMFFE---RGETPLIERIERRIAELVHWPV- 111

Query: 124 REERYKGPLQINNYGLGGHYDLHCD---------ATPRDEGLWRLASFMFYLTDVELGGA 174
             ER +G LQ+ +Y  G  Y  H D         A     G  R+ + + YL     GGA
Sbjct: 112 --ERGEG-LQVLHYRPGAQYKPHHDFFDPAHPGTANILRRGGQRVGTVVIYLNTPAGGGA 168

Query: 175 TIFPSLNLTVFPEKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
           T FP + L V P KG+AVF+  ++   L   R  H G PV  G KW
Sbjct: 169 TTFPEVGLEVQPIKGNAVFF--SYERPLASTRTLHGGAPVLDGEKW 212


>gi|218199253|gb|EEC81680.1| hypothetical protein OsI_25242 [Oryza sativa Indica Group]
          Length = 487

 Score = 70.9 bits (172), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 58/206 (28%), Positives = 87/206 (42%), Gaps = 31/206 (15%)

Query: 39  KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVYFL 96
           +V  +   PRV      + D E + +++L K K++R  V +   G ++  + R S   FL
Sbjct: 56  RVRAVSWRPRVFVYKGFLSDDECDHLVKLGKRKMQRSMVADNKSGKSVMSEVRTSSGMFL 115

Query: 97  YPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDATPR 152
                   P + +I+ RI   T L     E     +QI  Y  G     H+D   D   +
Sbjct: 116 DKR---QDPVVSRIEKRIAAWTFL----PEENAENIQILRYEHGQKYEPHFDYFHDKVNQ 168

Query: 153 DEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAVFW 194
             G  R A+ + YL+ VE GG T+FP+                    L V P KG AV +
Sbjct: 169 ALGGHRYATVLMYLSTVEKGGETVFPNAEGWENQPKDDTFSECAQKGLAVKPVKGDAVLF 228

Query: 195 YNAHANTLLDYRMYHSGCPVALGNKW 220
           ++ H + + D    H  CPV  G KW
Sbjct: 229 FSLHIDGVPDPLSLHGSCPVIEGEKW 254


>gi|321466507|gb|EFX77502.1| hypothetical protein DAPPUDRAFT_25542 [Daphnia pulex]
          Length = 92

 Score = 70.9 bits (172), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 40/92 (43%), Positives = 54/92 (58%), Gaps = 5/92 (5%)

Query: 134 INNYGLGGHYDLHCD--ATPRDEGLWR--LASFMFYLTDVELGGATIFPSLNLTVFPEKG 189
           I +YG+GGH+  HCD     R E      LA+ + YL +VE GGAT+FP +   V P KG
Sbjct: 1   ILSYGVGGHFSPHCDYIRNKRIEAKTGNILATLIIYLNEVENGGATVFPIVKTRVKPVKG 60

Query: 190 SAVFWYNAHA-NTLLDYRMYHSGCPVALGNKW 220
           SA+FWYN +  N   +    H+ CP+  G+KW
Sbjct: 61  SALFWYNLNPDNGEGNPTTLHASCPILSGSKW 92


>gi|307102975|gb|EFN51240.1| hypothetical protein CHLNCDRAFT_28187 [Chlorella variabilis]
          Length = 322

 Score = 70.9 bits (172), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 60/216 (27%), Positives = 98/216 (45%), Gaps = 38/216 (17%)

Query: 39  KVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVNYGDTIYVDTRLSKVYFLYP 98
           ++E +   PR + +H  +  SE + +I L++ ++E  KVV+   +  +D+  ++      
Sbjct: 14  RIELVSWKPRALLLHGFLAHSECDHMISLAEARLEPSKVVSRDGSGKLDSVRTRQGLSSS 73

Query: 99  EIF---GDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLG----GHYDLHCD--- 148
             F        +  ++ RI+  T+L     E+    LQ+  Y LG     HYD+H     
Sbjct: 74  GTFLTKRQDSVVAGVEDRIELATHLPFSHSEQ----LQVLKYELGQKYSAHYDVHGSNEQ 129

Query: 149 ---ATPRDE-GLWRLASFMFYLTDVELGGATIFP-------------------SLNLTVF 185
              A  R E G  R A+ + YL+DVE GG T FP                   S  + V 
Sbjct: 130 AQLAIRRGEQGGSRYATMLMYLSDVEEGGETSFPHGRWIDEGAQAQPPYSECGSRGVAVK 189

Query: 186 PEKGSAVFWYNAHAN-TLLDYRMYHSGCPVALGNKW 220
           P KG A+ +Y+  ++    D+   H+GCPVA G K+
Sbjct: 190 PRKGDAILFYSLKSDGQSKDFFSLHAGCPVAKGVKY 225


>gi|325925807|ref|ZP_08187179.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
           91-118]
 gi|325543793|gb|EGD15204.1| 2OG-Fe(II) oxygenase superfamily enzyme [Xanthomonas perforans
           91-118]
          Length = 286

 Score = 70.9 bits (172), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 61/214 (28%), Positives = 92/214 (42%), Gaps = 22/214 (10%)

Query: 20  NLKCFYESYNNTFLKIGPLKVEELY--LDPRVVKIHDAIYDSEINRIIELSKGKVERGKV 77
            +    +  + + L +G   V  L   L PRVV +   + D E + +I L++  + R + 
Sbjct: 67  RVPALQQDADASLLALGDRDVRVLVSLLLPRVVVLGGFLSDEECDALIALARPHLARSRT 126

Query: 78  VNY--GDTIYVDTRLSKVYFLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQIN 135
           V+   G+ +    R S    L     G      +I+ RI  + +  +   E     LQ+ 
Sbjct: 127 VDNANGEHVVHAARTSDSMCLR---LGQDALCQRIEARIARLLDWPVDHGEG----LQVL 179

Query: 136 NYGLGGHYDLHCD-------ATPR--DEGLWRLASFMFYLTDVELGGATIFPSLNLTVFP 186
            Y  G  Y  H D        TP     G  R+AS + YL   E GGAT FP  +L V  
Sbjct: 180 RYATGAEYRPHYDYFDPDAAGTPVLVQAGGQRVASLVMYLNTPERGGATRFPDAHLDVAA 239

Query: 187 EKGSAVFWYNAHANTLLDYRMYHSGCPVALGNKW 220
            KG+AVF+     + +   R  H+G PV  G+KW
Sbjct: 240 VKGNAVFFSYDRPHPM--TRSLHAGAPVLAGDKW 271


>gi|225452614|ref|XP_002281420.1| PREDICTED: prolyl 4-hydroxylase subunit alpha-1 [Vitis vinifera]
 gi|296087745|emb|CBI35001.3| unnamed protein product [Vitis vinifera]
          Length = 316

 Score = 70.9 bits (172), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 57/208 (27%), Positives = 88/208 (42%), Gaps = 31/208 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PR       + + E + +I L+K K+E+  V +   G +I  + R S   
Sbjct: 51  PTRVTQLSWRPRAFLYKGFLSEEECDHLITLAKDKLEKSMVADNESGKSIMSEVRTSSGM 110

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
           FL   +      +  I+ RI   T L +   E     +QI +Y  G     H+D   D  
Sbjct: 111 FL---LKAQDEIVADIEARIAAWTFLPVENGES----IQILHYENGEKYEPHFDYFHDKV 163

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
            +  G  R+A+ + YL  VE GG T+FP+                      V P+KG A+
Sbjct: 164 NQLLGGHRIATVLMYLATVEEGGETVFPNSEGRFSQPKDDSWSDCAKKGYAVNPKKGDAL 223

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +++ H +   D    H  CPV  G KW
Sbjct: 224 LFFSLHPDATTDPSSLHGSCPVIAGEKW 251


>gi|115481998|ref|NP_001064592.1| Os10g0413500 [Oryza sativa Japonica Group]
 gi|110289075|gb|ABG66075.1| prolyl 4-hydroxylase, putative, expressed [Oryza sativa Japonica
           Group]
 gi|113639201|dbj|BAF26506.1| Os10g0413500 [Oryza sativa Japonica Group]
 gi|215692577|dbj|BAG87997.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|222612821|gb|EEE50953.1| hypothetical protein OsJ_31503 [Oryza sativa Japonica Group]
          Length = 308

 Score = 70.9 bits (172), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 59/208 (28%), Positives = 87/208 (41%), Gaps = 31/208 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PR       + D+E   +I L+K K+E+  V +   G ++  + R S   
Sbjct: 42  PSRVVQLSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGM 101

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
           FL  +       + +I+ RI   T L     E     +QI +Y  G     HYD   D  
Sbjct: 102 FLEKK---QDEVVARIEERIAAWTFLPPDNGES----IQILHYQNGEKYEPHYDYFHDKN 154

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
            +  G  R+A+ + YL+DV  GG TIFP                       V P KG A+
Sbjct: 155 NQALGGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDAL 214

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +++ H +   D    H  CPV  G KW
Sbjct: 215 LFFSLHPDATTDSDSLHGSCPVIEGQKW 242


>gi|218184507|gb|EEC66934.1| hypothetical protein OsI_33548 [Oryza sativa Indica Group]
          Length = 308

 Score = 70.9 bits (172), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 59/208 (28%), Positives = 87/208 (41%), Gaps = 31/208 (14%)

Query: 37  PLKVEELYLDPRVVKIHDAIYDSEINRIIELSKGKVERGKVVN--YGDTIYVDTRLSKVY 94
           P +V +L   PR       + D+E   +I L+K K+E+  V +   G ++  + R S   
Sbjct: 42  PSRVVQLSWRPRAFLHKGFLTDAECEHLISLAKDKLEKSMVADNESGKSVMSEVRTSSGM 101

Query: 95  FLYPEIFGDHPFLYKIQTRIQDMTNLVIGREERYKGPLQINNYGLGG----HYDLHCDAT 150
           FL  +       + +I+ RI   T L     E     +QI +Y  G     HYD   D  
Sbjct: 102 FLEKK---QDEVVARIEERIAAWTFLPPDNGES----IQILHYQNGEKYEPHYDYFHDKN 154

Query: 151 PRDEGLWRLASFMFYLTDVELGGATIFPSLN------------------LTVFPEKGSAV 192
            +  G  R+A+ + YL+DV  GG TIFP                       V P KG A+
Sbjct: 155 NQALGGHRIATVLMYLSDVGKGGETIFPEAEGKLLQPKDDTWSDCAKNGYAVKPVKGDAL 214

Query: 193 FWYNAHANTLLDYRMYHSGCPVALGNKW 220
            +++ H +   D    H  CPV  G KW
Sbjct: 215 LFFSLHPDATTDSDSLHGSCPVIEGQKW 242


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.322    0.143    0.443 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,778,808,045
Number of Sequences: 23463169
Number of extensions: 158162085
Number of successful extensions: 358937
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1390
Number of HSP's successfully gapped in prelim test: 503
Number of HSP's that attempted gapping in prelim test: 354759
Number of HSP's gapped (non-prelim): 2047
length of query: 227
length of database: 8,064,228,071
effective HSP length: 137
effective length of query: 90
effective length of database: 9,144,741,214
effective search space: 823026709260
effective search space used: 823026709260
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 74 (33.1 bits)